Shaoxiong Ji
shaoxiong.ji@utu.fi +358 29 450 2147 +358 50 430 4151 ORCID-tunniste: https://orcid.org/0000-0003-3281-8002 |
Natural language processing; large language models; multilingual NLP; AI for health; multimodal AI
Dr. Shaoxiong Ji is an Assistant Professor at the University of Turku and a Principal Investigator at the ELLIS Institute Finland. He received his Ph.D. from Aalto University, after which he was a postdoctoral researcher in high-performance language technology at the University of Helsinki. Prior to his current roles, he worked as an independent research group leader at the Technical University of Darmstadt. Throughout his academic career, he was a visiting researcher at some international institutions, such as the University of Technology Sydney (UTS), the University of Queensland (UQ), Nanyang Technological University (NTU), the University of Munich (LMU), Shanghai AI Lab, and the Finnish Institute for Health and Welfare (THL).
- Graph2text or Graph2token: A Perspective of Large Language Modelsfor Graph Learning (2026)
- ACM Transactions on Information Systems
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models (2025) Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing : System Demonstrations Luo, Hengyu; Li, Zihao; Attieh, Joseph; Devkota, Sawal; de Gibert, Ona; Huang, Xu; Ji, Shaoxiong; Lin, Peiqin; Mantina, Bhavani Sai Praneeth Varma; Sreenidhi, Ananda; Vázquez, Raúl; Wang, Mengjie; Yusofi, Samea; Yuan, Fei; Tiedemann, Jörg
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on {BLOOM} (2025) Proceedings of the 31st International Conference on Computational Linguistics, {COLING} 2025, Abu Dhabi, UAE, January 19-24, 2025 Ji S; Chen P
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources (2025) Proceedings of the Second Conference on Language Modeling, COLM 2025 Li, Zihao; Ji, Shaoxiong; Luo, Hengyu; Tiedemann, Jörg
(D3 Artikkeli ammatillisessa konferenssijulkaisussa ) - A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives (2024) Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, {EMNLP} 2024, Miami, FL, USA, November 12-16, 2024 Li Z; Ji S; Mickus T; Segonne V; Tiedemann J
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - A New Massive Multilingual Dataset for High-Performance Language Technologies (2024)
- LREC Proceedings
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - A Unified Review of Deep Learning for Automated Medical Coding (2024)
- ACM Computing Surveys
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? (2024) Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, {LREC/COLING} 2024, 20-25 May, 2024, Torino, Italy Ji S; Mickus T; Segonne V; Tiedemann J
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Emerging trends in federated learning: from model fusion to federated X learning (2024)
- International Journal of Machine Learning and Cybernetics
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection (2024) Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, {LREC/COLING} 2024, 20-25 May, 2024, Torino, Italy Gao Y; Ji S; Marttinen P
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - {MAMMOTH:} Massively Multilingual Modular Open Translation @ Helsinki (2024) Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, {EACL} 2024 - System Demonstrations, St. Julians, Malta, March 17-22, 2024 Mickus T; Grö}nroos S; Attieh J; Boggia M; Gibert Bonet O; Ji S; Lopi NA; Raganato A; V{á}}zquez R; Tiedemann J
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca (2024) Chen, Pinzhen; Ji, Shaoxiong; Bogoychev, Nikolay; Kutuzov, Andrey; Haddow, Barry; Heafield, Kenneth
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Risk adjustment for regional healthcare funding allocations with ensemble methods: an empirical study and interpretation (2024)
- European Journal of Health Economics
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - SuicidEmoji: Derived Emoji Dataset and Tasks for Suicide-Related Social Content (2024) Zhang, Tianlin; Yang, Kailai; Ji, Shaoxiong; Liu, Boyang; Xie, Qianqian; Ananiadou, Sophia
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - TransFOL: A Logical Query Model for Complex Relational Reasoning in Drug-Drug Interaction (2024)
- IEEE Journal of Biomedical and Health Informatics
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge (2023) Proceedings of the 32nd {ACM} International Conference on Information and Knowledge Management, {CIKM} 2023, Birmingham, United Kingdom, October 21-25, 2023 Yang K; Zhang T; Ji S; Ananiadou S
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Contextualized Graph Embeddings for Adverse Drug Event Detection (2023)
- Lecture Notes in Computer Science
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Emotion fusion for mental illness detection from social media: A survey (2023) Zhang T; Yang K; Ji S; Ananiadou S
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Ensemble Hybrid Learning Methods for Automated Depression Detection (2023) Ansari L; Ji S; Chen Q; Cambria E
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Multitask Balanced and Recalibrated Network for Medical Code Prediction (2023)
- ACM transactions on intelligent systems and technology
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )



