Shaoxiong Ji
shaoxiong.ji@utu.fi +358 29 450 2147 +358 50 430 4151 ORCID identifier: https://orcid.org/0000-0003-3281-8002 |
Natural language processing; large language models; multilingual NLP; AI for health; multimodal AI
Dr. Shaoxiong Ji is an Assistant Professor at the University of Turku and a Principal Investigator at the ELLIS Institute Finland. He received his Ph.D. from Aalto University, after which he was a postdoctoral researcher in high-performance language technology at the University of Helsinki. Prior to his current roles, he worked as an independent research group leader at the Technical University of Darmstadt. Throughout his academic career, he was a visiting researcher at some international institutions, such as the University of Technology Sydney (UTS), the University of Queensland (UQ), Nanyang Technological University (NTU), the University of Munich (LMU), Shanghai AI Lab, and the Finnish Institute for Health and Welfare (THL).
- Graph2text or Graph2token: A Perspective of Large Language Modelsfor Graph Learning (2026)
- ACM Transactions on Information Systems
(A1 Refereed original research article in a scientific journal) - GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models (2025) Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing : System Demonstrations Luo, Hengyu; Li, Zihao; Attieh, Joseph; Devkota, Sawal; de Gibert, Ona; Huang, Xu; Ji, Shaoxiong; Lin, Peiqin; Mantina, Bhavani Sai Praneeth Varma; Sreenidhi, Ananda; Vázquez, Raúl; Wang, Mengjie; Yusofi, Samea; Yuan, Fei; Tiedemann, Jörg
(A4 Refereed article in a conference publication ) - How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on {BLOOM} (2025) Proceedings of the 31st International Conference on Computational Linguistics, {COLING} 2025, Abu Dhabi, UAE, January 19-24, 2025 Ji S; Chen P
(A4 Refereed article in a conference publication ) - Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources (2025) Proceedings of the Second Conference on Language Modeling, COLM 2025 Li, Zihao; Ji, Shaoxiong; Luo, Hengyu; Tiedemann, Jörg
(D3 Article in a professional conference publication) - A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives (2024) Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, {EMNLP} 2024, Miami, FL, USA, November 12-16, 2024 Li Z; Ji S; Mickus T; Segonne V; Tiedemann J
(A4 Refereed article in a conference publication ) - A New Massive Multilingual Dataset for High-Performance Language Technologies (2024)
- LREC Proceedings
(A4 Refereed article in a conference publication ) - A Unified Review of Deep Learning for Automated Medical Coding (2024)
- ACM Computing Surveys
(A1 Refereed original research article in a scientific journal) - Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? (2024) Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, {LREC/COLING} 2024, 20-25 May, 2024, Torino, Italy Ji S; Mickus T; Segonne V; Tiedemann J
(A4 Refereed article in a conference publication ) - Emerging trends in federated learning: from model fusion to federated X learning (2024)
- International Journal of Machine Learning and Cybernetics
(A1 Refereed original research article in a scientific journal) - Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection (2024) Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, {LREC/COLING} 2024, 20-25 May, 2024, Torino, Italy Gao Y; Ji S; Marttinen P
(A4 Refereed article in a conference publication ) - {MAMMOTH:} Massively Multilingual Modular Open Translation @ Helsinki (2024) Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, {EACL} 2024 - System Demonstrations, St. Julians, Malta, March 17-22, 2024 Mickus T; Grö}nroos S; Attieh J; Boggia M; Gibert Bonet O; Ji S; Lopi NA; Raganato A; V{á}}zquez R; Tiedemann J
(A4 Refereed article in a conference publication ) - Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca (2024) Chen, Pinzhen; Ji, Shaoxiong; Bogoychev, Nikolay; Kutuzov, Andrey; Haddow, Barry; Heafield, Kenneth
(A1 Refereed original research article in a scientific journal) - Risk adjustment for regional healthcare funding allocations with ensemble methods: an empirical study and interpretation (2024)
- European Journal of Health Economics
(A1 Refereed original research article in a scientific journal) - SuicidEmoji: Derived Emoji Dataset and Tasks for Suicide-Related Social Content (2024) Zhang, Tianlin; Yang, Kailai; Ji, Shaoxiong; Liu, Boyang; Xie, Qianqian; Ananiadou, Sophia
(A4 Refereed article in a conference publication ) - TransFOL: A Logical Query Model for Complex Relational Reasoning in Drug-Drug Interaction (2024)
- IEEE Journal of Biomedical and Health Informatics
(A1 Refereed original research article in a scientific journal) - A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge (2023) Proceedings of the 32nd {ACM} International Conference on Information and Knowledge Management, {CIKM} 2023, Birmingham, United Kingdom, October 21-25, 2023 Yang K; Zhang T; Ji S; Ananiadou S
(A1 Refereed original research article in a scientific journal) - Contextualized Graph Embeddings for Adverse Drug Event Detection (2023)
- Lecture Notes in Computer Science
(A4 Refereed article in a conference publication ) - Emotion fusion for mental illness detection from social media: A survey (2023) Zhang T; Yang K; Ji S; Ananiadou S
(A1 Refereed original research article in a scientific journal) - Ensemble Hybrid Learning Methods for Automated Depression Detection (2023) Ansari L; Ji S; Chen Q; Cambria E
(A1 Refereed original research article in a scientific journal) - Multitask Balanced and Recalibrated Network for Medical Code Prediction (2023)
- ACM transactions on intelligent systems and technology
(A1 Refereed original research article in a scientific journal)



