A1 Refereed original research article in a scientific journal

A Data-Driven Approach to Compare the Syntactic Difficulty of Programming Languages




AuthorsLokkila Erno, Christopoulos Athanasios, Laakso Mikko-Jussi

PublisherJournal of Information Systems Education

Publication year2023

JournalJournal of Information Systems Education

Journal acronymJISE

Volume34

Issue1

First page 84

Last page93

eISSN2574-3872

Web address https://jise.org/Volume34/n1/JISE2023v34n1pp84-93.html


Abstract

Educators who teach programming subjects are often wondering “which programming language should I teach first?”. The debate behind this question has a long history and coming up with a definite answer to this question would be farfetched. Nonetheless, several efforts can be identified in the literature wherein pros and cons of mainstream programming languages are examined, analysed, and discussed in view of their potential to facilitate the didactics of programming concepts especially to novice programmers. In line with these efforts, we explore the latter question by comparing the syntactic difficulty of two modern, but fundamentally different, programming languages: Java and Python. To achieve this objective, we introduce a standalone and purely data-driven method which stores the code submissions and clusters the errors occurred under the aid of a custom transition probability matrix. For the evaluation of this model a total of 219 454 submissions, made by 715 first-year undergraduate students, in 259 unique programming exercises were gathered and analysed. The results indicate that Python is an easier-to-grasp programming language and is, therefore, highly recommended as the steppingstone in introductory courses. Besides, the adoption of the described method enables educators to not only identify those students who struggle with coding (syntax-wise) but further paves the pathway for the adoption of personalised and adaptive learning practices.



Last updated on 2024-26-11 at 11:41