A1 Refereed original research article in a scientific journal

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling




AuthorsMetsämuuronen Jari

PublisherBehaviormetric Society of Japan

Publication year2022

Journal: Behaviormetrika

eISSN1349-6964

DOIhttps://doi.org/10.1007/s41237-022-00169-9

Publication's open availability at the time of reportingOpen Access

Publication channel's open availability Partially Open Access publication channel

Web address https://link.springer.com/article/10.1007/s41237-022-00169-9

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/176017020


Abstract

When the response pattern in a test item deviates from the deterministic pattern, the percentage of correct answers (p) is shown to be a biased estimator for the latent item difficulty (π). This is specifically true with the items of medium item difficulty. Four elements of impurities in p are formalized in the binary settings and four new estimators of π are proposed and studied. Algebraic reasons and a simulation suggest that, except the case of deterministic item discrimination, the real item difficulty is almost always more extreme than what p indicates. This characteristic of p to be biased toward a medium-leveled item difficulty has a strict consequence to item response theory (IRT) and Rasch modeling. Because the classical estimator of item difficulty p is a biased estimator of the latent difficulty level, the item parameters A and B and the person parameter θ within IRT modeling are, consequently, biased estimators of item discrimination and item difficulty as well as ability levels of the test takers.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 26/11/2024 02:00:16 PM