Item Response Theory (IRT)
Item Response Theory (IRT) is a paradigm in psychometrics and education that deals with the design, analysis, and scoring of questionnaires, tests, and other instruments for measuring abilities, attitudes, and other variables. Unlike Classical Test Theory, which focuses on test-level measures, IRT concentrates on individual items in a test to examine how different items function across varying levels of the attribute being measured.
IRT began to gain prominence in the mid-20th century as computer technology advanced, allowing for more complex statistical models. Early contributions were made by psychologists and statisticians like Frederic M. Lord and Georg Rasch.
- Parameters: In IRT, items are described by one or more parameters that explain different aspects like difficulty, discrimination, and guessing. These parameters are used to construct the Item Characteristic Curve (ICC).
- Item Characteristic Curve (ICC): The ICC is a graphical representation showing the probability of a correct response to an item across different levels of the latent trait being measured. The curve is usually S-shaped (sigmoidal).
- 1PL Model: The One-Parameter Logistic (1PL) model, also known as the Rasch model, is the simplest IRT model, considering only the difficulty parameter.
- 2PL Model: The Two-Parameter Logistic (2PL) model considers both difficulty and discrimination parameters. It is more flexible than the 1PL model but requires more data.
- 3PL Model: The Three-Parameter Logistic (3PL) model adds a guessing parameter to the 2PL model, making it suitable for multiple-choice tests where guessing is a factor.
- Educational Testing
- Personality Assessment
- Clinical Diagnosis
- Job Performance Measurement
- Adaptive Testing
- R packages like ltm and mirt
- Commercial software like Winsteps and IRTPro
- Requires large sample sizes
- Complex to understand and implement
- Assumes unidimensionality, which might not always be realistic