Mathematical Psychology
About

Two-Parameter Model

The two-parameter logistic (2PL) model in item response theory characterizes each item by its difficulty and discrimination, modeling the probability of a correct response as a logistic function of the latent trait.

P(θ) = 1 / (1 + exp(−a(θ − b)))

The two-parameter logistic (2PL) model is a cornerstone of item response theory (IRT). Proposed by Birnbaum (1968), it models the probability that an examinee with latent ability θ answers an item correctly as a function of two item parameters: difficulty (b) and discrimination (a). The model provides a richer characterization of item behavior than the one-parameter Rasch model by allowing items to differ in how sharply they discriminate among examinees of different ability levels.

The Model

2PL Item Response Function P_i(θ) = 1 / (1 + exp(−a_i(θ − b_i)))

where θ = latent ability
a_i = discrimination parameter (slope)
b_i = difficulty parameter (location)

The item characteristic curve (ICC) is an S-shaped logistic function. The difficulty parameter b locates the curve on the ability continuum: it is the value of θ at which the probability of a correct response is exactly 0.50. The discrimination parameter a controls the steepness of the curve at the inflection point. Higher values of a indicate that the item differentiates sharply between examinees just above and just below the difficulty level; lower values indicate a more gradual transition.

Parameter Estimation

Parameters are estimated by marginal maximum likelihood (MML), in which the ability distribution is integrated out and item parameters are estimated from the marginal likelihood of the response patterns. Ability estimates for individual examinees are then obtained conditional on the estimated item parameters, using maximum likelihood estimation (MLE) or expected a posteriori (EAP) methods. The EM algorithm is the standard computational approach for MML estimation, iterating between computing expected counts (E-step) and maximizing the log-likelihood (M-step).

Log-Likelihood ln L = Σ_j ln ∫ ∏_i P_i(θ)^u_ji × Q_i(θ)^(1−u_ji) × g(θ) dθ

where u_ji = response of person j to item i
Q_i(θ) = 1 − P_i(θ), g(θ) = prior ability distribution
2PL vs. Rasch Model

The Rasch (1PL) model constrains all discrimination parameters to be equal, yielding the property of specific objectivity: item parameters can be estimated independently of the ability distribution, and ability comparisons are independent of which items are administered. The 2PL relaxes this constraint, gaining flexibility at the cost of losing specific objectivity. The choice between models reflects a tension between measurement principles and empirical fit that has been debated for decades in the psychometric community.

Applications

The 2PL model is widely used in educational and psychological testing. It forms the basis for item banking — calibrating large pools of items on a common scale — and is the default model in many computerized adaptive testing systems. The discrimination parameter enables item selection algorithms to choose the most informative item for each examinee's current ability estimate. In test development, items with very low discrimination provide little measurement information and are typically revised or discarded.

Model fit is evaluated at both the item and test levels. At the item level, chi-square statistics compare observed and expected response proportions across ability groups. At the test level, likelihood-ratio tests compare the 2PL to more restricted (1PL) or more general (3PL) models. Graphical inspection of empirical item characteristic curves against the fitted model remains an essential diagnostic tool. The 2PL model strikes a balance between parsimony and flexibility that has made it the workhorse of modern psychometrics.

Interactive Calculator

Each row provides theta (person ability) and b (item difficulty). The calculator computes the probability of a correct response under the Rasch model: P = exp(θ−b) / (1+exp(θ−b)).

Click Calculate to see results, or Animate to watch the statistics update one record at a time.

Related Topics

References

  1. Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In F. M. Lord & M. R. Novick, Statistical theories of mental test scores (pp. 397–479). Addison-Wesley.
  2. Baker, F. B., & Kim, S.-H. (2004). Item response theory: Parameter estimation techniques (2nd ed.). Marcel Dekker. doi:10.1201/9781482276725
  3. Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Erlbaum. doi:10.1037/10519-000
  4. Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46(4), 443–459. doi:10.1007/BF02293801

External Links