A Correct Response Model in knowledge structure theory

doi:10.1016/j.jmp.2021.102519

Journal of Mathematical Psychology

Volume 102, June 2021, 102519

https://doi.org/10.1016/j.jmp.2021.102519 Get rights and content

Highlights

•
The Correct Response Model predicts the probability of a correct answer to any item
•
It is more straightforward than the classical Basic Local Independence Model
•
We investigate its testability, identifiability and characterizability

Abstract

In knowledge space theory, the (latent) knowledge state of a student consists of the subset of test items that he masters in principle. Even at a given stage of apprenticeship, the student’s knowledge state may vary in a given collection of subsets. The collection of all possible states of all potential students forms a knowledge structure. In the modeling of student answer production, the probabilities governing the knowledge state are parameters. Moreover, for each item, two additional parameters capture the probabilities of careless errors and lucky guesses in the answers to the item. From all the latter parameters, the Correct Response Model (CRM) predicts the probability of a correct answer to any isolated item. From the same parameters, the Basic Local Independence Model (BLIM) predicts the probability of any possible pattern of correct responses. Here, a particular pattern records at a given time all the items to which a given student would provide correct answers. While general properties of the BLIM (such as identifiability of parameters) have been investigated, the simpler Correct Response Model still requires scrutiny. The present paper investigates the CRM as regards testability, identifiability and characterizability. It either provides explicit answers or points out serious difficulties garnered from various mathematical disciplines.

Introduction

A toy example serves here to introduce the questions investigated in the paper. Consider a domain $Q$ consisting of three items: $Q = {a, b, c} .$ We view each item as an elementary piece in some corpus of knowledge, or equally well as an item belonging to an examination test about the corpus. Moreover, we identify the knowledge of a person in the domain as the collection of items he masters. Because of dependencies (which may be based on the logical organization of the corpus, or just statistically observed in a given population of students), not any subset of the domain can be a state of knowledge. Suppose there are only four (potential) knowledge states, constituting the collection $K = {\emptyset, {a, b}, {a, c}, Q} .$ In principle, the correctness of the answer provided by a student to any item depends only on his knowledge state. However, variations are routinely observed in such answers. This observation motivates the introduction of stochastic effects governed by probability values. So, assume the knowledge state of a student varies in the collection $K$ according to (unknown) probabilities assigned to the states and denoted as $π (\emptyset), π ({a, b}), π ({a, c}), π (Q) .$ Moreover, when a student prepares an answer to an item in his actual knowledge state, he might by inattention provide a false answer; or to the contrary, for an item outside of his state, he could by chance provide a correct answer. We thus introduce probabilities $β_{a}$ , $β_{b}$ , $β_{c}$ for careless errors in answering the three items, as well as probabilities $η_{a}$ , $η_{b}$ , $η_{c}$ for lucky guesses in those answers.

A first model (implicit in Doignon & Falmagne, 1999, page 270) defines the probability $τ (q)$ of a correct answer to any isolated item $q$ ; we call it the Correct Response Model (CRM). It first conditions the probability of a correct answer to item $q$ on the state of the student: $τ (q) = \sum_{K \in K} P r (q ∣ K) \cdot π (K) .$ Then, it specifies each conditional probability $P r (q ∣ K)$ by taking into account the careless error probabilities $β_{q}$ and the lucky guess probabilities $η_{q}$ . In the case of the item $b$ in our toy example, the CRM sets $τ (b) \overset{def}{=} (1 - β_{b}) \cdot π ({a, b}) + (1 - β_{b}) \cdot π (Q) + η_{b} \cdot π (\emptyset) + η_{b} \cdot π ({a, c}) = (1 - β_{b}) \cdot (π ({a, b}) + π (Q)) + η_{b} \cdot (π (\emptyset) + π ({a, c})) .$ In general, with the notations $K_{q} = {K \in K ∣ q \in K}, K_{\bar{q}} = {K \in K ∣ q \notin K},$ the probability of a correct answer to the item $q$ equals $τ (q) = (1 - β_{q}) \cdot π (K_{q}) + η_{q} \cdot π (K_{\bar{q}}) .$

A second model, the Basic Local Independence Model (BLIM), is due to Falmagne and Doignon (1988a) (see also Doignon & Falmagne, 1999 and Falmagne & Doignon, 2011). It defines the probability of each response pattern, where a particular response pattern collects the items from the domain to which a student would provide a correct answer at a given time of his apprenticeship. For instance, $a$ correct, $b$ false, $c$ correct is summarized by the pattern ${a$ , $c}$ (a subset of $Q$ ). According to the BLIM, the latter pattern is observed with a probability equal to $\begin{matrix} ρ ({a, c}) & = & \sum_{K \in K} P r ({a, c} ∣ K) \cdot π (K) \\ \overset{def}{=} & η_{a} (1 - η_{b}) η_{c} \cdot π (\emptyset) \\ + (1 - β_{a}) β_{b} η_{c} \cdot π ({a, b}) \\ + (1 - β_{a}) (1 - η_{b}) (1 - β_{c}) \cdot π ({a, c}) \\ + (1 - β_{a}) β_{b} (1 - β_{c}) \cdot π (Q) . \end{matrix}$ Both models rest on independence assumptions underlying their fundamental formulas in Eqs. (6), (7) (the formulation of the latter equation for a general pattern should be clear—anyway see later Eq. (9)). The models respectively define the probability of a correct response to each isolated item (CRM) or the probability of a whole pattern of responses (BLIM). Both rely on a fixed collection $K$ of states (some specified subsets of a domain $Q$ ) together with the values of the various parameters: the probabilities $π (K)$ of the states $K$ in $K$ and the probabilities $β_{q}$ of careless errors and $η_{q}$ of lucky guesses for the items $q$ in the domain $Q$ . A particular case plays an important role in previous investigations (Falmagne and Doignon, 1988a, Falmagne and Doignon, 1988b), as well as here: by definition, the straight case obtains when $β_{q} = η_{q} = 0$ for all items $q$ , that is, guessing and distraction are stochastically excluded.

Both the CRM and the BLIM are instances of probabilistic models. According to some formal process or formula, such a model predicts from any parameter point a distribution of probability values for some observables. Let us use the term predicted distribution or predicted point to abbreviate “any distribution of probability values for the observables that the model predicts”. Bamber and van Santen (2000) precisely state two fundamental questions about a probabilistic model, (i) model testability: is there any distribution of probability values for the observables that the model does not predict? (ii) model identifiability: is each predicted distribution produced from at most one parameter point? It is also common to investigate a third question, (iii) model characterizability: are the predicted distributions susceptible of an effective characterization (without reference to the underlying parameter values)?

Recently, several researchers have produced nice results about identifiability of the BLIM, the model for pattern probabilities. For instance, Stefanutti, Heller, Anselmi, and Robusto (2012) offer results on local identifiability of the model (this concept, as well as the related ones we need about models, will be precisely defined in Section 3). Spoto, Stefanutti, and Vidotto (2013) construct families of collections $K$ of states for which the BLIM is not identifiable. Heller (2017) further investigates local identifiability and so-called trade-offs among parameters. Stefanutti, Spoto, and Vidotto (2018) derive non-identifiability of the BLIM in case certain parameter transformations do exist. More recently, Stefanutti and Spoto (2020) further investigate groups of such transformations. Here, we consider the three fundamental questions for the first model, namely the correct response model (CRM). We reformulate (i) testability and (ii) identifiability, and investigate (iii) characterizability. We also point out serious difficulties for providing answers in the form of practically efficient criteria: we exhibit appropriate subcases which in other areas of mathematical research are deemed to be hopeless. Our results are often stated for the straight case. One of them indicates how to restrict in a natural way the parameter domain in order to make the modified CRM identifiable, at least for a large family of knowledge structures. Doignon, Heller, and Stefanutti (2018) is a survey chapter on the fundamental questions on probabilistic models, with the CRM and the BLIM as prominent examples. Here we give all proofs for results on the CRM, and we expose in a much simpler way how to restore identifiability.

Section snippets

Some basic concepts and two models in knowledge space theory

We briefly review the concepts from Knowledge Space Theory that we will need later on. A more detailed approach is offered in Doignon and Falmagne (1999) and Falmagne and Doignon (2011).

Definition 1 Knowledge Structure

A knowledge structure $(Q, K)$ consists of a finite set $Q$ of items together with a collection $K$ of subsets of $Q$ called knowledge states. It is assumed $\emptyset \in K$ and $Q \in K$ .

We recall two convenient notations where $q$ is an item of the knowledge structure $(Q, K)$ : $K_{q} = {K \in K ∣ q \in K}, K_{\bar{q}} = {K \in K ∣ q \notin K} .$

Additional axioms define particular

Fundamental questions on models

We now recall basic terminology about probabilistic models. Such a model predicts, in accordance with a general rule, the probabilistic behavior of some observable quantities from the specific values taken by parameters.

Definition 6 Model

A (probabilistic) model $(D, f, O)$ consists of a parameter domain $D$ , an outcome space $O$ , and a prediction function $f : D \to O,$ with $f (D)$ the predicted range. An element of $D$ is a parameter point, an element of $f (D)$ a predicted point.

Definition 7 Testability

The model $(D, f, O)$ is testable if $f (D) \subset O$ (strict

The correct response model

According to Definition 5, the Correct Response Model (CRM) is based on a knowledge structure $(Q, K)$ . Its parameters are as follows: for any state $K$ in $K$ , the probability $π (K)$ ; moreover, for any item $q$ in $Q$ , the probabilities $β_{q}$ of a careless error and $η_{q}$ of a lucky guess.

To get a formal definition of the CRM, we consider the real vector space $R^{K}$ (of all functions from $K$ to $R$ ) in which each vector $x$ has one real component $x (K)$ , also denoted $x_{K}$ , for each state $K$ in $K$ . The simplex $Λ_{K}$ consists of

Testability and identifiability of the CRM

Let us first consider the CRM in the straight case, denoting again by $(Q, K)$ the knowledge structure on which the model is based, and by $f$ the prediction function of the model. Thus the parameter domain is the simplex $D = Λ_{K}$ and the prediction function is $f : Λ_{K} \to {[0, 1]}^{Q} : π \mapsto τ with τ (q) = π (K_{q}) = \sum_{K \in K_{q}} π (K) .$ To summarize, an instance of the straight CRM is specified with the notation $(Λ_{K}, f, {[0, 1]}^{Q})$ . Clearly, the function $f$ in (15) is linear, or more adequately said: $f$ is the restriction to the simplex $Λ_{K}$ of a

Characterizability of the CRM

As we saw in the previous section, the prediction function $f$ of the straight CRM $(Λ_{K}, f, {[0, 1]}^{Q})$ extends to the linear mapping (still denoted by $f$ ) $f : R^{K} \to R^{Q} : π \mapsto τ with τ (q) = π (K_{q}) = \sum_{K \in K_{q}} π (K) .$ The matrix of $f$ (in the canonical basis of $R^{K}$ and $R^{Q}$ ) is the incidence matrix $M_{K}$ of $K$ . According to Proposition 1, the predicted range of the CRM $(Λ_{K}, f, {[0, 1]}^{Q})$ is the $0 ∕ 1$ -polytope $P_{K}$ having as vertices all characteristic vectors $χ_{K}$ for $K$ in $K$ (that is, all columns of $M_{K}$ ). As a consequence:

Proposition 6

Any linear description of

Restoring identifiability for the straight CRM

The final paragraph of Section 4 states that, in any case, some appropriate restriction of the domain ensures identifiability of a model and preserves the predicted range: it suffices that the restricted domain be a minimal transversal of the collection of counter-images of the prediction function. In this section we find a canonical restriction in the context of the straight CRM $(Λ_{K}, f, {[0, 1]}^{Q})$ based on a quasi ordinal space. We start with an illustrative example.

Example 5

Let $Q = {a, b}$ and $K_{1} = {\emptyset, {a}, {a, b}}$

Conclusion

The paper investigates the fundamental properties of the correct response model (CRM), a probabilistic model implicit in earlier publications on knowledge space theory (see for instance Doignon & Falmagne, 1999 page 270). In the straight case (no careless error or lucky guess), we provide in Section 5 a simple criterion for testability of the model (Proposition 2), as well as an intricate one for numerical testability (Proposition 3). We also state another intricate criterion for

Acknowledgments

The author thanks the Editor Jürgen Heller and two anonymous Reviewers for their helpful comments on preliminary versions of the text.

References (36)

BamberD. et al.
How many parameters can a model have and still be testable?
Journal of Mathematical Psychology
(1985)
BamberD. et al.
How to assess a model’s testability and identifiability
Journal of Mathematical Psychology
(2000)
CosynE. et al.
Note on two necessary and sufficient axioms for a well-graded knowledge space
Journal of Mathematical Psychology
(2009)
Davis-StoberC. et al.
Extended formulations for order polytopes through network flows
Journal of Mathematical Psychology
(2018)
DoignonJ.-P. et al.
Facets of the linear ordering polytope: a unification for the fence family through weighted graphs
Journal of Mathematical Psychology
(2006)
DoignonJ.-P. et al.
Weighted graphs defining facets: a connection between stable set and linear ordering polytopes
Discrete Optimization
(2009)
DoignonJ.-P. et al.
Primary facets of order polytopes
Journal of Mathematical Psychology
(2016)
FalmagneJ.-C.
A representation theorem for finite random scale systems
Journal of Mathematical Psychology
(1978)
FalmagneJ.-C. et al.
A Markovian procedure for assessing the state of a system
Journal of Mathematical Psychology
(1988)
FioriniS.
A short proof of a theorem of Falmagne
Journal of Mathematical Psychology
(2004)

HellerJ.

Identifiability in probabilistic knowledge structures

Journal of Mathematical Psychology

(2017)

HellerJ. et al.

Minimum discrepancy estimation in probabilistic knowledge structures

Electronic Notes in Discrete Mathematics

(2013)

SpotoA. et al.

Considerations about the identification of forward-and backward-graded knowledge structures

Journal of Mathematical Psychology

(2013)

StefanuttiL. et al.

Blim’s identifiability and parameter invariance under backward and forward transformations

Journal of Mathematical Psychology

(2020)

StefanuttiL. et al.

Detecting and explaining blim’s unidentifiability: Forward and backward parameter transformation groups

Journal of Mathematical Psychology

(2018)

ÜnlüA.

Estimation of careless error and lucky guess probabilities for dichotomous test items: a psychometric application of a biometric latent class model with random effects

Journal of Mathematical Psychology

(2006)

de ChiusoleD. et al.

Modeling missing data in knowledge space theory

Psychological Methamphetamine

(2015)

ConfortiM. et al.

Extended formulations in combinatorial optimization

4OR

(2010)

Cited by (0)

View full text

A Correct Response Model in knowledge structure theory

Highlights

Abstract

Introduction

Section snippets

Some basic concepts and two models in knowledge space theory

Fundamental questions on models

The correct response model

Testability and identifiability of the CRM

Characterizability of the CRM

Restoring identifiability for the straight CRM

Conclusion

Acknowledgments

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Discrete Optimization

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Electronic Notes in Discrete Mathematics

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Modeling missing data in knowledge space theory

Psychological Methamphetamine

Extended formulations in combinatorial optimization

4OR