Unobserved and dropped categories

Top Up Down  A A

If you have data in which a category is not observed, then you must make an assertion about the unobserved category. There are several options:

 

For intermediate categories: either

(a) this category will never be observed (this is called a "structural zero"). Generally, these categories are collapsed or recoded out of the rating scale hierarchy. This happens automatically with STKEEP=No.

or (b) this category didn't happen to be observed this time (an "incidental" or "sampling" zero). These categories can be maintained in the rating scale hierarchy (using STKEEP=Yes), but are estimated to be observed with a probability of zero.

 

For extreme categories:

(a) if this category will never be observed, the rating scale is analyzed as a shorter scale. This is the Winsteps standard.

(b) if this category may be observed, then introduce a dummy record into the data set which includes the unobserved extreme category, and also extreme categories for all other items except the easiest (or hardest) item. This forces the rare category into the category hierarchy.

(c) If an extreme (top or bottom) category is only observed for persons with extreme scores, then that category will be dropped from the rating (or partial credit) scales. This can lead to apparently paradoxical or incomplete results. This is particularly noticeable with ISGROUPS=0.

 

In order to account for unobserved extreme categories, a dummy data record needs to be introduced. If there is a dropped bottom category, then append to the data file a person data record which has bottom categories for all items except the easiest, or if the easiest item is in question, except for the second easiest.

 

If there is a dropped top category, then append to the data file a person data record which has top categories for all items except the most difficult, or if the most difficult item is in question, except for the second most difficult.

 

This extra person record will have very little impact on the relative measures of the non-extreme persons, but will make all categories of all items active in the measurement process.

 

If it is required to produce person statistics omitting the dummy record, then use PSELECT= to omit it, and regenerate Table 3.