Item difficulty
The proportion of a reference sample that answers a particular item correctly. Easy items (proportion correct > 0.7) discriminate poorly among above-average test-takers; hard items (proportion correct < 0.3) discriminate poorly among below-average test-takers. Modern adaptive tests select items at difficulty levels matched to the test-taker's estimated ability.
The proportion of a reference sample that answers a particular item correctly. Easy items (proportion correct > 0.7) discriminate poorly among above-average test-takers; hard items (proportion correct < 0.3) discriminate poorly among below-average test-takers. Modern adaptive tests select items at difficulty levels matched to the test-taker's estimated ability.
This term appears throughout the cognitive ability literature and across this site's articles. Understanding it is essential for interpreting any IQ score or cognitive subtest result. Modern psychometric textbooks (such as those by Anne Anastasi or Susan Embretson) cover the term in significant additional depth and document the empirical findings that justify its prominence in the field.
In the context of online IQ testing, the implications of this term are usually that the test-taker should be cautious about over-interpreting brief screener results. Most of the published precision claims for major IQ batteries do not transfer directly to short online instruments, and the relevant adjustments — wider confidence intervals, more conservative band assignments — are best made explicitly rather than ignored.
For further reading on this term, consult the related entries in this glossary and the deep-dive articles linked in the Related Reading section. The American Psychological Association's task force report 'Intelligence: Knowns and Unknowns' (1995) and its follow-ups remain the most authoritative summary at an accessible technical level.
Other glossary entries
Gifted
A designation typically used in school placement contexts for students scoring at or above the 95th to 98th percentile o…
Validity
The degree to which a test measures what it claims to measure. Multiple flavors: construct validity (does the test measu…
Floor effect
The phenomenon where test-takers below a certain ability level all score at the minimum possible score, losing the abili…
Crystallized intelligence (Gc)
The breadth and depth of knowledge accumulated through education, reading, and life experience. Measured by vocabulary t…
Flynn effect
The systematic rise in average IQ scores across the 20th century — about 3 IQ points per decade in industrialized countr…
Working memory
The cognitive system that holds and manipulates information over short timescales (seconds). Distinct from passive short…