Validity
The degree to which a test measures what it claims to measure. Multiple flavors: construct validity (does the test measure the underlying construct), criterion validity (does the test predict relevant outcomes), content validity (does the test sample the relevant content domain), face validity (does the test appear to measure what it claims). Modern measurement theory treats validity as a property of inferences from test scores, not of tests themselves.
The degree to which a test measures what it claims to measure. Multiple flavors: construct validity (does the test measure the underlying construct), criterion validity (does the test predict relevant outcomes), content validity (does the test sample the relevant content domain), face validity (does the test appear to measure what it claims). Modern measurement theory treats validity as a property of inferences from test scores, not of tests themselves.
This term appears throughout the cognitive ability literature and across this site's articles. Understanding it is essential for interpreting any IQ score or cognitive subtest result. Modern psychometric textbooks (such as those by Anne Anastasi or Susan Embretson) cover the term in significant additional depth and document the empirical findings that justify its prominence in the field.
In the context of online IQ testing, the implications of this term are usually that the test-taker should be cautious about over-interpreting brief screener results. Most of the published precision claims for major IQ batteries do not transfer directly to short online instruments, and the relevant adjustments — wider confidence intervals, more conservative band assignments — are best made explicitly rather than ignored.
For further reading on this term, consult the related entries in this glossary and the deep-dive articles linked in the Related Reading section. The American Psychological Association's task force report 'Intelligence: Knowns and Unknowns' (1995) and its follow-ups remain the most authoritative summary at an accessible technical level.
Other glossary entries
Crystallized intelligence (Gc)
The breadth and depth of knowledge accumulated through education, reading, and life experience. Measured by vocabulary t…
Floor effect
The phenomenon where test-takers below a certain ability level all score at the minimum possible score, losing the abili…
ICAR (International Cognitive Ability Resource)
A public-domain catalog of validated cognitive ability items, developed by William Revelle and colleagues at Northwester…
Cattell-Horn-Carroll model (CHC)
The dominant contemporary framework for organizing cognitive ability research. Three strata: g at the top, ten broad abi…
Standard error of measurement (SEM)
The expected variability in a measured score across repeated administrations of the same test, due to measurement error …
Raven's Progressive Matrices
A nonverbal IQ test developed by John Raven in 1938, consisting of 60 multiple-choice items of increasing difficulty. Ea…