Machine learning psychometrics
Improved cognitive ability validity from supervised training on item level data
Andrew Cutler, Boston University (USA)
Shane McLoughlin, University of Chester (UK)
Curtis Dunkel, Western Illinois University (USA)
Emil O. W. Kirkegaard, Ulster Institute for Social Research (UK/DK)
Predict or Understand?
Let’s try! First, the data
Dataset | N | Questions | Age | Education | Income | Sex |
ENEM (Brazilian national exam) | 551438 | 185 | n | y | y | y |
Estonia Raven's | 2738 | 60 | y | n | n | y |
NLSY97 | 1109 | 182 | n | y | y | y |
Vietnam Experience Study | 4376 | 202 | y | y | y | n |
American National Election Study | 5790 | 10 | y | y | y | y |
British Cohort Study (1970) | 9433 | 120 | n | n | y | n |
Online Vocab Test | 9278 | 45 | y | y | n | y |
Example: online vocabulary test
Binary vs. categorical item coding
Methods
Meta-analysis approach
All datasets: boxplot results
All datasets: average results
Shorter tests
Method | Test | Outcome | r | n_questions |
g_irt | ENEM | parent_edu | 0.323 | 185 |
Regression | ENEM | parent_edu | 0.323 | 14 |
g_sum | ENEM | g_irt | 0.956 | 185 |
Regression | ENEM | g_irt | 0.958 | 42 |
Conclusion: implications
Thank you
Random Forest
Predict weight from: age, gender, height
Male?
>5’8?
>5’2?
>28 yo?
>40 yo?
Similar Samples
Similar Samples
Similar Samples
Similar Samples
Similar Samples
Similar Samples
Two cultures: understanding and prediction
Scoring of cognitive data
Cognitive tests are used for prediction in practice