Test Development

Nathan Thompson earned his PhD in Psychometrics from the University of Minnesota, with a focus on computerized adaptive testing. His undergraduate degree was from Luther College with a triple major of Mathematics, Psychology, and Latin. He is primarily interested in the use of AI and software automation to augment and replace the work done by psychometricians, which has provided extensive experience in software design and programming. Dr. Thompson has published over 100 journal articles and conference presentations, but his favorite remains https://scholarworks.umass.edu/pare/vol16/iss1/1/ .
three standard errors
PsychometricsTest Development

The Story of the Three Standard Errors

One of my graduate school mentors once said in class that there are three standard errors that everyone in the assessment or I/O Psychology field needs to know: the standard error of the mean, the

digital literacy assessment
EducationTest Development

Digital Literacy Assessment and its Role in Modern Education

Digital literacy assessments are a critical aspect of modern educational and workforce development initiatives, given today’s fast-paced and technology-driven world, where digital literacy is essential in one’s education, occupation, and even in daily life. Defined

students discussing formative summative assessment
PsychometricsTest Development

Summative vs Formative Assessment in Education

Summative and formative assessment are a crucial component of the educational process.  If you work in the educational assessment field or even in educational generally, you have probably encountered these terms.  What do they mean? 

FastTest - Situational Judgment Test SJT example
PsychometricsTest Development

Situational Judgment Tests: Higher Fidelity in Pre-Employment Testing

Situational judgment tests (SJTs) are a type of assessment typically used in a pre-employment context to assess candidates’ soft skills and decision-making abilities. As the name suggests, we are not trying to assess something like

z-score-avatar
Test Development

What is a z-score?

A z-score measures the distance between a raw score and a mean in standard deviation units, conveying the location of an observation in a normal distribution, of which scores on a test are just one

confidence-intervals-avatar
PsychometricsTest Development

Confidence Intervals in Assessment and Psychometrics

Confidence intervals (CIs) are a fundamental concept in statistics, used extensively in assessment and measurement to estimate the reliability and precision of data. Whether in scientific research, business analytics, or health studies, confidence intervals provide

general-intelligence-avatar
PsychometricsTest Development

General Intelligence and Its Role in Assessment and Measurement

General intelligence, often symbolized as “g,” is a concept that has been central to psychology and cognitive science since the early 20th century. First introduced by Charles Spearman, general intelligence represents an individual’s overall cognitive

factor analysis
PsychometricsTest Development

Factor Analysis: Evaluating Dimensionality in Assessment

Factor analysis is a statistical technique widely used in research to understand and evaluate the underlying structure of assessment data. In fields such as education, psychology, and medicine, this approach to unsupervised machine learning helps

Test response function 10 items Angoff
CredentialingPsychometricsTest Development

Setting a Cutscore to Item Response Theory

Setting a cutscore on a test scored with item response theory (IRT) requires some psychometric knowledge.  This post will get you started. How do I set a cutscore with item response theory? There are two

Equation editor item type
EducationTest Development

What are technology enhanced items?

Technology-enhanced items are assessment items (questions) that utilize technology to improve the interaction of a test question in digital assessment, over and above what is possible with paper.  Tech-enhanced items can improve examinee engagement (important

modified-Angoff Beuk compromise
CredentialingPsychometricsTest Development

Modified-Angoff Method Study

A modified-Angoff method study is one of the most common ways to set a defensible cutscore on an exam.  It therefore means that the pass/fail decisions made by the test are more trustworthy than if

test response functions
PsychometricsTest Development

What is Item Response Theory (IRT)?

Item response theory (IRT) is a family of machine learning models in the field of psychometrics, which are used to design, analyze, validate, and score assessments.  It is a very powerful psychometric paradigm that allows

parcc ebsr items
EducationPsychometricsTest Development

Why PARCC EBSR Items Provide Bad Data

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a consortium of US States working together to develop educational assessments aligned with the Common Core State Standards.  This is a daunting task,

test-scaling
CredentialingEducationPsychometricsTest Development

What is Test Scaling?

Scaling is a psychometric term regarding the establishment of a score metric for a test, and it often has two meanings. First, it involves defining the method to operationally scoring the test, establishing an underlying

certification exam development construction
CredentialingPsychometricsTest Development

Certification Exam Development and Delivery

Certification exams are a critical component of workforce development for many professions and play a significant role in the global Testing, Inspection, and Certification (TIC) market, which was valued at approximately $359.35 billion in 2022

three-parameter-irt-model
Test Development

Innovation in Assessment: Learning from Other Industries

One of my favorite quotes is from Mark Twain: “There is no such thing as a new idea. It is impossible. We simply take a lot of old ideas and put them into a sort

T scores
Test Development

What is a T score?

A T Score is a conversion of scores on a test to a standardized scale with a mean of 50 and standard deviation of 10.  This is a common example of a scaled score in

Spearman-Brown
PsychometricsTest Development

What is the Spearman-Brown formula?

The Spearman-Brown formula, also known as the Spearman-Brown Prophecy Formula or Correction, is a method used in evaluating test reliability.  It is based on the idea that split-half reliability has better assumptions than coefficient alpha

Next