Test Development

Nathan Thompson earned his PhD in Psychometrics from the University of Minnesota, with a focus on computerized adaptive testing. His undergraduate degree was from Luther College with a triple major of Mathematics, Psychology, and Latin. He is primarily interested in the use of AI and software automation to augment and replace the work done by psychometricians, which has provided extensive experience in software design and programming. Dr. Thompson has published over 100 journal articles and conference presentations, but his favorite remains https://scholarworks.umass.edu/pare/vol16/iss1/1/ .
parcc ebsr items
EducationPsychometricsTest Development

Why PARCC EBSR Items Provide Bad Data

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a consortium of US States working together to develop educational assessments aligned with the Common Core State Standards.  This is a daunting task,

test-scaling
CredentialingEducationPsychometricsTest Development

What is Test Scaling?

Scaling is a psychometric term regarding the establishment of a score metric for a test, and it often has two meanings. First, it involves defining the method to operationally scoring the test, establishing an underlying

Test development unicycle
CredentialingPsychometricsTest Development

Intro to the Exam & Test Development Process

The exam and test development cycle the process of building a high-quality assessment in accordance to international standards like NCCA or AERA/APA/NCME, so that it has validity. If there are stakes associated with the test,

three-parameter-irt-model
Test Development

Innovation in Assessment: Learning from Other Industries

One of my favorite quotes is from Mark Twain: “There is no such thing as a new idea. It is impossible. We simply take a lot of old ideas and put them into a sort

T scores
Test Development

What is a T score in Assessment?

A T Score is a conversion of scores on a test to a standardized scale with a mean of 50 and standard deviation of 10.  This is a common example of a scaled score in

Spearman-Brown
PsychometricsTest Development

What is the Spearman-Brown formula?

The Spearman-Brown formula, also known as the Spearman-Brown Prophecy Formula or Correction, is a method used in evaluating test reliability.  It is based on the idea that split-half reliability has better assumptions than coefficient alpha

item-writing-tips
PsychometricsTest Development

Item Writing: Tips for Authoring Better Test Questions

Item writing (aka item authoring) is a science as well as an art, and if you have done it, you know just how challenging it can be!  You are experts at what you do, and

woman-taking-test
Test Development

Item Review Workflow for Exam Development

Item review is the process of ensuring that newly-written test questions go through a rigorous peer review, to ensure that they are high quality and meet industry standards. What is an item review workflow? Developing

job-task-analysis
PsychometricsTest Development

Job Task Analysis Study: Essential for Certification

Job Task Analysis (JTA) is a formal process to define what is being done in a profession, especially what is most important or frequent, and then using this information to make data-driven decisions. It is

laptop screen demonstrating automated test assembly
Test Development

Question and Test Interoperability (QTI)

Question and Test Interoperability® (QTI®) is a set of standards around the format of import/export files for test questions in educational assessment and HR/credentialing exams.  This facilitates the movement of questions from one software platform

Iteman45-quantile-plot
PsychometricsTest Development

Classical Test Theory: Item Statistics

Classical Test Theory (CTT) is a psychometric approach to analyzing, improving, scoring, and validating assessments.  It is based on relatively simple concepts, such as averages, proportions, and correlations.  One of the most frequently used aspects

Classical Test Theory vs. Item Response Theory
PsychometricsTest Development

Classical Test Theory vs. Item Response Theory

Classical Test Theory and Item Response Theory (CTT & IRT) are the two primary psychometric paradigms.  That is, they are mathematical approaches to how tests are analyzed and scored.  They differ quite substantially in substance

Coefficient cronbachs alhpa interpretation
PsychometricsTest Development

Coefficient Alpha Reliability (Cronbach’s alpha): A Quick Index of Score Consistency

Coefficient alpha reliability, sometimes called Cronbach’s alpha, is a statistical index that is used to evaluate the internal consistency or reliability of an assessment. That is, it quantifies how consistent we can expect scores to

Juggling-statistics
PsychometricsTest Development

“Dichotomous” Vs “Polytomous” in IRT?

What is the difference between the terms dichotomous and polytomous in psychometrics?  Well, these terms represent two subcategories within item response theory (IRT) which is the dominant psychometric paradigm for constructing, scoring and analyzing assessments.

group working on meta-analysis
Test Development

Meta-analysis in Assessment

Meta-analysis is a research process of collating data from multiple independent but similar scientific studies in order to identify common trends and findings by means of statistical methods. To put it simply, it is a

test-validation
Test Development

Test validation: How to determine if a test score is supported?

Test validation is the process of verifying whether the specific requirements to test development stages are fulfilled or not, based on solid evidence. In particular, test validation is an ongoing process of developing an argument

Psychometric software
EducationPsychometricsTest Development

Automated Item Generation: Making Test Development More Efficient

Automated item generation (AIG) is a paradigm for developing assessment items (test questions), utilizing principles of artificial intelligence and automation. As the name suggests, it tries to automate some or all of the effort involved

borderline method educational assessment
Test Development

Borderline group method standard setting

The borderline group method of standard setting is one of the most common approaches to establishing a cutscore for an exam.  In comparison with the item-centered standard setting methods such as modified-Angoff, Nedelsky, and Ebel,