assessment systems logo

CITAS: Classical Item and Test Analysis Spreadsheet

CITAS is a simple spreadsheet for non-psychometricians to evaluate the quality of assessments with classical test theory.

CITAS is an easy-to-use tool for implementing classical test theory on small data sets, designed to provide a straightforward and no-cost way for non-psychometricians to evaluate the quality of assessments. If you are using assessments but not evaluating their statistics, you have no way of knowing whether your test is reliable.

CITAS is intended for anyone that is new to psychometrics and wants a simple tool to help learn the analytics. It’s limited to 100 examinees and 100 items.

Download CITAS

CITAS is available for free. Just complete this form and we will send you a link to download.  Note that because it is free, we do not provide any support.

Need more power?  For professional-level analytics or larger data sets, check out Iteman.  Iteman also comes with a free version for up to 100 examinees.

Request Download of CITAS

First name*

Last name*

You must enter a valid email to receive the download link.

Name of your company, university, or other organization.

citas screenshot

What can CITAS do?

Simply type or paste your data into an Excel spreadsheet and the statistics populate in real time – no coding or “running” needed. But while simple to use, it still provides essential output with industry-standard Classical Test Theory for small samples.

  • Mean: The average score
  • Standard deviation: An index of the variation in scores
  • Reliability: An index of test quality on a scale of 0 to 1, using coefficient alpha (aka KR20)
  • Standard Error of Measurement (SEM): An index of score error that can be used to create confidence intervals, with a classical test theory approach
  • Item P values: An item difficulty statistic, calculated as the proportion of examinees that responded correctly
  • Item point-biserials: An item discrimination statistic, calculated as the correlation between item score and total score; weak items are <0.20 and bad items are <0.0
  • Distractor analysis: Frequencies of each item response, do determine if there are confusing options.

More information on the psychometric terms can be found by reading the CITAS Manual.