Return to search

A Comparison of Three Item Selection Methods in Criterion-Referenced Tests

This study compared three methods of selecting the best discriminating test items and the resultant test reliability of mastery/nonmastery classifications. These three methods were (a) the agreement approach, (b) the phi coefficient approach, and (c) the random selection approach.
Test responses from 1,836 students on a 50-item physical science test were used, from which 90 distinct data sets were generated for analysis. These 90 data sets contained 10 replications of the combination of three different sample sizes (75, 150, and 300) and three different numbers of test items (15, 25, and 35).
The results of this study indicated that the agreement approach was an appropriate method to be used for selecting criterion-referenced test items at the classroom level, while the phi coefficient approach was an appropriate method to be used at the district and/or state levels. The random selection method did not have similar characteristics in selecting test items and produced the lowest reliabilities, when compared with the agreement and the phi coefficient approaches.

Identiferoai:union.ndltd.org:unt.edu/info:ark/67531/metadc332327
Date08 1900
CreatorsLin, Hui-Fen
ContributorsYoung, Jon I., Yang, Chao-Chih, Wilson, Michael J., Beyerlein, Michael Martin, Moore, Alan D.
PublisherUniversity of North Texas
Source SetsUniversity of North Texas
LanguageEnglish
Detected LanguageEnglish
TypeThesis or Dissertation
Formatviii, 135 leaves: ill., Text
RightsPublic, Lin, Hui-Fen, Copyright, Copyright is held by the author, unless otherwise noted. All rights reserved.

Page generated in 0.0017 seconds