This study compared three methods of selecting the best discriminating test items and the resultant test reliability of mastery/nonmastery classifications. These three methods were (a) the agreement approach, (b) the phi coefficient approach, and (c) the random selection approach.
Test responses from 1,836 students on a 50-item physical science test were used, from which 90 distinct data sets were generated for analysis. These 90 data sets contained 10 replications of the combination of three different sample sizes (75, 150, and 300) and three different numbers of test items (15, 25, and 35).
The results of this study indicated that the agreement approach was an appropriate method to be used for selecting criterion-referenced test items at the classroom level, while the phi coefficient approach was an appropriate method to be used at the district and/or state levels. The random selection method did not have similar characteristics in selecting test items and produced the lowest reliabilities, when compared with the agreement and the phi coefficient approaches.
Identifer | oai:union.ndltd.org:unt.edu/info:ark/67531/metadc332327 |
Date | 08 1900 |
Creators | Lin, Hui-Fen |
Contributors | Young, Jon I., Yang, Chao-Chih, Wilson, Michael J., Beyerlein, Michael Martin, Moore, Alan D. |
Publisher | University of North Texas |
Source Sets | University of North Texas |
Language | English |
Detected Language | English |
Type | Thesis or Dissertation |
Format | viii, 135 leaves: ill., Text |
Rights | Public, Lin, Hui-Fen, Copyright, Copyright is held by the author, unless otherwise noted. All rights reserved. |
Page generated in 0.0017 seconds