Global ETD Search

1	序貫方法於電腦化效標參照測驗之應用 / Sequential Methods in Computerized Criterion-referenced Test 李佳紋, Lee, Chia-Wen Unknown Date (has links) 在一場競爭性的考試中，我們如何決定要錄取或是淘汰這個考生？傳統的紙筆測驗方式固定題目總數，考生回答相同的題目，60分以上為及格。隨著電腦科技的快速發展，測驗型式也由紙筆轉換成電腦操作，也就是電腦化測驗。所謂電腦化效標參照測驗（computerized criterion-referenced test）即是把考生能力分成兩個以上的程度區間，藉由考生的答題狀況來判斷考生應歸屬於哪個區間。這種測驗方式與傳統測驗不同的是：電腦化測驗是依據考生的答題表現來給題，考生能力越偏離分段點（thresholds），需要的題數就越少；越接近分段點，需要的題數就越多。在這篇論文中，我們運用兩個參數的羅吉斯模型（two-parameter logistic model）來估計考生之於試題的答對機率。藉由電腦模擬來探討結合貝它保護（beta-protection）方法和適性測驗對平均測驗題數及誤判率（亦即考生真正的能力與電腦判斷的區間不同）的影響。在模擬過程中，我們也介紹了試題參數的選擇情形，估計考生能力的方法以及在貝它保護下，停止選題的規則。根據這些原則，電腦模擬結果證明使用適性測驗加上貝它保護方法能夠有效地控制誤判率在規定的範圍內，程度不同的考生也能控制有不同的測驗題數。 / In a traditional Paper-and-Pencil (p-and-p) test, all examinees have same test items and the number of items is fixed. The examinee fails or passes the exam depends on if his/her test score exceeds a predetermined scores, say, 60 out of 100. However, with the rapid advancement of modern computer technology, the test form has been converted from p-and-p to computer terminal. Computerized criterion-referenced classify the examinees into more than two categories according to his/her answers to the items. It differs from the conventional standardized test in that the selection of test items is tailored to each examinee’s ability level. Typically, those examinees with high ability or low ability will have shorter average test length (ATL) than examinees with ability that close to thresholds. In this thesis, we assume that the probability of choosing correct response to an item follows a two-parameter logistic (2-PL) model. Our goal is to study the performance of ATL and misclassification rate (MR) using beta-protection method and adaptive sequential item selection. On the simulation procedures, we also introduce the selection rule of item parameter, the methods used to estimate an examinee’s ability, and the stopping rule with beta-protection. Simulation results show that using adaptive test and beta-protection method can control the MR within specified level and the number of test items required depends on the examinee’s ability. 電腦化效標參照測驗試題反應理論貝它保護 Computerized Criterion-referenced Test Item Response Theory beta-protection
2	混合試題與受試者模型於試題差異功能分析之研究 / A Mixture Items-and-Examinees Model Analysis on Differential Item Functioning 黃馨瑩, Huang, Hsin Ying Unknown Date (has links) 依據「多層次混合試題反應理論」與「隨機試題混合模型」，本研究提出「混合試題與受試者模型」。本研究旨在評估此模型在不同樣本數、不同試題差異功能的試題數下，偵測試題差異功能的表現，以及其參數回復性情形。研究結果顯示，「混合試題與受試者模型」在樣本數大、試題差異功能試題數較多之情境下，具有正確的參數回復性，能正確判斷出試題是否存在試題差異功能，且具有良好的難度估計值，並能將樣本正確地分群，其也與「隨機試題混合模型」的估計表現頗為相近。建議未來可將「混合試題與受試者模型」應用於大型教育資料庫相關研究上，並加入其他變項後進一步探討。 / Drawing upon the framework of the multilevel mixture item response theory model and the random item mixture model, the study attempts to propose one model, called the mixture items and examinees model(MIE model). The purpose of this study was to assess the respective performances of the model on different sample-sizes and differential item functioning (DIF) items. Particularly, the study assessed the model performances in the detection of DIF items, and the accurate parameters recovery. The results of the study revealed that with large sample-sizes and more DIF items, the MIE model had the good parameters recovery, the accurate detection of the DIF items, the good estimate of the item difficulty, and the accurate classifications of the sub-samples. These model performances appeared similar to those of the random item mixture model. The findings suggest that future studies should apply the MIE model to the analyses on large-scale education databases, and should add more variables to the MIE model. 混合試題反應理論隨機試題試題差異功能 mixture item response theory random item differential item functioning
3	線上題庫與適性測驗證合系統之發展研究 / A reserach in the development of an integrated on-line item bank and computerized adaptive testing system 陳新豐 Unknown Date (has links) 論文名稱：線上題庫與適性測驗整合系統之發展研究頁數：337 校所系別：國立政治大學教育學系畢業時間及摘要別：九十學年度第二學期博士論文摘要指導教授：林邦傑博士、余民寧博士研究生：陳新豐論文摘要內容本研究係結合工具研發、理論驗證與效能評估的研究，旨在開發一個建構在全球資訊網的「線上題庫與適性測驗整合系統」，以提供教師在網際網路環境下的輔助教學評量系統，除了可動態新增題庫之外，並能針對學生提供童身訂做的適性測驗。因此，研究的兩個核心主軸為「線上題庫與適性測驗整合系統」之開發與「線上題庫建置」的理論驗證。依循這兩個核心主軸，本研究的研究目的有三：（一）開發線上題庫與適性測驗整合系統。（二）驗證建置題庫的相關理論。（三）評估整合系統運作效能與使用者滿意程度。為達成這三個研究目的，研究者採用結構分析中，Sehlly、Cashmen和Rosenblatt (2001)所提出的「系統開發生命週期」，將「線上題庫與適性測驗整合系統」開發過程分為「系統規劃」、「系統分析」、「系統設計」、「系統建置」、「系統運行與支援」等五個階段逐步開發。研究樣本方面，第一次預試選用台南市崇明國中三年級學生115人，第二次預試選用台南市建興國中三年級學生191人，正式施測樣本則是台灣地區北、中、南、東、離島等共計九校2567位國中三年級學生為研究對象。此外，在需求調查報告部分，共調查十五位專家對開發系統的意見。就研究工具來說，本研究主要研究工具為「線上題庫與適性測驗之整合系統」，另外，「功能需求調查問卷」、「硬體設備」、「軟體工具」、「系統評估量表」也是本研究的研究工具。就資料處理來說，本研究運用ITEMAN、BILOG、MatLab和SPSS套裝軟體進行資料處理，所採用的統計方法包含古典測驗理論與試題反應理論等理論來分析，計有試題分析、IRT三參數估計、因素結構分析等。研究結果部分，本研究得到如下結論：（一）線上題庫與通性測驗兩個系統可整合為一。（二）線上題庫與遍性測驗之整合系統具有多項功能。（三）結構化分析中之系統開發生命週期是開發整合系統的理想方法。（四）題庫等化轉換常數方法以Mean/Mean和Haebara等方法較佳。（五）線上測驗與紙筆測驗的試題訊息量相近，但難度偏高。（六）線上測驗連結效益良好。（七）本整合系統運作效能良好。（八）使用者對整合系統之功能感到滿意。根據研究結論，本研究針對工具研發、題庫建置、效能評估等提出具體建議。關鍵字：試題反應理論、題庫、等化、電腦化適性測驗、系統開發生命週期 / A Reserach In The Development of An Integrated On-Line Item Bank and Computerized Adaptive Testing System Abstract This research is to develop an integrated internet system of on-line item bank and computerized adaptive testing (the "System"), which is comprised of the teaching tool development, theory verification, and efficiency evaluation. Except for the addition of new item bank dynamically, the System, an auxiliary teaching evaluation system for teachers, can also provide customarily made adaptive testing for students. Therefore, to develop an integrated on-line item bank and computerized adaptive testing system and to verify the theory of on-line item bank development constitute the two core spindles of this research. Following the aforementioned research spindles, the main purposes of this research are going to: (A). Develop an integrated on-line item bank and computerized adaptive testing system. (B). Verify the related theories concerning the development of on-line item bank. (C). Evaluate the operating efficiency of such System and the degree of users' satisfaction. The "systems development life cycle" (Sehily, Cashmen Rosenblatt, 2001), a structured analysis method, is adopted to conduct the research. The development process of an integrated on-line item bank and computerized adaptive testing system is divided into 5 separate and successive stages, starting from system planning, system analysis, system design, system development, to system operation and support. In terms of research sampling, the selected samples in the first preliminary testing are 115 ninth-grade students of Chiung-Ming High School in Tainan City, Taiwan. The selected samples in the second preliminary testing are 191 ninth-grade students of Cheng-Sing High School in Tainan City, Taiwan. The third and official sampling is 2,567 ninth-grade students who were selected from a total of 9 high schools ranging from Northern, Central, Southern, and Eastern Taiwan, and islands adjacent to Taiwan. Furthermore, in the demand side, an investigation has been conducted to consult with 15 teaching experts for their professional opinions in regard to such System development. As far as research tools are concerned, except for the main research tool - the integrated on-line item bank and computerized adaptive testing system, other research tools employed consist of functional demand questionnaires, hardware equipments, software tools, and scales for system evaluation. In the aspect of data processing, ITEMAN, BILOQ MatLab, and SPSSapplication softwares are used to perform the data processing. The statistical method,like classical true score theory and item response theory and etc., is applied to conduct the following analyses: item analysis, IRT three-parameter estimate, structured analysis of elements, and etc. The results of this research lead to the following conclusion: 1. The on-line item bank: system and the adaptive testing system, the two separate systems, can be integrated into one system. 2. The integrated on-line item bank and adaptive testing system can play multiple functions. 3. "Systems development life cycle" in the structured analysis is an ideal manner to develop an integrated system. 4. "Mean/Mean", "Haebara" and, etc. are the better methods to perform the item bank equating and constant conversion. 5. For users, the information volume provided by on-line testing and traditional written testing are quite similar, but the degree of difficulty of on-line testing is higher than that of traditional written testing. 6. The linking effect of on-line testing is fair. 7. The operating efficiency of the integrated system is fair. 8. Users are satisfied with the functions of the integrated system. Based on research conclusions drawn thereon, suggestions for tool development, item bank development, and efficiency evaluation are also provided. Keywords: item response theory, item bank, equating, computerized adaptive testing,systems development life cycle 試題反應理論題庫等化電腦化適性測驗系統開發生命週期 Item response theory Item bank Equating Computerized adaptive testing Systems development life cycle
4	廣義估計方程式在題組式測驗的應用 / Generalized estimation equation in Testlet-based educational testing 李介中, Lee, Chieh Chung Unknown Date (has links) 在測驗含有題組(testlet)結構時，由於違反了試題反應理論(Item Response Theory, IRT)中局部獨立性的假設，使得IRT的估計方法產生偏誤，過去研究的解決方式為在IRT模型中多加入一個參數，將題組的影響力納入模型中，此即為題組反應理論(Testlet Response Theory, TRT)，在貝氏(Bayesian)的架構下，此方法的計算則可透過SCORIGHT軟體來達成。本研究旨在透過另一種方法，即廣義方程式(Generalized Estimation Equation, GEE)去處理測驗中的題組效果。GEE過去常被使用於分析縱貫式(longitudinal)的資料，本研究使用此方法來捕捉題組測驗下作答結果的相關性，並經重新參數化調整係數後使其能對受試者能力值進行估計。電腦模擬的結果顯示GEE能有效的處理題組效果帶來的影響。在GEE和貝氏題組模型的比較上，GEE對於程度好和程度差的受試者有較佳的估計效果；而貝氏題組模型則對於程度中等的受試者表現較好，此外我們也針對GEE的估計效率進行了實驗，結果顯示先將受試者依能力分組再進行GEE估計能提升GEE的估計效率。在文章中，我們也展示了使用GEE計算題組訊息量的方式，做為題組式測驗下評估該測驗對於各能力區間的受試者在估計準確度上的參考。 / If the tests have testlet structure, the bias may arise when using traditional Item Response Theory(IRT) estimation methods due to the violations to the assumption of local independence. To deal with the testlet effect, previous studies introduced a new parameter to the classical IRT model which called Testlet Response Theory(TRT). Under the Bayesian framework, the estimation can be accomplished on the SCORIGHT program. The purpose of this paper is to use another method named Generalized Estimation Equation(GEE) to model testlet response data. GEE was commonly used to analyze the longitudinal data. We use this method to capture the information from the correlated items and estimated ability of the examinees through re-parametrization. Simulation results indicate that GEE can deal with the testlet effect effectively. On the comparison between GEE and Bayesian testlet model, GEE does better on estimation of the examinees who have high or low ability level. In contrast, Bayesian testlet model does better on estimation of medium ability level. In addition, we design the experiment to test the efficiency of GEE. The results show that group the examinees according to their ability before doing the GEE estimation can improve the efficiency of GEE. In this paper, we also demonstrate the method to calculate testlet information using GEE which can be taken as reference for assessing estimation accuracy of each ability level in testlet-based testing. 試題反應理論試題訊息量題組反應理論題組式測驗廣義估計方程式 SCORIGHT
5	自變數有測量誤差的羅吉斯迴歸模型之序貫設計探討及其在教育測驗上的應用 / Sequential Designs with Measurement Errors in Logistic Models with Applications to Educational Testing 盧宏益, Lu, Hung-Yi Unknown Date (has links) 本論文探討當自變數存在測量誤差時，羅吉斯迴歸模型的估計問題，並將此結果應用在電腦化適性測驗中的線上校準問題。在變動長度電腦化測驗的假設下，我們證明了估計量的強收斂性。試題反應理論被廣泛地使用在電腦化適性測驗上，其假設受試者在試題的表現情形與本身的能力，可以透過試題特徵曲線加以詮釋，羅吉斯迴歸模式是最常見的試題反應模式。藉由適性測驗的施行，考題的選取可以依據不同受試者，選擇最適合的題目。因此，相較於傳統測驗而言，在適性測驗中，題目的消耗量更為快速。在題庫的維護與管理上，新試題的補充與試題校準便為非常重要的工作。線上試題校準意指在線上測驗進行中，同時進行試題校準。因此，受試者的能力估計會存在測量誤差。從統計的觀點，線上校準面臨的困難，可以解釋為在非線性模型下，當自變數有測量誤差時的實驗設計問題。我們利用序貫設計降低測量誤差，得到更精確的估計，相較於傳統的試題校準，可以節省更多的時間及成本。我們利用處理測量誤差的技巧，進一步應用序貫設計的方法，處理在線上校準中，受試者能力存在測量誤差的問題。 / In this dissertation, we focus on the estimate in logistic regression models when the independent variables are subject to some measurement errors. The problem of this dissertation is motivated by online calibration in Computerized Adaptive Testing (CAT). We apply the measurement error model techniques and adaptive sequential design methodology to the online calibration problem of CAT. We prove that the estimates of item parameters are strongly consistent under the variable length CAT setup. In an adaptive testing scheme, examinees are presented with different sets of items chosen from a pre-calibrated item pool. Thus the speed of attrition in items will be very fast, and replenishing of item pool is essential for CAT. The online calibration scheme in CAT refers to estimating the item parameters of new, un-calibrated items by presenting them to examinees during the course of their ability testing together with previously calibrated items. Therefore, the estimated latent trait levels of examinees are used as the design points for estimating the parameter of the new items, and naturally these designs, the estimated latent trait levels, are subject to some estimating errors. Thus the problem of the online calibration under CAT setup can be formulated as a sequential estimation problem with measurement errors in the independent variables, which are also chosen sequentially. Item Response Theory (IRT) is the most commonly used psychometric model in CAT, and the logistic type models are the most popular models used in IRT based tests. That's why the nonlinear design problem and the nonlinear measurement error models are involved. Sequential design procedures proposed here can provide more accurate estimates of parameters, and are more efficient in terms of sample size (number of examinees used in calibration). In traditional calibration process in paper-and-pencil tests, we usually have to pay for the examinees joining the pre-test calibration process. In online calibration, there will be less cost, since we are able to assign new items to the examinees during the operational test. Therefore, the proposed procedures will be cost-effective as well as time-effective. 電腦化適性測驗線上校準測量誤差序貫設計變動長度試題反應理論試題校準 Item Response Theory Computerized Adaptive Testing online calibration measurement error sequential design sequential estimation stopping time variable length item calibration

1

Page generated in 0.0236 seconds