Spelling suggestions: "subject:"desponse 1heory"" "subject:"desponse btheory""
191 |
Teoria de resposta ao item : aplicação do modelo Rasch em desenvolvimento e validação de instrumentos em saúde mentalChachamovich, Eduardo January 2007 (has links)
Introdução: A Teoria de Resposta ao Item constitui-se em um novo grupo de modelos psicométricos de construção, validação e avaliação de instrumentos de aferição em desfechos em saúde. Dentre tais modelos, o Modelo de Rasch se destaca como uma possibilidade de testar a Teoria de Mensuração Conjunta e, portanto, desenvolver escalas de medida com estrutura intervalar. De fato, as robustas estratégias estatísticas utilizadas recentemente demandam que as variáveis apresentem propriedades intervalares para que resultados fidedignos sejam obtidos. Na área da saúde mental, o Modelo de Rasch agrega a possibilidade de que os construtos sejam avaliados quanto à dimensionalidade. Os instrumentos podem ser explorados quanto à invariância dos itens, calibragem, adequação das escalas de resposta e adequação dos itens ao modelo teórico subjacente.Objetivos: Explorar a aplicação do Modelo de Rasch no desenvolvimento de instrumentos de qualidade de vida e atitudes frente ao envelhecimento. Utilizar o Modelo de Rasch na validação cultural de um instrumento de depressão geriátrica, assim como testar a adequação de uma escala de resposta Likert de acordo com o nível educacional, através da aplicação do Modelo de Rasch. Método: Dados internacionais de qualidade de vida, atitudes frente ao envelhecimento e depressão foram utilizados nestes estudos. Os dados são provenientes do Projeto WHOQOL-OLD e AAQ, coordenado pela Organização Mundial de Saúde em 20 países. Uma combinação de Teoria Clássica emPsicometria e Modelo de Rasch foi utilizada para a análise dos dados. O Modelo de Rasch foi implementado através do software RUMM2020. Resultados: A utilização do Modelo de Rasch possibilitou um refinamento dos instrumentos testados, gerando instrumentos com novos formatos mais adequados. No instrumento WHOQOL-OLD, em sua versão em Português, dois domínios dos seis apresentaram interação item-total insuficiente. O remodelamento da escala de resposta e a exclusão de itens de performance inadequada acarretaram melhora da escala. O instrumento Atitudes Frente ao Envelhecimento também necessitou de exclusão de itens e agrupamento de categorias de resposta, em sua versão em Português, para que seu desempenho psicométrico fosse adequado. Em relação ao nível de escolaridade, a análise dos dados gerados pelo instrumento WHOQOL-BREF em uma amostra internacional de idosos por Modelo de Rasch evidenciou que uma escala Likert de cinco pontos não é adequada para sujeitos analfabetos, em oposição aos achados com sujeitos universitários. A escala de depressão geriátrica GDS também foi analisada em uma amostra brasileira. O formato original de 15 itens se mostrou inadequado, e a exclusão de itens é fundamental para que o desempenho deste instrumento seja satisfatório. Por fim, o impacto de sintomas depressivos clínicos e subclínicos na qualidade de vida de idosos foi avaliado. Mesmo níveis subclínicos de depressão mostraram-se significativamente associados a decréscimo de qualidade de vida e atitudes frente ao envelhecimento em uma amostra de 5566 idosos. Conclusão: A aplicação do Modelo de Rasch no desenvolvimento e validação de instrumentos na área da saúde mental mostrou-se marcadamente relevante. Tal modelo proporcionou que as escalas possam ser examinadas em detalhes. A avaliação dos itens de modo individual permite que a sua adequação seja determinada e que o instrumento seja refinado de acordo com as modificações sugeridas pelas análises de Rasch. Por fim, o Modelo de Rasch permite o teste de invariância dos itens, que assume particular importância na utilização transcultural de instrumentos na área da saúde mental. / Background: Item Response Theory represents a novel group of psychometric models for constructing, validating and evaluating health outcome instruments. Among these models, the Rasch Model is able to test the Theory of Conjoint Measurement and, so, develop intervalar scales. In fact, the recently applied robust statistical strategies demand that the variables have an intervalar structure, so that reliable results are obtained. In the mental health field, the Rasch Model assures that the constructs may be tested for dimensionality. The instruments may be explored regarding item invariance, targetting, response scales adequacy and item fit to the underlying theoretical model. Objectives: To explore the role of the Rasch Model for the development of quality of life and attitudes to aging instruments. In addition, to apply the Rasch Model in the cultural validation of a geriatric depression scale, as well as test the suitability of a 5-point Likert response scale according to the educational level. Method: International data on quality of life, attitudes to aging and depression were analyzed in these studies. Data were drawn from the WHOQOL-OLD and AAQ Project, coordinated by the World Health Organization in 20 countries. A combination of Classical Test Theory and Rasch Model was used to data analysis. The Rasch Model was implemented by RUMM2020 software. Results: The application of the Rasch Model determined a refinement of the original instruments, deriving more adequate versions. The Portuguese version of the WHOQOL-OLD instrument presented insufficient item-total interaction in two domains out of six. Rescoring the response scale and deleting items led to 14 improvement of the instrument. The Attitudes to Aging Questionnaire (Portuguese version) has also demanded rescoring and item deletion in order to present adequate psychometric performance. Regarding the educational level, the Rasch analysis of data obtained using the WHOQOL-BREF in an international sample of older adults indicated that a 5-point Likert scale is not suitable for illiterate subjects. In opositte, this Likert scale is suitable for graduate older adults. The Geriatric Depression Scale was explored in a Brazilian sample. Its original 15-item format has proven to be inadequate, and item exclusion is crucial to obtain a satisfactory performance. Finally, the impact of clinical and subclinical depressive symptoms in quality of life was assessed. Even subthreshold symptomatology is significantly associated to impairments in quality of life and attitudes to aging in a sample of 5566 older adults. Conclusion: The application of the Rasch Model to the development and validation of mental health instruments has proven to be highly relevant. This model brings the possibilty of examining scales in depth. The individual assessment of each item makes it possible to determine item fit and, thus, to refine the instruments according to the alterations claimed by the Rasch analysis. Finally, the Rasch Model implements the item invariance test, which is particularly important for utilizing instruments in transcultural investigations
|
192 |
The application and interpretation of the two-parameter item response model in the context of replicated preference testingButton, Zach January 1900 (has links)
Master of Science / Statistics / Suzanne Dubnicka / Preference testing is a popular method of determining consumer preferences for a variety of products in areas such as sensory analysis, animal welfare, and pharmacology. However, many prominent models for this type of data do not allow different probabilities of preferring one product over the other for each individual consumer, called overdispersion, which intuitively exists in real-world situations. We investigate the Two-Parameter variation of the Item Response Model (IRM) in the context of replicated preference testing. Because the IRM is most commonly applied to multiple-choice testing, our primary focus is the interpretation of the model parameters with respect to preference testing and the evaluation of the model’s usefulness in this context. We fit a Bayesian version of the Two-Parameter Probit IRM (2PP) to two real-world datasets, Raisin Bran and Cola, as well as five hypothetical datasets constructed with specific parameter properties in mind. The values of the parameters are sampled via the Gibbs Sampler and examined using various plots of the posterior distributions. Next, several different models and prior distribution specifications are compared over the Raisin Bran and Cola datasets using the Deviance Information Criterion (DIC). The Two-Parameter IRM is a useful tool in the context of replicated preference testing, due to its ability to accommodate overdispersion, its intuitive interpretation, and its flexibility in terms of parameterization, link function, and prior specification. However, we find that this model brings computational difficulties in certain situations, some of which require creative solutions. Although the IRM can be interpreted for replicated preference testing scenarios, this data typically contains few replications, while the model was designed for exams with many items. We conclude that the IRM may provide little evidence for marketing decisions, and it is better-suited for exploring the nature of consumer preferences early in product development.
|
193 |
The Accuracy of Accuracy Estimates for Single Form Dichotomous Classification ExamsJanuary 2013 (has links)
abstract: The use of exams for classification purposes has become prevalent across many fields including professional assessment for employment screening and standards based testing in educational settings. Classification exams assign individuals to performance groups based on the comparison of their observed test scores to a pre-selected criterion (e.g. masters vs. nonmasters in dichotomous classification scenarios). The successful use of exams for classification purposes assumes at least minimal levels of accuracy of these classifications. Classification accuracy is an index that reflects the rate of correct classification of individuals into the same category which contains their true ability score. Traditional methods estimate classification accuracy via methods which assume that true scores follow a four-parameter beta-binomial distribution. Recent research suggests that Item Response Theory may be a preferable alternative framework for estimating examinees' true scores and may return more accurate classifications based on these scores. Researchers hypothesized that test length, the location of the cut score, the distribution of items, and the distribution of examinee ability would impact the recovery of accurate estimates of classification accuracy. The current simulation study manipulated these factors to assess their potential influence on classification accuracy. Observed classification as masters vs. nonmasters, true classification accuracy, estimated classification accuracy, BIAS, and RMSE were analyzed. In addition, Analysis of Variance tests were conducted to determine whether an interrelationship existed between levels of the four manipulated factors. Results showed small values of estimated classification accuracy and increased BIAS in accuracy estimates with few items, mismatched distributions of item difficulty and examinee ability, and extreme cut scores. A significant four-way interaction between manipulated variables was observed. In additional to interpretations of these findings and explanation of potential causes for the recovered values, recommendations that inform practice and avenues of future research are provided. / Dissertation/Thesis / M.A. Educational Psychology 2013
|
194 |
Teoria de resposta ao item : aplicação do modelo Rasch em desenvolvimento e validação de instrumentos em saúde mentalChachamovich, Eduardo January 2007 (has links)
Introdução: A Teoria de Resposta ao Item constitui-se em um novo grupo de modelos psicométricos de construção, validação e avaliação de instrumentos de aferição em desfechos em saúde. Dentre tais modelos, o Modelo de Rasch se destaca como uma possibilidade de testar a Teoria de Mensuração Conjunta e, portanto, desenvolver escalas de medida com estrutura intervalar. De fato, as robustas estratégias estatísticas utilizadas recentemente demandam que as variáveis apresentem propriedades intervalares para que resultados fidedignos sejam obtidos. Na área da saúde mental, o Modelo de Rasch agrega a possibilidade de que os construtos sejam avaliados quanto à dimensionalidade. Os instrumentos podem ser explorados quanto à invariância dos itens, calibragem, adequação das escalas de resposta e adequação dos itens ao modelo teórico subjacente.Objetivos: Explorar a aplicação do Modelo de Rasch no desenvolvimento de instrumentos de qualidade de vida e atitudes frente ao envelhecimento. Utilizar o Modelo de Rasch na validação cultural de um instrumento de depressão geriátrica, assim como testar a adequação de uma escala de resposta Likert de acordo com o nível educacional, através da aplicação do Modelo de Rasch. Método: Dados internacionais de qualidade de vida, atitudes frente ao envelhecimento e depressão foram utilizados nestes estudos. Os dados são provenientes do Projeto WHOQOL-OLD e AAQ, coordenado pela Organização Mundial de Saúde em 20 países. Uma combinação de Teoria Clássica emPsicometria e Modelo de Rasch foi utilizada para a análise dos dados. O Modelo de Rasch foi implementado através do software RUMM2020. Resultados: A utilização do Modelo de Rasch possibilitou um refinamento dos instrumentos testados, gerando instrumentos com novos formatos mais adequados. No instrumento WHOQOL-OLD, em sua versão em Português, dois domínios dos seis apresentaram interação item-total insuficiente. O remodelamento da escala de resposta e a exclusão de itens de performance inadequada acarretaram melhora da escala. O instrumento Atitudes Frente ao Envelhecimento também necessitou de exclusão de itens e agrupamento de categorias de resposta, em sua versão em Português, para que seu desempenho psicométrico fosse adequado. Em relação ao nível de escolaridade, a análise dos dados gerados pelo instrumento WHOQOL-BREF em uma amostra internacional de idosos por Modelo de Rasch evidenciou que uma escala Likert de cinco pontos não é adequada para sujeitos analfabetos, em oposição aos achados com sujeitos universitários. A escala de depressão geriátrica GDS também foi analisada em uma amostra brasileira. O formato original de 15 itens se mostrou inadequado, e a exclusão de itens é fundamental para que o desempenho deste instrumento seja satisfatório. Por fim, o impacto de sintomas depressivos clínicos e subclínicos na qualidade de vida de idosos foi avaliado. Mesmo níveis subclínicos de depressão mostraram-se significativamente associados a decréscimo de qualidade de vida e atitudes frente ao envelhecimento em uma amostra de 5566 idosos. Conclusão: A aplicação do Modelo de Rasch no desenvolvimento e validação de instrumentos na área da saúde mental mostrou-se marcadamente relevante. Tal modelo proporcionou que as escalas possam ser examinadas em detalhes. A avaliação dos itens de modo individual permite que a sua adequação seja determinada e que o instrumento seja refinado de acordo com as modificações sugeridas pelas análises de Rasch. Por fim, o Modelo de Rasch permite o teste de invariância dos itens, que assume particular importância na utilização transcultural de instrumentos na área da saúde mental. / Background: Item Response Theory represents a novel group of psychometric models for constructing, validating and evaluating health outcome instruments. Among these models, the Rasch Model is able to test the Theory of Conjoint Measurement and, so, develop intervalar scales. In fact, the recently applied robust statistical strategies demand that the variables have an intervalar structure, so that reliable results are obtained. In the mental health field, the Rasch Model assures that the constructs may be tested for dimensionality. The instruments may be explored regarding item invariance, targetting, response scales adequacy and item fit to the underlying theoretical model. Objectives: To explore the role of the Rasch Model for the development of quality of life and attitudes to aging instruments. In addition, to apply the Rasch Model in the cultural validation of a geriatric depression scale, as well as test the suitability of a 5-point Likert response scale according to the educational level. Method: International data on quality of life, attitudes to aging and depression were analyzed in these studies. Data were drawn from the WHOQOL-OLD and AAQ Project, coordinated by the World Health Organization in 20 countries. A combination of Classical Test Theory and Rasch Model was used to data analysis. The Rasch Model was implemented by RUMM2020 software. Results: The application of the Rasch Model determined a refinement of the original instruments, deriving more adequate versions. The Portuguese version of the WHOQOL-OLD instrument presented insufficient item-total interaction in two domains out of six. Rescoring the response scale and deleting items led to 14 improvement of the instrument. The Attitudes to Aging Questionnaire (Portuguese version) has also demanded rescoring and item deletion in order to present adequate psychometric performance. Regarding the educational level, the Rasch analysis of data obtained using the WHOQOL-BREF in an international sample of older adults indicated that a 5-point Likert scale is not suitable for illiterate subjects. In opositte, this Likert scale is suitable for graduate older adults. The Geriatric Depression Scale was explored in a Brazilian sample. Its original 15-item format has proven to be inadequate, and item exclusion is crucial to obtain a satisfactory performance. Finally, the impact of clinical and subclinical depressive symptoms in quality of life was assessed. Even subthreshold symptomatology is significantly associated to impairments in quality of life and attitudes to aging in a sample of 5566 older adults. Conclusion: The application of the Rasch Model to the development and validation of mental health instruments has proven to be highly relevant. This model brings the possibilty of examining scales in depth. The individual assessment of each item makes it possible to determine item fit and, thus, to refine the instruments according to the alterations claimed by the Rasch analysis. Finally, the Rasch Model implements the item invariance test, which is particularly important for utilizing instruments in transcultural investigations
|
195 |
Teoria da resposta do item : um estudo inicial dos dados GERES Campinas / Item response theory subingles: a initial study of data GERES CampinasStevão, Christiane Bellório Gennari de Andrade, 1965- 15 February 2008 (has links)
Orientador: Luiz Carlos de Freitas, Dalton Francisco de Andrade / Dissertação (mestrado) - Universidade Estadual de Campinas, Faculdade de Educação / Made available in DSpace on 2018-08-10T21:12:26Z (GMT). No. of bitstreams: 1
Stevao_ChristianeBellorioGennarideAndrade_M.pdf: 5247957 bytes, checksum: 947de91bb797b7b1b693ceab7297dd14 (MD5)
Previous issue date: 2008 / Resumo: Este trabalho teve como problema pesquisa mostrar os resultados obtidos aplicando a Teoria da Resposta ao Item aos dados do Projeto GERES Campinas, o qual utiliza todos itens de um teste como âncora para a criação da escala de proficiência, e assim comparar os resultados com a forma clássica de se elaborar escalas, o qual usa critérios específicos para aceitação de um item como item âncora. Para isso trabalhamos com os dados da pesquisa GERES, um estudo longitudinal com alunos de 1ª a 4ª série do ensino fundamental na cidade de Campinas, que teve seu início em março de 2005, nas três redes de ensino, Estadual, Municipal e Particular. Apresentamos o desempenho dos alunos nas três primeiras aplicações, no teste de matemática, e mostramos que há divergência entre as duas técnicas / Abstract: This work had as a research issue to show the results obtained by applying the Theory of the Response to the item to the database from the GERES' Project of Campinas. This Project utilizes all the items of a test as an anchor to make the scale of proficiency, and then to compare the obtained results with the classical way to elaborate scales. The latter uses specific criteria for the approval of an item as an anchor item. Therefore, we have analyzed the data from GERES' Project, which consists of a longitudinal study with 1st to 4th grade students of Elementary School of Campinas City (State, Municipal and Private School System) that started in March, 2005. We present the performance of the students in the Mathematic Test considering the first three applications of GERES Project and as a conclusion we found a divergence between the two methods / Mestrado / Ensino, Avaliação e Formação de Professores / Mestre em Educação
|
196 |
Um estudo da GQT por meio do Modelo de Resposta Gradual da Teoria da Resposta ao Item / Item Response Theory, Total Quality Management; Graded Response ModelKÃtia Michelle Matos de Oliveira 16 September 2010 (has links)
FundaÃÃo de Amparo à Pesquisa do Estado do Cearà / Com o objetivo de avaliar o grau de maturidade, quanto à utilizaÃÃo de ferramentas e tÃcnicas da GestÃo pela Qualidade Total (GQT), e sua evoluÃÃo nos Ãltimos 10 anos, nas empresas de transformaÃÃo e construÃÃo civil de portes mÃdio e grande no Estado do CearÃ, o presente trabalho utilizou o Modelo de
Resposta Gradual da Teoria da Resposta ao Item (TRI). Este modelo foi projetado para analisar respostas graduadas e passÃveis de ordenaÃÃo. Como base de comparaÃÃo para verificar uma possÃvel evoluÃÃo na maturidade, foi utilizado o trabalho realizado por Alexandre (1999). A metodologia empregada para a coleta de dados foi a pesquisa quantitativa, atravÃs de um questionÃrio estruturado na escala de Likert
com 5 categorias de resposta. O estudo utilizou o modelo de Resposta Gradual da TRI para avaliaÃÃo da maturidade criando-se uma escala de medida padronizada, atravÃs do qual se tornou possÃvel a comparaÃÃo e a verificaÃÃo de que houve uma evoluÃÃo pouco significativa da maturidade das empresas,
principalmente levando-se em conta a importÃncia da qualidade como diferencial competitivo para a manutenÃÃo das empresas no mercado globalizado. A escala construÃda pode auxiliar na identificaÃÃo de prÃticas nÃo implantadas que poderiam melhorar a qualidade dos produtos. / Aiming to evaluate the degree of maturity in the use of tools and techniques of Total Quality Management (TQM), and its evolution over the past 10 years, in the companies of transformation and construction of medium and large scale in the State of Ceara, this study used the Graded Response Model of Item Response Theory (IRT). This model was designed to examine responses graded and capable of
ordinate. As a basis of comparison for possible evaluates in the maturity, it was used the work done by Alexander in 1999. The methodology used for collecting data was the quantitative survey, using a structured questionnaire in Likert scale with five response categories. The study used a model for assessment of maturity by creating a standardized measurement scale with the support of the IRT, whereby it became possible to compare and to check that there was a low evolution of maturity of organizations, especially taking into account the importance of quality as a competitive differential for maintenance companies in
the global market. The scale constructed may serve as a basis to enable companies to identify potential practice has not implanted, allows, still monitoring their development level of maturity.
|
197 |
Contribuições ao estudo do modelo de resposta nominal / Contributions to the study of nominal response modelPereira, Sheila Regina dos Santos, 1981- 02 October 2012 (has links)
Orientadores: Caio Lucidius Neberezny Azevedo, Hildete Prisco Pinheiro / Dissertação (mestrado) - Universidade Estadual de Campinas, Instituto de Matemática, Estatística e Computação Científica / Made available in DSpace on 2018-08-20T00:05:22Z (GMT). No. of bitstreams: 1
Pereira_SheilaReginadosSantos_M.pdf: 25409754 bytes, checksum: 462fcbdec414321c70cbff248acc4ed1 (MD5)
Previous issue date: 2012 / Resumo: Na área educacional é crescente o interesse pela aplicação de técnicas derivadas da Teoria de Resposta ao Item (TRI), já que esta metodologia vem sendo utilizada em processos qualitativos da avaliação psicológica e educacional. Porém, na grande parte das avaliações que empregam itens de múltipla escolha, é comum a redução das respostas em padrões de certo ou errado para a utilização desses modelos. A dicotomização das respostas dos indivíduos ignora qualquer conhecimento parcial que ele possa ter da resposta correta, pois assume implicitamente que ou o indivíduo tem conhecimento para escolher a alternativa correta, ou não o tem e seleciona aleatoriamente uma das alternativas. Desta maneira, a informação do conhecimento parcial não é ser usada na estimação dos traços latentes. Nesse sentido, o objetivo do presente trabalho é mostrar a eficiência do Modelo de Resposta Nominal no processo de estimação dos traços latentes dos indivíduos submetidos a testes com itens de múltipla escolha, bem com, analisar e interpretar os parâmetros dos itens estimados por esse modelo / Abstract: In the educational field there is growing interest in applying techniques derived from the TRI, since this methodology has been used in qualitative processes of psychological and educational assessment. However, in most of the educational assessments that use multiple choice items is common to decreased response in patterns of right or wrong to use these models. The dichotomization of the responses of individuals ignores any partial knowledge he may have the correct answer, or because it implicitly assumes that the individual has knowledge to choose the correct alternative, or do not have it and randomly selects one of the alternatives. Thus, information from the partial knowledge is not to be used in the estimation of latent traits. In this sense, the objective of this work is to show the efficiency of the Nominal Response Model in the estimation of latent traits of individuals tested with multiple choice items, as well as analyze and interpret the parameters of the items estimated by this model / Mestrado / Estatistica / Mestre em Estatística
|
198 |
Assessment of Competencies Among Doctoral Trainees in PsychologyPrice, Samantha 08 1900 (has links)
The recent shift to a culture of competence has permeated several areas of professional psychology, including competency identification, competency-based education training, and competency assessment. A competency framework has also been applied to various programs and specialty areas within psychology, such as clinical, counseling, clinical health, school, cultural diversity, neuro-, gero-, child, and pediatric psychology. Despite the spread of competency focus throughout psychology, few standardized measures of competency assessment have been developed. To the authors' knowledge, only four published studies on measures of competency assessment in psychology currently exist. While these measures demonstrate significant steps in progressing the assessment of confidence, three of these measures were designed for use with individual programs, two of these international (i.e., UK and Taiwan). The current study applied the seminal Competency Benchmarks, via a recently adapted benchmarks form (i.e., Practicum Evaluation form; PEF), to practicum students at the University of North Texas. In addition to traditional supervisor ratings, the present study also involved self-, peer supervisor, and peer supervisee ratings to provide 360-degree evaluations. Item-response theory (IRT) was used to evaluate the psychometric properties of the PEF and inform potential revisions of this form. Supervisor ratings of competency were found to fit the Rasch model specified, lending support to use of the benchmarks framework as assessed by this form. Self- and peer-ratings were significantly correlated with supervisor ratings, indicating that there may be some utility to 360-degree evaluations. Finally, as predicted, foundational competencies were rated as significantly higher than functional competencies, and competencies improved significantly with training. Results of the current study provide clarity about the utility of the PEF and inform our understanding of practicum-level competencies.
|
199 |
Rule-based Risk Monitoring Systems for Complex DatasetsHaghighi, Mona 28 June 2016 (has links)
In this dissertation we present rule-based machine learning methods for solving problems with high-dimensional or complex datasets. We are applying decision tree methods on blood-based biomarkers and neuropsychological tests to predict Alzheimer’s disease in its early stages. We are also using tree-based methods to identify disparity in dementia related biomarkers among three female ethnic groups. In another part of this research, we tried to use rule-based methods to identify homogeneous subgroups of subjects who share the same risk patterns out of a heterogeneous population. Finally, we applied a network-based method to reduce the dimensionality of a clinical dataset, while capturing the interaction among variables. The results show that the proposed methods are efficient and easy to use in comparison to the current machine learning methods.
|
200 |
Item Analysis for the Development of the Shirts and Shoes Test for 6-Year-OldsTucci, Alexander, Tucci, Alexander January 2017 (has links)
The development of a standardized assessment can, in general, be broken into multiple stages. In the first, items to be used in the assessment are generated according to the skills and abilities that are to be assessed and the needs of the developers. These items are then, ideally, tested in the field on members of the population for which the assessment is intended. Item Response Theory (IRT) analysis is used to reveal items in the item pool which are unusable due to measurement error, redundancy in the level of item difficulty, or bias. More potential items may be generated and tested until there is a set of valid items with which the developers can move forward. The present study focused on the steps of item tryout and analysis for the establishment of demonstrable item-level validity. Fifty-one potential test items were analyzed for a version of the Shirts and Shoes Test (Plante & Vance, 2012) for 6-year-olds. A total of 23 items were discarded due to error in one or more of the measures mentioned above, and one item was discarded due to its low difficulty. The remaining 27 items were deemed suitable for the 6-year-old population.
|
Page generated in 0.0661 seconds