Global ETD Search

231	Detection and Classification of DIF Types Using Parametric and Nonparametric Methods: A comparison of the IRT-Likelihood Ratio Test, Crossing-SIBTEST, and Logistic Regression Procedures Lopez, Gabriel E. 01 January 2012 (has links) The purpose of this investigation was to compare the efficacy of three methods for detecting differential item functioning (DIF). The performance of the crossing simultaneous item bias test (CSIBTEST), the item response theory likelihood ratio test (IRT-LR), and logistic regression (LOGREG) was examined across a range of experimental conditions including different test lengths, sample sizes, DIF and differential test functioning (DTF) magnitudes, and mean differences in the underlying trait distributions of comparison groups, herein referred to as the reference and focal groups. In addition, each procedure was implemented using both an all-other anchor approach, in which the IRT-LR baseline model, CSIBEST matching subtest, and LOGREG trait estimate were based on all test items except for the one under study, and a constant anchor approach, in which the baseline model, matching subtest, and trait estimate were based on a predefined subset of DIF-free items. Response data for the reference and focal groups were generated using known item parameters based on the three-parameter logistic item response theory model (3-PLM). Various types of DIF were simulated by shifting the generating item parameters of select items to achieve desired DIF and DTF magnitudes based on the area between the groups' item response functions. Power, Type I error, and Type III error rates were computed for each experimental condition based on 100 replications and effects analyzed via ANOVA. Results indicated that the procedures varied in efficacy, with LOGREG when implemented using an all-other approach providing the best balance of power and Type I error rate. However, none of the procedures were effective at identifying the type of DIF that was simulated. crossing simultaneous item bias test DIF differential item functioning item bias logistic regression American Studies Arts and Humanities Psychology
232	混合試題與受試者模型於試題差異功能分析之研究 / A Mixture Items-and-Examinees Model Analysis on Differential Item Functioning 黃馨瑩, Huang, Hsin Ying Unknown Date (has links) 依據「多層次混合試題反應理論」與「隨機試題混合模型」，本研究提出「混合試題與受試者模型」。本研究旨在評估此模型在不同樣本數、不同試題差異功能的試題數下，偵測試題差異功能的表現，以及其參數回復性情形。研究結果顯示，「混合試題與受試者模型」在樣本數大、試題差異功能試題數較多之情境下，具有正確的參數回復性，能正確判斷出試題是否存在試題差異功能，且具有良好的難度估計值，並能將樣本正確地分群，其也與「隨機試題混合模型」的估計表現頗為相近。建議未來可將「混合試題與受試者模型」應用於大型教育資料庫相關研究上，並加入其他變項後進一步探討。 / Drawing upon the framework of the multilevel mixture item response theory model and the random item mixture model, the study attempts to propose one model, called the mixture items and examinees model(MIE model). The purpose of this study was to assess the respective performances of the model on different sample-sizes and differential item functioning (DIF) items. Particularly, the study assessed the model performances in the detection of DIF items, and the accurate parameters recovery. The results of the study revealed that with large sample-sizes and more DIF items, the MIE model had the good parameters recovery, the accurate detection of the DIF items, the good estimate of the item difficulty, and the accurate classifications of the sub-samples. These model performances appeared similar to those of the random item mixture model. The findings suggest that future studies should apply the MIE model to the analyses on large-scale education databases, and should add more variables to the MIE model. 混合試題反應理論隨機試題試題差異功能 mixture item response theory random item differential item functioning
233	The Differential Item Functioning (dif) Analysis Of Mathematics Items In The International Assessment Programs Yildirim, Huseyin Husnu 01 April 2006 (has links) (PDF) Cross-cultural studies, like TIMSS and PISA 2003, are being conducted since 1960s with an idea that these assessments can provide a broad perspective for evaluating and improving education. In addition countries can assess their relative positions in mathematics achievement among their competitors in the global world. However, because of the different cultural and language settings of different countries, these international tests may not be functioning as expected across all the countries. Thus, tests may not be equivalent, or fair, linguistically and culturally across the participating countries. In this conte! ! xt, the present study aimed at assessing the equivalence of mathematics items of TIMSS 1999 and PISA 2003 across cultures and languages, to fin! d out if mathematics achievement possesses any culture specifi! c aspect s. For this purpose, the present study assessed Turkish and English versions of TIMSS 1999 and PISA 2003 mathematics items with respect to, (a) psychometric characteristics of items, and (b) possible sources of Differential Item Functioning (DIF) between these two versions. The study used Restricted Factor Analysis, Mantel-Haenzsel Statistics and Item Response Theory Likelihood Ratio methodologies to determine DIF items. The results revealed that there were adaptation problems in both TIMSS and PISA studies. However it was still possible to determine a subtest of items functioning fairly between cultures, to form a basis for a cross-cultural comparison. In PISA, there was a high rate of agreement among the DIF methodologies used. However, in TIMSS, the agree! ment ra! te decreased considerably possibly because the rate o! f differ e! ntially functioning items within TIMSS was higher, and differential guessing and differential discriminating were also issues in the test. The study! also revealed that items requiring competencies of reproduction of practiced knowledge, knowledge of facts, performance of routine procedures, application of technical skills were less likely to be biased against Turkish students with respect to American students at the same ability level. On the other hand, items requiring students to communicate mathematically, items where various results must be compared, and items that had real-world context were less likely to be in favor of Turkish students.
234	Estudo sobre construção de escalas com base na Teoria da Resposta ao Item: avaliação de proficiência em conteúdos matemáticos básicos / Study on scale construction based on Item Response Theory: assessment of proficiency in basic mathematical contents Fujii, Tânia Robaskiewicz Coneglian 07 May 2018 (has links) Submitted by TÂNIA ROBASKIEWICZ CONEGLIAN FUJII (taniaconeglian@hotmail.com) on 2018-07-05T02:38:14Z No. of bitstreams: 1 dissertação_tânia_fujii.pdf: 9736559 bytes, checksum: 96a18fba83fc563e110d05ccc897d764 (MD5) / Approved for entry into archive by ALESSANDRA KUBA OSHIRO ASSUNÇÃO (alessandra@fct.unesp.br) on 2018-07-05T12:58:24Z (GMT) No. of bitstreams: 1 fujii_trc_me_prud.pdf: 9736559 bytes, checksum: 96a18fba83fc563e110d05ccc897d764 (MD5) / Made available in DSpace on 2018-07-05T12:58:24Z (GMT). No. of bitstreams: 1 fujii_trc_me_prud.pdf: 9736559 bytes, checksum: 96a18fba83fc563e110d05ccc897d764 (MD5) Previous issue date: 2018-05-07 / Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) / Neste trabalho realizou-se um estudo sobre construção de escalas, com base na Teoria da Resposta ao Item (TRI), resultando na construção e interpretação pedagógica de uma escala de conhecimento para medir a proficiência em conteúdos matemáticos, necessários para o acompanhamento das disciplinas de cálculo e similares dos ingressantes nos cursos da área de exatas. O modelo matemático adotado nesta pesquisa foi o logístico unidimensional de três parâmetros. A estimação dos parâmetros dos itens e das proficiências dos respondentes foi feita sob enfoque bayesiano, utilizando-se o amostrador de Gibbs, algoritmo da classe dos Métodos de Monte Carlo via Cadeia de Markov (MCMC), implementado via software OpenBUGS (Bayesian inference Using Gibbs Sampling), direcionado para análise bayesiana de modelos complexos. O software BILOG-MG também foi utilizado para comparação dos resultados. O instrumento utilizado para a medida do conhecimento consistiu em uma prova composta por trinta e seis itens de múltipla escolha, cada um com cinco alternativas, sendo somente uma a correta. Os itens foram elaborados com base em uma matriz de referência construída para este fim, dividida em três temas, sendo estes “espaço e forma”, “grandezas e medidas” e “números e operações/álgebra e funções”. Cada tema é composto por competências e cada competência descreve uma habilidade que se deseja medir. Para a construção da escala proposta, optou-se por adotar uma escala com média 250 e desvio padrão 50. Nesta escala foram selecionados níveis para serem interpretados em um intervalo de 75 a 425. Para interpretação da escala proposta, foram comparados alguns métodos de posicionamento de itens âncora nos níveis selecionados. Buscando a interpretação da escala, em toda a sua amplitude, optou-se por utilizar a análise de agrupamentos hierárquicos para segmentar a escala em grupos, ou seja, em faixas de proficiência. A escala foi dividida em cinco grupos, cada grupo caracterizado com base nos itens posicionados como âncora, a partir de suas probabilidades de resposta correta e de seus valores para o parâmetro de discriminação. Embora os resultados sejam consistentes, apontam para a necessidade de um processo contínuo de aprimoramento do banco de questões e da escala de proficiência. / In this work, a study was carried out on the construction of scales, based on the Item Response Theory (IRT), resulting in the construction and pedagogical interpretation of a scale of knowledge to measure the proficiency in mathematical contents, necessary for the follow-up of Calculus and similar subjects of the students in the courses of the Exact Sciences Area. The mathematical model adopted in this research was the three parameters one-dimensional logistic. The parameters estimation of the items and proficiencies of the respondents was done using a Bayesian approach using the Gibbs sampler, Monte Carlo Methods via Markov Chain algorithm (MCMC), implemented using OpenBUGS software (Bayesian inference Using Gibbs Sampling), directed to Bayesian analysis of complex models. The BILOG-MG software was also used to compare the results. The instrument used for the measurement of knowledge consisted of a test composed of thirty-six multiple choice items, each with five alternatives, with only one correct. The items were elaborated based on a reference matrix constructed for this purpose, divided in three themes, being these “space and form”, “quantities and measures” and “numbers and operations/ algebra and functions". Each subject is composed of competencies and each competency describes a skill that one wishes to measure. In order to construct the proposed scale, we chose to adopt a scale with a mean of 250 and standard deviation of 50. In this scale, we selected levels to be interpreted in a range of 75 to 425. For the interpretation of the proposed scale, some methods of positioning anchor items at the selected levels were compared. In order to interpret the scale in all its amplitude, it was decided to use hierarchical groupings analysis to segment the scale into groups, that is, in skill bands. The scale was divided into five groups, each group was characterized based on the items positioned as anchor, from their correct response probabilities and their values for the discrimination parameter. Although the results are consistent, they point to the need for an ongoing upgrading process of questions bank and proficiency scale. Teoria da Resposta ao Item Inferência Bayesiana Segmentação da escala Item Response Theory Bayesian inference Scale segmentation
235	Teoria da resposta ao item : aplicação na avaliação da intensidade de sintomas depressivos Castro, Stela Maris de Jezus January 2008 (has links) A depressão é uma doença com alta prevalência no mundo todo e se manifesta através de diversos sintomas observáveis, os chamados sintomas depressivos. Determinar a intensidade dos sintomas depressivos pode ser importante para verificar o estágio da depressão e avaliar seu desfecho, e quanto mais acurada e rápida for esta medida mais benefícios podem ser alcançados. A intensidade dos sintomas depressivos é um traço latente que pode ser medido através de instrumentos compostos por itens representativos destes sintomas observáveis, como o Inventário de Depressão Beck (BDI). É importante que a metodologia para analisar instrumentos do tipo do BDI considere que nem todos os sintomas depressivos têm a mesma importância em relação ao traço latente que pretendem medir. A Teoria da Resposta ao Item (TRI) compreende um grupo de modelos lineares generalizados e procedimentos estatísticos associados, que descrevem a associação entre o nível de um indivíduo sobre o traço latente e a probabilidade de uma resposta a um item. Estes modelos têm como uma de suas características especiais que os níveis estimados do traço latente sendo medido incorporam as diferenças em discriminação e gravidade de cada item constante no instrumento de medida, isto é, os itens entram com diferentes pesos na estimativa do traço latente dos indivíduos avaliados. OBJETIVOS: Este trabalho tem por objetivo mostrar a potencialidade dos modelos da TRI e o total aproveitamento das informações quando do uso destes modelos na análise de dados oriundos do BDI para a medida de intensidade de sintomas depressivos. MÉTODO: Os dados são provenientes de um estudo transversal conduzido para realizar a adaptação, normatização e validação para o português das Escalas Beck, em um estudo conduzido pela Dra. Jurema Alcides Cunha (PUCRS) e publicado em 2001; os modelos TRI utilizados na análise destes dados foram o modelo de Resposta Gradual de Samejima (1969) e o modelo para Itens Constrangedores de Cúri (2006). RESULTADOS: Os sintomas depressivos que melhor discriminam a população quanto ao nível de intensidade de sintomas depressivos são sentimento de fracasso, insatisfações, tristeza, auto-aversão, indecisão, dificuldade de trabalhar e pessimismo; e os que menos discriminam são perda de peso, irritabilidade e auto-acusações. Os sintomas mais graves são perda de peso, retraimento social, idéias suicidas, sentimento de fracasso apenas para as mulheres e perda da libido apenas para os homens (estes dois últimos são itens com funcionamento diferencial). CONCLUSÕES: Este estudo mostrou os inúmeros ganhos advindos da utilização de modelos TRI na avaliação da intensidade de sintomas depressivos, pois sua utilização aproveita totalmente a informação, considerando o perfil de cada indivíduo que responde ao instrumento, contribuindo na identificação daqueles que apresentam potencial depressivo. / CONTEXT: Depression is a disease with high prevalence worldwide and manifests itself through various symptoms observed, so-called depressive symptoms. To determine the intensity of depressive symptoms may be important to determine the stage of depression and evaluate its outcome, and the more rapid and accurate is this more benefits can be achieved. The intensity of depressive symptoms is a latent trait that can be measured by instruments consisting of items representative of observable symptoms, as the Beck Depression Inventory (BDI). It is important that the methodology for analyzing instruments of the type of BDI considers that not all depressive symptoms have the same importance in relation to the latent trait they wish to measure. The Item Response Theory (IRT) comprises a group of generalized linear models and statistical procedures involved, which describe the association between the level of an individual on the latent trait and the likelihood of a response to an item. These models have as one of its special characteristics that the estimated levels of latent trait being measured incorporate the differences in discrimination and severity of each item contained in the measuring instrument, that is, those items come with different weights in the estimation of latent trait of individuals evaluated. OBJECTIVES: This paper aims to show the capability of the models of the IRR and total utilization of information when using these models to analyze data from the BDI to measure the intensity of depressive symptoms. METHOD: The data come from a cross-sectional study conducted for the adaptation, standardization and validation of Beck scales for the portuguese, in a study conducted by Dr. Alcides Jurema Cunha (PUCRS) and published in 2001; the TRI models used in the analysis of these data was the Graded-Response model of Samejima (1969) and the model IRT for embarrassing items of Cúri (2006).RESULTS: The depressive symptoms that best depict the population about the level of intensity of depressive symptoms are feeling of failure, dissatisfaction, sadness, self-hatred, indecision, difficulty of work and pessimism; and those who are less discriminating are weight loss, irritability and self-accusations. The symptoms are more severe weight loss, social withdrawal, suicidal thoughts, feelings of failure only for women and loss of libido only for men (the latter two items are working with differential functioning). CONCLUSIONS: This study showed the many gains resulting from use of IRT models in the assessment of the intensity of depressive symptoms, because their use completely takes the information, considering the profile of each person who responds to the instrument, helping to identify those which have the potential depression. Depressão Epidemiologia Estudos de validação Escalas de graduação psiquiátrica Modelos estatísticos Interpretacao estatística de dados Depression Intensity of depressive symptoms Beck depression inventory Item response theory Model IRT for embarrassing items
236	Počítačové adaptivní testování v kinantropologii: Monte Carlo simulace s využitím physical self description questionnaire / Computerized Adaptive Testing In Kinanthropology: Monte Carlo Simulations Using The Physical Self Description Questionnaire Komarc, Martin January 2017 (has links) This thesis aims to introduce the use of computerized adaptive testing (CAT) - a novel and ever increasingly used method of a test administration - applied to the field of Kinanthropology. By adapting a test to an individual respondent's latent trait level, computerized adaptive testing offers numerous theoretical and methodological improvements that can significantly advance testing procedures. In the first part of the thesis, the theoretical and conceptual basis of CAT, as well as a brief overview of its historical origins and basic general principles are presented. The discussion necessarily includes the description of Item Response Theory (IRT) to some extent, since IRT is almost exclusively used as the mathematical model in today's CAT applications. Practical application of CAT is then evaluated using Monte-Carlo simulations involving adaptive administration of the Physical Self-Description Questionnaire (PSDQ) (Marsh, Richards, Johnson, Roche, & Tremayne, 1994) - an instrument widely used to assess physical self-concept in the field of sport and exercise psychology. The Monte Carlo simulation of the PSDQ adaptive administration utilized a real item pool (N = 70) calibrated with a Graded Response Model (GRM, see Samejima, 1969, 1997). The responses to test items were generated based on item...
237	Teoria da resposta ao item : aplicação na avaliação de orientação para atenção primária à saúde Oliveira, Mônica Maria Celestina de January 2013 (has links) O instrumento PCATool-Brasil (Primary Care Assessment Tool) permite a mensuração da presença e extensão dos atributos estruturantes da Atenção Primária à Saúde (APS) nos serviços de saúde. É importante que a metodologia para analisar instrumentos como o PCATool-Brasil considere que nem todos os itens que descrevem as características de estrutura e processo têm a mesma importância em relação ao traço latente ―Orientação à APS‖ que pode ser medido por esse instrumento. A Teoria da Resposta ao Item (TRI) é um conjunto de modelos matemáticos que procuram representar a probabilidade de um indivíduo dar uma resposta a um item como função dos parâmetros do item e do traço latente. Estes modelos têm como uma de suas características a incorporação de diferenças de discriminação e favorabilidade, que são os parâmetros de cada item, na estimação do traço latente. Este trabalho tem como objetivo explicitar as potencialidades da TRI na obtenção do traço latente ―Orientação para Atenção Primária‖, para serviços da rede básica, na percepção dos usuários, comparando os resultados com a validação via teoria clássica de testes e propor uma versão reduzida do PCATool-Brasil- Adultos. A partir de uma subamostra do estudo transversal de base populacional de adultos adscritos à rede pública de APS de Porto Alegre, ajustou-se o Modelo Logístico de 2 parâmetros, explorando as características de estrutura e processo que representam os atributos da APS e estimar o traço latente ―Orientação à APS‖. Os resultados desta avaliação revelaram a contribuição baseada na discriminação e favorabilidade de cada um dos itens do PCATool. Alguns itens apresentaram comportamento pouco destacado dado a sua baixa discriminação e elevada variabilidade. Mesmo assim o escore de orientação à APS obtido via TRI se revelou compatível com o escore obtido via Teoria Clássica de Testes. Com base nos itens mais discriminantes do traço latente foi possível estruturar uma versão reduzida do PCATool composta de 23 itens. Foi identificado neste estudo que itens dos atributos Longitudinalidade e Integralidade – Serviços Prestados – são os mais discriminantes para o traço latente. No entanto, há contribuição ao menos moderada dos itens dos outros atributos essenciais e derivados. Esses resultados sustentaram a estruturação de uma versão reduzida do PCATool, que apresentou medidas de consistência interna e concordância indicativos de um bom ajuste e associação com o escore TRI da versão completa, versão esta alinhada à necessidade de avaliar de forma rápida características importantes para a melhoria dos serviços de APS. / The instrument PCATool-Brazil (Primary Care Assessment Tool) allows the measurement of the presence and extent of structural attributes of the Primary Health Care (PHC) in health services. It is important that the methodology for analyzing instruments like PCATool-Brazil consider that not all items that describe the characteristics of structure and process are equally important in relation to the latent trait "Guidance PHC" which can be measured by this instrument. The Item Response Theory (IRT) is a set of mathematical models that seek to represent the probability of an individual to respond to an item as a function of the item parameters and latent trait. These models have as one of its features to incorporate differences in favorability and discrimination, which are the parameters for each item, in the estimation of latent trait. This paper aims to clarify the potential of TRI in obtaining latent trait "Guidance for Primary Care" for basic network services, the users' perception, comparing the results with validation via classical theory testing and propose a reduced version of PCATool-Brazil-Adult. From a subsample of the population-based crosssectional study of adults ascribed to public APS Porto Alegre, set the 2- parameter logistic model, exploring the characteristics of structure and process that represent the attributes of the APS and estimate the trace latent "Guidance to the APS." The results of this evaluation revealed the contribution-based discrimination and favorability of each of the items PCATool. Some items behaved little highlighted given its low discrimination and high variability. Yet the score PHC orientation obtained via IRT proved compatible with the score via Classical Test Theory. Based on the items most discriminant latent trait was possible to structure a reduced version of PCATool composed of 23 items. Was identified in this study that the items attributes and longitudinality Completeness - Services Provided - are the most discriminating for the latent trait. However, there is at least moderate contribution of other essential attributes of items and derivatives. These results supported the structuring of a reduced version of PCATool, which showed internal consistency measures and agreement indicative of a good fit and association with TRI score of the full version, this version aligned to the need to quickly evaluate important features to improve PHC services. Medidas, métodos e teorias Avaliação de serviços de saúde Atenção primária à saúde Saúde da família Análise estatística Epidemiologia Assessment Services Family Health Strategy Latent Trait Item Response Theory Primary Health Care
238	Aplicação da TRI às avaliações dos anos iniciais do Ensino Médio com o objetivo de detectar possíveis falhas no conhecimento de matemática Tanabe, Roberto Setsuo January 2016 (has links) Orientador: Prof. Dr. Valdecir Marvulle / Dissertação (mestrado) - Universidade Federal do ABC, Programa de Pós-Graduação em Mestrado Profissional em Matemática em Rede Nacional, 2016. / Este trabalho tem como objetivo analisar as deficiências no conhecimento em matemática nos alunos do primeiro ano do Ensino Médio utilizando como ferramenta de análise a Teoria da Resposta ao Item (TRI) em um teste com alunos de uma escola Estadual da Zona Leste de São Paulo. Apresentamos também os resultados do uso da TRI em um questionário aplicado a alunos universitários, versando sobre várias situações que implicaria em ações de sorte ou azar. Para que fosse possível a utilização da TRI, transformamos essas questões em itens dicotômicos com 2 parâmetros e utilizamos o pacote R para realização dos cálculos. / This essay has an objective to analyze all deficiencies in Math knowledge in all student from the first year of High School using as an analysis tool the Item Response Theory (IRT) in a test using students of a Public School of the East Side of São Paulo city. We had also presented the results of IRT use in a questionnaire given to college students, explaining about situations, which would provide actions concerning lucky and unlucky responses. In order to use IRT we had transformed these questions into dichotomous with two parameters and we also had a R pack to archive all calculations. TEORIA DA RESPOSTA AO ITEM ESTATÍSTICA ITEM RESPONSE THEORY STATISTICS
239	Mapeamento de repertórios de leitura e escrita em escolas com baixos índices na Prova Brasil / Mapping reading and writing repertoires in schools with low rates at Prova Brasil assessment Silveira, Carolina Coury 05 March 2015 (has links) Made available in DSpace on 2016-06-02T20:30:58Z (GMT). No. of bitstreams: 1 6592.pdf: 1407244 bytes, checksum: c0910b430fa49682809b7e15747694e7 (MD5) Previous issue date: 2015-03-05 / Financiadora de Estudos e Projetos / The Brazilian Ministry of Education is effectively measuring text comprehension by the national evaluation called Prova Brasil. However, lowered averages performance recurrent propagation for the schools raises two assumptions: the first one, that students may not have fully learned previous repertoires, more basic, which might influence their performance in text comprehension measures; and the second one, is related to the fact that Prova Brasil presents an informative character, not having as main objective to provide specific teaching procedures linked to the measures used. This Prova Brasil feature make arduous for the teachers and coordinators to program interventions that produces improvements in this complex repertoire. Both studies of this work aimed to contribute with the discussion of these assumptions. In Study 1 basic reading and writing repertoires were characterized regarding regular and irregular isolated words for 5th grade students of three schools with low rates in Prova Brasil. The results indicated deficits for varied basic repertoires in the three schools, however, were observed most affected performances for writing under dictation and for irregular words. In Study 2, it was described the items construction that composed a reading comprehension assessment, based on a behavioral interpretation. Evidences for validity and reliability were also investigated by the items, using Item Response Theory (IRT). The results not only corroborate the low median performance presented at Prova Brasil by the students, but also allowed verify the validity and consistency of the items to evaluate intermediate reading comprehension repertoire. The two studies allowed mapping reading and writing repertoires, from basic to complex, showing the feasibility of the two evaluations to prescribe specific teaching procedures in order to remediate the deficits identified. / O Ministério da Educação vem medindo com eficácia a compreensão de textos pela avaliação nacional da Prova Brasil. Contudo, a recorrente divulgação de médias de desempenho defasadas para as escolas, levanta duas suposições: a primeira, de que os alunos podem não ter apreendido repertórios prévios, mais básicos, que possam influenciar seus desempenhos em medidas de compreensão de textos; e a segunda, relaciona-se com o fato de a Prova Brasil apresentar caráter informativo, não tendo por objetivo principal prover procedimentos de ensino específicos atrelados às medidas utilizadas. Esta característica da Prova Brasil pode dificultar a programação pelos professores e coordenadores de intervenções que acarretem melhoras neste complexo repertório. Os dois estudos deste trabalho pretenderam contribuir com a discussão sobre estas duas suposições. No Estudo 1 foram caracterizados repertórios básicos de leitura e escrita de palavras isoladas regulares e irregulares para alunos de 5º ano de três escolas com baixos índices na Prova Brasil. Os resultados indicaram déficits para variados repertórios básicos nas três escolas, contudo, foram observados desempenhos mais prejudicados para repertórios de escrita sob ditado e para palavras irregulares. No Estudo 2 descreveu-se a construção de itens para a composição de uma avaliação de compreensão de leitura, embasada por uma interpretação comportamental. Foram ainda investigadas evidências de validade e precisão dos itens pela Teoria da Resposta ao Item (TRI). Os resultados, além de corroborarem os desempenhos medianos a baixos apresentados pelos alunos na Prova Brasil, também permitiram verificar a validade e consistência dos itens para avaliar repertórios intermediários de compreensão de textos. Os dois estudos permitiram de maneira geral mapear repertórios de leitura e escrita, de básicos a complexos, apresentando a viabilidade das avaliações para prescrever procedimentos específicos de ensino visando remediação dos déficits identificados. Comportamento - avaliação Leitura e escrita Teoria de resposta ao item Avaliação comportamental Prova Brasil Compreensão de texto Behavioral assessment Reading Writing Text comprehension Item Response Theory (IRT) CIENCIAS HUMANAS::PSICOLOGIA
240	Teoria da resposta ao item : aplicação na avaliação da intensidade de sintomas depressivos Castro, Stela Maris de Jezus January 2008 (has links) A depressão é uma doença com alta prevalência no mundo todo e se manifesta através de diversos sintomas observáveis, os chamados sintomas depressivos. Determinar a intensidade dos sintomas depressivos pode ser importante para verificar o estágio da depressão e avaliar seu desfecho, e quanto mais acurada e rápida for esta medida mais benefícios podem ser alcançados. A intensidade dos sintomas depressivos é um traço latente que pode ser medido através de instrumentos compostos por itens representativos destes sintomas observáveis, como o Inventário de Depressão Beck (BDI). É importante que a metodologia para analisar instrumentos do tipo do BDI considere que nem todos os sintomas depressivos têm a mesma importância em relação ao traço latente que pretendem medir. A Teoria da Resposta ao Item (TRI) compreende um grupo de modelos lineares generalizados e procedimentos estatísticos associados, que descrevem a associação entre o nível de um indivíduo sobre o traço latente e a probabilidade de uma resposta a um item. Estes modelos têm como uma de suas características especiais que os níveis estimados do traço latente sendo medido incorporam as diferenças em discriminação e gravidade de cada item constante no instrumento de medida, isto é, os itens entram com diferentes pesos na estimativa do traço latente dos indivíduos avaliados. OBJETIVOS: Este trabalho tem por objetivo mostrar a potencialidade dos modelos da TRI e o total aproveitamento das informações quando do uso destes modelos na análise de dados oriundos do BDI para a medida de intensidade de sintomas depressivos. MÉTODO: Os dados são provenientes de um estudo transversal conduzido para realizar a adaptação, normatização e validação para o português das Escalas Beck, em um estudo conduzido pela Dra. Jurema Alcides Cunha (PUCRS) e publicado em 2001; os modelos TRI utilizados na análise destes dados foram o modelo de Resposta Gradual de Samejima (1969) e o modelo para Itens Constrangedores de Cúri (2006). RESULTADOS: Os sintomas depressivos que melhor discriminam a população quanto ao nível de intensidade de sintomas depressivos são sentimento de fracasso, insatisfações, tristeza, auto-aversão, indecisão, dificuldade de trabalhar e pessimismo; e os que menos discriminam são perda de peso, irritabilidade e auto-acusações. Os sintomas mais graves são perda de peso, retraimento social, idéias suicidas, sentimento de fracasso apenas para as mulheres e perda da libido apenas para os homens (estes dois últimos são itens com funcionamento diferencial). CONCLUSÕES: Este estudo mostrou os inúmeros ganhos advindos da utilização de modelos TRI na avaliação da intensidade de sintomas depressivos, pois sua utilização aproveita totalmente a informação, considerando o perfil de cada indivíduo que responde ao instrumento, contribuindo na identificação daqueles que apresentam potencial depressivo. / CONTEXT: Depression is a disease with high prevalence worldwide and manifests itself through various symptoms observed, so-called depressive symptoms. To determine the intensity of depressive symptoms may be important to determine the stage of depression and evaluate its outcome, and the more rapid and accurate is this more benefits can be achieved. The intensity of depressive symptoms is a latent trait that can be measured by instruments consisting of items representative of observable symptoms, as the Beck Depression Inventory (BDI). It is important that the methodology for analyzing instruments of the type of BDI considers that not all depressive symptoms have the same importance in relation to the latent trait they wish to measure. The Item Response Theory (IRT) comprises a group of generalized linear models and statistical procedures involved, which describe the association between the level of an individual on the latent trait and the likelihood of a response to an item. These models have as one of its special characteristics that the estimated levels of latent trait being measured incorporate the differences in discrimination and severity of each item contained in the measuring instrument, that is, those items come with different weights in the estimation of latent trait of individuals evaluated. OBJECTIVES: This paper aims to show the capability of the models of the IRR and total utilization of information when using these models to analyze data from the BDI to measure the intensity of depressive symptoms. METHOD: The data come from a cross-sectional study conducted for the adaptation, standardization and validation of Beck scales for the portuguese, in a study conducted by Dr. Alcides Jurema Cunha (PUCRS) and published in 2001; the TRI models used in the analysis of these data was the Graded-Response model of Samejima (1969) and the model IRT for embarrassing items of Cúri (2006).RESULTS: The depressive symptoms that best depict the population about the level of intensity of depressive symptoms are feeling of failure, dissatisfaction, sadness, self-hatred, indecision, difficulty of work and pessimism; and those who are less discriminating are weight loss, irritability and self-accusations. The symptoms are more severe weight loss, social withdrawal, suicidal thoughts, feelings of failure only for women and loss of libido only for men (the latter two items are working with differential functioning). CONCLUSIONS: This study showed the many gains resulting from use of IRT models in the assessment of the intensity of depressive symptoms, because their use completely takes the information, considering the profile of each person who responds to the instrument, helping to identify those which have the potential depression. Depressão Epidemiologia Estudos de validação Escalas de graduação psiquiátrica Modelos estatísticos Interpretacao estatística de dados Depression Intensity of depressive symptoms Beck depression inventory Item response theory Model IRT for embarrassing items

Search results