Spelling suggestions: "subject:"tem desponse"" "subject:"tem coresponse""
381 |
MODELAGEM PARA O SOFTWARE VIRTUAL-TANEB BASEADO NA TEORIA DA RESPOSTA AO ITEM PARA AVALIAR O RENDIMENTO DOS ALUNOS / MODELING SOFTWARE FOR VIRTUAL Taneb-BASED THEORY THE RESPONSE TO ITEM TO EVALUATE THE PERFORMANCE OF STUDENTSVeras, Jaclason Machado 26 February 2010 (has links)
Made available in DSpace on 2016-08-17T14:53:07Z (GMT). No. of bitstreams: 1
Jaclason Machado Veras.pdf: 1682090 bytes, checksum: 035fe122eaaac20d2dba05af3e7eab78 (MD5)
Previous issue date: 2010-02-26 / This work proposes a general modeling for an educational software which will
serve to verify the knowledge of geometry of fourth grade students of elementary
school. The Virtual-Taneb will be based on Basic Education Evaluation System
(SAEB) which is an evaluative instrument used in Brazil originated from the
guidelines of the National Assessment of Educational Progress (NAEP), which is
currently applied manually by written tests. One of the motivations for developing
this tool was to reduce the delivery time of obtained results through the resolution of
written tests, obeying Basic Education Courseworks. Another motivation was the
inclusion of the Item Response Theory (IRT) as an educational evaluation quantifier
and qualifier, allowing a statistical analysis from items such as central elements of
tests, since the need to monitor the development process of educational evaluation
systems. However, there are factors that suppress the use of IRT on a large scale,
such as customizing each application and the mathematical complexity of the
methods involved. This prototype of educational software will be applied to
elementary schools in the State of Maranhão, Brazil. It will be adapted to the
cognitive level of students through the inclusion of recourses from the area of
Artificial Intelligence and besides it will enable an effective analysis of students
knowledge using the IRT on the responses made. Virtual-Taneb will provide some
benefits to the Brazilian society, as to favor a long-term reduction of economic
investment in the evaluation process, resulting in improvements for present and
future generations. Another benefit is the adequation possibility of this software to
other evaluation systems, as an example, the International Program of Students
Assessment (PISA), the National Examination of High School (ENEM), Prova Brasil,
among others. In the future, though the adaptation of the Virtual-Taneb software,
there will be the possibility of adding other subjects to the test. / Esse trabalho propõe uma modelagem geral para um software educacional
que atuará na verificação do conhecimento de geometria de alunos da quarta série
do ensino fundamental. O Virtual-TANEB estará baseado no Sistema de Avaliação
da Educação Básica (SAEB) que é um instrumento avaliativo usado no Brasil
originado a partir das diretrizes do National Assessment of Educational Progress
(NAEP), que, atualmente, é aplicado de forma manual por meio de cadernos de
provas. Uma das motivações para o desenvolvimento dessa ferramenta foi reduzir o
tempo de entrega dos resultados obtidos, através da resolução dos cadernos de
provas, obedecendo à Matriz Curricular da Educação Básica. Outra motivação foi a
inclusão da Teoria da Resposta ao Item (TRI) como quantificadora e qualificadora da
avaliação educacional, permitindo uma análise estatística a partir de itens como
elementos centrais da prova, visto a necessidade de acompanhar o processo de
evolução dos sistemas de avaliação educacional. Porém, existem fatores que
suprimem a utilização da TRI em larga escala, tais como: a personalização de cada
aplicação e a complexidade matemática dos métodos envolvidos. Este protótipo de
software educacional será aplicado às escolas do ensino fundamental no Estado do
Maranhão, Brasil. Ele se adaptará ao nível cognitivo dos alunos através da inclusão
de recursos provenientes da área de Inteligência Artificial e, além disso, possibilitará
uma análise mais efetiva do conhecimento dos alunos usando a TRI sobre as
respostas efetuadas. O Virtual-TANEB irá proporcionar alguns benefícios para a
sociedade, como favorecer a longo prazo a redução de investimento econômico no
processo de avaliação, resultando em melhorias para gerações presentes e futuras.
Outro beneficio é a adequação desse software a outros sistemas de avaliação, por
exemplo, Programa Internacional de Avaliação de Alunos (PISA), Exame Nacional
do Ensino Médio (ENEM), Prova Brasil, dentre outros. No futuro, mediante uma
adaptação no software Virtual-TANEB, haverá a possibilidade de realizar a inclusão
de outras disciplinas à prova.
|
382 |
Planejamento de experimentos no ensino da estatística e probabilidade nas séries finais do ensino fundamental II / Designs of experiments in the teaching of statistics and probability for the last years of elementary schoolHomailson Lopes Passos 04 April 2018 (has links)
Este trabalho apresenta uma proposta para o ensino da Estatística e Probabilidade nas séries finais do Ensino Fundamental II. Seu objetivo é mostrar que a metodologia aqui adotada possibilita a aquisição de conceitos estatísticos e probabilísticos, assim como o desenvolvimento de habilidades pessoais e interpessoais. Trata de um projeto fundamentado em Planejamento de Experimentos com respaldo na aprendizagem ativa. Na sequência didática do projeto os alunos realizaram um experimento com aviões de papel no qual tiveram que responder, na prática, a seguinte questão \"Quais alterações podem ser feitas em um modelo de avião de papel para que ele permaneça mais tempo no ar?\". Para atestar a efetividade da sequência didática, foi construído e validado um Teste de Proficiência em Estatística e Probabilidade (TEPEP) com base nos fundamentos da Psicometria. A análise das características do teste foi feita por meio da Teoria Clássica dos Testes e da Teoria de Resposta ao Item. Foram sujeitos da pesquisa 391 alunos de escolas públicas e particulares da região do Vale do Paraíba, Estado de São Paulo. Desse total, 374 auxiliaram na validação do instrumento e os 17 alunos restantes participaram do projeto. Os resultados deste trabalho mostraram que o uso de Planejamento de Experimentos favoreceu a aprendizagem da Estatística e Probabilidade, desenvolvendo também outras competências. Em relação à validação do TEPEP, concluiu-se que os métodos psicométricos empregados têm grande potencial e devem ser mais explorados. Esta pesquisa apresenta, como produtos finais, a metodologia desenvolvida e o teste de proficiência construído, oferecendo ambos a professores e pesquisadores. / This work presents a proposal for teaching of Statistics and Probability, for the last years of Elementary School. Its objective is to show that the methodology adopted here allows the acquisition of statistical and probabilistic concepts, as well as the development of personal and interpersonal skills. It\'s a project with a didactic sequence grounded in Designs of Experiments, supported in active learning. In the didactic sequence of the project the students carried out an experiment with paper airplanes in which they had to answer, in practice, the following question \"What changes can be made to a paper airplane model so that it stays longer in the air?\". To attest the effectiveness of the didactic sequence, it was developed and validated a Proficiency Test in Statistics and Probability (PTSP), this using Psychometry. The analysis of the characteristics of the test was made through the Classical Test Theory and the Item Response Theory. The research subjects were a total of 391 students from public and private schools in the Vale do Paraíba region, State of São Paulo. Of this total, 17 students participated in the project. The other students (374) assisted in the test validation. The results of this research showed that the use of Design of Experiments favored the learning of Statistic and Probability, also to develop others competences. Regarding the validation of PTSP, it could be concluded that the psychometric methods used have potential and they should be more explored. This research acclaims, as final products, the developed methodology and the Proficiency test validated, both offered to teachers and researchers.
|
383 |
Transposição da Teoria da Resposta ao Item: uma abordagem pedagógica / Transposition of Item Response Theory: a pedagogical approachEder Alencar Silva 23 June 2017 (has links)
Este trabalho tem por objetivo apresentar a Teoria da Resposta ao Item (TRI), por meio de uma abordagem pedagógica, aos professores da educação básica, que mencionaram esta necessidade por meio de pesquisa realizada pelo autor. Levar parte do conhecimento teórico que embasa esta teoria ao conhecimento do docente, principalmente a construção da curva de probabilidade de acerto do item, favorecerá a compreensão, a análise e o monitoramento do processo avaliativo educacional. Este material apresenta as principais definições e conceitos da avaliação externa em larga escala, além de fornecer insumos para a compreensão das suposições realizadas para aplicação da metodologia. Neste sentido, o texto foi estruturado de forma a apresentar didaticamente as etapas do processo de implementação de uma avaliação, desde a construção do item até a apuração e divulgação dos resultados. Todo enfoque será dado à construção do modelo da TRI com um parâmetro (dificuldade do item), também conhecido como modelo de Rasch, o que simplifica e facilita a compreensão da metodologia. O modelo utilizado nas avaliações externas em larga escala (modelo com três parâmetros) será introduzido a partir das considerações realizadas na abordagem que explicita o pensamento da construção do modelo de um parâmetro. Acredita-se que esta compreensão possa colaborar com o professor na exploração das habilidades/competências dos alunos durante os anos escolares. / This study aims to present the Item Response Theory (IRT), through a pedagogical approach, to teachers of basic education, which mentioned this necessity through research conducted by the author. To take part of the theoretical knowledge that supports this theory to the teacher\'s knowledge, especially the construction of probability curve of item correct response, it will favor for understanding, analysis and monitoring the evaluation educational process. This material presents the main definitions and concepts of the external evaluation in large scale, besides providing inputs for understanding the assumptions made to apply the methodology. In this sense, the text was structured in order to present the implementation process stages of a large scale assessment, from the item construction to the results calculation and dissemination. The focus will be given to the IRT model construction of one-parameter (difficulty of the item), also known as Rasch model, since it simplifies and facilitates the understanding of methodology. The model used in the external assessment on a large scale (three-parameter model) will be introduced from the considerations made in the approach that explicit the thought of one-parameter model construction. It is believed that understanding can collaborate with teacher in exploration of the students\' skills/competences during the school year.
|
384 |
Uma análise do desempenho dos estudantes no exame nacional do ensino médio e as contribuições para o ensino-aprendizagem de física / An analysis of student performance in national high school examination and contributions to the physical education-learningCarlos, Pablo Rafael de Oliveira 15 December 2016 (has links)
Submitted by Renata Lopes (renatasil82@gmail.com) on 2017-04-18T13:40:02Z
No. of bitstreams: 1
pablorafaeldeoliveiracarlos.pdf: 29554334 bytes, checksum: 0246060899d919f9c710cf3521c9644b (MD5) / Approved for entry into archive by Adriana Oliveira (adriana.oliveira@ufjf.edu.br) on 2017-04-18T14:01:49Z (GMT) No. of bitstreams: 1
pablorafaeldeoliveiracarlos.pdf: 29554334 bytes, checksum: 0246060899d919f9c710cf3521c9644b (MD5) / Approved for entry into archive by Adriana Oliveira (adriana.oliveira@ufjf.edu.br) on 2017-04-18T14:02:00Z (GMT) No. of bitstreams: 1
pablorafaeldeoliveiracarlos.pdf: 29554334 bytes, checksum: 0246060899d919f9c710cf3521c9644b (MD5) / Made available in DSpace on 2017-04-18T14:02:00Z (GMT). No. of bitstreams: 1
pablorafaeldeoliveiracarlos.pdf: 29554334 bytes, checksum: 0246060899d919f9c710cf3521c9644b (MD5)
Previous issue date: 2016-12-15 / No presente trabalho são abordados aspectos importantes das avaliações de larga escala, principalmente as avaliações de Física, os tipos, as características e também a elaboração das matrizes de referência para tais avaliações e dos itens que compõem os instrumentos de avaliação. Discutiremos a Taxonomia de Bloom, a Teoria Clássica dos Testes e a Teoria de Resposta ao Item. É feita uma análise sobre o desempenho dos estudantes concluintes do Ensino Médio que participaram do Exame Nacional do Ensino Médio (ENEM) nas edições de 2009 a 2014, para esse objetivo foi utilizado como fonte de dados os microdados do ENEM fornecida pelo INEP. Primeiramente é feita uma breve discussão sobre a base teórica que envolve esse modelo de exame, a Teoria de Resposta ao Item e Teoria Clássica dos Testes. Os alunos foram distribuídos em sete níveis de proficiência, buscou-se uma série de informações estatísticas entre outras medidas psicrométricas relevantes. Esse estudo revelou um baixo desempenho em Física dos estudantes que estão concluindo o Ensino Médio no Brasil, evidenciando a necessidade do acompanhamento do ensino nessa disciplina, para assim possibilitar ações políticas e pedagógicas que busquem melhorar a qualidade do ensino científico, sendo assim as avaliações educacionais mostram-se uma excelente ferramenta. Por fim, iremos discorrer a respeito dos aspectos positivos de se avaliar e como os resultados das avaliações podem ser usados para melhorar as práticas pedagógicas no ensino de Física. O produto educacional desenvolvido nesse trabalho de mestrado foi a construção de um guia de elaboração, revisão e aplicação de instrumentos de avaliação no ensino de Física cuja finalidade é dar subsídios aos professores para desenvolver um olhar crítico sobre essas avaliações, além de auxiliar no tratamento dos dados gerados e na utilização desses dados para desenvolver atividades em sala de aula que busquem potencializar o ensino-aprendizagem em Física. / In this work we address important aspects of large-scale assessments, especially the Physic assessments, the types, characteristics and also the preparation of the reference matrices for such assessments and items that make up the evaluation instruments. We discuss Bloom's Taxonomy, the classical theory of tests and the Item Response Theory. We analyse the performance of graduating high school students who participated in the National High School Exam (ENEM) in the 2009 to 2014 editions, for this purpose we used as a data source ENEM microdata provided by INEP. A brief discussion of the theoretical basis is primarily made involving this test model, item response theory and classical theory of tests. The students were divided into seven levels of proficiency, sought a series of statistical information between other relevant psychometric measures. This study revealed a poor performance in Physics of students that are completing high school in Brazil, highlighting the need for monitoring of teaching this discipline, so as to enable political and pedagogical actions that seek to improve the quality of science education, so the educational assessments show is an excellent tool. Finally, we will discuss the positive aspects of assessing and how the results of the evaluations can be used to improve teaching practices in teaching physics. The educational product developed in this master's work was the construction of a preparation guide, review and application of assessment tools in teaching physics whose purpose is to assist teachers to develop a critical eye on these assessments, to help in the treatment of the data generated and to use these data to develop activities in the classroom that seek to enhance the teaching and learning in physics.
|
385 |
[en] NONLINEAR MODELS IN ASSESSMENT IN THE SOCIAL SCIENCES: ESTIMATION BY STOCHASTIC APPROXIMATION, A FREQUENTIST MCMC / [pt] MODELOS NÃO LINEARES EM AVALIAÇÃO NAS CIÊNCIAS SOCIAIS: ESTIMAÇÃO POR APROXIMAÇÃO ESTOCÁSTICA UMA MCMC FREQÜENTISTACARLOS ALBERTO QUADROS COIMBRA 19 July 2005 (has links)
[pt] Neste trabalho apresentamos algumas contrubuições ao
estudo dos modelos
de avaliação estatística usados nas ciências sociais. As
contribuições
originais são: i ) uma descrição unificada sobre como a
teoria da medição
evoluiu nas diversas disciplinas científicas; ii ) uma
resenha abrangente sobre
os métodos de estimação por máxima verossimilhança
empregados na
medição estatística; iii ) uma formulação geral do métodos
da máxima verossimilhan
ça tendo em vista a aplicação em modelos não-lineares; e
principalmente,
iv ) a apresentação do método da aproximação estocástica na
estimação dos modelos estatísticos de avaliação e medição.
Os modelos não-lineares ocorrem freqüentemente nas
ciências sociais onde
é importante a modelagem de variáveis de resposta
dicotômicas ou ordinais.
Em particular, este trabalho trata dos modelos da teoria
da resposta
ao item, dos modelos de regressão logística e dos modelos
de componentes
aleatórias em geral. A estimação destes modelos ainda é
objeto de intensa
pesquisa. Não se pode afirmar que exista um método de
estimação
inteiramente confiável. Os métodos aproximados produzem
estimativas com
viés acentuado nas componentes de variância, enquanto os
métodos de integração numérica e os métodos bayesianos
podem apresentar problemas de
convergência em muitos casos. O método da aproximação
estocástica se baseia
na maximização da verossimilhança e emprega o algoritmo de
Robbins-
Monro para resolver a equação do escore. Como um método
estocástico ele
gera um processo de Markov que se aproxima das estimativas
desejadas e
portanto pode ser considerado um MCMC (Monte Carlo Markov
chain)
freqüentista. Nas simulações realizadas o método
apresentou um bom desempenho,
produzindo estimativas com viés pequeno, precisão razoável
e
raros problemas de convergência. / [en] This work presents a study of statistical models used for
assessment and
measurement in the social sciences. The main contributions
are: i ) a unified
description of how evaluation, assessment, and the theory
of measurement
evolved within several branches of science; ii ) a review
of estimation
methods currently employed in nonlinear models; iii ) a
general formulation
of the maximum likelihood estimation method; and
particularly, iv the
presentation of the stochastic approximation method for
estimation of non
linear statistical models in measurement and assessment.
Non linear models occurs frequently in the social sciences
where it is
important to model binary or ordinal response variables.
This work deals
with item response theory models, logistic regression
models and general
models with random components. The estimation of these
models has been
the subject of several recent simulation studies. One
cannot say there is a
best estimation method. The approximate methods are known
to produce
biased estimates, numerical integration methods and
bayesian methods can
present convergence problems in many cases. Stochastic
approximation
method is a maximum likelihood method that uses the
Robbins-Monro
algorithm to solve the score equation. As a stochastic
approximation method
it generates a Markov chain that converges to the desired
estimates and can
be considered a frequentist MCMC. A simulation study and a
comparative
estimation study show a good performance, the method
producing small
bias for the estimates, good precision, and very rare
convergence problems.
|
386 |
The translation and validation of the Postpartum Depression Screening Scale (PDSS) : towards improving screening for postpartum depression in English- and Afrikaans-speaking South African womenStruik, Melony 16 June 2012 (has links)
Postpartum depression is an illness that is frequently unreported and undetected for a variety of reasons and may be potentially devastating for the mother affected as well as her family. Routine screening of postpartum women enables health practitioners to detect symptoms of PPD early and provides an opportunity for early intervention which may improve the outcome and increase the mother’s chances of an earlier recovery. It is therefore important that reliable and convenient screening tools are available to health practitioners who have contact with postpartum women. The primary objective of this research was to make an Afrikaans version of an existing screening scale available – the Postpartum Depression Screening Scale (PDSS), designed specifically to encompass the multifaceted phenomenon of PPD. In accordance with this objective, the validity and reliability of the PDSS and its Afrikaans version was investigated in English- and Afrikaans-speaking South African mothers. A further objective of this study was to compare the performance of the PDSS with the Edinburgh Postnatal Depression Scale (EPDS) and the Quick Inventory of Depressive Symptomatology (QIDS-SR16). Various factors have been reported to be associated with the development of PPD. The final objective of this study was to explore the relationship between known risk factors for PPD and high scores on the PDSS amongst women in South African. A total of 365 South African mothers, between 4 and 16 weeks postpartum participated in this study. English-speaking mothers (n = 187) completed the PDSS, EPDS, QIDS, and a demographic and psychosocial questionnaire, while Afrikaans speaking mothers (n = 178) completed the respective Afrikaans versions of these questionnaires. A multiple translation method – Brislin’s back-translation method and the committee approach – was used to translate the PDSS and the QIDS into Afrikaans. An item response theory (IRT), Rasch analysis, was used to examine dimensionality, item difficulty, differential item functioning, and category functioning of the PDSS and the Afrikaans PDSS. Results reveal excellent person reliability estimates for the Afrikaans PDSS as well as for the PDSS in a South African sample. Both language versions performed reasonably well and the majority of items in the PDSS dimensions and the Afrikaans PDSS dimensions demonstrated fit statistics that supported the underlying constructs of each dimension. Some items were identified as problematic, namely Item 2, Item 25, Item 28, and Item 30. The item person construct maps show reasonably good spread of items. There were, however, persons that scored higher than the items could measure and an overrepresentation of items at the mean level. The Likert response categories proved to be effective for all the Afrikaans PDSS items and almost all the PDSS items. Results indicate that 49.7% of mothers screened positive for major PPD using the PDSS. A further 17.3% of mothers obtained scores indicating the presence of significant symptoms of PPD. Statistically significant correlations were obtained between total scores on the PDSS, the EPDS, and the QIDS-SR16. Stepwise multiple regression analysis identified 11 variables that were significantly associated with a high PDSS total score. These were a history of psychiatric illness, postpartum blues, feeling negative or ambivalent about expecting this baby, fearful of childbirth, infant temperament, antenatal depression in recent pregnancy, lack of support from the baby’s father, concern about health related issues regarding the infant, lack of support from friends, difficulty conceiving, and life stress. / Thesis (PhD)--University of Pretoria, 2012. / Psychology / unrestricted
|
387 |
Exploring a meta-theoretical framework for dynamic assessment and intelligenceMurphy, Raegan 30 September 2007 (has links)
Dynamic assessment, as manner of alternative process-based assessment, is currently at a cross-roads chiefly characterised by, at times, vague conceptualisation of terminology, blurred demarcation as to its model and theory status and at times ill-defined fundamental philosophy. As a movement in modern psychological assessment within the broader field of intelligence, dynamic assessment does not present with a coherent unifying theory as such and due to its lack of clarity in a number of key areas its eventual disuse might well be the final outcome of this method and its unique history and methodology. In pursuit of this study’s main goal, dynamic assessment models and theories are critically explored by means of a meta-theory largely inspired by the work K.B. Madsen, a Danish meta-theorist and pioneer in theoretical psychology. Madsen’s meta-theory is attenuated in order to suit the nature and purposes of this study; so as to better analyse dynamic assessment within intelligence research and assessment. In its primary aim, this study builds on a foundation of epistemological and ontological considerations within science in general, the social sciences and psychology in particular. In keeping with Madsen’s method of meta-theory analysis, the author’s predilections are stated at the outset in order to place the progression of analyses of the various models and theories within dynamic assessment. Dynamic assessment and intelligence are discussed and a brief digression into the history of Soviet psychology is offered as it is pertinent to the work of Lev Vygotsky and its subsequent influence within process-based assessment. Theory and model development within science and the social sciences are described from a philosophy-of-science vantage point. Psychological assessment’s prime considerations are critically explored and the discussion highlights the role played by the philosophical aspects of mathematics and statistical foundations as leveraging measurement within assessment. Particular attention is paid to the perennial controversy surrounding null hypothesis significance testing and the possible future directions that can be explored by and within dynamic assessment which lends itself to approaches less restrictive than those offered by mainstream statistics. The obvious and not so obvious aspects within the mathematical, statistical and measurement foundations are critically explored in terms of how best dynamic assessment can manoeuvre within the current mainstream psychological assessment system and how new models of item response theory suited to change-based assessment can be explored as possible manner of handling the gain score issue; itself a paradoxical state of affairs within classical and modern test theory. Dynamic assessment’s past has in large part been dictated by mainstream considerations in the areas mentioned and in order to place itself on an alternative path these considerations are critically assessed in terms of dynamic assessment’s future path. Dynamic assessment and its place within the broader intelligence assessment field is then investigated by means of the meta-theory developed. It is envisaged that the intuitive appeal of dynamic assessment will continue to garner support from practitioners across the globe, specifically those trained in countries outside the traditional stronghold of Western psychological theory. However, the position taken in this argument is that in order to ensure its survival it will need to make a decision in terms of its future progress: either to branch off from mainstream assessment altogether or to become fused within mainstream assessment. The “best of both worlds” scenario has obviously not worked out as it was originally hoped. The study concludes with the meta-theoretical exploration of dynamic assessment within intelligence by utilising a small selection of current models. The application of the attenuated Madsenian framework seeks to explore, place and ascertain the nature of each model regarding the ontological and philosophical status of the approach; the nature of the hypothetical terminology, scientific hypotheses and hypothesis system utilised and lastly the nature of the abstract data, concrete data and prime considerations as implicit concerns within the varied approaches. An HQ score is calculated for each such model and is a partial indicator of the testability (verifiability or falsifiability) of the model in question. The models are thus couched in meta, hypothetical and data strata and can be positioned on a continuum of sorts according to which tentative claims can be made regarding the veracity of the approach behind each model. The study concludes with two appendices; a meta-analysis which was conducted on South African research in the field of dynamic assessment (1961-2002) and which cumulated in a significant effect size evidencing an overall positive effect that dynamic assessment has had as an alternative intervention technique in comparison to conventional or static based assessment models. In order to encourage replication of this study, all details pertaining to the studies included for consideration in the meta-analyses are attached in section 2 of this appendix. Secondly, an informal content analysis was conducted on eleven responses to questionnaires that were originally delivered to one hundred dynamic assessment practitioners and researchers across the globe. The purpose of the questionnaire was to ascertain information on core issues within dynamic assessment, as these fundamental issues were considered as pivotal in the future of this approaches’ eventual development or stagnation. The analysis concluded that dynamic assessment is indeed perceived to be at a crossroads of sorts and thus supported the initial hypothesis stated above. It is hoped that this theoretical study will aid in aligning dynamic assessment in a manner such that its eventual place in psychological assessment will be solidly grounded, theoretically defensible and viable as alternative manner of assessment. / Thesis (PhD (Psychology))--University of Pretoria, 2007. / Psychology / PhD / PhD / unrestricted
|
388 |
Berufsbezogene Handlungs- vs. Lageorientierung: Skalierbarkeit und Beziehung zu beruflicher Arbeitsleistung: Occupational action state orientation: Scalability and its relation to job performanceStadelmaier, Ulrich W. 01 December 2016 (has links)
Die vorliegende Arbeit verknüpft die Theorie der Interaktion psychischer Systeme von Kuhl (2000, 2001) mit dem Modell beruflicher Arbeitsleistung von Tett und Burnett (2003). Unter Anwendung reizorientierter arbeitspsychologischer Stressmodelle werden Hypothesen über einen durch das subjektive Bedrohungs- und Belastungspotenzial der Arbeitssituation moderierten Zusammenhang zwischen dem berufsbezogenen und nach Maßgabe der Item Response Theorie skalierbaren Persönlichkeitsmerkmal Handlungs- vs. Lageorientierung und beruflicher Arbeitsleistung aufgestellt. In drei Befragungen an N = 415, N = 331 sowie N = 49 Berufstätigen wurden querschnittliche Daten zur Hypothesenprüfung erhoben. Berufsbezogene Handlungs- vs. Lageorientierung zeigt sich als valides Subkonstrukt der allgemeinen Handlungs- vs. Lageorientierung, welches gemäß Graded Response Modell von Samejima (1969, 1997) mit 14 Items skalierbar ist. Prospektive berufsbezogene Handlungs- vs. Lageorientierung erklärt in multiplen hierarchischen Regressionsanalysen, im Gegensatz zu allgemeiner Handlungs- vs. Lageorientierung, inkrementell zu Gewissenhaftigkeit, Extraversion und Neurotizismus Anteile kontextueller und aufgabenbezogener Arbeitsleistung. Hypothesenkonträr werden diese Zusammenhänge nur marginal vom subjektiven Belastungspotential der Arbeitssituation moderiert. Die Prädiktorfunktion prospektiver berufsbezogener Handlungs- vs. Lageorientierung für berufliche Arbeitsleistung bleibt auch unter pfadanalytischer Kontrolle eines vorhandenen Common Method Bias erhalten. Die dispositionelle Fähigkeit, durch berufliche Hindernisse gehemmten positiven Affekt vorbewusst gegenregulieren zu können, scheint demnach ein bedeutender Prädiktor beruflicher Arbeitsleistung zu sein, insbesondere bei Führungskräften. Für die berufliche Eignungsbeurteilung ist es damit von diagnostischem Mehrwert, Handlungs- vs. Lageorientierung kontextualisiert zu erheben. Der Einsatz probabilistisch- testtheoretisch konstruierter Skalen steigert dabei die Effizienz des Beurteilungsprozesses. / The current paper combines personality systems interaction theory (Kuhl, 2000, 2001) with the model of job performance by Tett and Burnett (2003). Using established stress models from work psychology it is hypothesized that there is a relation between occupational action state orientation, scalable by means of items response theory, and job performance, which is moderated by the subjective stress level of job characteristics. Three surveys among samples of N = 415, N = 331, and N = 49 professionals yielded cross sectional data for investigating the hypotheses. Occupational action state orientation proves a valid construct which is compatible with Samejima’s (1969, 1997) Graded Response Model using a 14-item scale. As a result of multiple hierarchical regression analyses, the hesitation dimension of specifically occupational, in contrast to general action state orientation is a predictor of both contextual and task performance, incremental to conscientiousness, extraversion, and neuroticism. Contrary to expectations this relation is only marginally moderated by stress-relevant job characteristics. Even when controlling for an occurring common method bias by means of path analysis the occupational hesitation dimension’s predictor role perseveres. Therefore, the dispositional ability in subconsciously regulating inhibited positive affect due to occupational obstacles, seems to be a crucial predictor of job performance, especially regarding leaders. Hence, professional aptitude assessment benefits from assessing action state orientation in a contextualized manner. Application of item response theory-based scales further enhances assessment process efficiency.
|
389 |
Adaptivní testování pro odhad znalostí / Computerized adaptive testing in knowledge assessmentTělupil, Dominik January 2018 (has links)
In this thesis, we describe and analyze computerized adaptive tests (CAT), the class of psychometrics tests in which items are selected based on the actual estimate of respondent's ability. We focus on the tests based on di- chotomic IRT (item response theory) models. We present critera for item selection, methods for ability estimation and termination criteria, as well as methods for exposure rate control and content balancing. In the analytical part, the effect of CAT settings on the average length of the test and on absoulute bias of ability estimates is investigated using linear regression mo- dels. We provide post hoc analysis of real data coming from real admission test with unknown true values of abilities, as well as simulation study based on the simulated answers of respondents with known true values of ability. In the last chapter of the thesis we investigate the possibilities of analysing adaptive tests in R software and of creating a real CAT. 1
|
390 |
Modèles de tests adaptatifs pour le diagnostic de connaissances dans un cadre d'apprentissage à grande échelle / Cognitive diagnostic computerized adaptive testing models for large-scale learningVie, Jill-Jênn 05 December 2016 (has links)
Cette thèse porte sur les tests adaptatifs dans les environnements d’apprentissage. Elle s’inscrit dans les contextes de fouille de données éducatives et d’analytique de l’apprentissage, où l’on s’intéresse à utiliser les données laissées par les apprenants dans des environnements éducatifs pour optimiser l’apprentissage au sens large.L’évaluation par ordinateur permet de stocker les réponses des apprenants facilement, afin de les analyser et d’améliorer les évaluations futures. Dans cette thèse, nous nous intéressons à un certain type de test par ordinateur, les tests adaptatifs. Ceux-ci permettent de poser une question à un apprenant, de traiter sa réponse à la volée, et de choisir la question suivante à lui poser en fonction de ses réponses précédentes. Ce processus réduit le nombre de questions à poser à un apprenant tout en conservant une mesure précise de son niveau. Les tests adaptatifs sont aujourd’hui implémentés pour des tests standardisés tels que le GMAT ou le GRE, administrés à des centaines de milliers d’étudiants. Toutefois, les modèles de tests adaptatifs traditionnels se contentent de noter les apprenants, ce qui est utile pour l’institution qui évalue, mais pas pour leur apprentissage. C’est pourquoi des modèles plus formatifs ont été proposés, permettant de faire un retour plus riche à l’apprenant à l’issue du test pour qu’il puisse comprendre ses lacunes et y remédier. On parle alors de diagnostic adaptatif.Dans cette thèse, nous avons répertorié des modèles de tests adaptatifs issus de différents pans de la littérature. Nous les avons comparés de façon qualitative et quantitative. Nous avons ainsi proposé un protocole expérimental, que nous avons implémenté pour comparer les principaux modèles de tests adaptatifs sur plusieurs jeux de données réelles. Cela nous a amenés à proposer un modèle hybride de diagnostic de connaissances adaptatif, meilleur que les modèles de tests formatifs existants sur tous les jeux de données testés. Enfin, nous avons élaboré une stratégie pour poser plusieursquestions au tout début du test afin de réaliser une meilleure première estimation des connaissances de l’apprenant. Ce système peut être appliqué à la génération automatique de feuilles d’exercices, par exemple sur un cours en ligne ouvert et massif (MOOC). / This thesis studies adaptive tests within learning environments. It falls within educational data mining and learning analytics, where student educational data is processed so as to optimize their learning.Computerized assessments allow us to store and analyze student data easily, in order to provide better tests for future learners. In this thesis, we focus on computerized adaptive testing. Such adaptive tests which can ask a question to the learner, analyze their answer on the fly, and choose the next question to ask accordingly. This process reduces the number of questions to ask to a learner while keeping an accurate measurement of their level. Adaptive tests are today massively used in practice, for example in the GMAT and GRE standardized tests, that are administered to hundreds of thousands of students. Traditionally, models used for adaptive assessment have been mostly summative : they measure or rank effectively examinees, but do not provide any other feedback. Recent advances have focused on formative assessments, that provide more useful feedback for both the learner and the teacher ; hence, they are more useful for improving student learning.In this thesis, we have reviewed adaptive testing models from various research communities. We have compared them qualitatively and quantitatively. Thus, we have proposed an experimental protocol that we have implemented in order to compare the most popular adaptive testing models, on real data. This led us to provide a hybrid model for adaptive cognitive diagnosis, better than existing models for formative assessment on all tried datasets. Finally, we have developed a strategy for asking several questions at the beginning of a test in order to measure the learner more accurately. This system can be applied to the automatic generation of worksheets, for example on a massive online open course (MOOC).
|
Page generated in 0.0837 seconds