Spelling suggestions: "subject:"tem response theory"" "subject:"stem response theory""
51 |
Mindfulness In Parenting Questionnaire (MIPQ): Development and validation of a measure of mindful parentingMcCaffrey, Stacey 01 January 2015 (has links)
Mindful parenting has been defined as “paying attention to your child and your parenting in a particular way: intentionally, here and now, and non-judgmentally” (Kabat-Zinn & Kabat-Zinn, 1997). Although it is hypothesized that increasing mindful parenting improves parent and child functioning, the development of a measure of mindful parenting is needed to support this assumption. The aim of the present study was to develop and psychometrically evaluate a measure of mindful parenting (the Mindfulness In Parenting Questionnaire: MIPQ) for use with mothers and fathers of both children and adolescents, ranging in age from 2- to 16-years-old. The current study contained three phases. First, content experts in the area of mindfulness and parenting provided content for preliminary items. Second, parents participated in cognitive interviewing in order to reduce measurement error and increase the psychometrics of the measure. The third and final phase consisted of large-scale data collection to explore the psychometrics of the new MIPQ. Two-hundred and three parents recruited from academic and after-school programs in South Florida completed the MIPQ, along with measures of intrapersonal mindfulness, parenting behavior, parenting style, and a demographics questionnaire. The Partial Credit Model, which evidenced significantly better fit than the Rating Scale Model, was used to evaluate the MIPQ using WINSTEPS 3.74.01. The MIPQ was iteratively refined based on statistical and clinical considerations, resulting in a 28-item measure with 4 response categories. Further, results supported a 2 factor mindful parenting construct. The first factor (Parental Self-Efficacy) reflects a parent’s self-efficacy, as well as nonreactivity and awareness within the parenting role, while the second factor (Being in the Moment with the Child) pertains to the child, and reflects present-centered attention, empathic understanding, and acceptance of the child. Factors were correlated (r = .67) and explained 42.3% and 43.4% of the variance, respectively. Correlations between the MIPQ and parenting style, parenting practices, practice of mindfulness, and participant demographics provided support for convergent and discriminant validity. The MIPQ exhibited a positive and weak correlation with the MAAS, indicating that interpersonal and intrapersonal mindfulness are related, but separate and distinct constructs. Limitations and directions for future research are discussed.
|
52 |
En Raschanalys för att jämföra två svenska översättningar av en enkät som mäter hälsorelaterad livskvalitetKielén, Martina, Wallentinsson, Emma January 2016 (has links)
During the 1980’s the non-profit organisation RAND Corporation conducted the two-year Medical Outcomes Study with the goal of creating a comprehensive medical questionnaire. The resulting 116-item questionnaire measures health related quality of life (HRQoL) topics such as physical, mental and general health. The questionnaire is available as a free resource on their web page. SF-36, which contains 36 of these questions, is distributed for a fee by the US company Quality Metric Inc. The company has translated the questionnaire into several languages, including Swedish, and has also taken license for the translations. Registercentrum sydost has made a new Swedish translation of the same questions as in the SF-36. This survey is called RAND-36 and is license free. Because Quality Metric Inc has taken license for its Swedish translation, the surveys are similar but not identical. This study aims to compare the aforementioned HRQoL-instruments to determine whether it is possible to replace the licensed questionnaire SF-36 with the license free RAND-36. The distribution of items with response options according ordinal scale were compared with Mann-Whitney U-test. The test yielded a significant difference for eight items in the measure PF(physical functioning), MH(mental health), VT (vitality) and GH (general health perceptions). The distribution of items with response options according dichotomous scale were compared with X2-test. The test yielded significant difference for an item in the measure RE (emotional role functioning). The reliability of questionnaire was compared with ordinal alpha. In the selection the reliability between MH and VT is equivalent. The biggest difference between the surveys is the measure RP (physical role functioning) where the RAND-36 meets the requirement that the measure can be used for reliable conclusions on the individual level, which is a condition that SF-36 can’t met. The probability of entering an answer, given the respondent's ability, was compared with Rasch analysis. Wald's test gave DIF between most items within the measures PF, MH, VT and GH.
|
53 |
The Teacher Attitudes toward Homeless Students Scale: Development and ValidationBrown, Jessica January 2012 (has links)
Thesis advisor: Larry H. Ludlow / Recent estimates suggest there are roughly 1.6 million homeless children and this number is growing (National Center on Family Homelessness, 2011). This trend is particularly worrisome given that homeless children face a number of obstacles within society and education, not the least of which is negative teacher attitudes (Swick, 2000; U.S. Department of Education, 2002). This study's primary research question addressed whether a set of underlying dimensions could be identified and used to effectively measure teacher attitudes toward homeless students. A necessary part of answering this research question involved the development of a measurement scale. Both Classical Test Theory and Item Response Theory analyses aided in the elimination process of items in order to create the final Teacher Attitudes toward Homeless Students (TAHS) assessment, which includes an attitudes scale and subscales, and a related knowledge scale. The final outcome was a set of 43 items, across eight dimensions, which could effectively be used to measure teacher attitudes toward homeless students. Additionally, the findings upheld the principles of Rasch measurement, including unidimensionality, a hierarchical ordering of items, and a continuum of the construct definition. In other words, the findings indicate that the TAHS scale was successfully developed according to explicit a priori measurement criteria. Moreover, additional correlational and regression analyses provided empirical construct and convergent validity evidence for the TAHS scale. It was also found that attitudes differed slightly for teachers of various backgrounds and experiences, but when analyzed collectively these variables were not significantly related to teacher attitudes toward homeless students. Additionally, there was only a weak relationship between teachers' attitudes and their knowledge about homelessness. Overall the TAHS scale allows for reliable and accurate measurement of teacher attitudes toward homeless students from which valid inferences can be made. The TAHS scale scores and score descriptors can be used to help teacher interpret their attitude. This has the potential for a direct impact in creating equal educational opportunities for homeless students as teachers become aware of their attitude and make positive changes. / Thesis (PhD) — Boston College, 2012. / Submitted to: Boston College. Lynch School of Education. / Discipline: Educational Research, Measurement, and Evaluation.
|
54 |
Stability and sensitivity of a model-based person-fit index in detecting item pre-knowledge in computerized adaptive test. / 特定模型個人擬合指數在探測預見題目時的穩定性及靈敏度 / CUHK electronic theses & dissertations collection / Te ding mo xing ge ren ni he zhi shu zai tan ce yu jian ti mu shi de wen ding xing ji ling min duJanuary 2008 (has links)
After the stability and sensitivity of FLOR were investigated, the application of it in the CAT environment had become the main concern. The present studies found that both the test length and the number of exposed items affect the final value of FLOR. In the fixed length CAT, the FLOR has a much stronger sensitivity than lz and CUSUM in detecting item pre-knowledge. The sensitivity of FLOR in the fixed length CAT was the same as that in the fixed length fixed items test. If the test length could vary, the sensitivity of FLOR in CAT would be slightly weakened. The Adjusted FLOR index could increase the sensitivity. Concerning about the effect of ability on the sensitivity of FLOR in CAT, it was found that the abilities of the test takers in CAT did not affect the sensitivity of FLOR and Adjusted FLOR. / Item response theory is a modern test theory. It focuses on the performance of each item. Under this framework, the performance of test takers on a test item can be predicted by a set of abilities. The relationship between the test takers' item performances and the set of abilities underlying item performances can be described by a monotonically increasing function called an item characteristic curve. Due to various personal reasons, the performances of the test takers may depart from the response patterns predicted by the underlying test model. In order to calculate the extent of departure of these aberrant response patterns, a number of methods have been developed under the theme "person-fit statistics". The degree of aberration is calculated as an index called person-fit index. Inside the computerized adaptive testing (CAT), test takers with different abilities will answer different numbers of questions and the difficulties of the items administered to them are usually clustered at the abilities of the test takers. Due to this reason, the application of person-fit indices in the computerized adaptive testing environment to measure misfit is difficult. / The present study also found that FLOR has a much superior sensitivity over other indices in detecting item pre-knowledge. Concerning about the sensitivity over different abilities of test takers, it was found that the sensitivity of FLOR was the highest among low ability test takers and the weakest among strong ability test takers in the fixed length and fixed items tests. However, the sensitivities of FLOR became the same among different abilities of test takers if items with difficulties matching their abilities were used in the tests. The number of beneficiaries among the test takers did not affect the sensitivity of FLOR. Moreover, in a simulation to test the differentiating power of FLOR, it was found that FLOR could differentiate item pre-knowledge from other reasons of personal misfits (test anxiety, player, random response and challenger) effectively. / The present study assessed the stability of FLOR over other variables, which were unrelated to item pre-knowledge. It found that FLOR was stable over the discrimination and difficulty parameters of test items. It was also stable over positions of the exposed items in the test and the initial assignment of prior probability of item pre-knowledge. However, the asymptotes (guessing factor) and the probabilities of item exposure did affect the final values of FLOR seriously. / The present study used the hf plot to access the sensitivity of the person-fit indices. hf plot is a plot of hit rate against false alarm rate. For a higher hit rate, usually a higher false alarm rate is followed. hf plot provides a good tools for comparison between indices by inspection of the speed of rise of the curves. A sensitive index should give a faster rise of the curve. In this study, sensitivity of an index was defined as the speed of rise of the hf plot, which is represented by a parameter hftau estimated from the data obtained from hf plot. / When the frequent accesses to the item bank has become feasible, test takers may memorize blocks of test items and share these items with future test takers. Individuals with prior knowledge of some items may use that information to get high scores, in the sense that their test scores have been artificially inflated. FLOR is an index of posterior log-odds ratio used for detecting the use of item pre-knowledge. It can be applied both in the fixed item, fixed length test and the CAT environment. It is a model-based index in which aberrant models are defined in the situation of item pre-knowledge. FLOR describes the likelihood that a response pattern arises from the aberrant models. / Hui Hing-fai. / Adviser: Kit-tai Hau. / Source: Dissertation Abstracts International, Volume: 70-09, Section: A, page: . / Thesis (Ed.D.)--Chinese University of Hong Kong, 2008. / Includes bibliographical references (leaves 108-111). / Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web. / Electronic reproduction. [Ann Arbor, MI] : ProQuest Information and Learning, [200-] System requirements: Adobe Acrobat Reader. Available via World Wide Web. / Abstracts in English and Chinese. / School code: 1307.
|
55 |
Potential test information for multidimensional testsJonas, Katherine Grace 01 August 2017 (has links)
Test selection in psychological assessment is guided, both explicitly and implicitly, by how informative tests are with regard to a trait of interest. Most existing formulations of test information are sensitive to subpopulation variation, with the result that test information will vary from sample to sample. Recently, measures of test information have been developed that quantify the potential informativeness of the test. These indices are defined by the properties of the test, as distinct from the properties of the sample or examinee. As of yet, however, measures of potential information have been developed only for unidimensional tests. In practice, psychological tests are often multidimensional. Furthermore, multidimensional tests are often used to estimate one specific trait among many. This study develops measures of potential test information for multidimensional tests, as well as measures of marginal potential test information---test information with regard to one trait within a multidimensional test. In Study 1, the performance of the metrics was tested in data simulated from unidimensional, first-order multidimensional, second-order, and bifactor models. In Study 2, measures of marginal and multidimensional potential test information are applied to a set of neuropsychological data collected as part of Rush University's Memory and Aging Project. In simulated data, marginal and multidimensional potential test information were sensitive to the changing dimensionality of the test. In observed neuropsychological data, five traits were identified. Verbal abilities were most closely correlated with probable dementia. Both indices of marginal potential test information identify the Mini Mental Status Exam as the best measure of that trait. More broadly, greater marginal potential test information calculated with regard to verbal abilities was associated with greater criterion validity. These measures allow for the direct comparison of two multidimensional tests that assess the same trait, facilitating test selection and improving the precision and validity of psychological assessment.
|
56 |
Schizotypy's shape: structure, items, and dependabilityStringer, Deborah Michele 01 December 2012 (has links)
Dimensional models of schizotypy and associated traits have taken on current relevance in the DSM-5 (http://www.dsm5.org) proposal for personality disorder (PD), which includes a personality trait initially conceptualized as a five-facet schizotypy domain and then simplified into a three-facet psychoticism domain that has yet to be evaluated extensively. In this study, I (1) reviewed the literature to develop hypotheses about the content and boundaries of the schizotypy domain, and (2) measured this content in a mixed sample of students and patents with 657 usable protocols at Time 1 (193 Notre Dame students, 301 University of Iowa students; 163 outpatients) and 263 usable protocols (74 Notre Dame students, 76 University of Iowa students, 113 outpatients) at Time 2, at least 1.5 weeks later. I then (3) evaluated confirmatory models including DSM-5 schizotypy and psychoticism facet models and other empirically grounded models and (4) used the best confirmatory model to provide item pools for classically constructing scales of schizotypy facets. This four-factor structure provided content pools used to create four corresponding scales: Unusual Perceptions, Unusual Beliefs, Dissociation Proneness, and Cognitive and Communicative Peculiarity. Additionally, (5) I used item response theory (IRT)-based analyses to evaluate items in these facet scales, both in terms of the level of schizotypy they best measure and the strength of their relations to the schizotypy construct. I also (6) examined the short-term test-retest reliability of the schizotypy scales, as well as that of the established measures used in this study; new and existing measures were comparably stable. Finally, (7) I evaluated schizotypy's convergent and discriminant validity in relation to three other types of traits: (a) those correlated with the domain (e.g., Obsessive Compulsive Disorder [OCD] and non-delusional mistrust), (b) other higher level traits (i.e., measures of the 3 factor and 5-factor models of higher order personality/temperament), and (c) familially related traits (e.g., social anxiety). Overall, the schizotypy facet measures appeared to assess moderate amounts of variance that were unexplained by the established measures of personality, temperament, and psychopathology that were included in this study. The implications of adding a schizotypy trait to the overall personality trait taxonomy are discussed.
|
57 |
Assessing Invariance of Factor Structures and Polytomous Item Response Model Parameter EstimatesReyes, Jennifer McGee 2010 December 1900 (has links)
The purpose of the present study was to examine the
invariance of the factor structure and item response model
parameter estimates obtained from a set of 27 items
selected from the 2002 and 2003 forms of Your First College
Year (YFCY). The first major research question of the
present study was: How similar/invariant are the factor
structures obtained from two datasets (i.e., identical
items, different people)? The first research question was
addressed in two parts: (1) Exploring factor structures
using the YFCY02 dataset; and (2) Assessing factorial
invariance using the YFCY02 and YFCY03 datasets.
After using exploratory and confirmatory and factor
analysis for ordered data, a four-factor model using 20
items was selected based on acceptable model fit for the YFCY02 and YFCY03 datasets. The four factors (constructs)
obtained from the final model were: Overall Satisfaction,
Social Agency, Social Self Concept, and Academic Skills.
To assess factorial invariance, partial and full factorial
invariance were examined. The four-factor model fit both
datasets equally well, meeting the criteria for partial and
full measurement invariance.
The second major research question of the present
study was: How similar/invariant are person and item
parameter estimates obtained from two different datasets
(i.e., identical items, different people) for the
homogenous graded response model (Samejima, 1969) and the
partial credit model (Masters, 1982)?
To evaluate measurement invariance using IRT methods,
the item discrimination and item difficulty parameters
obtained from the GRM need to be equivalent across
datasets. The YFCY02 and YFCY03 GRM item discrimination
parameters (slope) correlation was 0.828. The YFCY02 and
YFCY03 GRM item difficulty parameters (location)
correlation was 0.716. The correlations and scatter plots
indicated that the item discrimination parameter estimates
were more invariant than the item difficulty parameter
estimates across the YFCY02 and YFCY03 datasets.
|
58 |
Item and person parameter estimation using hierarchical generalized linear models and polytomous item response theory modelsWilliams, Natasha Jayne. January 2003 (has links)
Thesis (Ph. D.)--University of Texas at Austin, 2003. / Vita. Includes bibliographical references. Available also from UMI Company.
|
59 |
A New Item Response Theory Model for Estimating Person Ability and Item Parameters for Multidimensional Rank Order ResponsesSeybert, Jacob 01 January 2013 (has links)
The assessment of noncognitive constructs poses a number of challenges that set it apart from traditional cognitive ability measurement. Of particular concern is the influence of response biases and response styles that can influence the accuracy of scale scores. One strategy to address these concerns is to use alternative item presentation formats (such as multidimensional forced choice (MFC) pairs, triads, and tetrads) that may provide resistance to such biases. A variety of strategies for constructing and scoring these forced choice measured have been proposed, though they often require large sample sizes, are limited in the way that statements can vary in location, and (in some cases) require a separate precalibration phase prior to the scoring of forced-choice responses. This dissertation introduces new item response theory models for estimating item and person parameters from rank-order responses indicating preferences among two or more alternatives representing, for example, different personality dimensions. Parameters for this new model, called the Hyperbolic Cosine Model for Rank order responses (HCM-RANK), can be estimated using Markov chain Monte Carlo (MCMC) methods that allow for the simultaneous evaluation of item properties and person scores. The efficacy of the MCMC parameter estimation procedures for these new models was examined via three studies. Study 1 was a Monte Carlo simulation examining the efficacy of parameter recovery across levels of sample size, dimensionality, and approaches to item calibration and scoring. It was found that estimation accuracy improves with sample size, and trait scores and location parameters can be estimated reasonably well in small samples. Study 2 was a simulation examining the robustness of trait estimation to error introduced by substituting subject matter expert (SME) estimates of statement location for MCMC item parameter estimates and true item parameters. Only small decreases in accuracy relative to the true parameters were observed, suggesting that using SME ratings of statement location for scoring might be a viable short-term way of expediting MFC test deployment in field settings. Study 3 was included primarily to illustrate the use of the newly developed IRT models and estimation methods with real data. An empirical investigation comparing validities of personality measures using different item formats yielded mixed results and raised questions about multidimensional test construction practices that will be explored in future research. The presentation concludes with a discussion of MFC methods and potential applications in educational and workforce contexts.
|
60 |
Item and person parameter estimation using hierarchical generalized linear models and polytomous item response theory modelsWilliams, Natasha Jayne 27 July 2011 (has links)
Not available / text
|
Page generated in 0.0871 seconds