Return to search

A Bayesian Hierarchical Mixture Approach to Model Timing Data with Application to Writing Assessment

Deane (2011) proposed a multi-layer cognitive writing model. The 2009 Cognitively Based Assessment of, for, and as learning (CBAL) Writing pilot assessment was designed to support that multi-layer model of writing. One of the features of the assessment was that the keystroke activity of students writing essays were logged to computer files. The preliminary analysis (Almond, et al., 2012) developed an algorithm to classify the pauses in writing based on the keystroke and suggested the distribution for pause events is a mixture of lognormal distribution. This early research was a promising effort to tie the mixture components to the layers in the multi-layer writing model. However, the preliminary analysis with sample size of 68 needed to be repeated with the large data set. Moreover, the data needed to be hierarchically modeled so that the data can fit mixture components when the number of pause events is small per essay. To address these problems, the first part of this thesis aims to fit the large data set (CBAL Writing pilot 2009) in a mixture of lognormal distribution. Then, a distributional analysis was carried out to evaluate the fit of the model to the data. The result confirms the preliminary analysis result. Namely, the two-component mixture model provides an adequate description of the data. The second part of the thesis aims to estimate mixture parameters by using the Bayesian hierarchical model suggested in the preliminary analysis (Almond, et al., 2012). In the Bayesian framework, the hierarchical model is useful when the number of observed pause events is small per essay. Therefore, the proposed model becomes a mixture of k univariate lognormal heteroscedastic components in the Bayesian hierarchical framework. The evaluation of Bayesian model implementation primarily requires the convergence of the Markov chain Monte Carlo (MCMC) sampler. Diagnostic tools were used to assess the convergence of the MCMC sampler. The results show that the MCMC sampler converged for both the two- and the three-component models. / A Thesis submitted to the Department of Educational Psychology and Learning Systems in partial fulfillment of the requirements for the degree of Master of Science. / Fall Semester, 2013. / June 26, 2013. / Bayesian statistics, Mixture Model, Timing Data, Writing Assessment / Includes bibliographical references. / Russell George Almond, Professor Directing Thesis; Young-Suk Kim, Committee Member; Yanyun Yang, Committee Member.

Identiferoai:union.ndltd.org:fsu.edu/oai:fsu.digital.flvc.org:fsu_185129
ContributorsLi, Tingxuan (authoraut), Almond, Russell George (professor directing thesis), Kim, Young-Suk (committee member), Yang, Yanyun (committee member), Department of Educational Psychology and Learning Systems (degree granting department), Florida State University (degree granting institution)
PublisherFlorida State University, Florida State University
Source SetsFlorida State University
LanguageEnglish, English
Detected LanguageEnglish
TypeText, text
Format1 online resource, computer, application/pdf
RightsThis Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). The copyright in theses and dissertations completed at Florida State University is held by the students who author them.

Page generated in 0.0016 seconds