Global ETD Search

21	Extracting Quantitative Informationfrom Nonnumeric Marketing Data: An Augmentedlatent Semantic Analysis Approach Arroniz, Inigo 01 January 2007 (has links) Despite the widespread availability and importance of nonnumeric data, marketers do not have the tools to extract information from large amounts of nonnumeric data. This dissertation attempts to fill this void: I developed a scalable methodology that is capable of extracting information from extremely large volumes of nonnumeric data. The proposed methodology integrates concepts from information retrieval and content analysis to analyze textual information. This approach avoids a pervasive difficulty of traditional content analysis, namely the classification of terms into predetermined categories, by creating a linear composite of all terms in the document and, then, weighting the terms according to their inferred meaning. In the proposed approach, meaning is inferred by the collocation of the term across all the texts in the corpus. It is assumed that there is a lower dimensional space of concepts that underlies word usage. The semantics of each word are inferred by identifying its various contexts in a document and across documents (i.e., in the corpus). After the semantic similarity space is inferred from the corpus, the words in each document are weighted to obtain their representation on the lower dimensional semantic similarity space, effectively mapping the terms to the concept space and ultimately creating a score that measures the concept of interest. I propose an empirical application of the outlined methodology. For this empirical illustration, I revisit an important marketing problem, the effect of movie critics on the performance of the movies. In the extant literature, researchers have used an overall numerical rating of the review to capture the content of the movie reviews. I contend that valuable information present in the textual materials remains uncovered. I use the proposed methodology to extract this information from the nonnumeric text contained in a movie review. The proposed setting is particularly attractive to validate the methodology because the setting allows for a simple test of the text-derived metrics by comparing them to the numeric ratings provided by the reviewers. I empirically show the application of this methodology and traditional computer-aided content analytic methods to study an important marketing topic, the effect of movie critics on movie performance. In the empirical application of the proposed methodology, I use two datasets that combined contain more than 9,000 movie reviews nested in more than 250 movies. I am restudying this marketing problem in the light of directly obtaining information from the reviews instead of following the usual practice of using an overall rating or a classification of the review as either positive or negative. I find that the addition of direct content and structure of the review adds a significant amount of exploratory power as a determinant of movie performance, even in the presence of actual reviewer overall ratings (stars) and other controls. This effect is robust across distinct opertaionalizations of both the review content and the movie performance metrics. In fact, my findings suggest that as we move from sales to profitability to financial return measures, the role of the content of the review, and therefore the critic's role, becomes increasingly important. Latent Structures Latent Semantic Analysis Content Analysis Movie reviewers Movie performance; Marketing
22	Accuracy and Interpretability Testing of Text Mining Methods Ashton, Triss A. 08 1900 (has links) Extracting meaningful information from large collections of text data is problematic because of the sheer size of the database. However, automated analytic methods capable of processing such data have emerged. These methods, collectively called text mining first began to appear in 1988. A number of additional text mining methods quickly developed in independent research silos with each based on unique mathematical algorithms. How good each of these methods are at analyzing text is unclear. Method development typically evolves from some research silo centric requirement with the success of the method measured by a custom requirement-based metric. Results of the new method are then compared to another method that was similarly developed. The proposed research introduces an experimentally designed testing method to text mining that eliminates research silo bias and simultaneously evaluates methods from all of the major context-region text mining method families. The proposed research method follows a random block factorial design with two treatments consisting of three and five levels (RBF-35) with repeated measures. Contribution of the research is threefold. First, the users perceived a difference in the effectiveness of the various methods. Second, while still not clear, there are characteristics with in the text collection that affect the algorithms ability to extract meaningful results. Third, this research develops an experimental design process for testing the algorithms that is adaptable into other areas of software development and algorithm testing. This design eliminates the bias based practices historically employed by algorithm developers. Latent semantic analysis latent Dirichlet analysis non-negative matrix factorization experimental design
23	Investigating the relationship between the business performance management framework and the Malcolm Baldrige National Quality Award framework. Hossain, Muhammad Muazzem 08 1900 (has links) The business performance management (BPM) framework helps an organization continuously adjust and successfully execute its strategies. BPM helps increase flexibility by providing managers with an early alert about changes and, as a result, allows faster response to such changes. The Malcolm Baldrige National Quality Award (MBNQA) framework provides a basis for self-assessment and a systems perspective for managing an organization's key processes for achieving business results. The MBNQA framework is a more comprehensive framework and encapsulates the underlying constructs in the BPM framework. The objectives of this dissertation are fourfold: (1) to validate the underlying relationships presented in the 2008 MBNQA framework, (2) to explore the MBNQA framework at the dimension level, and develop and test constructs measured at that level in a causal model, (3) to validate and create a common general framework for the business performance model by integrating the practitioner literature with basic theory including existing MBNQA theory, and (4) to integrate the BPM framework and the MBNQA framework into a new framework (BPM-MBNQA framework) that can guide organizations in their journey toward achieving and sustaining competitive and strategic advantages. The purpose of this study is to achieve these objectives by means of a combination of methodologies including literature reviews, expert opinions, interviews, presentation feedbacks, content analysis, and latent semantic analysis. An initial BPM framework was developed based on the reviews of literature and expert opinions. There is a paucity of academic research on business performance management. Therefore, this study reviewed the practitioner literature on BPM and from the numerous organization-specific BPM models developed a generic, conceptual BPM framework. With the intent of obtaining valuable feedback, this initial BPM framework was presented to Baldrige Award recipients (BARs) and selected academicians from across the United States who participated in the Fall Summit 2007 held at Caterpillar Financial Headquarter in Nashville, TN on October 1 and 2, 2007. Incorporating the feedback from that group allowed refining and improving the proposed BPM framework. This study developed a variant of the traditional latent semantic analysis (LSA) called causal latent semantic analysis (cLSA) that enables us to test causal models using textual data. This method was used to validate the 2008 MBNQA framework based on article abstracts on the Baldrige Award and program published in both practitioner and academic journals from 1987 to 2009. The cLSA was also used to validate the BPM framework using the full body text data from all articles published in the practitioner journal entitled the Business Performance Management Magazine since its inception in 2003. The results provide the first cLSA study of these frameworks. This is also the first study to examine all the causal relationships within the MBNQA and BPM frameworks. BPM Business performance management Baldrige award Baldrige program cLSA MBNQA latent semantic analysis causal latent semantic analysis LSA Management. Industrial management. Performance. Malcolm Baldrige National Quality Award. Total quality management.
24	Visualization of Knowledge Spaces to Enable Concurrent, Embedded and Transformative Input to Knowledge Building Processes Teplovs, Christopher 01 September 2010 (has links) This thesis focuses on the creation of a systems architecture to help inform development of next generation knowledge-building environments. The architectural model consists of three components: an infrastructure layer, a discourse layer, and a visualization layer. The Knowledge Space Visualizer (KSV), which defines the top visualization layer, is a prototypic system for showing reconstructed representations of discourse-based artifacts and facilitating assessment in light of patterns of interactivity of participants and their ideas. The KSV uses Latent Semantic Analysis to extend techniques from Social Network Analysis, making it possible to infer relationships among note contents. Thus idea networks can be studied in conjunction with social networks in online discourse. Further, benchmark corpora can be used to determine knowledge advances, and systems of interactivity leading to them. Results can then provide feedback to students and teachers to support them in obtaining continually higher level achievements. In addition to visual representations, the KSV provides quantitative network metrics such as degree and density. Data drawn from 9- and 10-year-old students working on a six-week unit on optics were used to illustrate some of the functionality of the KSV. Three studies show ways in which new visualizations can be used: (a) to highlight relationships among notes, (b) as a way of tracking the development of discourse over time, and (c) as an assessment tool. Implications for the design of knowledge building environments, assessment tools, and design-based research are discussed. knowledge building visualization assessment Knowledge Forum latent semantic analysis knowledge space visualizer KSV LSA systems model 0710
25	Visualization of Knowledge Spaces to Enable Concurrent, Embedded and Transformative Input to Knowledge Building Processes Teplovs, Christopher 01 September 2010 (has links) This thesis focuses on the creation of a systems architecture to help inform development of next generation knowledge-building environments. The architectural model consists of three components: an infrastructure layer, a discourse layer, and a visualization layer. The Knowledge Space Visualizer (KSV), which defines the top visualization layer, is a prototypic system for showing reconstructed representations of discourse-based artifacts and facilitating assessment in light of patterns of interactivity of participants and their ideas. The KSV uses Latent Semantic Analysis to extend techniques from Social Network Analysis, making it possible to infer relationships among note contents. Thus idea networks can be studied in conjunction with social networks in online discourse. Further, benchmark corpora can be used to determine knowledge advances, and systems of interactivity leading to them. Results can then provide feedback to students and teachers to support them in obtaining continually higher level achievements. In addition to visual representations, the KSV provides quantitative network metrics such as degree and density. Data drawn from 9- and 10-year-old students working on a six-week unit on optics were used to illustrate some of the functionality of the KSV. Three studies show ways in which new visualizations can be used: (a) to highlight relationships among notes, (b) as a way of tracking the development of discourse over time, and (c) as an assessment tool. Implications for the design of knowledge building environments, assessment tools, and design-based research are discussed. knowledge building visualization assessment Knowledge Forum latent semantic analysis knowledge space visualizer KSV LSA systems model 0710
26	Generalized Hebbian Algorithm for Dimensionality Reduction in Natural Language Processing Gorrell, Genevieve January 2006 (has links) The current surge of interest in search and comparison tasks in natural language processing has brought with it a focus on vector space approaches and vector space dimensionality reduction techniques. Presenting data as points in hyperspace provides opportunities to use a variety of welldeveloped tools pertinent to this representation. Dimensionality reduction allows data to be compressed and generalised. Eigen decomposition and related algorithms are one category of approaches to dimensionality reduction, providing a principled way to reduce data dimensionality that has time and again shown itself capable of enabling access to powerful generalisations in the data. Issues with the approach, however, include computational complexity and limitations on the size of dataset that can reasonably be processed in this way. Large datasets are a persistent feature of natural language processing tasks. This thesis focuses on two main questions. Firstly, in what ways can eigen decomposition and related techniques be extended to larger datasets? Secondly, this having been achieved, of what value is the resulting approach to information retrieval and to statistical language modelling at the ngram level? The applicability of eigen decomposition is shown to be extendable through the use of an extant algorithm; the Generalized Hebbian Algorithm (GHA), and the novel extension of this algorithm to paired data; the Asymmetric Generalized Hebbian Algorithm (AGHA). Several original extensions to the these algorithms are also presented, improving their applicability in various domains. The applicability of GHA to Latent Semantic Analysisstyle tasks is investigated. Finally, AGHA is used to investigate the value of singular value decomposition, an eigen decomposition variant, to ngram language modelling. A sizeable perplexity reduction is demonstrated. Generalized Hebbian Algorithm Language Modelling Singular Value Decomposition Eigen Decomposition Latent Semantic Analysis Vector Space Models Computational linguistics Datorlingvistik
27	Using Topic Models to Study Journalist-Audience Convergence and Divergence: The Case of Human Trafficking Coverage on British Online Newspapers Papadouka, Maria Eirini 08 1900 (has links) Despite the accessibility of online news and availability of sophisticated methods for analyzing news content, no previous study has focused on the simultaneous examination of news coverage on human trafficking and audiences' interpretations of this coverage. In my research, I have examined both journalists' and commenters' topic choices in coverage and discussion of human trafficking from the online platforms of three British newspapers covering the period 2009–2015. I used latent semantic analysis (LSA) to identify emergent topics in my corpus of newspaper articles and readers' comments, and I then quantitatively investigated topic preferences to identify convergence and divergence on the topics discussed by journalists and their readers. I addressed my research questions in two distinctive studies. The first case study implemented topic modelling techniques and further quantitative analyses on article and comment paragraphs from The Guardian. The second extensive study included article and comment paragraphs from the online platforms of three British newspapers: The Guardian, The Times and the Daily Mail. The findings indicate that the theories of "agenda setting" and of "active audience" are not mutually exclusive, and the scope of explanation of each depends partly on the specific topic or subtopic that is analyzed. Taking into account further theoretical concepts related to agenda setting, four more additional research questions were addressed. Topic convergence and divergence was further identified when taking into account the newspapers' political orientation and the articles' and comments' year of publication. human trafficking text analysis topic models agenda setting active audiences news media latent semantic analysis Sociology, Criminology and Penology Sociology, General
28	An Analysis of Educational Technology Publications: Who, What and Where in the Last 20 Years Natividad Beltrán del Río, Gloria Ofelia 05 1900 (has links) This exploratory and descriptive study examines research articles published in ten of the top journals in the broad area of educational technology during the last 20 years: 1) Educational Technology Research and Development (ETR&D); 2) Instructional Science; 3) Journal of the Learning Sciences; 4) TechTrends; 5) Educational Technology: The Magazine for Managers of Change in Education; 6) Journal of Educational Technology & Society; 7) Computers and Education; 8) British Journal of Educational Technology (BJET); 9) Journal of Educational Computing Research; and 10) Journal of Research on Technology in Education. To discover research trends in the articles published from 1995-2014, abstracts from all contributing articles published in those ten prominent journals were analyzed to extract a latent semantic space of broad research areas, top authors, and top-cited publications. Concepts that have emerged, grown, or diminished in the field were noted in order to identify the most dominant in the last two decades; and the most frequent contributors to each journal as well as those who contributed to more than one of the journals studied were identified. educational technology educational technology trends scholarly publications latent semantic analysis LSA journal rankings journal impact Educational technology. Educational literature.
29	Enhancing User Search Experience in Digital Libraries with Rotated Latent Semantic Indexing Polyakov, Serhiy 08 1900 (has links) This study investigates a semi-automatic method for creation of topical labels representing the topical concepts in information objects. The method is called rotated latent semantic indexing (rLSI). rLSI has found application in text mining but has not been used for topical labels generation in digital libraries (DLs). The present study proposes a theoretical model and an evaluation framework which are based on the LSA theory of meaning and investigates rLSI in a DL environment. The proposed evaluation framework for rLSI topical labels is focused on human-information search behavior and satisfaction measures. The experimental systems that utilize those topical labels were built for the purposes of evaluating user satisfaction with the search process. A new instrument was developed for this study and the experiment showed high reliability of the measurement scales and confirmed the construct validity. Data was collected through the information search tasks performed by 122 participants using two experimental systems. A quantitative method of analysis, partial least squares structural equation modeling (PLS-SEM), was used to test a set of research hypotheses and to answer research questions. The results showed a not significant, indirect effect of topical label type on both guidance and satisfaction. The conclusion of the study is that topical labels generated using rLSI provide the same levels of alignment, guidance, and satisfaction with the search process as topical labels created by the professional indexers using best practices. Digital libraries latent semantic analysis user search experience Information behavior. Latent semantic indexing. Digital libraries.
30	Evaluating Semantic Internalization Among Users of an Online Review Platform Zaras, Dimitrios 08 1900 (has links) The present study draws on recent sociological literature that argues that the study of cognition and culture can benefit from theories of embodied cognition. The concept of semantic internalization is introduced, which is conceptualized as the ability to perceive and articulate the topics that are of most concern to a community as they are manifested in social discourse. Semantic internalization is partly an application of emotional intelligence in the context of community-level discourse. Semantic internalization is measured through the application of Latent Semantic Analysis. Furthermore, it is investigated whether this ability is related to an individual’s social capital and habitus. The analysis is based on data collected from the online review platform yelp.com. Semantic internalization latent semantic analysis cognitive sociology social media Cognition and culture. Latent semantic indexing. User-generated content. Social media. Emotional intelligence.

Search results