Global ETD Search

171	Novel document representations based on labels and sequential information Kim, Seungyeon 21 September 2015 (has links) A wide variety of text analysis applications are based on statistical machine learning techniques. The success of those applications is critically affected by how we represent a document. Learning an efficient document representation has two major challenges: sparsity and sequentiality. The sparsity often causes high estimation error, and text's sequential nature, interdependency between words, causes even more complication. This thesis presents novel document representations to overcome the two challenges. First, I employ label characteristics to estimate a compact document representation. Because label attributes implicitly describe the geometry of dense subspace that has substantial impact, I can effectively resolve the sparsity issue while only focusing the compact subspace. Second, while modeling a document as a joint or conditional distribution between words and their sequential information, I can efficiently reflect sequential nature of text in my document representations. Lastly, the thesis is concluded with a document representation that employs both labels and sequential information in a unified formulation. The following four criteria are utilized to evaluate the goodness of representations: how close a representation is to its original data, how strongly a representation can be distinguished from each other, how easy to interpret a representation by a human, and how much computational effort is needed for a representation. While pursuing those good representation criteria, I was able to obtain document representations that are closer to the original data, stronger in discrimination, and easier to be understood than traditional document representations. Efficient computation algorithms make the proposed approaches largely scalable. This thesis examines emotion prediction, temporal emotion analysis, modeling documents with edit histories, locally coherent topic modeling, and text categorization tasks for possible applications. Representation learning Topic modeling Supervised learning Sequential document modeling Sentiment analysis Mood analysis Matrix factorization Machine learning Artificial intelligence
172	Application of common sense computing for the development of a novel knowledge-based opinion mining engine Erik, Cambria January 2011 (has links) The ways people express their opinions and sentiments have radically changed in the past few years thanks to the advent of social networks, web communities, blogs, wikis and other online collaborative media. The distillation of knowledge from this huge amount of unstructured information can be a key factor for marketers who want to create an image or identity in the minds of their customers for their product, brand, or organisation. These online social data, however, remain hardly accessible to computers, as they are specifically meant for human consumption. The automatic analysis of online opinions, in fact, involves a deep understanding of natural language text by machines, from which we are still very far. Hitherto, online information retrieval has been mainly based on algorithms relying on the textual representation of web-pages. Such algorithms are very good at retrieving texts, splitting them into parts, checking the spelling and counting their words. But when it comes to interpreting sentences and extracting meaningful information, their capabilities are known to be very limited. Existing approaches to opinion mining and sentiment analysis, in particular, can be grouped into three main categories: keyword spotting, in which text is classified into categories based on the presence of fairly unambiguous affect words; lexical affinity, which assigns arbitrary words a probabilistic affinity for a particular emotion; statistical methods, which calculate the valence of affective keywords and word co-occurrence frequencies on the base of a large training corpus. Early works aimed to classify entire documents as containing overall positive or negative polarity, or rating scores of reviews. Such systems were mainly based on supervised approaches relying on manually labelled samples, such as movie or product reviews where the opinionist’s overall positive or negative attitude was explicitly indicated. However, opinions and sentiments do not occur only at document level, nor they are limited to a single valence or target. Contrary or complementary attitudes toward the same topic or multiple topics can be present across the span of a document. In more recent works, text analysis granularity has been taken down to segment and sentence level, e.g., by using presence of opinion-bearing lexical items (single words or n-grams) to detect subjective sentences, or by exploiting association rule mining for a feature-based analysis of product reviews. These approaches, however, are still far from being able to infer the cognitive and affective information associated with natural language as they mainly rely on knowledge bases that are still too limited to efficiently process text at sentence level. In this thesis, common sense computing techniques are further developed and applied to bridge the semantic gap between word-level natural language data and the concept-level opinions conveyed by these. In particular, the ensemble application of graph mining and multi-dimensionality reduction techniques on two common sense knowledge bases was exploited to develop a novel intelligent engine for open-domain opinion mining and sentiment analysis. The proposed approach, termed sentic computing, performs a clause-level semantic analysis of text, which allows the inference of both the conceptual and emotional information associated with natural language opinions and, hence, a more efficient passage from (unstructured) textual information to (structured) machine-processable data. The engine was tested on three different resources, namely a Twitter hashtag repository, a LiveJournal database and a PatientOpinion dataset, and its performance compared both with results obtained using standard sentiment analysis techniques and using different state-of-the-art knowledge bases such as Princeton’s WordNet, MIT’s ConceptNet and Microsoft’s Probase. Differently from most currently available opinion mining services, the developed engine does not base its analysis on a limited set of affect words and their co-occurrence frequencies, but rather on common sense concepts and the cognitive and affective valence conveyed by these. This allows the engine to be domain-independent and, hence, to be embedded in any opinion mining system for the development of intelligent applications in multiple fields such as Social Web, HCI and e-health. Looking ahead, the combined novel use of different knowledge bases and of common sense reasoning techniques for opinion mining proposed in this work, will, eventually, pave the way for development of more bio-inspired approaches to the design of natural language processing systems capable of handling knowledge, retrieving it when necessary, making analogies and learning from experience. 006.3
173	K lingvistické struktuře emocionálního významu v češtině / On the Linguistic Structure of Emotional Meaning in Czech Veselovská, Kateřina January 2015 (has links) Title: On the Linguistic Structure of Emotional Meaning in Czech Author: Mgr. Kateřina Veselovská Department: Institute of Formal and Applied Linguistics Supervisor: Prof. PhDr. Eva Hajičová, DrSc., Institute of Formal and Applied Linguistics Keywords: emotional meaning, linguistic structure, sentiment analysis, opinion mining, evaluative language Abstract: This thesis has two main goals. First, we provide an analysis of language means which together form an emotional meaning of written utterances in Czech. Sec- ond, we employ the findings concerning emotional language in computational applications. We provide a systematic overview of lexical, morphosyntactic, semantic and pragmatic aspects of emotional meaning in Czech utterances. Also, we propose two formal representations of emotional structures within the framework of the Prague Dependency Treebank and Construction Grammar. Regarding the computational applications, we focus on sentiment analysis, i.e. automatic extraction of emotions from text. We describe a creation of manually annotated emotional data resources in Czech and perform two main sentiment analysis tasks, polarity classification and opinion target identification on Czech data. In both of these tasks, we reach the state-of-the-art results.
174	Social media sentiment analysis for firm's revenue prediction Dimadi, Ioanna January 2018 (has links) The advent of the Internet and its social media platforms have affected people’s daily life. More and more people use it as a tool in order to communicate, exchange opin-ions and share information with others. However, those platforms have not only been used for socializing but also for expressing people’s product preferences. This wide spread of social networking sites has enabled companies to take advantage of them as an important way of approaching their target audience. This thesis focuses on study-ing the influence of social media platforms on the revenue of a single organization like Nike that uses them actively. Facebook and Twitter, two widely-used social me-dia platforms, were investigated with tweets and comments produced by consumer’s online discussions in brand’s hosted pages being gathered. This unstructured social media data were collected from 26 Nike official pages, 13 fan pages from each plat-form and their sentiment was analyzed. The classification of those comments had been done by using the Valence Aware Dictionary and Sentiment Reasoner (VADER), a lexicon-based approach that is implemented for social media analysis. After gathering the five-year Nike’s revenue, the degree to which these could be affected by the clas-sified data was examined by using multiple stepwise linear regression analysis. The findings showed that the fraction of positive/total for both Facebook and Twitter ex-plained 84.6% of the revenue’s variance. Fitting this data on the multiple regression model, Nike’s revenue could be forecast with a root mean square error around 287 billion. Social media Facebook Twitter Sentiment analysis VADER Linear regression Information Systems, Social aspects
175	Ontology Based Framework for Conceptualizing Human Affective States and Their Influences Abaalkhail, Rana 12 November 2018 (has links) The study of human affective states and their influences has been a research interest in psychology for some time. Fortunately, the presence of an affective computing paradigm allows us to use theories and findings from the discipline of psychology in the representation and development of human affective applications. However, because of the complexity of the subject, it is possible to misunderstand concepts that are shared via human and/or computer communications. With the appearance of technological innovations in our lives, for instance the SemanticWeb and the Web Ontology Language (OWL), there is a stronger need for computers to better understand human affective states and their influences. The use of an ontology can be beneficial in order to represent human affective states and their influences in a machine-understandable format. Truly, ontologies provide powerful tools to make sense of data. Our thesis proposes HASIO, a Human Affective States and their Influences Ontology, designed based on existing psychological theories. HASIO was developed to represent the knowledge that is necessary to model affective states and their influences in a computerized format. It describes the human affective states (Emotion, Mood and Sentiment) and their influences (Personality, Need and Subjective well-being) and conceptualizes their models and recognition methods. HASIO also represents the relationships between affective states and the factors that influence them. We surveyed and analyzed existing ontologies regarding human affective states and their influences to realize the significance and profit of developing our proposed ontology (HASIO). We follow the Methontology approach, a comprehensive engineering methodology for ontology building, to design and build HASIO. An important aspect in determining the ontology scope is Competency Questions (CQs). We configure HASIO CQs by analyzing the resources from psychology theories, available lexicons and existing ontologies. In this thesis, we present the development, modularization and evaluation of HASIO. HASIO can profit from the modularization process by dividing the whole ontology in self-contained modules that are easy to reuse and maintain. The ontology is evaluated through Question Answering system (HASIOQA), a task-based evaluation system, for validation. We design and develop a natural language interface system for this purpose. Moreover, the proposed ontology was evaluated through the Ontology Pitfall Scanner for verification and correctness against several criteria. Furthermore, HASIO was used in sentiment analysis on diffrent Twitter dataset. We designed and developed a tweet polarity calculation algorithm. Additionally, we compare our ontology result with machine learning technique. We demonstrate and highlight the advantage of using ontology in sentiment analysis. Ontology Human Affective States Human Influnces Ontology Evaluation Ontology sentiment analysis Ontology Development Ontology modularization Ontology Competency Questions
176	深度學習於中文句子之表示法學習 / Deep learning techniques for Chinese sentence representation learning 管芸辰, Kuan, Yun Chen Unknown Date (has links) 本篇論文主要在探討如何利用近期發展之深度學習技術在於中文句子分散式表示法學習。近期深度學習受到極大的注目，相關技術也隨之蓬勃發展。然而相關的分散式表示方式，大多以英文為主的其他印歐語系作為主要的衡量對象，也據其特性發展。除了印歐語系外，另外漢藏語系及阿爾泰語系等也有眾多使用人口。還有獨立語系的像日語、韓語等語系存在，各自也有其不同的特性。中文本身屬於漢藏語系，本身具有相當不同的特性，像是孤立語、聲調、量詞等。近來也有許多論文使用多語系的資料集作為評量標準，但鮮少去討論各語言間表現的差異。本論文利用句子情緒分類之實驗，來比較近期所發展之深度學習之技術與傳統詞向量表示法的差異，我們將以TF-IDF為基準比較其他三個PVDM、Siamese-CBOW及Fasttext的表現差異，也深入探討此些模型對於中文句子情緒分類之表現。 / The paper demonstrates how the deep learning methods published in recent years applied in Chinese sentence representation learning. Recently, the deep learning techniques have attracted the great attention. Related areas also grow enormously. However, the most techniques use Indo-European languages mainly as evaluation objective and developed corresponding to their properties. Besides Indo-European languages, there are Sino-Tibetan language and Altaic language, which also spoken widely. There are only some independent languages like Japanese or Korean, which have their own properties. Chinese itself is belonged to Sino-Tibetan language family and has some characters like isolating language, tone, count word...etc.Recently, many publications also use the multilingual dataset to evaluate their performance, but few of them discuss the differences among different languages. This thesis demonstrates that we perform the sentiment analysis on Chinese Weibo dataset to quantize the effectiveness of different deep learning techniques. We compared the traditional TF-IDF model with PVDM, Siamese-CBOW, and FastText, and evaluate the model they created. 深度學習分散式表示情緒分類 Deep learning Distributed representation Sentiment analysis
177	Extending Game User Experience - Exploring Player Feedback and Satisfaction : The Birth of the Playsona Strååt, Björn January 2017 (has links) Video games are experience-based products and user satisfaction is key for their popularity. To design for as strong an experience as possible, game developers incorporate evaluation methods that help to discover their users’ expectations and needs. Despite such efforts, problems still occur with the game design that lower the user experience. To counter these problems, the evaluation methods should be investigated and improved. To address this need, I have explored various design tools and user experience theories. Applying these in a game evaluation context, I have analyzed user-created game reviews and conducted longitudinal user interview- and game diary studies in connection to playing a newly released game, in other words different methods to take advantage of users' expectations, opinions, attitudes and experiences. One result of the analysis of the obtained data is a set of “slogans” that illustrate how and why users lose interest in a game. A second result is a method for extracting user attitudes from pre-produced user reviews and how this can be used in game development. Thirdly, I introduce an alternative model, aimed at game user experience development, the Playsona. The Playsona is a lightweight tool that introduces a variant of the Persona-method, specifically for video game design. / <p>At the time of the doctoral defense, the following paper was unpublished and had a status as follows: Paper 4: Manuscript.</p> video game design user experience game user experience playsona aspect based sentiment analysis focused player diaries Media and Communications Medie- och kommunikationsvetenskap
178	Statistical Dialog Management for Health Interventions Yasavur, Ugan 09 July 2014 (has links) Research endeavors on spoken dialogue systems in the 1990s and 2000s have led to the deployment of commercial spoken dialogue systems (SDS) in microdomains such as customer service automation, reservation/booking and question answering systems. Recent research in SDS has been focused on the development of applications in different domains (e.g. virtual counseling, personal coaches, social companions) which requires more sophistication than the previous generation of commercial SDS. The focus of this research project is the delivery of behavior change interventions based on the brief intervention counseling style via spoken dialogue systems. Brief interventions (BI) are evidence-based, short, well structured, one-on-one counseling sessions. Many challenges are involved in delivering BIs to people in need, such as finding the time to administer them in busy doctors' offices, obtaining the extra training that helps staff become comfortable providing these interventions, and managing the cost of delivering the interventions. Fortunately, recent developments in spoken dialogue systems make the development of systems that can deliver brief interventions possible. The overall objective of this research is to develop a data-driven, adaptable dialogue system for brief interventions for problematic drinking behavior, based on reinforcement learning methods. The implications of this research project includes, but are not limited to, assessing the feasibility of delivering structured brief health interventions with a data-driven spoken dialogue system. Furthermore, while the experimental system focuses on harmful alcohol drinking as a target behavior in this project, the produced knowledge and experience may also lead to implementation of similarly structured health interventions and assessments other than the alcohol domain (e.g. obesity, drug use, lack of exercise), using statistical machine learning approaches. In addition to designing a dialog system, the semantic and emotional meanings of user utterances have high impact on interaction. To perform domain specific reasoning and recognize concepts in user utterances, a named-entity recognizer and an ontology are designed and evaluated. To understand affective information conveyed through text, lexicons and sentiment analysis module are developed and tested. spoken dialog systems health interventions reinforcement learning dialog management embodied conversational agents ECAs IVAs semantic networks sentiment analysis
179	Using Social Media Networks for Measuring Consumer Confidence: Problems, Issues and Prospects Igboayaka, Jane-Vivian Chinelo Ezinne January 2015 (has links) This research examines the confluence of consumers’ use of social media to share information with the ever-present need for innovative research that yields insight into consumers’ economic decisions. Social media networks have become ubiquitous in the new millennium. These networks, including, among others: Facebook, Twitter, Blog, and Reddit, are brimming with conversations on an expansive array of topics between people, private and public organizations, governments and global institutions. Preliminary findings from initial research confirms the existence of online conversations and posts related to matters of personal finance and consumers’ economic outlook. Meanwhile, the Consumer Confidence Index (CCI) continues to make headline news. The issue of consumer confidence (or sentiment) in anticipating future economic activity generates significant interest from major players in the news media industry, who scrutinize its every detail and report its implications for key players in the economy. Though the CCI originated in the United States in 1946, variants of the survey are now used to track and measure consumer confidence in nations worldwide. In light of the fact that the CCI is a quantified representation of consumer sentiments, it is possible that the level of confidence consumers have in the economy could be deduced by tracking the sentiments or opinions they express in social media posts. Systematic study of these posts could then be transformed into insights that could improve the accuracy of an index like the CCI. Herein lies the focus of the current research—to analyze the attributes of data from social media posts, in order to assess their capacity to generate insights that are novel and/or complementary to traditional CCI methods. The link between data gained from social media and the survey-based CCI is perhaps not an obvious one. But our research will use a data extraction tool called NetBase Insight Workbench to mine data from the social media networks and then apply natural language processing to analyze the social media content. Also, KH Coder software will be used to perform a set of statistical analyses on samples of social media posts to examine the co-occurrence and clustering of words. The findings will be used to expose the strengths and weaknesses of the data and to assess the validity and cohesion of the NetBase data extraction tool and its suitability for future research. In conclusion, our research findings support the analysis of opinions expressed in social media posts as a complement to traditional survey-based CCI approaches. Our findings also identified a key weakness with regards to the degree of ‘noisiness’ of the data. Although this could be attributed to the ‘modeling’ error of the data mining tool, there is room for improvement in the area of association—of discerning the context and intention of posts in online conversations. Consumer Confidence Index Natural Language Processing Social Media Networks NetBase Data Mining KH Coder Consumer Confidence Survey Sentiment Analysis
180	Konkurenční analýza předních ICT firem na českém trhu / Competitive analysis of leading ICT companies on the Czech market Dvořák, Oskar January 2012 (has links) This thesis deals with the field of Competitive Intelligence in relation to the possibilities of application of its methods and tools for competitive analysis of the market environment using modern virtual social networks. Theoretical part focuses on the characteristics of the market environment of ICT companies by using Porter's analysis and then it is focused on the description of selected tools and methods used to processing unstructured data and social networks analysis. The practical part is based on a real project which ran from early March 2013 at IBM Company. Practical part demonstrates current possibilities of information technology in the field of Competitive Intelligence.

Search results