• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 201
  • 43
  • 17
  • 14
  • 10
  • 9
  • 7
  • 7
  • 5
  • 3
  • 2
  • 2
  • 2
  • 2
  • 1
  • Tagged with
  • 345
  • 345
  • 97
  • 96
  • 88
  • 74
  • 72
  • 71
  • 67
  • 61
  • 58
  • 58
  • 58
  • 55
  • 52
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

An empirical analysis of lexical polarity and contextual valence shifters for opinion classification

Longton, Adam 11 1900 (has links)
This work is concerned with the automatic understanding of evaluative text. We investigate sentence level opinion polarity prediction by assigning lexical polarities and deriving sentence polarity from these with the use of contextual valence shifters. A methodology for iterative failure analysis is developed and used to refine our lexicon and identify new contextual shifters. Algorithms are presented that employ these new shifters to improve sentence polarity prediction accuracy beyond that of a state-of-the-art existing algorithm in the domain of consumer product reviews. We then apply the best configuration of our algorithm to the domain of movie reviews.
2

An empirical analysis of lexical polarity and contextual valence shifters for opinion classification

Longton, Adam 11 1900 (has links)
This work is concerned with the automatic understanding of evaluative text. We investigate sentence level opinion polarity prediction by assigning lexical polarities and deriving sentence polarity from these with the use of contextual valence shifters. A methodology for iterative failure analysis is developed and used to refine our lexicon and identify new contextual shifters. Algorithms are presented that employ these new shifters to improve sentence polarity prediction accuracy beyond that of a state-of-the-art existing algorithm in the domain of consumer product reviews. We then apply the best configuration of our algorithm to the domain of movie reviews.
3

An empirical analysis of lexical polarity and contextual valence shifters for opinion classification

Longton, Adam 11 1900 (has links)
This work is concerned with the automatic understanding of evaluative text. We investigate sentence level opinion polarity prediction by assigning lexical polarities and deriving sentence polarity from these with the use of contextual valence shifters. A methodology for iterative failure analysis is developed and used to refine our lexicon and identify new contextual shifters. Algorithms are presented that employ these new shifters to improve sentence polarity prediction accuracy beyond that of a state-of-the-art existing algorithm in the domain of consumer product reviews. We then apply the best configuration of our algorithm to the domain of movie reviews. / Science, Faculty of / Computer Science, Department of / Graduate
4

A framework and practical implementation for sentiment analysis and aspect exploration

Qin, Zhenxin January 2017 (has links)
With the upsurge of Web 2.0, customers are able to share their opinions and feelings about products and services, politics, economic shifts, current events and any number of other topics on the Web. This information, if leveraged effectively, can provide rich and valuable insights, such as: input for vendors to create successful marketing strategies, understanding of areas of improvement in products and services and tracking political opinion. The problem with this information is that it is unorganised and unstructured, therefore, it is difficult to assess automatically and in bulk. Studies in the field of sentiment analysis aim to provide a solution to determining the polarities of, and gain an overview of, the wider public opinion behind certain topics in a large volume of textual data. This research provides a novel framework and a solid, practical implementation of the proposed framework for fine-grained sentiment analysis. The framework supports mixed-opinion text and multiword expressions when analysing the sentiments expressed and the aspects that those sentiments relate to. This research uses datasets across two domains in the customer reviews area (phone products and hotel services) to evaluate the proposed framework for its reliability and validity. A sizeable performance improvement was noted whereby the proposed methodology yielded a result of 91.3% accuracy in sentiment classification, as compared to the baseline (SentiWordNet), which had a result of 71.0%. In addition, an accuracy of 92.5% was observed for the aspect analysis automatically generated across the two domains tested.
5

Sentiment Analysis of Data from Online Forums on the Newborn Genome Sequencing

Poursepanj, Hamid January 2015 (has links)
In this thesis, we classified user comments posted on online forums related to “Newborn Genome Sequencing” (NGS). User comments were annotated as irrelevant, positive, negative, or mixed by two annotators. The objective was to create a classification model that could predict the sentiment of each user comment with a high accuracy. To compare classifiers, a baseline classifier (Accuracy 52%) was created. We created a single classifier (called flat comment-level classifier with accuracy of 65.14%) to classify comments into irrelevant, positive, negative, or mixed. A more sophisticated classifier, named two-level comment classifier, consisting of two classifiers, was created (Accuracy 69.81%): - The first classifier that classified each comment into relevant or irrelevant ones. - The second classifier that classified each relevant comment (predicted by the first classifier) as positive, negative, or mixed. 18 extra features were generated to improve the accuracy of the flat classification compared to baseline classifier (from 52% to 65.14% for flat comment classification, and 69.48% to 69.81% for two-level comment classification). Attempts were made to enhance the result of the two-level comment classifier by using the discourse structure of each sentence in a comment. The accuracy achieved by this enhanced two-level classifier was 64.24%. Therefore, removing irrelevant EDUs did not improve the accuracy. To achieve the above-mentioned enhancement, all comments were segmented into their consisting elementary discourse units (EDUs). We removed irrelevant EDUs from the relevant comments before running the second classifier. Furthermore, we performed EDU-level classification by creating two classifiers: - A flat classifier: classified all EDUs into irrelevant, positive, negative, or neutral - A two-level EDU: classified EDUs, first, into relevant or irrelevant and then classified the relevant EDUs (predicted by the first classifier) into positive, negative, or neutral ones. The accuracy achieved for the flat EDU-level classifier was 81.84%. However, due to the highly imbalanced nature of the EDU dataset, the F-measure for positive, negative, and neutral class was very low. Under-sampling was performed to improve the F-measure for positive, negative, and neutral class. Another topic investigated was to know why forum users supported or rejected NGS. To extract the arguments, the comments were segmented into EDUs. Following segmenting, each EDU was annotated as relevant or irrelevant to NGS. Each relevant EDU was annotated as for or against NGS. Topic related EDUs were selected as well as two EDUs before and after the topic-related EDUs. Bigrams, trigrams, four-grams and five-grams were created from extracted EDUs. Five-grams were more meaningful for human annotators, and were therefore favoured and ranked based on frequency in the dataset. Following ranking of the five grams, the top five were selected as the possible arguments.
6

Outsider trading: trading on twitter sentiment

Stevens, Joshua 20 April 2023 (has links) (PDF)
This study aims to establish if a relationship between the investor sentiment generated from social media posts, such as Tweets, and the return on securities exists. If a relationship exists, one would be able to obtain an informational advantage from public information and outperform the market on a risk-adjusted basis. This would give the “outsider” information processed the predictive power of insider information, hence the title of the paper. The study makes use of Bloomberg's social activity data, which through natural language processing, allows for investor sentiment to be obtained by analysing a combination of Twitter and Stock Twits posts. This paper makes use of a three-prong approach, firstly examining if investor sentiment is a predictor of next-day returns. Next, an event study methodology is used to examine the optimal holding period, which can further be expanded to test market efficiency. Lastly, this paper considers the asymmetric risk aversion as outlined by Kahneman and Tversky (1979). Results show that there is little to no correlation between sentiment and next day returns. There is evidence for a multi-day holding period being optimal but statistically insignificant and there is no evidence found for asymmetric risk aversion.
7

Predicting sentiment-mention associations in product reviews

Vaswani, Vishwas January 1900 (has links)
Master of Science / Department of Computing and Information Sciences / Doina Caragea / With the rising trend in social networking, more people express their opinions on the web. As a consequence, there has been an increase in the number of blogs where people write reviews about the products they buy or services they experience. These reviews can be very helpful to other potential customers who want to know the pros and cons of a product, and also to manufacturers who want to get feedback from customers about their products. Sentiment analysis of online data (such as review blogs) is a rapidly growing field of research in Machine Learning, which can leverage online reviews and quickly extract the sentiment of a whole blog. The accuracy of a sentiment analyzer relies heavily on correctly identifying associations between a sentiment (opinion) word and the targeted mention (token or object) in blog sentences. In this work, we focus on the task of automatically identifying sentiment-mention associations, in other words, we identify the target mention that is associated with a sentiment word in a sentence. Support Vector Machines (SVM), a supervised machine learning algorithm, was used to learn classifiers for this task. Syntactic and semantic features extracted from sentences were used as input to the SVM algorithm. The dataset used in the work has reviews from car and camera domain. The work is divided into two phases. In the first phase, we learned domain specific classifiers for the car and camera domains, respectively. To further improve the predictions of the domain specific classifiers we investigated the use of transfer learning techniques in the second phase. More precisely, the goal was to use knowledge from a source domain to improve predictions for a target domain. We considered two transfer learning approaches: a feature level fusion approach and a classifier level fusion approach. Experimental results show that transfer learning can help to improve the predictions made using the domain specific classifier approach. While both the feature level and classifier level fusion approaches were shown to improve the prediction accuracy, the classifier level fusion approach gave better results.
8

Recommender systems based on online social networks : an Implicit Social Trust And Sentiment analysis approach

Alahmadi, Dimah January 2017 (has links)
Recommender systems (RSs) provide personalised suggestions of information or products relevant to user's needs. RSs are considered as powerful tools that help users to find interesting items matching their own taste. Although RSs have made substantial progress in theory and algorithm development and have achieved many commercial successes, how to utilise the widely available information on Online Social Networks (OSNs) has largely been overlooked. Noticing this gap in existing research on RSs and taking into account a user's selection being greatly influenced by his/her trusted friends and their opinions, this thesis proposes a novel personalised Recommender System framework, so-called Implicit Social Trust and Sentiment (ISTS) based RSs. The main motivation was to overcome the overlooked use of OSNs in Recommender Systems and to utilise the widely available information from such networks. This work also designs solutions to a number of challenges inherent to the RSs domain, such as accuracy, cold-start, diversity and coverage. ISTS improves the existing recommendation approaches by exploring a new source of data from friends' short posts in microbloggings. In the case of new users who have no previous preferences, ISTS maps the suggested recommendations into numerical rating scales by applying the three main components. The first component is measuring the implicit trust between friends based on their intercommunication activities and behaviour. Owing to the need to adapt friends' opinions, the implicit social trust model is designed to include the trusted friends and give them the highest weight of contribution in recommendation encounter. The second component is inferring the sentiment rating to reflect the knowledge behind friends' short posts, so-called micro-reviews. The sentiment behind micro-reviews is extracted using Sentiment Analysis (SA) techniques. To achieve the best sentiment representation, our approach considers the special natural environment in OSNs brief posts. Two Sentiment Analysis methodologies are used: a bag of words method and a probabilistic method. The third ISTS component is identifying the impact degree of friends' sentiments and their level of trust by using machine learning algorithms. Two types of machine learning algorithms are used: classification models and regressions models. The classification models include Naive Bayes, Logistic Regression and Decision Trees. Among the three classification models, Decision Trees show the best Mean absolute error (MAE) at 0.836. Support Vector Regression performed the best among all models at 0.45 of MAE. This thesis also proposes an approach with further improvement over ISTS, namely Hybrid Implicit Social Trust and Sentiment (H-ISTS). The enhanced approach applies improvements by optimising trust parameters to identify the impact of the features (re-tweets and followings/followers list) on recommendation results. Unlike the ISTS which allocates equal weight to trust features, H-ISTS provides different weights to determine the different effects of the two trust features. As a result, we found that H-ISTS improved the MAE to be 0.42 which is based on Support Vector Regression. Further, it increases the number of trust features from two to five features in order to include the influence of these features in rating predictions. The integration of the new approach H-ISTS with a Collaborative Filtering recommender system, in particular memory-based, is investigated next. Therefore, existing users with a history of ratings can receive recommendations by fusing their own tastes and their friends' preferences using the two type of memory-based methods: user-based and item-based. H-ISTSitem is the integration of H-ISTS and item-based which provides the lowest error at 0.7091. The experiments show that diversity is better achieved using the H-ISTSuser which is the integration of H-ISTS and user-based technique. To evaluate the performance of these approaches, two real social datasets are collected from Twitter. To verify the proposed framework, the experiments are conducted and the results are compared against the most relevant baselines which confirmed that RSs have been successfully improved using OSNs. These enhancements demonstrate the effectiveness and promises of the proposed approach in RSs.
9

Aspect Based Sentiment Analysis On Review Data

Xue, Wei 04 December 2017 (has links)
With proliferation of user-generated reviews, new opportunities and challenges arise. The advance of Web technologies allows people to access a large amount of reviews of products and services online. Knowing what others like and dislike becomes increasingly important for their decision making in online shopping. The retailers also care more than ever about online reviews, because a vast pool of reviews enables them to monitor reputations and collect feedbacks efficiently. However, people often find difficult times in identifying and summarizing fine-grained sentiments buried in the opinion-rich resources. The traditional sentiment analysis, which focuses on the overall sentiments, fails to uncover the sentiments with regard to the aspects of the reviewed entities. This dissertation studied the research problem of Aspect Based Sentiment Analysis (ABSA), which is to reveal the aspect-dependent sentiment information of review text. ABSA consists of several subtasks: 1) aspect extraction, 2) aspect term extraction, 3) aspect category classification, and 4) sentiment polarity classification at aspect level. We focused on the approach of topic models and neural networks for ABSA. First, to extract the aspects from a collection of reviews and to detect the sentiment polarity regarding the aspects in each review, we proposed a few probabilistic graphical models, which can model words distribution in reviews and aspect ratings at the same time. Second, we presented a multi-task learning model based on long-short term memory and convolutional neural network for aspect category classification and aspect term extraction. Third, for aspect-level sentiment polarity classification, we developed a gated convolution neural network, which can be applied to aspect category sentiment analysis as well as aspect target sentiment analysis.
10

Unsupervised Aspect Discovery from Online Consumer Reviews

Suleman, Kaheer 18 March 2104 (has links)
The success of on-line review websites has led to an overwhelming number of on-line consumer reviews. These reviews have become an important tool for consumers when making a decision to purchase a product. This growth has led to the need for applications that enable this information to be presented in a way that is meaningful. These applications often rely on domain specific semantic lexicons which are both expensive and time consuming to make. The following thesis proposes an unsupervised approach for product aspect discovery in on-line consumer reviews. We apply a two step hierarchical clustering process in which we first cluster based on the semantic similarity of the contexts of terms and then on the similarity of the hypernyms of the cluster members. The method also includes a process for assigning class labels to each of the clusters. Finally an experiment showing how the proposed methods can be used to measure aspect based sentiment is performed. The methods proposed in this thesis are evaluated on a set of 157,865 reviews from a major commercial website and found that the two-step clustering process increases cluster F-scores over a single round of clustering. Finally, the proposed methods are compared to a state of the art topic modelling approach by Titov and McDonald (2008).

Page generated in 0.1126 seconds