• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 23
  • 4
  • 2
  • 1
  • 1
  • 1
  • 1
  • Tagged with
  • 38
  • 21
  • 13
  • 11
  • 11
  • 7
  • 7
  • 6
  • 6
  • 6
  • 6
  • 5
  • 5
  • 4
  • 4
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

The Insignificance of Feature Frequency in Classifying Gender of Twitter Tweets

Kroft, Amanda Marie 11 April 2013 (has links)
In 2011, Internet users spent almost 23% of their time on social media sites such as Twitter and Facebook. Twitter alone was estimated to have over 200 million active users. With social media being such a popular online pastime, a tremendous amount of information becomes available from the posts that users put on social media sites. This information has the potential to reveal details about the social media users, such as the relationship between characteristics of the users and what they post. This relationship is a hot research topic and one of the most frequently studied characteristic is the gender of a user. Feature frequency is often included in such a task, but this thesis shows that for Twitter tweets it either does not contribute significantly to gender classification or hinders classification. / McAnulty College and Graduate School of Liberal Arts; / Computational Mathematics / MS; / Thesis;
2

Database for Storing and Analyzing Tweets Posted During Disasters

Saha, Debarshi January 1900 (has links)
Master of Science / Department of Computer Science / Doina Caragea / In the last few decades, we have witnessed many natural disasters that have shaken the nations across the world. Millions of people have lost their lives, cities have been destroyed, people have gone homeless, injured and their lives have been affected. Sometimes hours or even days after a disaster, people are still stuck in the disaster sites, powerless, homeless and without food, as the rescue teams do not always get information about people in need in a timely manner. Whenever there is a natural disaster like a hurricane or an earthquake, people start tweeting about it. Most of the tweets are posted by users who are in the disaster sites, and may contain information about victims of the disaster: where they are and what the problem is, in what areas the rescue teams should work or focus on, or if someone needs special help. Such information can be very useful for the response teams, which can leverage this information in the recovery or rescue process. However, rescue team are faced with an information overload problem, due to the large number of tweets they need to sift through. To help with this issue, computational approaches can be used to analyze and prioritize information that may be useful to the rescue teams. In this project, we have crawled tweets related to natural disasters, and extracted useful information in CSV files. Then, we have designed and developed a database to store the tweets. The design of the database is such that it will help us to query and gain information about a natural disaster. We have also performed some statistical analysis, such as deriving word clouds of the tweets posted during natural disasters. The analysis shows the areas where the users who post tweet about disaster are highly concerned. The word cloud analysis can help in comparing multiple natural disasters to understand patterns that are common or specific to disasters in terms of how Twitter users talk about them.
3

A Large Collection Learning Optimizer Framework

Chakravarty, Saurabh 30 June 2017 (has links)
Content is generated on the web at an increasing rate. The type of content varies from text on a traditional webpage to text on social media portals (e.g., social network sites and microblogs). One such example of social media is the microblogging site Twitter. Twitter is known for its high level of activity during live events, natural disasters, and events of global importance. Challenges with the data in the Twitter universe include the limit of 140 characters on the text length. Because of this limitation, the vocabulary in the Twitter universe includes short abbreviations of sentences, emojis, hashtags, and other non-standard usage. Consequently, traditional text classification techniques are not very effective on tweets. Fortunately, sophisticated text processing techniques like cleaning, lemmatizing, and removal of stop words and special characters will give us clean text which can be further processed to derive richer word semantic and syntactic relationships using state of the art feature selection techniques like Word2Vec. Machine learning techniques, using word features that capture semantic and context relationships, can be of benefit regarding classification accuracy. Improving text classification results on Twitter data would pave the way to categorize tweets relative to human defined real world events. This would allow diverse stakeholder communities to interactively collect, organize, browse, visualize, analyze, summarize, and explore content and sources related to crises, disasters, human rights, inequality, population growth, resiliency, shootings, sustainability, violence, etc. Having the events classified into different categories would help us study causality and correlations among real world events. To check the efficacy of our classifier, we would compare our experimental results with an Association Rules (AR) classifier. This classifier composes its rules around the most discriminating words in the training data. The hierarchy of rules, along with an ability to tune to a support threshold, makes it an effective classifier for scenarios where short text is involved. Traditionally, developing classification systems for these purposes requires a great degree of human intervention. Constantly monitoring new events, and curating training and validation sets, is tedious and time intensive. Significant human capital is required for such annotation endeavors. Also, involved efforts are required to tune the classifier for best performance. Developing and tuning classifiers manually using human intervention would not be a viable option if we are to monitor events and trends in real-time. We want to build a framework that would require very little human intervention to build and choose the best among the available performing classification techniques in our system. Another challenge with classification systems is related to their performance with unseen data. For the classification of tweets, we are continually faced with a situation where a given event contains a certain keyword that is closely related to it. If a classifier, built for a particular event, due to overfitting to what is a biased sample with limited generality, is faced with new tweets with different keywords, accuracy may be reduced. We propose building a system that will use very little training data in the initial iteration and will be augmented with automatically labelled training data from a collection that stores all the incoming tweets. A system that is trained on incoming tweets that are labelled using sophisticated techniques based on rich word vector representation would perform better than a system that is trained on only the initial set of tweets. We also propose to use sophisticated deep learning techniques like Convolutional Neural Networks (CNN) that can capture the combination of the words using an n-gram feature representation. Such sophisticated feature representation could account for the instances when the words occur together. We divide our case studies into two phases: preliminary and final case studies. The preliminary case studies focus on selecting the best feature representation and classification methodology out of the AR and the Word2Vec based Logistic Regression classification techniques. The final case studies focus on developing the augmented semi-supervised training methodology and the framework to develop a large collection learning optimizer to generate a highly performant classifier. For our preliminary case studies, we are able to achieve an F1 score of 0.96 that is based on Word2Vec and Logistic Regression. The AR classifier achieved an F1 score of 0.90 on the same data. For our final case studies, we are able to show improvements of F1 score from 0.58 to 0.94 in certain cases based on our augmented training methodology. Overall, we see improvement in using the augmented training methodology on all datasets. / Master of Science
4

Analyse wissenschaftlicher Konferenz-Tweets mittels Codebook und der Software Tweet Classifier

Lemke, Steffen, Mazarakis, Athanasios 26 March 2018 (has links) (PDF)
Mit seiner fokussierten Funktionsweise hat der Mikrobloggingdienst Twitter im Laufe des vergangenen Jahrzehnts eine beachtliche Präsenz als Kommunikationsmedium in diversen Bereichen des Lebens erreicht. Eine besondere Weise, auf die sich die gestiegene Sichtbarkeit Twitters in der täglichen Kommunikation häufig manifestiert, ist die gezielte Verwendung von Hashtags. So nutzen Unternehmen Hashtags um die auf Twitter stattfindenden Diskussionen über ihre Produkte zu bündeln, während Organisatoren von Großveranstaltungen und Fernsehsendungen durch Bekanntgabe ihrer eigenen, offiziellen Hashtags Zuschauer dazu ermutigen, den Dienst parallel zum eigentlichen Event als Diskussionsplattform zu nutzen. [... aus der Einleitung]
5

Tweet-interaktion medBeliebers : En textanalys om hur Justin Bieber konstruerargemenskap med en tilltänkt publik genomtweets på Twitter

Söderström, Mimmi January 2013 (has links)
Syftet med denna uppsats är att, genom en textanalys av tweets på mikrobloggen Twitter, undersöka hurinteraktion skapas och upprätthålls mellan idol och fans. Exemplet som används är popstjärnan JustinBieber och hur tweets konstrueras på hans Twitter-sida för att bekräfta frågor om pseudo-interaktion,gemenskap och närvaro med sina följare som ofta kallas ”Beliebers”. Jag vill ta reda på vilkakommunikationskoder som används och hur teorier om interaktion kan kopplas till de tweets jagundersöker närmare. Som främsta metod används en kvalitativ textanalys för att se om det går att hittatydliga indikationer genom språkbruk, tilltal och innehåll som kan kopplas till teorier om hurinteraktionen med publiken presenteras, och huruvida publiken ses som okänd eller iakttagbar.Resultatet av studien har visat att den centrala kommunikationsmodellen som används i stjärnansTwitter-flöde fokuserar på gemenskap och samhörighet i budskapet som överförs snarare än självaöverföringen av information mellan till sändare, Bieber, och mottagare; fans, ”Beliebers” och följare.
6

@therealDonaldTrump EFFECT: DONALD TRUMP’S SOCIAL INFLUENCE THROUGH THE USE OF TWITTER

Schuhmeier, Phoenisha 01 June 2019 (has links)
There has been a recent rise in the use of social media as a platform for political communication. President Donald Trump who is very influential, due in part to his celebrity status as well as his presidential position, has had the power to influence his millions of followers on twitter. For this research, I used a content analysis and comparative analysis approach on eight tweets made by President Donald Trump which targeted Mexican immigration, Maxine Waters, LeBron James, Don Lemon, the National Football League (NFL) national anthem protesters and Elizabeth Warren and three tweets made by Senator Ted Cruz which targeted Mexican immigration. I found that for Mexican immigration, twitter commenters on Trump’s tweets were more prone to agree with him, as opposed to Cruz’s tweets, where his commenters disagreed with him.
7

Monitoring Tweets for Depression to Detect At-Risk Users

Jamil, Zunaira January 2017 (has links)
According to the World Health Organization, mental health is an integral part of health and well-being. Mental illness can affect anyone, rich or poor, male or female. One such example of mental illness is depression. In Canada 5.3% of the population had presented a depressive episode in the past 12 months. Depression is difficult to diagnose, resulting in high under-diagnosis. Diagnosing depression is often based on self-reported experiences, behaviors reported by relatives, and a mental status examination. Currently, author- ities use surveys and questionnaires to identify individuals who may be at risk of depression. This process is time-consuming and costly. We propose an automated system that can identify at-risk users from their public social media activity. More specifically, we identify at-risk users from Twitter. To achieve this goal we trained a user-level classifier using Support Vector Machine (SVM) that can detect at-risk users with a recall of 0.8750 and a precision of 0.7778. We also trained a tweet-level classifier that predicts if a tweet indicates distress. This task was much more difficult due to the imbalanced data. In the dataset that we labeled, we came across 5% distress tweets and 95% non-distress tweets. To handle this class imbalance, we used undersampling methods. The resulting classifier uses SVM and performs with a recall of 0.8020 and a precision of 0.1237. Our system can be used by authorities to find a focused group of at-risk users. It is not a platform for labeling an individual as a patient with depres- sion, but only a platform for raising an alarm so that the relevant authorities could take necessary interventions to further analyze the predicted user to confirm his/her state of mental health. We respect the ethical boundaries relating to the use of social media data and therefore do not use any user identification information in our research.
8

When Tweets Are Embedded, Who Gains the Upper Hand? : The Discursive Power Struggle on Finnish Digital News Articles before the 2019 Parliamentary Election

Lehtinen, Don January 2021 (has links)
This Master’s thesis focuses on the discursive power struggle between politicians and journalists on Finnish digital news articles where the politician’s tweets are embedded or quoted in using multimodal discourse analysis. Embedded and quoted tweets are one of the premier links between Twitter and digital news platforms but have for the most part been left out of the field of discourse analysis. This research will try to fill that gap, focusing on a time period of one month before the 2019 parliamentary election in Finland. The research material consists of 18 articles from two of the biggest digital news platforms in Finland, Iltalehti and Ilta-Sanomat. They are analyzed using Machin and Mayr’s seven-part scheme for critical discourse analysis, focusing on the embedded and quoted tweets in relation to the text’s discourse, and also the intertwined textual and the visual sides of the articles. The analysis shows that in most articles, the discourse portrayed in the tweets is not challenged by the journalist, meaning that the politicians most often come on top in the discursive power struggle. The analysis also displays that there are multiple ways of challenging the discourse, but they are seldom used in the power struggle. In conclusion, as the tweets’ discourses often go unchallenged, both the politicians and Twitter as a platform have arguably disproportionate power to influence both the discourse on digital news platforms, as well as the readers of those.
9

A deep multi-modal neural network for informative Twitter content classification during emergencies

Kumar, A., Singh, J.P., Dwivedi, Y.K., Rana, Nripendra P. 03 January 2020 (has links)
Yes / People start posting tweets containing texts, images, and videos as soon as a disaster hits an area. The analysis of these disaster-related tweet texts, images, and videos can help humanitarian response organizations in better decision-making and prioritizing their tasks. Finding the informative contents which can help in decision making out of the massive volume of Twitter content is a difficult task and require a system to filter out the informative contents. In this paper, we present a multi-modal approach to identify disaster-related informative content from the Twitter streams using text and images together. Our approach is based on long-short-term-memory (LSTM) and VGG-16 networks that show significant improvement in the performance, as evident from the validation result on seven different disaster-related datasets. The range of F1-score varied from 0.74 to 0.93 when tweet texts and images used together, whereas, in the case of only tweet text, it varies from 0.61 to 0.92. From this result, it is evident that the proposed multi-modal system is performing significantly well in identifying disaster-related informative social media contents.
10

Analyse wissenschaftlicher Konferenz-Tweets mittels Codebook und der Software Tweet Classifier

Lemke, Steffen, Mazarakis, Athanasios January 2017 (has links)
Mit seiner fokussierten Funktionsweise hat der Mikrobloggingdienst Twitter im Laufe des vergangenen Jahrzehnts eine beachtliche Präsenz als Kommunikationsmedium in diversen Bereichen des Lebens erreicht. Eine besondere Weise, auf die sich die gestiegene Sichtbarkeit Twitters in der täglichen Kommunikation häufig manifestiert, ist die gezielte Verwendung von Hashtags. So nutzen Unternehmen Hashtags um die auf Twitter stattfindenden Diskussionen über ihre Produkte zu bündeln, während Organisatoren von Großveranstaltungen und Fernsehsendungen durch Bekanntgabe ihrer eigenen, offiziellen Hashtags Zuschauer dazu ermutigen, den Dienst parallel zum eigentlichen Event als Diskussionsplattform zu nutzen. [... aus der Einleitung]

Page generated in 0.035 seconds