Global ETD Search

Return to search

Iterative Matrix Factorization Method for Social Media Data Location Prediction

Since some of the location of where the users posted their tweets collected by social media company have varied accuracy, and some are missing. We want to use those tweets with highest accuracy to help fill in the data of those tweets with incomplete information. To test our algorithm, we used the sets of social media data from a city, we separated them into training sets, where we know all the information, and the testing sets, where we intentionally pretend to not know the location. One prediction method that was used in (Dukler, Han and Wang, 2016) requires appending one-hot encoding of the location to the bag of words matrix to do Location Oriented Nonnegative Matrix Factorization (LONMF). We improve further on this algorithm by introducing iterative LONMF. We found that when the threshold and number of iterations are chosen correctly, we can predict tweets location with higher accuracy than using LONMF.

Statistics and Probability

Identifer	oai:union.ndltd.org:CLAREMONT/oai:scholarship.claremont.edu:hmc_theses-1109
Date	01 January 2018
Creators	Suaysom, Natchanon
Publisher	Scholarship @ Claremont
Source Sets	Claremont Colleges
Detected Language	English
Type	text
Format	application/pdf
Source	HMC Senior Theses
Rights	© 2017 Natchanon Suaysom, default

Page generated in 0.0026 seconds

Iterative Matrix Factorization Method for Social Media Data Location Prediction

Description

Links & Downloads

Tags

Additional Fields