The purpose of this thesis was to investigate whether some successful ideas in NLP, such as word2vec, are applicable to time series prob- lems or not. More specifically, we are interested to assess a combina- tion of previously proven methods such as SAX and Word2vec. Based on a rolling window approach, we applied SAX to create words for each window. These words formed a corpus on which we performed Word2vec, which served as inputs in a time series forecasting setting. We found that for forecasting horizons of longer length, our proposed method showed an improvement over statistical models under certain conditions. The findings suggest that bringing tools from the natural language processing domain into the time series domain may be an ef- fective idea. Further research is necessary to broaden the knowledge of these types of methods by testing alternative options for the cre- ation of words. Hopefully, this work will motivate other researchers to investigate this type of solution further.
Identifer | oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:hh-51133 |
Date | January 2023 |
Creators | Janerdal, Erik, Dimovski, David |
Publisher | Högskolan i Halmstad, Akademin för informationsteknologi |
Source Sets | DiVA Archive at Upsalla University |
Language | English |
Detected Language | English |
Type | Student thesis, info:eu-repo/semantics/bachelorThesis, text |
Format | application/pdf |
Rights | info:eu-repo/semantics/openAccess |
Page generated in 0.0013 seconds