Global ETD Search

Return to search

Bayesian Analysis of Binary Sales Data for Several Industries

The analysis of big data is now very popular. Big data may be very important for companies, societies or even human beings if we can take full advantage of them. Data scientists defined big data with four Vs: volume, velocity, variety and veracity. In a short, the data have large volume, grow with high velocity, represent with numerous varieties and must have high quality. Here we analyze data from many sources (varieties). In small area estimation, the term ``big data' refers to numerous areas. We want to analyze binary for a large number of small areas. Then standard Markov Chain Monte Carlo methods (MCMC) methods do not work because the time to do the computation is prohibitive. To solve this problem, we use numerical approximations. We set up four methods which are MCMC, method based on Beta-Binomial model, Integrated Nested Normal Approximation Model (INNA) and Empirical Logistic Transform (ELT) method. We compare the processing time and accuracies of these four methods in order to find the fastest and reasonable accurate one. Last but not the least, we combined the empirical logistic transform method, the fastest and accurate method, with time series to explore the sales data over time.

https://digitalcommons.wpi.edu/etd-theses/596

inna

empirical logistic transform

Identifer	oai:union.ndltd.org:wpi.edu/oai:digitalcommons.wpi.edu:etd-theses-1595
Date	30 April 2015
Creators	Chen, Zhilin
Contributors	Balgobin Nandram, Advisor, ,
Publisher	Digital WPI
Source Sets	Worcester Polytechnic Institute
Detected Language	English
Type	text
Source	Masters Theses (All Theses, All Years)

Page generated in 0.0017 seconds

Bayesian Analysis of Binary Sales Data for Several Industries

Description

Links & Downloads

Tags

Additional Fields