361 |
Exploring the Noise Resilience of Combined Sturges AlgorithmAgarwal, Akrita January 2015 (has links)
No description available.
|
362 |
A Massively Parallel Algorithm for Cell Classification Using CUDASchmidt, Samuel January 2015 (has links)
No description available.
|
363 |
Bayesian Nonparametric Models for Ranked Set SamplingGemayel, Nader M. 30 July 2010 (has links)
No description available.
|
364 |
Calibrated Bayes factors for model selection and model averagingLu, Pingbo 24 August 2012 (has links)
No description available.
|
365 |
Using sentiment analysis to craft a narrative of the COVID-19 pandemic from the perspective of social mediaRay, Taylor Breanna 06 August 2021 (has links)
Throughout the COVID-19 pandemic, people have turned to social media to share their experiences with the coronavirus and their feelings regarding subjects like social distancing, mask-wearing, COVID-19 vaccines, and other related topics. The publicly available nature of these social media posts provides researchers the chance to obtain a consensus on an array of issues, topics, people, and entities. For the COVID-19 pandemic, this is valuable information that can prepare communities and governing bodies for future epidemics or events of a similar magnitude. However, clearly defining such a consensus can be difficult, especially if researchers want to limit the amount of bias they introduce. The process of sentiment analysis helps to address this need by categorizing text sources into one of three distinct polarities. Namely, those polarities are often positive, neutral, and negative. While sentiment analysis can take form as a completely manual task, this becomes incredibly burdensome for projects that involve substantial amounts of data. This thesis attempts to overcome this challenge by programmatically classifying the sentiment of COVID-19 posts from 10 social media and web-based forums using a multinomial Naive Bayes classifier. The unique and contrasting qualities of the social networks being analyzed provide a robust take on the public's perception of the pandemic that has not yet been offered up to the present.
|
366 |
Generalized Empirical Bayes: Theory, Methodology, and ApplicationsFletcher, Douglas January 2019 (has links)
The two key issues of modern Bayesian statistics are: (i) establishing a principled approach for \textit{distilling} a statistical prior distribution that is \textit{consistent} with the given data from an initial believable scientific prior; and (ii) development of a \textit{consolidated} Bayes-frequentist data analysis workflow that is more effective than either of the two separately. In this thesis, we propose generalized empirical Bayes as a new framework for exploring these fundamental questions along with a wide range of applications spanning fields as diverse as clinical trials, metrology, insurance, medicine, and ecology. Our research marks a significant step towards bridging the ``gap'' between Bayesian and frequentist schools of thought that has plagued statisticians for over 250 years. Chapters 1 and 2---based on \cite{mukhopadhyay2018generalized}---introduces the core theory and methods of our proposed generalized empirical Bayes (gEB) framework that solves a long-standing puzzle of modern Bayes, originally posed by Herbert Robbins (1980). One of the main contributions of this research is to introduce and study a new class of nonparametric priors ${\rm DS}(G, m)$ that allows exploratory Bayesian modeling. However, at a practical level, major practical advantages of our proposal are: (i) computational ease (it does not require Markov chain Monte Carlo (MCMC), variational methods, or any other sophisticated computational techniques); (ii) simplicity and interpretability of the underlying theoretical framework which is general enough to include almost all commonly encountered models; and (iii) easy integration with mainframe Bayesian analysis that makes it readily applicable to a wide range of problems. Connections with other Bayesian cultures are also presented in the chapter. Chapter 3 deals with the topic of measurement uncertainty from a new angle by introducing the foundation of nonparametric meta-analysis. We have applied the proposed methodology to real data examples from astronomy, physics, and medical disciplines. Chapter 4 discusses some further extensions and application of our theory to distributed big data modeling and the missing species problem. The dissertation concludes by highlighting two important areas of future work: a full Bayesian implementation workflow and potential applications in cybersecurity. / Statistics
|
367 |
Social media analysis for product safety using text mining and sentiment analysisIsa, H., Trundle, Paul R., Neagu, Daniel January 2014 (has links)
No / The growing incidents of counterfeiting and associated economic and health consequences necessitate the development of active surveillance systems capable of producing timely and reliable information for all stake holders in the anti-counterfeiting fight. User generated content from social media platforms can provide early clues about product allergies, adverse events and product counterfeiting. This paper reports a work in progress with contributions including: the development of a framework for gathering and analyzing the views and experiences of users of drug and cosmetic products using machine learning, text mining and sentiment analysis; the application of the proposed framework on Facebook comments and data from Twitter for brand analysis, and the description of how to develop a product safety lexicon and training data for modeling a machine learning classifier for drug and cosmetic product sentiment prediction. The initial brand and product comparison results signify the usefulness of text mining and sentiment analysis on social media data while the use of machine learning classifier for predicting the sentiment orientation provides a useful tool for users, product manufacturers, regulatory and enforcement agencies to monitor brand or product sentiment trends in order to act in the event of sudden or significant rise in negative sentiment.
|
368 |
Flight Data Processing Techniques to Identify Unusual EventsMugtussids, Iossif B. 26 June 2000 (has links)
Modern aircraft are capable of recording hundreds of parameters during flight. This fact not only facilitates the investigation of an accident or a serious incident, but also provides the opportunity to use the recorded data to predict future aircraft behavior. It is believed that, by analyzing the recorded data, one can identify precursors to hazardous behavior and develop procedures to mitigate the problems before they actually occur. Because of the enormous amount of data collected during each flight, it becomes necessary to identify the segments of data that contain useful information. The objective is to distinguish between typical data points, that are present in the majority of flights, and unusual data points that can be only found in a few flights. The distinction between typical and unusual data points is achieved by using classification procedures.
In this dissertation, the application of classification procedures to flight data is investigated. It is proposed to use a Bayesian classifier that tries to identify the flight from which a particular data point came. If the flight from which the data point came is identified with a high level of confidence, then the conclusion that the data point is unusual within the investigated flights can be made.
The Bayesian classifier uses the overall and conditional probability density functions together with a priori probabilities to make a decision. Estimating probability density functions is a difficult task in multiple dimensions. Because many of the recorded signals (features) are redundant or highly correlated or are very similar in every flight, feature selection techniques are applied to identify those signals that contain the most discriminatory power. In the limited amount of data available to this research, twenty five features were identified as the set exhibiting the best discriminatory power. Additionally, the number of signals is reduced by applying feature generation techniques to similar signals.
To make the approach applicable in practice, when many flights are considered, a very efficient and fast sequential data clustering algorithm is proposed. The order in which the samples are presented to the algorithm is fixed according to the probability density function value. Accuracy and reduction level are controlled using two scalar parameters: a distance threshold value and a maximum compactness factor. / Ph. D.
|
369 |
Stochastic Motion Planning for Applications in Subsea Survey and Area ProtectionBays, Matthew Jason 24 April 2012 (has links)
This dissertation addresses high-level path planning and cooperative control for autonomous vehicles. The objective of our work is to closely and rigorously incorporate classication and detection performance into path planning algorithms, which is not addressed with typical approaches found in literature. We present novel path planning algorithms for two different applications in which autonomous vehicles are tasked with engaging targets within a stochastic environment. In the first application an autonomous underwater vehicle (AUV) must reacquire and identify clusters of discrete underwater objects. Our planning algorithm ensures that mission objectives are met with a desired probability of success. The utility of our approach is verified through field trials. In the second application, a team of vehicles must intercept mobile targets before the targets enter a specified area. We provide a formal framework for solving the second problem by jointly minimizing a cost function utilizing Bayes risk. / Ph. D.
|
370 |
Rogue Access Point Detection through Statistical AnalysisKanaujia, Swati 26 May 2010 (has links)
The IEEE 802.11 based Wireless LAN (WLAN) has become increasingly ubiquitous in recent years. However, due to the broadcast nature of wireless communication, attackers can exploit the existing vulnerabilities in IEEE 802.11 to launch various types of attacks in wireless and wired networks.
This thesis presents a statistical based hybrid Intrusion Detection System (IDS) for Rogue Access Point (RAP) detection, which employs distributed monitoring devices to monitor on 802.11 link layer activities and a centralized detection module at a gateway router to achieve higher accuracy in detection of rogue devices. This detection approach is scalable, non-intrusive and does not require any specialized hardware. It is designed to utilize the existing wireless LAN infrastructure and is independent of 802.11a/b/g/n. It works on passive monitoring of wired and wireless traffic, and hence is easy to manage and maintain. In addition, this approach requires monitoring a smaller number of packets for detection as compared to other detection approaches in a heterogeneous network comprised of wireless and wired subnets.
Centralized detection is done at a gateway router by differentiating wired and wireless TCP traffic using Weighted Sequential Hypothesis Testing on inter-arrival time of TCP ACK-pairs. A decentralized module takes care of detection of MAC spoofing and totally relies on 802.11 beacon frames. Detection is done through analysis of the clock skew and the Received Signal Strength (RSS) as fingerprints using a naïve Bayes classifier to detect presence of rogue APs.
Analysis of the system and extensive experiments in various scenarios on a real system have proven the efficiency and accuracy of the approach with few false positives/negatives and low computational and storage overhead. / Master of Science
|
Page generated in 0.0321 seconds