Return to search

Probabilistic models for classification of bioacoustic data

Probabilistic models have been successfully applied for a wide variety of problems,
such as but not limited to information retrieval, computer vision, bio-informatics
and speech processing. Probabilistic models allow us to encode our assumptions
about the data in an elegant fashion and enable us to perform machine learning
tasks such as classification and clustering in a principled manner. Probabilistic
models for bio-acoustic data help in identifying interesting patterns in the data (for instance, the species-specific vocabulary), as well as species identification (classification) in recordings where the label is not available.

The focus of this thesis is to develop efficient inference techniques for existing
models, as well as develop probabilistic models tailored to bioacoustic data.
First, we develop inference algorithms for the supervised latent Dirichlet allocation (LDA) model. We present collapsed variational Bayes, collapsed Gibbs sampling and maximum-a-posteriori (MAP) inference for parameter estimation and classification in supervised LDA. We provide an empirical evaluation of the trade-off between computational complexity and classification performance of the inference methods for supervised LDA, on audio classification (species identification in this context)as well as image classification and document classification tasks. Next, we present novel probabilistic models for bird sound recordings, that can capture temporal structure at different hierarchical levels, and model additional information such as the duration and frequency of vocalizations. We present a non-parametric density estimation technique for parameter estimation and show that the MAP classifier for our models can be interpreted as a weighted nearest neighbor classifier. We provide an experimental comparison between the proposed models and a support vector machine based approach, using bird sound recordings from the Cornell Macaulay library. / Graduation date: 2011 / Access restricted to the OSU Community at author's request from Dec. 30, 2010 - Dec. 30, 2011

Identiferoai:union.ndltd.org:ORGSU/oai:ir.library.oregonstate.edu:1957/19655
Date30 December 2010
CreatorsLakshminarayanan, Balaji
ContributorsRaich, Raviv
Source SetsOregon State University
Languageen_US
Detected LanguageEnglish
TypeThesis/Dissertation

Page generated in 0.002 seconds