Global ETD Search

Return to search

Speech/music discrimination : novel features in time domain

This research aimed to find novel features that can be used to discriminate between speech and music in the time domain for the purpose of data retrieval. The study used speech and music data that were recorded in standard anechoic chambers and sampled at 44.1 kHz. Two types of new features were found and thoroughly examined: the Ratio of Silent Frames (RSF) feature and the Time Series Events (TSE) set of features. The Receiver Operating Characteristics (ROC) curves were used to assess each one of the proposed features as well as certain relevant features from the literature for the purpose of comparison. The RSF feature introduced up to 8% enhancement when compared to a couple of relevant features from the literature. One of the TSE set of features provided close to 100% speech/music discrimination.

http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.512895

005.3

Identifer	oai:union.ndltd.org:bl.uk/oai:ethos.bl.uk:512895
Date	January 2010
Creators	Alnadabi, Muhammad Saeid Muhammad
Publisher	Durham University
Source Sets	Ethos UK
Detected Language	English
Type	Electronic Thesis or Dissertation
Source	http://etheses.dur.ac.uk/206/

Page generated in 0.0097 seconds

Speech/music discrimination : novel features in time domain

Description

Links & Downloads

Tags

Additional Fields