Global ETD Search

Return to search

Separation of Vocal and Non-Vocal Components from Audio Clip Using Correlated Repeated Mask (CRM)

Extraction of singing voice from music is one of the ongoing research topics in the field of speech recognition and audio analysis. In particular, this topic finds many applications in the music field, such as in determining music structure, lyrics recognition, and singer recognition. Although many studies have been conducted for the separation of voice from the background, there has been less study on singing voice in particular.
In this study, efforts were made to design a new methodology to improve the separation of vocal and non-vocal components in audio clips using REPET [14]. In the newly designed method, we tried to rectify the issues encountered in the REPET method, while designing an improved repeating mask which is used to extract the non-vocal component in audio. The main reason why the REPET method was preferred over previous methods for this study is its independent nature. More specifically, the majority of existing methods for the separation of singing voice from music were constructed explicitly based on one or more assumptions.

Signal Processing

Identifer	oai:union.ndltd.org:uno.edu/oai:scholarworks.uno.edu:td-3502
Date	09 August 2017
Creators	Kanuri, Mohan Kumar
Publisher	ScholarWorks@UNO
Source Sets	University of New Orleans
Detected Language	English
Type	text
Format	application/pdf
Source	University of New Orleans Theses and Dissertations

Page generated in 0.0065 seconds

Separation of Vocal and Non-Vocal Components from Audio Clip Using Correlated Repeated Mask (CRM)

Description

Links & Downloads

Tags

Additional Fields