Global ETD Search

Return to search

A design of text-independent medium-size speaker recognition system

This paper presents text-independent speaker identification results for medium-size speaker population sizes up to 400 speakers for TV speech and TIMIT database . A system based on Gaussian mixture speaker models is used for speaker identification, and experiments are conducted on the TV database and TIMIT database. The TV-Database results show medium-size population performance under TV conditions. These are believed to be the first speaker identification experiments on the complete 400 speaker TV databases and the largest text-independent speaker identification task reported to date. Identification accuracies of 94.5% on the TV databases, respectively and 98.5% on the TIMIT database .

http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0913102-043719

Gaussian mixture model

Vector quantization

Speaker recognition

Mel-frequency cepstrum coefficients

Identifer	oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0913102-043719
Date	13 September 2002
Creators	Zheng, Shun-De
Contributors	Chih-Chien Chen, Chii-Maw Uang, Tsung Lee
Publisher	NSYSU
Source Sets	NSYSU Electronic Thesis and Dissertation Archive
Language	Cholon
Detected Language	English
Type	text
Format	application/pdf
Source	http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0913102-043719
Rights	restricted, Copyright information available at source archive

Page generated in 0.0017 seconds

A design of text-independent medium-size speaker recognition system

Description

Links & Downloads

Tags

Additional Fields