Return to search

A Design of Multi-session Text-independent Digital Camcorder Audio-Video Database for Speaker Recognition

In this thesis, an audio-video database for speaker recognition is constructed using a digital camcorder. Motion pictures of fifteen hundred speakers are recorded in three different sessions in the database. For each speaker, 20 still images per session are also derived from the video data. It is hoped that this database can provide an appropriate training and testing mechanism for person identification using both voice and face features.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0905108-021052
Date05 September 2008
CreatorsChen, Chun-chi
ContributorsChii-Maw Uang, Chih-Chien Chen, Tsung Lee
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageCholon
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0905108-021052
Rightsnot_available, Copyright information available at source archive

Page generated in 0.0018 seconds