• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 337
  • 40
  • 24
  • 14
  • 10
  • 10
  • 9
  • 9
  • 9
  • 9
  • 9
  • 9
  • 8
  • 3
  • 3
  • Tagged with
  • 498
  • 498
  • 498
  • 180
  • 124
  • 97
  • 89
  • 48
  • 48
  • 42
  • 41
  • 41
  • 39
  • 38
  • 37
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
41

Model-based speech separation and enhancement with single-microphone input. / CUHK electronic theses & dissertations collection

January 2008 (has links)
Experiments were carried out for continuous real speech mixed with either competitive speech source or broadband noise. Results show that separation outputs bear similar spectral trajectories as the ideal source signals. For speech mixtures, the proposed algorithm is evaluated in two ways: segmental signal-to-interference ratio (segSIR) and Itakura-Saito distortion ( dIS). It is found that (1) interference signal power is reduced in term of segSIR improvement, even under harsh condition of similar target speech and interference powers; and (2) dIS between the estimated source and the clean speech source is significantly smaller than before processing. These assert the capability of the proposed algorithm to extract individual sources from a mixture signal by reducing the interference signal and generating appropriate spectral trajectory for individual source estimates. / Our approach is based on the findings of psychoacoustics. To separate individual sound sources in a mixture signal, human exploits perceptual cues like harmonicity, continuity, context information and prior knowledge of familiar auditory patterns. Furthermore, the application of prior knowledge of speech for top-down separation (called schema-based grouping) is found to be powerful, yet unexplored. In this thesis, a bi-directional, model-based speech separation and enhancement algorithm is proposed by utilizing speech schemas, in particular. As model patterns are employed to generate subsequent spectral envelopes in an utterance, output speech is expected to be natural and intelligible. / The proposed separation algorithm regenerates a target speech source by working out the corresponding spectral envelope and harmonic structure. In the first stage, an optimal sequence of Wiener filtering is determined for subsequent interference removal. Specifically, acoustic models of speech schemas represented by possible line spectrum pair (LSP) patterns, are manipulated to match the input mixture and the given transcription if available, in a top-down manner. Specific LSP patterns are retrieved to constitute a spectral evolution that synchronizes with the target speech source. With this evolution, the mixture spectrum is then filtered to approximate the target source in an appropriate signal level. In the second stage, irrelevant harmonic structure from interfering sources is eliminated by comb filtering. These filters are designed according to the results of pitch tracking. / This thesis focuses on speech source separation problem in a single-microphone scenario. Possible applications of speech separation include recognition, auditory prostheses and surveillance systems. Sound signals typically reach our ears as a mixture of desired signals, other competing sounds and background noise. Example scenarios are talking with someone in crowd with other people speaking or listening to an orchestra with a number of instruments playing concurrently. These sounds are often overlapped in time and frequency. While human attends to individual sources remarkably well under these adverse conditions even with a single ear, the performance of most speech processing system is easily degraded. Therefore, modeling how human auditory system performs is one viable way to extract target speech sources from the mixture before any vulnerable processes. / Lee, Siu Wa. / "April 2008." / Adviser: Chung Ching. / Source: Dissertation Abstracts International, Volume: 70-03, Section: B, page: 1846. / Thesis (Ph.D.)--Chinese University of Hong Kong, 2008. / Includes bibliographical references (p. 233-252). / Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web. / Electronic reproduction. [Ann Arbor, MI] : ProQuest Information and Learning, [200-] System requirements: Adobe Acrobat Reader. Available via World Wide Web. / Abstracts in English and Chinese. / School code: 1307.
42

Clustering wide-contexts and HMM topologies for spontaneous speech recognition /

Shafran, Izhak. January 2001 (has links)
Thesis (Ph. D.)--University of Washington, 2001. / Includes bibliographical references (p. 80-95).
43

Speech recognition software : an alternative to reduce ship control manning /

Kuffel, Robert F. January 2004 (has links) (PDF)
Thesis (M.S. in Information Systems and Operations)--Naval Postgraduate School, March 2004. / Thesis advisor(s): Russell Gottfried, Monique P. Fargues. Includes bibliographical references (p. 43-45). Also available online.
44

Effects of transcription errors on supervised learning in speech recognition

Sundaram, Ramasubramanian H. January 2003 (has links)
Thesis (M.S.)--Mississippi State University. Department of Electrical and Computer Engineering. / Title from title screen. Includes bibliographical references.
45

Speaker-independent recognition of Putonghua finals /

Chan, Chit-man. January 1987 (has links)
Thesis (Ph. D.)--University of Hong Kong, 1988.
46

A study of some variations on the hidden Markov modelling approach to speaker independent isolated word speech recognition

梁舜德, Leung, Shun Tak Albert. January 1990 (has links)
published_or_final_version / Electrical and Electronic Engineering / Master / Master of Philosophy
47

Analysis and compensation of stressed and noisy speech with application to robust automatic recognition

Hansen, John H. L. 08 1900 (has links)
No description available.
48

Modeling speech using a partially observable Markov decison process /

Jonas, Michael. January 1900 (has links)
Thesis (Ph.D.)--Tufts University, 2003. / Adviser: James G. Schmolze. Submitted to the Dept. of Computer Science. Includes bibliographical references (leaves 103-109). Access restricted to members of the Tufts University community. Also available via the World Wide Web;
49

Transformation sharing strategies for MLLR speaker adaptation /

Mandal, Arindam. January 2007 (has links)
Thesis (Ph. D.)--University of Washington, 2007. / Vita. Includes bibliographical references (p. 102-115).
50

General description and recognition of 37 Chinese speech sounds /

Cheng, Ping-yung. January 1979 (has links) (PDF)
Thesis (M.Eng.Sc.) -- University of Adelaide, Dept. of Electrical Engineering, 1980. / Typescript (photocopy).

Page generated in 0.1048 seconds