Spelling suggestions: "subject:"[een] SPEECH PROCESSING"" "subject:"[enn] SPEECH PROCESSING""
181 |
An analysis of blind signal separation for real time applicationSmith, Daniel. January 2006 (has links)
Thesis (Ph.D.)--University of Wollongong, 2006. / Typescript. Includes bibliographical references: leaf 236-258.
|
182 |
Non-intrusive speech quality assessment in VoIP /Radwan, Ayman, January 1900 (has links)
Thesis (M. App. Sc.)--Carleton University, 2003. / Includes bibliographical references (p. 114-122). Also available in electronic format on the Internet.
|
183 |
Parametric mixing for centralized VoIP conferencing using ITU-T recommendation G.722.2 /Agnello, Giuseppe, January 1900 (has links)
Thesis (M.App.Sc.) - Carleton University, 2006. / Includes bibliographical references (p. 114-119). Also available in electronic format on the Internet.
|
184 |
The effectiveness of voice recognition technology as used by persons with disabilitiesJohnson, Joanna. January 1998 (has links) (PDF)
Thesis--PlanB (M.S.)--University of Wisconsin--Stout, 1998. / Includes bibliographical references.
|
185 |
FRIC : an expert system to recognize fricatives /Atkinson, Karen A. January 1987 (has links)
Thesis (M.S.)--Rochester Institute of Technology, 1987. / Typescript. Includes bibliographical references (leaves 54-56).
|
186 |
Usability of interactive voice response systems in real-life tasks performed by old and young women.Dulude, Louise, Carleton University. Dissertation. Psychology. January 2000 (has links)
Thesis (M.A.)--Carleton University, 2001. / Also available in electronic format on the Internet.
|
187 |
Building a prosodically sensitive diphone database for a Korean text-to-speech synthesis systemYoon, Kyuchul, January 2005 (has links)
Thesis (Ph. D.)--Ohio State University, 2005. / Title from first page of PDF file. Document formatted into pages; contains xxii, 291 p.; also includes graphics (some col.) Includes bibliographical references (p. 210-216). Available online via OhioLINK's ETD Center
|
188 |
Analysis and implementation of the speaker adaptation techniques : MAP, MLLR, and MLEDFanner, Robert M. 12 1900 (has links)
Thesis (MScEng)--University of Stellenbosch, 2002. / ENGLISH ABSTRACT: The topic of this thesis is speaker adaptation, whereby speaker-independent speech models
are adapted to more closely match individual speakers by utilising a small amount of
data from the targeted individual. Speaker adaptation methods - specifically, the MAP,
MLLR and MLED speaker adaptation methods - are critically evaluated and compared.
Two novel extensions of the MLED adaptation method are introduced, derived and
evaluated. The first incorporates the explicit modelling of the mean speaker model in
the speaker-space into the MLED framework. The second extends MLED to use basis
vectors modelling inter-class variance for classes of speech models, instead of basis vectors
modelling inter-speaker variance.
An evaluation of the effect of two different types of feature vector - PLP-cepstra and
LPCCs - on the performance of speaker adaptation is made, to determine which feature
vector is optimal for speaker-independent systems and the adaptation thereof. / AFRIKAANSE OPSOMMING: Die onderwerp van hierdie tesis is spreker-aanpassing, dit wil sê, die verandering van
'n spreker-onafhanklike spraakmodel om nader aan 'n spreker-afhanklike model vir 'n
individu te wees, gegewe 'n klein hoeveelheid spraakdata van die individu. Die volgende
sprekeraanpassing-metodes word geëvalueer: MAP, MLLR en MLED.
Twee nuwe uitbreidings vir die MLED-metode word beskryf, afgelei en geëvalueer.
Die eerste inkorporeer die eksplisiete modellering van die gemiddelde sprekermodel van
die sprekerruimte in die MLED metode. Die tweede uitbreiding maak gebruik van basisvektore
vir MLED wat vanaf die interklas-variansie tussen 'n stel sprekerklasse in plaas
van die interspreker-variansie afgelei is.
Die effek van twee tipes kenmerk-vektore - PLP-kepstra en LPCC's - op die prestasie
van sprekeraanpassings-metodes word ondersoek, sodat die optimale tipe kenmerk-vektor
vir spreker-onafhanklike modelle en hul aanpassing gevind kan word.
|
189 |
Investigation of the impact of high frequency transmitted speech on speaker recognitionPool, Jan 04 1900 (has links)
Thesis (MScEng)--Stellenbosch University, 2002. / Some digitised pages may appear illegible due to the condition of the original hard copy. / ENGLISH ABSTRACT: Speaker recognition systems have evolved to a point where near perfect performance can be
obtained under ideal conditions, even if the system must distinguish between a large number
of speakers. Under adverse conditions, such as when high noise levels are present or when the
transmission channel deforms the speech, the performance is often less than satisfying.
This project investigated the performance of a popular speaker recognition system, that use
Gaussian mixture models, on speech transmitted over a high frequency channel. Initial experiments
demonstrated very unsatisfactory results for the base line system.
We investigated a number of robust techniques. We implemented and applied some of them in
an attempt to improve the performance of the speaker recognition systems. The techniques we
tested showed only slight improvements.
We also investigates the effects of a high frequency channel and single sideband modulation on
the speech features of speech processing systems. The effects that can deform the features, and
therefore reduce the performance of speech systems, were identified.
One of the effects that can greatly affect the performance of a speech processing system is
noise. We investigated some speech enhancement techniques and as a result we developed a
new statistical based speech enhancement technique that employs hidden Markov models to
represent the clean speech process. / AFRIKAANSE OPSOMMING: Sprekerherkenning-stelsels het 'n punt bereik waar nabyaan perfekte resultate verwag kan word
onder ideale kondisies, selfs al moet die stelsel tussen 'n groot aantal sprekers onderskei. Wanneer
nie-ideale kondisies, soos byvoorbeeld hoë ruisvlakke of 'n transmissie kanaal wat die
spraak vervorm, teenwoordig is, is die resultate gewoonlik nie bevredigend nie.
Die projek ondersoek die werksverrigting van 'n gewilde sprekerherkenning-stelsel, wat gebruik
maak van Gaussiese mengselmodelle, op spraak wat oor 'n hoë frekwensie transmissie
kanaal gestuur is. Aanvanklike eksperimente wat gebruik maak van 'n basiese stelsel het nie
goeie resultate opgelewer nie.
Ons het 'n aantal robuuste tegnieke ondersoek en 'n paar van hulle geïmplementeer en getoets
in 'n poging om die resultate van die sprekerherkenning-stelsel te verbeter. Die tegnieke wat
ons getoets het, het net geringe verbetering getoon.
Die studie het ook die effekte wat die hoë-frekwensie kanaal en enkel-syband modulasie op
spraak kenmerkvektore, ondersoek. Die effekte wat die spraak kenmerkvektore kan vervorm en
dus die werkverrigting van spraak stelsels kan verlaag, is geïdentifiseer.
Een van die effekte wat 'n groot invloed op die werkverrigting van spraakstelsels het, is ruis.
Ons het spraak verbeterings metodes ondersoek en dit het gelei tot die ontwikkeling van 'n
statisties gebaseerde spraak verbeteringstegniek wat gebruik maak van verskuilde Markov modelle
om die skoon spraakproses voor te stel.
|
190 |
Chaos Computing: From Theory to ApplicationJanuary 2011 (has links)
abstract: In this thesis I introduce a new direction to computing using nonlinear chaotic dynamics. The main idea is rich dynamics of a chaotic system enables us to (1) build better computers that have a flexible instruction set, and (2) carry out computation that conventional computers are not good at it. Here I start from the theory, explaining how one can build a computing logic block using a chaotic system, and then I introduce a new theoretical analysis for chaos computing. Specifically, I demonstrate how unstable periodic orbits and a model based on them explains and predicts how and how well a chaotic system can do computation. Furthermore, since unstable periodic orbits and their stability measures in terms of eigenvalues are extractable from experimental times series, I develop a time series technique for modeling and predicting chaos computing from a given time series of a chaotic system. After building a theoretical framework for chaos computing I proceed to architecture of these chaos-computing blocks to build a sophisticated computing system out of them. I describe how one can arrange and organize these chaos-based blocks to build a computer. I propose a brand new computer architecture using chaos computing, which shifts the limits of conventional computers by introducing flexible instruction set. Our new chaos based computer has a flexible instruction set, meaning that the user can load its desired instruction set to the computer to reconfigure the computer to be an implementation for the desired instruction set. Apart from direct application of chaos theory in generic computation, the application of chaos theory to speech processing is explained and a novel application for chaos theory in speech coding and synthesizing is introduced. More specifically it is demonstrated how a chaotic system can model the natural turbulent flow of the air in the human speech production system and how chaotic orbits can be used to excite a vocal tract model. Also as another approach to build computing system based on nonlinear system, the idea of Logical Stochastic Resonance is studied and adapted to an autoregulatory gene network in the bacteriophage λ. / Dissertation/Thesis / Ph.D. Electrical Engineering 2011
|
Page generated in 0.057 seconds