Spelling suggestions: "subject:"markov codels"" "subject:"markov 2models""
51 |
Synchronous HMMs for audio-visual speech processingDean, David Brendan January 2008 (has links)
Both human perceptual studies and automaticmachine-based experiments have shown that visual information from a speaker's mouth region can improve the robustness of automatic speech processing tasks, especially in the presence of acoustic noise. By taking advantage of the complementary nature of the acoustic and visual speech information, audio-visual speech processing (AVSP) applications can work reliably in more real-world situations than would be possible with traditional acoustic speech processing applications. The two most prominent applications of AVSP for viable human-computer-interfaces involve the recognition of the speech events themselves, and the recognition of speaker's identities based upon their speech. However, while these two fields of speech and speaker recognition are closely related, there has been little systematic comparison of the two tasks under similar conditions in the existing literature. Accordingly, the primary focus of this thesis is to compare the suitability of general AVSP techniques for speech or speaker recognition, with a particular focus on synchronous hidden Markov models (SHMMs). The cascading appearance-based approach to visual speech feature extraction has been shown to work well in removing irrelevant static information from the lip region to greatly improve visual speech recognition performance. This thesis demonstrates that these dynamic visual speech features also provide for an improvement in speaker recognition, showing that speakers can be visually recognised by how they speak, in addition to their appearance alone. This thesis investigates a number of novel techniques for training and decoding of SHMMs that improve the audio-visual speech modelling ability of the SHMM approach over the existing state-of-the-art joint-training technique. Novel experiments are conducted within to demonstrate that the reliability of the two streams during training is of little importance to the final performance of the SHMM. Additionally, two novel techniques of normalising the acoustic and visual state classifiers within the SHMM structure are demonstrated for AVSP. Fused hidden Markov model (FHMM) adaptation is introduced as a novel method of adapting SHMMs from existing wellperforming acoustic hidden Markovmodels (HMMs). This technique is demonstrated to provide improved audio-visualmodelling over the jointly-trained SHMMapproach at all levels of acoustic noise for the recognition of audio-visual speech events. However, the close coupling of the SHMM approach will be shown to be less useful for speaker recognition, where a late integration approach is demonstrated to be superior.
|
52 |
Statistical Analysis of Wireless Systems Using Markov ModelsAkbar, Ihsan Ali 06 March 2007 (has links)
Being one of the fastest growing fields of engineering, wireless has gained the attention of researchers and commercial businesses all over the world. Extensive research is underway to improve the performance of existing systems and to introduce cutting edge wireless technologies that can make high speed wireless communications possible.
The first part of this dissertation deals with discrete channel models that are used for simulating error traces produced by wireless channels. Most of the time, wireless channels have memory and we rely on discrete time Markov models to simulate them. The primary advantage of using these models is rapid experimentation and prototyping. Efficient estimation of the parameters of a Markov model (including its number of states) is important to reproducing and/or forecasting channel statistics accurately. Although the parameter estimation of Markov processes has been studied extensively, its order estimation problem has been addressed only recently. In this report, we investigate the existing order estimation techniques for Markov chains and hidden Markov models. Performance comparison with semi-hidden Markov models is also discussed. Error source modeling in slow and fast fading conditions is also considered in great detail.
Cognitive Radio is an emerging technology in wireless communications that can improve the utilization of radio spectrum by incorporating some intelligence in its design. It can adapt with the environment and can change its particular transmission or reception parameters to execute its tasks without interfering with the licensed users. One problem that CR network usually faces is the difficulty in detecting and classifying its low power signal that is present in the environment. Most of the time traditional energy detection techniques fail to detect these signals because of their low SNRs. In the second part of this thesis, we address this problem by using higher order statistics of incoming signals and classifying them by using the pattern recognition capabilities of HMMs combined with cased-based learning approach.
This dissertation also deals with dynamic spectrum allocation in cognitive radio using HMMs. CR networks that are capable of using frequency bands assigned to licensed users, apart from utilizing unlicensed bands such as UNII radio band or ISM band, are also called Licensed Band Cognitive Radios. In our novel work, the dynamic spectrum management or dynamic frequency allocation is performed by the help of HMM predictions. This work is based on the idea that if Markov models can accurately model spectrum usage patterns of different licensed users, then it should also correctly predict the spectrum holes and use these frequencies for its data transmission. Simulations have shown that HMMs prediction results are quite accurate and can help in avoiding CR interference with the primary licensed users and vice versa. At the same time, this helps in sending its data over these channels more reliably. / Ph. D.
|
53 |
Efficient Mixed-Order Hidden Markov Model InferenceSchwardt, Ludwig 12 1900 (has links)
Thesis (PhD (Electrical and Electronic Engineering))--University of Stellenbosch, 2007. / Higher-order Markov models are more powerful than first-order models, but
suffer from an exponential increase in model parameters with order, which leads
to data scarcity problems during training. A more efficient approach is to use
mixed-order Markov models, which model data sequences with contexts of different
lengths.
This study proposes two algorithms for inferring mixed-order Markov chains
and hidden Markov models (HMMs), respectively. The basis of these algorithms
is the prediction suffix tree (PST), an efficient representation of a mixed-order
Markov chain.
The smallest encoded context tree (SECT) algorithm constructs PSTs from
data, based on the minimum description length principle. It has no user-specifiable
parameters to tune, and will expand the depth of the resulting PST as far as
the data set allows it, making it a self-bounded algorithm. It is also faster than
the original PST inference algorithm.
The hidden SECT algorithm replaces the underlying Markov chain of an
HMM with a prediction suffix tree, which is inferred using SECT. The algorithm
is efficient and integrates well with standard techniques.
The properties of the SECT and hidden SECT algorithms are verified on synthetic
data. The hidden SECT algorithm is also compared with a fixed-order
HMM training algorithm on an automatic language recognition task, where the
resulting mixed-order HMMs are shown to be smaller and train faster than the
fixed-order models, for similar classification accuracies.
|
54 |
Speech-driven animation using multi-modal hidden Markov modelsHofer, Gregor Otto January 2010 (has links)
The main objective of this thesis was the synthesis of speech synchronised motion, in particular head motion. The hypothesis that head motion can be estimated from the speech signal was confirmed. In order to achieve satisfactory results, a motion capture data base was recorded, a definition of head motion in terms of articulation was discovered, a continuous stream mapping procedure was developed, and finally the synthesis was evaluated. Based on previous research into non-verbal behaviour basic types of head motion were invented that could function as modelling units. The stream mapping method investigated in this thesis is based on Hidden Markov Models (HMMs), which employ modelling units to map between continuous signals. The objective evaluation of the modelling parameters confirmed that head motion types could be predicted from the speech signal with an accuracy above chance, close to 70%. Furthermore, a special type ofHMMcalled trajectoryHMMwas used because it enables synthesis of continuous output. However head motion is a stochastic process therefore the trajectory HMM was further extended to allow for non-deterministic output. Finally the resulting head motion synthesis was perceptually evaluated. The effects of the “uncanny valley” were also considered in the evaluation, confirming that rendering quality has an influence on our judgement of movement of virtual characters. In conclusion a general method for synthesising speech-synchronised behaviour was invented that can applied to a whole range of behaviours.
|
55 |
Linear dynamic models for automatic speech recognitionFrankel, Joe January 2004 (has links)
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in which the output distribution associated with each state is modelled by a mixture of diagonal covariance Gaussians. Dynamic information is typically included by appending time-derivatives to feature vectors. This approach, whilst successful, makes the false assumption of framewise independence of the augmented feature vectors and ignores the spatial correlations in the parametrised speech signal. This dissertation seeks to address these shortcomings by exploring acoustic modelling for ASR with an application of a form of state-space model, the linear dynamic model (LDM). Rather than modelling individual frames of data, LDMs characterize entire segments of speech. An auto-regressive state evolution through a continuous space gives a Markovian model of the underlying dynamics, and spatial correlations between feature dimensions are absorbed into the structure of the observation process. LDMs have been applied to speech recognition before, however a smoothed Gauss-Markov form was used which ignored the potential for subspace modelling. The continuous dynamical state means that information is passed along the length of each segment. Furthermore, if the state is allowed to be continuous across segment boundaries, long range dependencies are built into the system and the assumption of independence of successive segments is loosened. The state provides an explicit model of temporal correlation which sets this approach apart from frame-based and some segment-based models where the ordering of the data is unimportant. The benefits of such a model are examined both within and between segments. LDMs are well suited to modelling smoothly varying, continuous, yet noisy trajectories such as found in measured articulatory data. Using speaker-dependent data from the MOCHA corpus, the performance of systems which model acoustic, articulatory, and combined acoustic-articulatory features are compared. As well as measured articulatory parameters, experiments use the output of neural networks trained to perform an articulatory inversion mapping. The speaker-independent TIMIT corpus provides the basis for larger scale acoustic-only experiments. Classification tasks provide an ideal means to compare modelling choices without the confounding influence of recognition search errors, and are used to explore issues such as choice of state dimension, front-end acoustic parametrization and parameter initialization. Recognition for segment models is typically more computationally expensive than for frame-based models. Unlike frame-level models, it is not always possible to share likelihood calculations for observation sequences which occur within hypothesized segments that have different start and end times. Furthermore, the Viterbi criterion is not necessarily applicable at the frame level. This work introduces a novel approach to decoding for segment models in the form of a stack decoder with A* search. Such a scheme allows flexibility in the choice of acoustic and language models since the Viterbi criterion is not integral to the search, and hypothesis generation is independent of the particular language model. Furthermore, the time-asynchronous ordering of the search means that only likely paths are extended, and so a minimum number of models are evaluated. The decoder is used to give full recognition results for feature-sets derived from the MOCHA and TIMIT corpora. Conventional train/test divisions and choice of language model are used so that results can be directly compared to those in other studies. The decoder is also used to implement Viterbi training, in which model parameters are alternately updated and then used to re-align the training data.
|
56 |
Extracting motion primitives from natural handwriting dataWilliams, Ben H. January 2009 (has links)
Humans and animals can plan and execute movements much more adaptably and reliably than current computers can calculate robotic limb trajectories. Over recent decades, it has been suggested that our brains use motor primitives as blocks to build up movements. In broad terms a primitive is a segment of pre-optimised movement allowing a simplified movement planning solution. This thesis explores a generative model of handwriting based upon the concept of motor primitives. Unlike most primitive extraction studies, the primitives here are time extended blocks that are superimposed with character specific offsets to create a pen trajectory. This thesis shows how handwriting can be represented using a simple fixed function superposition model, where the variation in the handwriting arises from timing variation in the onset of the functions. Furthermore, it is shown how handwriting style variations could be due to primitive function differences between individuals, and how the timing code could provide a style invariant representation of the handwriting. The spike timing representation of the pen movements provides an extremely compact code, which could resemble internal spiking neural representations in the brain. The model proposes an novel way to infer primitives in data, and the proposed formalised probabilistic model allows informative priors to be introduced providing a more accurate inference of primitive shape and timing.
|
57 |
Analysis of Nanopore Detector Measurements using Machine Learning Methods, with Application to Single-Molecule KineticsLandry, Matthew 18 May 2007 (has links)
At its core, a nanopore detector has a nanometer-scale biological membrane across which a voltage is applied. The voltage draws a DNA molecule into an á-hemolysin channel in the membrane. Consequently, a distinctive channel current blockade signal is created as the molecule flexes and interacts with the channel. This flexing of the molecule is characterized by different blockade levels in the channel current signal. Previous experiments have shown that a nanopore detector is sufficiently sensitive such that nearly identical DNA molecules were classified successfully using machine learning techniques such as Hidden Markov Models and Support Vector Machines in a channel current based signal analysis platform [4-9]. In this paper, methods for improving feature extraction are presented to improve both classification and to provide biologists and chemists with a better understanding of the physical properties of a given molecule.
|
58 |
In vivo Analysis and Modeling Reveals that Transient Interactions of Myosin XI, its Cargo, and Filamentous Actin Overcome Diffusion Limitations to Sustain Polarized Cell GrowthBibeau, Jeffrey Philippe 19 February 2018 (has links)
Tip growth is a ubiquitous process throughout the plant kingdom in which a single cell elongates in one direction in a self-similar manner. To sustain tip growth in plants, the cell must regulate the extensibility of the wall to promote growth and avoid turgor-induced rupture. This process is heavily dependent on the cytoskeleton, which is thought to coordinate the delivery and recycling of vesicles containing cell wall materials at the cell tip. Although significant work has been done to elucidate the various molecular players in this process, there remains a need for a more mechanistic understanding of the cytoskeletonÂ’s role in tip growth. For this reason, specific emphasis should be placed on understanding the dynamics of the cytoskeleton, its associated motors, and their cargo. Since the advent of fluorescence fusion technology, various quantitative fluorescence dynamics techniques have emerged. Among the most prominent of these techniques is fluorescence recovery after photobleaching (FRAP). Despite its prominence, it is unclear how to interpret fluorescence recoveries in confined cellular geometries such as tip-growing cells. Here we developed a digital confocal microscope simulation of FRAP in tip-growing cells. With this simulation, we determined that fluorescence recoveries are significantly influenced by cell boundaries. With this FRAP simulation, we then measured the diffusion of VAMP72-labeled vesicles in the moss Physcomitrella patens. Using finite element modeling of polarized cell growth, and the measured VAMP72-labeled vesicle diffusion coefficient, we were able to show that diffusion alone cannot support the required transport of wall materials to the cell tip. This indicates that an actin-based active transport system is necessary for vesicle clustering at the cell tip to support growth. This provides one essential function of the actin cytoskeleton in polarized cell growth. After establishing the requirement for actin-based transport, we then sought to characterize the in vivo binding interactions of myosin XI, vesicles, and filamentous actin. Particle tracking evidence from P. patens protoplasts suggests that myosin XI and VAMP72-labeled vesicles exhibit fast transient interactions. Hidden Markov modeling of particle tracking indicates that myosin XI and VAMP72- labeled vesicles move along actin filaments in short-lived linear trajectories. These fast transient interactions may be necessary to achieve the rapid dynamics of the apical actin, important for growth. This work advances the fieldÂ’s understanding of fluorescence dynamics, elucidates a necessary function of the actin cytoskeleton, and provides insight into how the components of the cytoskeleton interact in vivo.
|
59 |
Stohastički dinamički opis ISI vremenskih nizova: Markovljevi modeli / Stochastic Dynamical description of the ISI time series: Markov modelsMinich Janoš 11 September 2018 (has links)
<p>Cilj: Brzina ispaljivanja neuralnih impulsa u kori velikog mozga je veoma promenljiva što ukazuje da bi Poasonov tačkasti proces mogao da bude pogodan za modelirqnje takvog procesa. Međutim, brojna istraživanja su pokazala da statistika ispaljivanja ne sledi Poasona. Uprkos tome, još uvek se nije iskristalisao ni alternativni mehanizam koji bi opisao generisanje spajkova, ni raspodela koja bi opisala raspodelu intervala između spajkova (ISI). Ključni cilj ove disertacije je statistička analiza koja će omogućiti modelovanje ISI vremenskih nizova snimljenih u različitim delovima kore velikog mozga dok su majmuni rešavali različite probleme.<br />Metoda: Primenjena je robusna neparametarska statistika da bi se odredila funkcija gustine raspodele (PDF) ISI vremenskih nizova. Rezultati su verifikovani butstrep metodom i iskorišćeni za kreiranje Markovljevog modela.<br />Rezultati: Pokazalo se da se raspodela ISI intervala ne može opisati samo jednom funkcijom i da se statistika ne može da poveže isključivo sa već postojećim modelima, uključujući i eksponencijalni. Pokazalo se, zatim, da ISI statistika ne zavisi od regije u kori velikog mozga, niti, unutar jedne regije, od problema koji je budni majmun rešavao. Međutim, ISI mizovi snimani dok je majmun rešavao isti problem ali u različitim vremenskim intervalima nisu statistički slični, što ukazuje na postojanje varijabiliteta u ISI vremenskim nizovima u zavisnosti od problema koji se rešava.<br />Zaključak: Rezultati analize signala ukazuju da je neuralna aktivnost posledica komplesnih generišućih mehanizama sa značajnom međuzavisnošću i da process zavisi od zadatka koji se rešava.</p> / <p>Objectives: High variability of neuronal firing patterns in the cerebral cortex points towards spiking activity models based on Poisson point processes. In spite of growing evidence that firing behavior may fail Poisson statistics, an alternate spike generating mechanisms and the resulting inter-spike interval (ISI) distributions have not been clarified yet. The key objective of this thesis is to perform a statistical analysis that would yield a model of ISI time series recorded from different from different cortical areas of awake monkeys performing various behavioral tasks.<br />Methods: A robust and non-parametrical statistics to determine ISI probability density functions (PDF-s) of extracellularly recorded cerebral cortical neurons of behaving macaque monkeys is performed. The results were validated using the bootstrap method. The obtained statistics were used to create a Markov model of ISI time series.<br />Results: It turned out that there is no single ISI distribution, but many, and that the underlying statistics is not associated exclusively to the current established models including the exponential. Distribution of types of ISI statistics obtained from different cortical areas are statistically similar and the same applies to the statistics obtained from the same cortical area by ignoring ongoing behavior. However, particular ISI time series observed during the time epochs of the same behavioral task did not show statistical similarity, suggesting a task dependent variation of spike generating dynamics.<br />Conclusion: In summary, the results indicate that neuronal firing activity is resulted by complex generative mechanisms with significant dependency and that this process is contingent upon the behavior.</p>
|
60 |
Contribution à l'évaluation de la fiabilité des chaînes polyphasées de conversion électromécanique d'énergie / Contribution to the reliability assessment of the electromechanical energy conversion multi-phase systems.Olmi, Christophe 07 May 2019 (has links)
Les machines électriques polyphasées présentent des avantages intrinsèques (fractionnement de la puissance, faible ondulation du couple) par rapport à leurs équivalents triphasés qui sont appréciés notamment pour la propulsion navale. Structurellement, ces machines disposent également de capacités de reconfiguration du fait des redondances offertes par leur grand nombre de phases. L'exploitation de ces capacités est susceptible d'augmenter leur sûreté de fonctionnement en adoptant des modes de marche dégradée. Les travaux présentés proposent une méthode permettant de quantifier la fiabilité de toute la chaine de conversion. Le convertisseur statique y est particulièrement étudié car ses composants constituent un point faible en matière de fiabilité dans le système polyphasé. Des bases virtuelles continues de ces composants sont développées afin de s'affranchir des effets de quantification. Les principaux facteurs de stress sont identifiés et intégrés dans l'évaluation des taux de défaillance des différents éléments du système. Les modèles de Markov sont exploités pour prendre en compte les effets des reconfigurations sur la fonction de fiabilité. Un critère couplant la performance et à la fiabilité est introduit afin de caractériser les modes de marche dégradée dans l'évaluation de la fiabilité du système. Des exemples d'application de la méthode sur des systèmes issus essentiellement de l'environnement maritime sont exposés en intégrant leur topologie, leur profil de mission et leur stratégie de commande, ceux-ci influençant fortement les facteurs de stress. Enfin une étude de sensibilité de l'impact de la variabilité des données d'entrée sur la fonction de fiabilité est proposée. / Electrical multi-phase machines exhibit intrinsic advantages (power subdivision, weak torque ripple) compared to 3-phase machines. Multi-phase machines are appreciated for marine propulsion. They own reconfiguration capabilities due to redundancy because of their high number of phases. Those capabilities are able to improve multi-phase machines reliability by using degraded modes. Presented work proposes a methodology to quantify the multi-phase system reliability. Static converter is particularly investigated as its components are a weak point in the system. Continuous virtual bases of the components are developed to prevent quantification effects. Main stressors are identified and included in the failure rates assessment of the different system components. Markov models are used to take into account the reconfiguration consequences onto the reliability function. A coupled criterion performance-reliability is introduced to characterize degraded modes into the reliability assessment. Examples of the method application from marine environment are exhibited including their topology, mission profile and control strategy, which strongly influence the stressors. A sensitivity analysis is proposed showing the input data scattering effect onto the reliability function.
|
Page generated in 0.0602 seconds