Global ETD Search

631	Loan Default Prediction using Supervised Machine Learning Algorithms / Fallissemangprediktion med hjälp av övervakade maskininlärningsalgoritmer Granström, Daria, Abrahamsson, Johan January 2019 (has links) It is essential for a bank to estimate the credit risk it carries and the magnitude of exposure it has in case of non-performing customers. Estimation of this kind of risk has been done by statistical methods through decades and with respect to recent development in the field of machine learning, there has been an interest in investigating if machine learning techniques can perform better quantification of the risk. The aim of this thesis is to examine which method from a chosen set of machine learning techniques exhibits the best performance in default prediction with regards to chosen model evaluation parameters. The investigated techniques were Logistic Regression, Random Forest, Decision Tree, AdaBoost, XGBoost, Artificial Neural Network and Support Vector Machine. An oversampling technique called SMOTE was implemented in order to treat the imbalance between classes for the response variable. The results showed that XGBoost without implementation of SMOTE obtained the best result with respect to the chosen model evaluation metric. / Det är nödvändigt för en bank att ha en bra uppskattning på hur stor risk den bär med avseende på kunders fallissemang. Olika statistiska metoder har använts för att estimera denna risk, men med den nuvarande utvecklingen inom maskininlärningsområdet har det väckt ett intesse att utforska om maskininlärningsmetoder kan förbättra kvaliteten på riskuppskattningen. Syftet med denna avhandling är att undersöka vilken metod av de implementerade maskininlärningsmetoderna presterar bäst för modellering av fallissemangprediktion med avseende på valda modelvaldieringsparametrar. De implementerade metoderna var Logistisk Regression, Random Forest, Decision Tree, AdaBoost, XGBoost, Artificiella neurala nätverk och Stödvektormaskin. En översamplingsteknik, SMOTE, användes för att behandla obalansen i klassfördelningen för svarsvariabeln. Resultatet blev följande: XGBoost utan implementering av SMOTE visade bäst resultat med avseende på den valda metriken. Machine Learning Deep Learning Credit Risk Default Prediction Logistic Regression Random Forest Decision Tree AdaBoost XGBoost Artificial Neural Network Support Vector Machine SMOTE Maskininlärning Djupinlärning Kreditrisk Fallissemangprediktion Logistisk Regression Random Forest Decision Tree AdaBoost XGBoost Artificiella neurala nätverk Stödvektormaskin SMOTE Probability Theory and Statistics Sannolikhetsteori och statistik
632	The impact of parsing methods on recurrent neural networks applied to event-based vehicular signal data / Påverkan av parsningsmetoder på återkommande neuronnät applicerade på händelsebaserad signaldata från fordon Max, Lindblad January 2018 (has links) This thesis examines two different approaches to parsing event-based vehicular signal data to produce input to a neural network prediction model: event parsing, where the data is kept unevenly spaced over the temporal domain, and slice parsing, where the data is made to be evenly spaced over the temporal domain instead. The dataset used as a basis for these experiments consists of a number of vehicular signal logs taken at Scania AB. Comparisons between the parsing methods have been made by first training long short-term memory (LSTM) recurrent neural networks (RNN) on each of the parsed datasets and then measuring the output error and resource costs of each such model after having validated them on a number of shared validation sets. The results from these tests clearly show that slice parsing compares favourably to event parsing. / Denna avhandling jämför två olika tillvägagångssätt vad gäller parsningen av händelsebaserad signaldata från fordon för att producera indata till en förutsägelsemodell i form av ett neuronnät, nämligen händelseparsning, där datan förblir ojämnt fördelad över tidsdomänen, och skivparsning, där datan är omgjord till att istället vara jämnt fördelad över tidsdomänen. Det dataset som används för dessa experiment är ett antal signalloggar från fordon som kommer från Scania. Jämförelser mellan parsningsmetoderna gjordes genom att först träna ett lång korttidsminne (LSTM) återkommande neuronnät (RNN) på vardera av de skapade dataseten för att sedan mäta utmatningsfelet och resurskostnader för varje modell efter att de validerats på en delad uppsättning av valideringsdata. Resultaten från dessa tester visar tydligt på att skivparsning står sig väl mot händelseparsning. neural network artificial neural network ANN recurrent neural network RNN long-short term memory LSTM event slice parsing method slice parsing event parsing time-slice event-based slice-based signal data temporal data temporal sequence multivariate time-series unequally spaced unevenly spaced irregularly spaced Scania SICS SAGA Computer Sciences Datavetenskap (datalogi)
633	Spectral Portfolio Optimisation with LSTM Stock Price Prediction / Spektralportföljsoptimering med LSTM aktieprispredikering Wang, Nancy January 2020 (has links) Nobel Prize-winning modern portfolio theory (MPT) has been considered to be one of the most important and influential economic theories within finance and investment management. MPT assumes investors to be riskaverse and uses the variance of asset returns as a proxy of risk to maximise the performance of a portfolio. Successful portfolio management reply, thus on accurate risk estimate and asset return prediction. Risk estimates are commonly obtained through traditional asset pricing factor models, which allow the systematic risk to vary over time domain but not in the frequency space. This approach can impose limitations in, for instance, risk estimation. To tackle this shortcoming, interest in applications of spectral analysis to financial time series has increased lately. Among others, the novel spectral portfolio theory and the spectral factor model which demonstrate enhancement in portfolio performance through spectral risk estimation [1][11]. Moreover, stock price prediction has always been a challenging task due to its non-linearity and non-stationarity. Meanwhile, Machine learning has been successfully implemented in a wide range of applications where it is infeasible to accomplish the needed tasks traditionally. Recent research has demonstrated significant results in single stock price prediction by artificial LSTM neural network [6][34]. This study aims to evaluate the combined effect of these two advancements in a portfolio optimisation problem and optimise a spectral portfolio with stock prices predicted by LSTM neural networks. To do so, we began with mathematical derivation and theoretical presentation and then evaluated the portfolio performance generated by the spectral risk estimates and the LSTM stock price predictions, as well as the combination of the two. The result demonstrates that the LSTM predictions alone performed better than the combination, which in term performed better than the spectral risk alone. / Den nobelprisvinnande moderna portföjlteorin (MPT) är utan tvekan en av de mest framgångsrika investeringsmodellerna inom finansvärlden och investeringsstrategier. MPT antar att investerarna är mindre benägna till risktagande och approximerar riskexponering med variansen av tillgångarnasränteavkastningar. Nyckeln till en lyckad portföljförvaltning är därmed goda riskestimat och goda förutsägelser av tillgångspris. Riskestimering görs vanligtvis genom traditionella prissättningsmodellerna som tillåter risken att variera i tiden, dock inte i frekvensrummet. Denna begränsning utgör bland annat ett större fel i riskestimering. För att tackla med detta har intresset för tillämpningar av spektraanalys på finansiella tidsserier ökat de senast åren. Bland annat är ett nytt tillvägagångssätt för att behandla detta den nyintroducerade spektralportföljteorin och spektralfak- tormodellen som påvisade ökad portföljenprestanda genom spektralriskskattning [1][11]. Samtidigt har prediktering av aktierpriser länge varit en stor utmaning på grund av dess icke-linjära och icke-stationära egenskaper medan maskininlärning har kunnat använts för att lösa annars omöjliga uppgifter. Färska studier har påvisat signifikant resultat i aktieprisprediktering med hjälp av artificiella LSTM neurala nätverk [6][34]. Detta arbete undersöker kombinerade effekten av dessa två framsteg i ett portföljoptimeringsproblem genom att optimera en spektral portfölj med framtida avkastningar predikterade av ett LSTM neuralt nätverk. Arbetet börjar med matematisk härledningar och teoretisk introduktion och sedan studera portföljprestation som genereras av spektra risk, LSTM aktieprispredikteringen samt en kombination av dessa två. Resultaten visar på att LSTM-predikteringen ensam presterade bättre än kombinationen, vilket i sin tur presterade bättre än enbart spektralriskskattningen. Artificial Neural Network LSTM Spectral factor model Portfolio optimisation Stock price prediction Time series analysis Risk estimation Spectral risk Frequency-specific beta decomposition Artificiella neurala nätverk LSTM Spektralfaktormodell Portföljoptimering Aktieprispredikering Tidsserieranalys Riskestimering Spektra risk Frekvensspecifik beta dekomposition Probability Theory and Statistics Sannolikhetsteori och statistik
634	Unsupervised Detection of Interictal Epileptiform Discharges in Routine Scalp EEG : Machine Learning Assisted Epilepsy Diagnosis Shao, Shuai January 2023 (has links) Epilepsy affects more than 50 million people and is one of the most prevalent neurological disorders and has a high impact on the quality of life of those suffering from it. However, 70% of epilepsy patients can live seizure free with proper diagnosis and treatment. Patients are evaluated using scalp EEG recordings which is cheap and non-invasive. Diagnostic yield is however low and qualified personnel need to process large amounts of data in order to accurately assess patients. MindReader is an unsupervised classifier which detects spectral anomalies and generates a hypothesis of the underlying patient state over time. The aim is to highlight abnormal, potentially epileptiform states, which could expedite analysis of patients and let qualified personnel attest the results. It was used to evaluate 95 scalp EEG recordings from healthy adults and adult patients with epilepsy. Interictal Epileptiform discharges (IED) occurring in the samples had been retroactively annotated, along with the patient state and maneuvers performed by personnel, to enable characterization of the classifier’s detection performance. The performance was slightly worse than previous benchmarks on pediatric scalp EEG recordings, with a 7% and 33% drop in specificity and sensitivity, respectively. Electrode positioning and partial spatial extent of events saw notable impact on performance. However, no correlation between annotated disturbances and reduction in performance could be found. Additional explorative analysis was performed on serialized intermediate data to evaluate the analysis design. Hyperparameters and electrode montage options were exposed to optimize for the average Mathew’s correlation coefficient (MCC) per electrode per patient, on a subset of the patients with epilepsy. An increased window length and lowered amount of training along with an common average montage proved most successful. The Euclidean distance of cumulative spectra (ECS), a metric suitable for spectral analysis, and homologous L2 and L1 loss function were implemented, of which the ECS further improved the average performance for all samples. Four additional analyses, featuring new time-frequency transforms and multichannel convolutional autoencoders were evaluated and an analysis using the continuous wavelet transform (CWT) and a convolutional autoencoder (CNN) performed the best, with an average MCC score of 0.19 and 56.9% sensitivity with approximately 13.9 false positives per minute. EEG electroencephalography IED interictal epileptiform discharges spike detection epilepsy unsupervised Fourier transform STFT short-time Fourier transform CWT continuous wavelet transform DWT discrete wavelet transform ML machine learning ANN artificial neural network CNN convolutional neural network autoencoder HMM hidden Markov model ECS Bioinformatics (Computational Biology) Bioinformatik (beräkningsbiologi) Neurology Neurologi
635	A Probabilistic Formulation of Keyword Spotting Puigcerver I Pérez, Joan 18 February 2019 (has links) [ES] La detección de palabras clave (Keyword Spotting, en inglés), aplicada a documentos de texto manuscrito, tiene como objetivo recuperar los documentos, o partes de ellos, que sean relevantes para una cierta consulta (query, en inglés), indicada por el usuario, entre una gran colección de documentos. La temática ha recogido un gran interés en los últimos 20 años entre investigadores en Reconocimiento de Formas (Pattern Recognition), así como bibliotecas y archivos digitales. Esta tesis, en primer lugar, define el objetivo de la detección de palabras clave a partir de una perspectiva basada en la Teoría de la Decisión y una formulación probabilística adecuada. Más concretamente, la detección de palabras clave se presenta como un caso particular de Recuperación de la Información (Information Retrieval), donde el contenido de los documentos es desconocido, pero puede ser modelado mediante una distribución de probabilidad. Además, la tesis también demuestra que, bajo las distribuciones de probabilidad correctas, el marco de trabajo desarrollada conduce a la solución óptima del problema, según múltiples medidas de evaluación utilizadas tradicionalmente en el campo. Más tarde, se utilizan distintos modelos estadísticos para representar las distribuciones necesarias: Redes Neuronales Recurrentes o Modelos Ocultos de Markov. Los parámetros de estos son estimados a partir de datos de entrenamiento, y las respectivas distribuciones son representadas mediante Transductores de Estados Finitos con Pesos (Weighted Finite State Transducers). Con el objetivo de hacer que el marco de trabajo sea práctico en grandes colecciones de documentos, se presentan distintos algoritmos para construir índices de palabras a partir de modelos probabilísticos, basados tanto en un léxico cerrado como abierto. Estos índices son muy similares a los utilizados por los motores de búsqueda tradicionales. Además, se estudia la relación que hay entre la formulación probabilística presentada y otros métodos de gran influencia en el campo de la detección de palabras clave, destacando cuáles son las limitaciones de los segundos. Finalmente, todas la aportaciones se evalúan de forma experimental, no sólo utilizando pruebas académicas estándar, sino también en colecciones con decenas de miles de páginas provenientes de manuscritos históricos. Los resultados muestran que el marco de trabajo presentado permite construir sistemas de detección de palabras clave muy rápidos y precisos, con una sólida base teórica. / [CA] La detecció de paraules clau (Keyword Spotting, en anglès), aplicada a documents de text manuscrit, té com a objectiu recuperar els documents, o parts d'ells, que siguen rellevants per a una certa consulta (query, en anglès), indicada per l'usuari, dintre d'una gran col·lecció de documents. La temàtica ha recollit un gran interés en els últims 20 anys entre investigadors en Reconeixement de Formes (Pattern Recognition), així com biblioteques i arxius digitals. Aquesta tesi defineix l'objectiu de la detecció de paraules claus a partir d'una perspectiva basada en la Teoria de la Decisió i una formulació probabilística adequada. Més concretament, la detecció de paraules clau es presenta com un cas concret de Recuperació de la Informació (Information Retrieval), on el contingut dels documents és desconegut, però pot ser modelat mitjançant una distribució de probabilitat. A més, la tesi també demostra que, sota les distribucions de probabilitat correctes, el marc de treball desenvolupat condueix a la solució òptima del problema, segons diverses mesures d'avaluació utilitzades tradicionalment en el camp. Després, diferents models estadístics s'utilitzen per representar les distribucions necessàries: Xarxes Neuronal Recurrents i Models Ocults de Markov. Els paràmetres d'aquests són estimats a partir de dades d'entrenament, i les corresponents distribucions són representades mitjançant Transductors d'Estats Finits amb Pesos (Weighted Finite State Transducers). Amb l'objectiu de fer el marc de treball útil per a grans col·leccions de documents, es presenten distints algorismes per construir índexs de paraules a partir dels models probabilístics, tan basats en un lèxic tancat com en un obert. Aquests índexs són molt semblants als utilitzats per motors de cerca tradicionals. A més a més, s'estudia la relació que hi ha entre la formulació probabilística presentada i altres mètodes de gran influència en el camp de la detecció de paraules clau, destacant algunes limitacions dels segons. Finalment, totes les aportacions s'avaluen de forma experimental, no sols utilitzant proves acadèmics estàndard, sinó també en col·leccions amb desenes de milers de pàgines provinents de manuscrits històrics. Els resultats mostren que el marc de treball presentat permet construir sistemes de detecció de paraules clau molt acurats i ràpids, amb una sòlida base teòrica. / [EN] Keyword Spotting, applied to handwritten text documents, aims to retrieve the documents, or parts of them, that are relevant for a query, given by the user, within a large collection of documents. The topic has gained a large interest in the last 20 years among Pattern Recognition researchers, as well as digital libraries and archives. This thesis, first defines the goal of Keyword Spotting from a Decision Theory perspective. Then, the problem is tackled following a probabilistic formulation. More precisely, Keyword Spotting is presented as a particular instance of Information Retrieval, where the content of the documents is unknown, but can be modeled by a probability distribution. In addition, the thesis also proves that, under the correct probability distributions, the framework provides the optimal solution, under many of the evaluation measures traditionally used in the field. Later, different statistical models are used to represent the probability distribution over the content of the documents. These models, Hidden Markov Models or Recurrent Neural Networks, are estimated from training data, and the corresponding distributions over the transcripts of the images can be efficiently represented using Weighted Finite State Transducers. In order to make the framework practical for large collections of documents, this thesis presents several algorithms to build probabilistic word indexes, using both lexicon-based and lexicon-free models. These indexes are very similar to the ones used by traditional search engines. Furthermore, we study the relationship between the presented formulation and other seminal approaches in the field of Keyword Spotting, highlighting some limitations of the latter. Finally, all the contributions are evaluated experimentally, not only on standard academic benchmarks, but also on collections including tens of thousands of pages of historical manuscripts. The results show that the proposed framework and algorithms allow to build very accurate and very fast Keyword Spotting systems, with a solid underlying theory. / Puigcerver I Pérez, J. (2018). A Probabilistic Formulation of Keyword Spotting [Tesis doctoral]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/116834 Keyword Spotting Handwritten Text Recognition Information Retrieval Pattern Recognition Image Retrieval Probabilistic Text Indexing Historical Manuscripts Weighted Finite State Transducer Hidden Markov Model Artificial Neural Network LENGUAJES Y SISTEMAS INFORMATICOS
636	Design of an electrocardiographic lead reconstruction algorithm using machine learning in the context of ambulatory monitoring Grande Fidalgo, Alejandro 16 January 2025 (has links) [ES] Esta tesis doctoral presenta un algoritmo para reconstruir el registro electrocardiográfico (ECG) estándar del sistema de 12 derivaciones utilizando un sistema reducido de derivaciones independientes mediante el uso de modelos de aprendizaje automático, centrándose en su integración en un sistema de monitorización ambulatoria. Los métodos tradicionales de reconstrucción de ECG se basan en enfoques basados en combinaciones lineales, con una exploración limitada de los métodos de evaluación y de las posiciones de los electrodos. Esta tesis evalúa la eficacia de nuevas redes neuronales artificiales y algoritmos basados en fuzzy c-means en comparación con los métodos clásicos de regresión lineal, destacando un rendimiento superior y subrayando la importancia de la explicabilidad del modelo. Se exploran otras mejoras, como comités de expertos y modelos difusos, para aumentar la precisión y la eficacia. La validación clínica realizada en el Hospital Clínico Universitario de València y en el Hospital General Universitario de València demuestran la eficacia del algoritmo en la reconstrucción precisa de derivaciones, facilitando el camino para aplicaciones de monitorización ambulatoria. El estudio también aborda los retos que plantean dispositivos implantables como marcapasos y desfibriladores; un estudio posterior propone una estrategia para eliminar pulsos distorsionados durante la reconstrucción, mejorando la calidad de la señal en cualquier condición. En conjunto, la tesis contribuye al avance de las metodologías de reconstrucción de derivaciones de ECG para mejorar la atención al paciente. / [CA] Aquesta tesi doctoral presenta un algoritme per a reconstruir el registre electrocardiogràfic (ECG) estàndard del sistema de 12 derivacions utilitzant un sistema reduït de derivacions independents mitjançant l'ús de models d'aprenentatge automàtic, centrant-se en la seua integració en un sistema de monitoratge ambulatori. Els mètodes tradicionals de reconstrucció de ECG es basen en enfocaments basats en combinacions lineals, amb una exploració limitada dels mètodes d'avaluació i de les posicions dels elèctrodes. Aquesta tesi avalua l'eficàcia de noves xarxes neuronals artificials i algoritmes basats en fuzzy c-means en comparació amb els mètodes clàssics de regressió lineal, destacant un rendiment superior i subratllant la importància de la explicabilitat del model. S'exploren altres millores, com a comités d'experts i models difusos, per a augmentar la precisió i l'eficàcia. La validació clínica realitzada a l'Hospital Clínic Universitari de València i a l'Hospital General Universitari de València demostren l'eficàcia de l'algoritme en la reconstrucció precisa de derivacions, facilitant el camí per a aplicacions de monitoratge ambulatori. L'estudi també aborda els reptes que plantegen dispositius implantables com a marcapassos i desfibril·ladors; un estudi posterior proposa una estratègia per a eliminar polsos distorsionats durant la reconstrucció, millorant la qualitat del senyal en qualsevol condició. En conjunt, la tesi contribueix a l'avanç de les metodologies de reconstrucció de derivacions de ECG per a millorar l'atenció al pacient. / [EN] This PhD Thesis presents an algorithm for reconstructing the standard 12-lead system electrocardiographic (ECG) register using a reduced system of independent leads supported by machine learning models, with a focus on its integration into an ambulatory monitoring system. Traditional ECG lead reconstruction methods have relied on linear combination based approaches, with limited exploration of evaluation methods and electrode positions. This thesis evaluates the effectiveness of new artificial neural networks and fuzzy c-means based algorithms compared to classical linear regression methods, highlighting superior performance and emphasizing the importance of model explainability. Further enhancements, including expert committees and fuzzy models, are explored to improve accuracy and efficiency. Clinical validation at the Hospital Clínico Universitario de València and Hospital General Universitario de València demonstrates the algorithm's effectiveness in an accurate lead reconstruction, paving the way for ambulatory monitoring applications. The study also addresses challenges posed by implantable devices such as pacemakers and defibrillators; a subsequent study proposes a strategy to eliminate distorted pulses during reconstruction, improving signal quality under any condition. Overall, the thesis contributes to advancing ECG lead reconstruction methodologies for improved patient care. / Grande Fidalgo, A. (2024). Design of an electrocardiographic lead reconstruction algorithm using machine learning in the context of ambulatory monitoring [Tesis doctoral]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/214023 Cardiología Electrocardiograma Monitorización ambulatoria Inteligencia Artificial (IA) Redes neuronales artificiales Fuzzy C-means Cardiovascular diseases Electrocardiogram Ambulatory monitoring Lead reconstruction Artificial neural network Standard 12-lead system
637	Applying Artificial Neural Networks to Reduce the Adaptation Space in Self-Adaptive Systems : an exploratory work Buttar, Sarpreet Singh January 2019 (has links) Self-adaptive systems have limited time to adjust their configurations whenever their adaptation goals, i.e., quality requirements, are violated due to some runtime uncertainties. Within the available time, they need to analyze their adaptation space, i.e., a set of configurations, to find the best adaptation option, i.e., configuration, that can achieve their adaptation goals. Existing formal analysis approaches find the best adaptation option by analyzing the entire adaptation space. However, exhaustive analysis requires time and resources and is therefore only efficient when the adaptation space is small. The size of the adaptation space is often in hundreds or thousands, which makes formal analysis approaches inefficient in large-scale self-adaptive systems. In this thesis, we tackle this problem by presenting an online learning approach that enables formal analysis approaches to analyze large adaptation spaces efficiently. The approach integrates with the standard feedback loop and reduces the adaptation space to a subset of adaptation options that are relevant to the current runtime uncertainties. The subset is then analyzed by the formal analysis approaches, which allows them to complete the analysis faster and efficiently within the available time. We evaluate our approach on two different instances of an Internet of Things application. The evaluation shows that our approach dramatically reduces the adaptation space and analysis time without compromising the adaptation goals. Self-Adaptive Systems Self-Adaptation Architecture-Based Adaptation Autonomous Systems Cyber-Physical Systems CPS DeltaIoT IoT ActivFORMS MAPE-K Feedback Loop Runtime Uncertainties Adaptation Space Analysis Machine Learning Artificial Neural Network ANN Online Learning Deep Learning Online Supervised Learning Incremental Learning Classification Multi-Layer Perceptron MLP Computer Sciences Datavetenskap (datalogi) Control Engineering Reglerteknik Software Engineering Programvaruteknik
638	Porovnání klasifikačních metod / Comparison of Classification Methods Dočekal, Martin January 2019 (has links) This thesis deals with a comparison of classification methods. At first, these classification methods based on machine learning are described, then a classifier comparison system is designed and implemented. This thesis also describes some classification tasks and datasets on which the designed system will be tested. The evaluation of classification tasks is done according to standard metrics. In this thesis is presented design and implementation of a classifier that is based on the principle of evolutionary algorithms.
639	Analýza dat síťové komunikace mobilních zařízení / Analysis of Mobile Devices Network Communication Data Abraham, Lukáš January 2020 (has links) At the beginning, the work describes DNS and SSL/TLS protocols, it mainly deals with communication between devices using these protocols. Then we'll talk about data preprocessing and data cleaning. Furthermore, the thesis deals with basic data mining techniques such as data classification, association rules, information retrieval, regression analysis and cluster analysis. The next chapter we can read something about how to identify mobile devices on the network. We will evaluate data sets that contain collected data from communication between the above mentioned protocols, which will be used in the practical part. After that, we finally get to the design of a system for analyzing network communication data. We will describe the libraries, which we used and the entire system implementation. We will perform a large number of experiments, which we will finally evaluate.
640	Detekce logopedických vad v řeči / Detection of Logopaedic Defects in Speech Pešek, Milan January 2009 (has links) The thesis deals with a design and an implementation of software for a detection of logopaedia defects of speech. Due to the need of early logopaedia defects detecting, this software is aimed at a child’s age speaker. The introductory part describes the theory of speech realization, simulation of speech realization for numerical processing, phonetics, logopaedia and basic logopaedia defects of speech. There are also described used methods for feature extraction, for segmentation of words to speech sounds and for features classification into either correct or incorrect pronunciation class. In the next part of the thesis there are results of testing of selected methods presented. For logopaedia speech defects recognition algorithms are used in order to extract the features MFCC and PLP. The segmentation of words to speech sounds is performed on the base of Differential Function method. The extracted features of a sound are classified into either a correct or an incorrect pronunciation class with one of tested methods of pattern recognition. To classify the features, the k-NN, SVN, ANN, and GMM methods are tested.

Search results