• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • 1
  • Tagged with
  • 3
  • 3
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Vocal Frequency Estimation and Voicing State Prediction with Surface EMG Pattern Recognition

De Armas, Winston 11 July 2013 (has links)
Most electrolarynges do not allow hands-free use or pitch modulation. This study presents the potential of pattern recognition to support electrolarynx use by predicting fundamental frequency (F0) and voicing state (VS) from neck surface EMG and respiratory trace. Respiratory trace and neck surface EMG were collected from 10 normal, adult males (18-60 years old) during different vocal tasks. Time-domain features were extracted from both signals, and a Support Vector Machine (SVM) classifier was employed to model F0 and VS. An average mean-squared-error (MSE) of 8.21 ± 3.5 semitones2 was achieved for the estimation of vocal frequency. An average classification accuracy of 78.05 ± 6.3 % was achieved for the prediction of voicing state from EMG and 65.24 ± 7.8 % from respiratory trace. Our results show that pattern classification of neck-muscle EMG and respiratory trace has merit in the prediction of F0 and VS during vocalization.
2

Vocal Frequency Estimation and Voicing State Prediction with Surface EMG Pattern Recognition

De Armas, Winston 11 July 2013 (has links)
Most electrolarynges do not allow hands-free use or pitch modulation. This study presents the potential of pattern recognition to support electrolarynx use by predicting fundamental frequency (F0) and voicing state (VS) from neck surface EMG and respiratory trace. Respiratory trace and neck surface EMG were collected from 10 normal, adult males (18-60 years old) during different vocal tasks. Time-domain features were extracted from both signals, and a Support Vector Machine (SVM) classifier was employed to model F0 and VS. An average mean-squared-error (MSE) of 8.21 ± 3.5 semitones2 was achieved for the estimation of vocal frequency. An average classification accuracy of 78.05 ± 6.3 % was achieved for the prediction of voicing state from EMG and 65.24 ± 7.8 % from respiratory trace. Our results show that pattern classification of neck-muscle EMG and respiratory trace has merit in the prediction of F0 and VS during vocalization.
3

Analysis of vocal tremor in normophonic and dysphonic speakers

Mertens, Christophe 09 October 2015 (has links)
The study concerns the analysis of vocal cycle length perturbations in normophonic and dysphonic speakers.A method for tracking cycle lengths in voiced speech is proposed. The speech cycles are detected via the saliences of the speech signal samples, defined as the length of the temporal interval over which a sample is a maximum. The tracking of the cycle lengths is based on a dynamic programming algorithm that does not request that the signal is locally periodic and the average period length known a priori.The method is validated on a corpus of synthetic stimuli. The results show a good agreement between the extracted and the synthetic reference length time series. The method is able to track accurately low-frequency modulations and ast cycle-to-cycle perturbations of up to 10% and 4% respectively over the whole range of vocal frequencies. Robustness with regard to the background noise has lso been tested. The results indicate that the tracking is reliable for signal-to-noise ratios higher than 15dB.A method for analyzing the size of the cycle length perturbations as well as their frequency is proposed. The cycle length time series is decomposed into a sum of oscillating components by empirical mode decomposition the instantaneous envelopes and frequencies of which are obtained via AM-FM decomposition. Based on their average instantaneous frequencies, the empirical modes are then assigned to four categories (declination, physiological tremor, neurological tremor as well as cycle length jitter) and added within each. The within-category size of the cycle length perturbations is estimated via the standard deviation of the empirical mode sum divided by the average cycle length. The neurological tremor modulation frequency and bandwidth are obtained via the instantaneous frequencies and amplitudes of empirical modes in the neurological tremor category and summarized via a weighted instantaneous frequency probability density, compensating for the effects of mode mixing.The method is applied to two corpora of vowels comprising 123 and 74 control and 456 and 205 Parkinson speaker recordings respectively. The results indicate that the neurological tremor modulation depth is statistically significantly higher for female Parkinson speakers than for female control speakers. Neurological tremor frequency differs statistically significantly between male and female speakers and increases statistically significantly for the pooled Parkinson speakers compared to the pooled control speakers. Finally, the average vocal frequency increases for male Parkinson speakers and decreases for female Parkinson speakers, compared to the control speakers. / Doctorat en Sciences de l'ingénieur et technologie / info:eu-repo/semantics/nonPublished

Page generated in 0.4039 seconds