• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 337
  • 167
  • 52
  • 30
  • 25
  • 23
  • 12
  • 12
  • 12
  • 12
  • 12
  • 11
  • 11
  • 7
  • 6
  • Tagged with
  • 795
  • 311
  • 183
  • 181
  • 163
  • 117
  • 117
  • 98
  • 96
  • 58
  • 57
  • 56
  • 53
  • 50
  • 50
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
601

Perception auditive, visuelle et audiovisuelle des voyelles nasales par les adultes devenus sourds. Lecture labiale, implant cochléaire, implant du tronc cérébral. / Auditory, visual and auditory-visual perception of nasal vowels by deafened adults : Speechareading, Cochlear Implant, Auditory Brainstem Implant

Borel, Stéphanie 14 January 2015 (has links)
Cette thèse porte sur la perception visuelle, auditive et audiovisuelle des voyelles nasales [ɑ̃] (« lent »),[ɔ̃] (« long ») et [ɛ̃] (« lin ») par des adultes devenus sourds, implantés cochléaires et implantés dutronc cérébral. L’étude sur la perception visuelle des voyelles, auprès de 22 adultes devenus sourds,redéfinit les sosies labiaux des voyelles nasales et propose une mise à jour de la classification desvisèmes. Trois études sur l’identification auditive des voyelles nasales auprès de 82, 15 et 10 adultesimplantés cochléaires mettent en évidence leur difficulté à reconnaitre les trois voyelles nasales, qu’ilsperçoivent comme des voyelles orales. Les analyses acoustiques et perceptives suggèrent que lesadultes implantés cochléaires s’appuient sur les informations fréquentielles des deux premiers picsspectraux mais négligent les informations d’intensité relative de ces pics. D’après l’étude menéeauprès de 13 adultes implantés du tronc cérébral, des informations acoustiques linguistiques sonttransmises par l’implant du tronc cérébral mais la fusion entre les informations auditives et visuellespourrait être optimisée pour l’identification des voyelles. Enfin, une enquête auprès de 179orthophonistes pointe le besoin d’une information sur la définition phonétique articulatoire actualiséedes voyelles [ɑ̃] et [ɛ̃]. / This thesis focuses on the visual, auditory and auditory-visual perception of french nasal vowels [ɑ̃](« lent »), [ɔ̃] (« long ») and [ɛ̃] (« lin ») by Cochlear Implant (CI) and Auditory Brainstem Implant(ABI) adults users. The study on visual perception of vowels, with 22 deafened adults, redefines thelip configuration of french nasal vowels and provides an update of the classification of vocalic visualphonemes. Three studies on auditory identification of nasal vowels with 82, 15 and 10 CI usershighlight their difficulty in recognizing the three nasal vowels, which they perceive as oral vowels.Acoustic and perceptual analyzes suggest that adults with CI rely on frequency informations of thefirst two spectral peaks but miss the informations of relative intensity of these peaks. The study with13 ABI users show that some linguistic acoustic cues are transmitted by the ABI but the fusion ofauditory and visual features could be optimized for the identification of vowels. Finally, a survey of179 Speech Language and Hearing Therapists show the need of an update on the phonetic articulationof french nasal vowels [ɑ̃] and [ɛ̃].
602

The punctuation and intonation of parentheticals

Bodenbender, Christel 17 May 2010 (has links)
From a historical perspective, punctuation marks are often assumed to only represent some of the phonetic structure of the spoken form of that text. It has been argued recently that punctuation today is a linguistic system that not only represents some of the phonetic sentence structure but also syntactic as well as semantic information. One case in point is the observation that the semantic difference in differently punctuated parenthetical phrases is not reflected in the intonation contour. This study provides the acoustic evidence for this observation. Furthermore, this study makes recommendations to achieve natural-sounding text-to-speech output for English parentheticals by incorporating the study's findings with respect to parenthical intonation. The experiment conducted for this study involved three male and three female native speakers of Canadian English reading aloud a set of 20 sentences with parenthetical and non-parenthetical phrases. These sentences were analyzed with respect to acoustic characteristics due to differences in punctuation as well as due to differences between parenthetical and non-parenthetical phrases. A number of conclusions were drawn based on the results of the experiment: (1) a difference in punctuation, although entailing a semantic difference, is not reflected in the intonation pattern; (2) in contrast to the general understanding that parenthetical phrases are lower-leveled and narrower in pitch range than the surrounding sentence, this study shows that it is not the parenthetical phrase itself that is implemented differently from its non-parenthetical counterpart; rather, the phrase that precedes the parenthetical exhibits a lower baseline and with that a wider pitch range than the corresponding phrase in a non-parenthetical sentence; (3) sentences with two adjacent parenthetical phrases or one embedded in the other exhibit the same pattern for the parenthetical-preceding phrase as the sentences in (2) above and a narrowed pitch range for the parenthetical phrases that are not in the final position of the sequence of parentheticals; (4) no pausing pattern could be found; (5) the characteristics found for parenthetical phrases can be implemented in synthesized speech through the use of SABLE speech markup as part of the SABLE speech synthesis system. This is the first time that the connection between punctuation and intonation in parenthetical sentences has been investigated; it is also the first look at sentences with more than one parenthetical phrase. This study contributes to our understanding of the intonation of parenthetical phrases in English and their implementation in text-to-speech systems, by providing an analysis of their acoustic characteristics.
603

A nativização de termos de informática do inglês no português brasileiro: uma análise fonológica

Cardoso, João Henrique da Costa 08 March 2005 (has links)
This work deals with the issue of nativization. That is to say we observe the interference caused by the system of sounds of Portuguese (L1) on the pronunciation, by native speakers of Portuguese, of some terms of English (L2). The words in question were drawn out of the technical lexicon of Informatics. In order to carry on the contrastive study between the two languages, we utilized the Contrastive Analysis (CA) methods and postulates. The description and specific analysis of the transfer of traits of the native system to the pronunciation of the loanwords, mostly to explain the differences between the two phonological inventories, is founded on the presuppositions and concepts of Phonology, basically of a structural point of view, but also by taking in account some findings of the Standard Generative Phonology. When we refer to phenomena related to the syllabic structure, the framework utilized is that of the Metrical Phonology. The results of the study point to the fact that the transfer of traits is due to the differences between the two systems of sounds, both in terms of inventories of phonemes and differences in their syllabic patterns. In the very conclusion of this work, we present some rules that intend formalize the main patterns of nativization from English to Portuguese. / Este trabalho trata da questão da interferência que o sistema fonológico de uma língua nativa causa no desempenho de termos de uma língua estrangeira que foram nativizados. Procura observar, especificamente, a interferência do sistema de sons do Português (L1) no desempenho, por falantes nativos, de termos do Inglês (L2) recortados do vocabulário técnico de informática. Para a realização do estudo contrastivo entre as duas línguas foram utilizados os métodos da Análise Contrastiva (Contrastive Analysis CA), e para a análise dos fatos observados levou-se em conta os pressupostos da fonologia gerativa para explicar as diferenças entre os inventários fonológicos das duas línguas e os pressupostos da fonologia métrica para explicar os fenômenos referentes à sílaba. Os resultados da pesquisa indicam que a interferência se dá pelo fato de os inventários das duas línguas serem diferentes e por elas possuírem diferentes padrões silábicos. Como conclusão, são apresentadas algumas regras que têm a intenção de formalizar os principais modelos de nativização do Inglês para o Português.
604

Pronunciar para comunicar: uma investigação do efeito do ensino explícito da pronúncia na sala de aula de LE

LIMA JÚNIOR, Ronaldo Mangueira January 2008 (has links)
LIMA JÚNIOR, Ronaldo Mangueira. Pronunciar para comunicar: uma investigação do efeito do ensino explícito da pronúncia na sala de aula de LE. 2008. 243f. – Dissertação (Mestrado) – Universidade de Brasília, Programa de Mestrado em Linguística Aplicada, Brasília (DF), 2008. / Submitted by anizia almeida (aniziaalmeida80@gmail.com) on 2016-06-23T12:20:43Z No. of bitstreams: 1 2008_dis_rmlimajr.pdf: 3211739 bytes, checksum: 52f93705e746f46fb3e140662fee9ec4 (MD5) / Approved for entry into archive by Márcia Araújo (marcia_m_bezerra@yahoo.com.br) on 2016-06-27T21:14:18Z (GMT) No. of bitstreams: 1 2008_dis_rmlimajr.pdf: 3211739 bytes, checksum: 52f93705e746f46fb3e140662fee9ec4 (MD5) / Made available in DSpace on 2016-06-27T21:14:18Z (GMT). No. of bitstreams: 1 2008_dis_rmlimajr.pdf: 3211739 bytes, checksum: 52f93705e746f46fb3e140662fee9ec4 (MD5) Previous issue date: 2008 / This study aimed at investigating the effects of explicit instruction, as well as the durability of such effects, in foreign language teaching. It is believed pronunciation instruction ought to be planned taking into consideration the specif difficulties that leaners' native language imposes, especially in Brasilian teaching context in which this research was carried out, since most foreign language classrooms in Brazil have homogeneity concerning the students' monthe tongue. Therefore, the approach chosen was the interventionist aqction research, which had as participants two classes of basic level, teenager learners of english as a foreign language at a binational center, where students have english classes as an extra-curricular activity. In one of the classes there was intervention of weekly explicit lessons of pronunciation for one semester. All participants were recorded once before and twice after the intervenions, one shortly after, so that both the immediate and the long-term effects of the explicit instruction could be assessed. All recordings were phonetically transcribed an analyzed having as basic second language acquisition and phenetics and phonology theories. The results indicate that, among other conclusions, there are positive effcts of explicit pronunciation teaching and that these effects are durable. / O presente estudo visou a investigar os efeitos, assim como a durabilidade desses efeitos, do ensino explicito dos aspectos fonético-fonológicos em aulas de língua estrangeira. Partiu do pressuposto de que a aula de pronúncia deve ser planejada considerando-se as dificuldades específicas que a língua nativa dos aprendizes, dispõem, principalmente no contexto brasileiro, no qual a presente pesquisa foi conduzida, visto que a maioria das salas de aula de língua estrangeira no Brasil apresenta uniformidade quanto a língua mãe dos aprendizes. Foi conduzida, portanto, uma pesquisa-ação intervencionista que teve como participantes de pesquisa duas turmas de aprendizes pré-adolescentes de nível básico de um centro binacional, onde os alunos tem aula de inglês como atividade extra-curricular. Em uma das turmas houve intervenção de aulas explícitas de pronúncias semanais durante o semestre. Todos os participantes foi gravando uma vez antes e duas vezes após as intervenções, uma logo em seguida e outra 11 meses depois , para que pudessem ser avaliados os efeitos imediatos e de longo prazo da instrução explicita conduzida. Todas as gravações foram transcritas foneticamente e analisadas a luz de teorias de Aquisição de segunda língua e de Fonética e Fonologia. Os resultados indicam, entre outras conclusões, que há efeitos positivos na instrução explicita da pronúncia e que esses efeitos são duráveis.
605

Identifikace mluvčího v temporální doméně řeči / Speaker identification in the temporal domain of speech

Weingartová, Lenka January 2015 (has links)
This thesis aims to thoroughly describe the temporal characteristics of spoken Czech by means of phone durations and their changes under the influence of several prosodic and segmental factors, such as position in a higher unit (syllable, word or prosodic phrase), length of the higher unit, segmental environment, structure of the syllable or phrase-final lengthening. The speech material comes from a semi-spontaneous corpus of scripted dialogues comprising 4046 utterances by 34 speakers. The descriptions are afterwards used for the creation of a rule-based temporal model, which provides a baseline for analysing local articulation rate contours and their speaker-specificity. The results indicate, that systematic speaker-specific differences can be found in the segmental domain, as well as in the temporal contours. Moreover, speaker identification potential of articulation rate and global temporal features is also assessed. Keywords: temporal characteristics, temporal modelling, phone duration, speaker identification, Czech
606

Sociophonologie de l'anglais contemporain en Nouvelle-Zélande : corpus et dynamique des systèmes / Sociophonology of contemporary New Zealand English : corpus and system dynamics

Viollain, Cécile 28 November 2014 (has links)
La présente thèse propose une description multidimensionnelle (phonologique, phonéticoacoustique et sociolinguistique) des caractéristiques phonético-phonologiques de l’anglais néo-zélandais (NZE) contemporain ainsi qu’une étude théorique et empirique de l’évolution de cette variété. Notre travail de recherche s’inscrit dans le cadre du programme PAC (Phonologie de l’Anglais Contemporain : usages, variétés et structure) et se fonde sur les données authentiques et récentes du corpus PAC Nouvelle-Zélande que nous avons constitué à Dunedin, la capitale de l’Otago, au sud de l’île du Sud de la Nouvelle-Zélande. Notre analyse se concentre sur deux phénomènes qui permettent d’étudier la variation et le changement en NZE : la rhoticité et le ‘r’ de sandhi, ainsi que les changements vocaliques impliquant notamment les voyelles antérieures brèves des ensembles lexicaux KIT, DRESS et TRAP. En nous appuyant sur une étude phonético-acoustique des voyelles produites par les locuteurs du corpus PAC-NZ, nous proposons une modélisation des changements impliquant ces voyelles dans le cadre de la Phonologie de Dépendance. Nous intégrons également une réflexion théorique sur les modélisations linguistiques et sociolinguistiques qui ont été proposées dans la littérature sur le changement linguistique en général, et sur l’évolution du NZE en particulier, et montrons la nécessité d’intégrer des facteurs internes et externes pour rendre compte de l’évolution d’une variété comme le NZE contemporain. / This thesis offers a multidimensional description (phonological, phonetic-acoustic and sociolinguistic) of the phonetic and phonological characteristics of contemporary New Zealand English (NZE) as well as a theoretical and empirical study of its evolution. Our work fits into the framework of the PAC program (Phonology of Contemporary English: usage, varieties and structure) and is based on the recent and authentic data collected for the PAC New Zealand corpus recorded in Dunedin, the capital of Otago, in the south of the South island of New Zealand. Our analysis focuses on two phenomena that allow us to study variation and change in NZE: rhoticity and sandhi-r, as well as vocalic shifts, which notably involve the short front vowels in the lexical sets of KIT, DRESS and TRAP. On the basis of a phonetic-acoustic study of the vowels produced by the PAC-NZ informants, we provide an account of the shifts involving these vowels within the framework of Dependency Phonology. We also integrate a theoretical reflection on the linguistic and sociolinguistic accounts that have been presented in the literature on linguistic change generally and on the evolution of NZE specifically, and show that it is necessary to take internal as well as external factors into account when modeling the evolution of a variety such as contemporary NZE.
607

Aquisição fonológica de fricativas por crianças com transtorno fonológico: uma investigação acústica

Corrêa, Alessandra Pagliuso dos Santos [UNESP] 26 August 2013 (has links) (PDF)
Made available in DSpace on 2014-08-27T14:36:47Z (GMT). No. of bitstreams: 0 Previous issue date: 2013-08-26Bitstream added on 2014-08-27T15:57:14Z : No. of bitstreams: 1 000731266.pdf: 2249574 bytes, checksum: 40b26d7fb5eb25c7c6d095bb7962b473 (MD5) / O presente trabalho versa sobre a presença de contrastes encobertos na fala de crianças com transtorno fonológico. Na literatura, os contrastes encobertos são descritos como produções que, apesar de apresentarem resultados perceptivo-auditivos idênticos/semelhantes, revelam, a partir da análise acústica, diferenças sutis. De maneira mais específica, este estudo busca observar se há preferência, por parte das crianças com transtorno fonológico, pela manipulação de pistas acústicas que não são robustas para o Português Brasileiro, na tentativa de distinguir os fones fricativos. Para tanto, foram utilizadas cinco gravações em áudio, da fala de cinco crianças entre 4 e 5 anos com transtorno fonológico, que apresentavam as chamadas “substituições fônicas” envolvendo a classe de sons das fricativas. Os dados foram coletados utilizando-se o Instrumento para Avaliação de Fala para Análise Acústica – IAFAC, gravados em cabine acústica, socilitando a cada criança cinco repetições das 96 palavras que compõem o instrumento. Os dados foram editados e analisados com o uso do software PRAAT. Foi realizada uma transcrição fonética da primeira repetição (R1) de cada criança, por três juízes, e considerada a concordância de 66%. A partir desta transcrição, foi realizado o cálculo do grau de severidade do transtorno fonológico por meio do PCC-R. Em seguida, realizaram-se a análise fonológica contrastiva da produção das cinco crianças e a análise acústica de todas as “substituições” envolvendo a classe de sons das fricativas. Para a análise acústica, os seguintes parâmetros foram adotados: limite inferior do pico espectral, centróide, variância, assimetria, curtose e duração. Após a análise acústica, verificou-se a existência de contrastes encobertos nas produções tidas como homófonas auditivamente, representando um total de 54% do total das ... / The present study focuses on the presence of covert contrasts in the speech of children with phonological disorder. The covert contrasts are described in the literature as productions that, despite showing auditory perception results identical/similar reveal, from the acoustic analysis, subtle differences. More specifically, this study to observe whether there is a preference, on the part of children with phonological disorders, on the manipulation of acoustic cues that are not robust to Brazilian Portuguese in an attempt to distinguish the fricative phones. Five audio recordings of the speech of five children with speech disorder between 4 and 5 years old who presented the so-called phonic substitution involving the sound class of the fricatives were used. These data were collected using the Speech Assessment Instrument for Acoustic Analysis – SAIAA (IAFAC), recorded in a soundproof booth, requesting each child five repetitions out of the 96 words that make up the instrument. The data were edited and analyzed using the software PRAAT. A phonetic transcription of the first repetition (R1) of each child were performed by three judges and considered the agreement of 66%. From this transcription the degree of severity of phonological disorder was calculated through the PCC-R. Posteriorly, contrastive phonological analysis of the production of the five children was carried out and, finally, acoustic analysis of all the substitutions was performed involving the sound class of the fricatives. For the acoustic analysis the following parameters were used: the lower limit of the spectral peak, centre of gravity, variance, skewness, kurtosis and duration. After acoustic analysis, we could verify the existence of covert contrast in the productions as homophones aurally taken by the judges, representing a total of 54% of total substitutions identified through impressionistic approach by the judges ...
608

O português brasileiro cantado: normas de 1938 e 2007, análise compárativa para interpretação de obras vocais em idioma brasileiro

Stolagli, Juliana Starling [UNESP] 20 December 2010 (has links) (PDF)
Made available in DSpace on 2014-06-11T19:23:08Z (GMT). No. of bitstreams: 0 Previous issue date: 2010-12-20Bitstream added on 2014-06-13T20:50:07Z : No. of bitstreams: 1 stolagli_js_me_ia.pdf: 6632356 bytes, checksum: c37a676a36f64f7a6822fe446b416d7f (MD5) / Universidade Estadual Paulista (UNESP) / Buscou-se neste trabalho a recuperação histórica da pronúncia do português brasileiro cantado, tal como proposta nas normas expostas nos Anais do Primeiro Congresso da Língua Nacional Cantada, de 1938, bem como a realização de uma análise prático-comparativa destas com as normas atuais, publicadas em 2007, destacando os principais pontos que as distinguem e elementos que proporcionam modificações na interpretação de canções em idioma brasileiro. O estudo baseado em documentos históricos e na investigação das circunstâncias que favoreceram a normalização do português brasileiro cantado teve, em sua fase prática, a realização de dois recitais e a gravação de um CD demonstrativo, com a execução de peças cujas pronúncias estão fundamentadas nas normas de 1938 e 2007, buscando evidenciar os elementos de divergência entre elas / The aim of this research is to recover the historical pronunciation of Brazilian Portuguese sung as proposed in the standards set out in the Annals of the First Congress of the National Language as Sung (1° Congresso da Língua Nacional Cantada), in 1938, and the realization of a practical-comparative analysis between these and the current norms, published in 2007, highlighting the main points that distinguish them and evidences that provide changes in the interpretation of songs in Brazilian Portuguese. The study, based on historical documents and investigation of the circumstances which furthered the standardization of Brazilian Portuguese as sung, included the of two concerts and the recording of a demo CD with the execution of pieces whose pronunciations are based on norms from 1938 and 2007, seeking evidences of diverging elements between them
609

A elevação da vogal média anterior átona em Flores da Cunha (RS)

Guzzo, Natália Brambatti 21 June 2010 (has links)
A elevação variável da vogal média anterior átona /e/, como em cidade::cidadi, segunda::sigunda e me chama::mi chama, foi investigada, na fala de 32 informantes de Flores da Cunha (RS), por meio de análise quantitativa, nos moldes da Teoria da Variação Linguística, de Labov (1994, 2008 [1972]), e por meio de análise qualitativa, nos moldes da Teoria da Variação como Prática Social, de Eckert (2000). Houve aplicação da regra de elevação em 50,7% dos 25708 contextos obtidos. As variáveis controladas – Presença de coda na sílaba, Presença de onset na sílaba, Vogal da Sílaba Seguinte, Posição de /e/ na palavra, Contexto fonológico precedente, Contexto fonológico seguinte, Gênero, Idade e Local de residência – foram consideradas significativas pelo programa GoldvarbX, usado na análise estatística. A elevação é condicionada favoravelmente pelos fatores sílaba sem onset, sílaba com coda, vogal alta na sílaba seguinte, vogal /e/ em clítico, consoante velar ou zero em contexto precedente, vogal ou zero em contexto seguinte, zona urbana e idade entre 18 e 30 anos. Sendo os jovens os introdutores da regra de elevação na comunidade, o fenômeno caracteriza-se como mudança linguística em progresso. Para verificar em que medida as práticas sociais desses jovens estão relacionadas a seus índices de elevação de /e/, foi realizada análise de conteúdo (BARDIN, 2000; FREITAS; JANISSEK, 2000) de entrevistas de oito jovens florenses. Essa análise revelou que os jovens que adotam práticas sociais tradicionais, ligadas à história da imigração italiana, têm frequência de aplicação da regra menor do que aqueles que se engajam em práticas inovadoras. Enquanto que as práticas tradicionais orientam-se para a vida na comunidade, as inovadoras orientam-se para fora da comunidade. Os jovens que desejam permanecer na localidade elevam menos a vogal /e/, ao passo que aqueles que desejam dela sair, a fim de adequar-se ao modo de falar mais corrente em outras regiões brasileiras, passam a aplicar a regra de elevação com mais frequência. / Submitted by Ana Guimarães Pereira (agpereir@ucs.br) on 2015-09-30T14:26:55Z No. of bitstreams: 1 Dissertacao Natalia Brambatti Guzzo.pdf: 6386152 bytes, checksum: 1899166e04dfc0e94a7bdcad4f74c853 (MD5) / Made available in DSpace on 2015-09-30T14:26:55Z (GMT). No. of bitstreams: 1 Dissertacao Natalia Brambatti Guzzo.pdf: 6386152 bytes, checksum: 1899166e04dfc0e94a7bdcad4f74c853 (MD5) / The variable raising of the unstressed mid front vowel /e/, in contexts such as cidade::cidadi (city), segunda::sigunda (second) and me chama::mi chama (call me), was studied in the speech of 32 informants from Flores da Cunha (RS, Brazil). The process was analyzed quantitatively, according to Labov’s (1994, 2008 [1972]) Theory of Language Variation, and qualitatively, according to Eckert’s (2000) Theory of Language Variation as Social Practice. 25708 contexts were obtained, and the variable rule – the raising of /e/ – was applied in 50,7% of them. All of the controlled variables – Syllable with coda, Syllable with onset, Type of vowel of the following syllable, Position of /e/ in the word, Preceding phonological context, Following phonological context, Gender, Age and Place of living – were considered to be significant by the statistic program GoldvarbX. The raising of /e/ is favorably conditioned by the factors syllable without onset, syllable with coda, high vowel in the following syllable, /e/ in clitics, preceding velar consonant or no preceding context, following vowel or no following context, informants who live in the city (not in the rural areas) and age between 18 and 30 years old. Since young people are introducing the raising of /e/ in the community, this phenomenon may be considered change in progress. In order to verify how the social practices of young people are related to the raising, a content analysis was performed (BARDIN, 2000; FREITAS; JANISSEK, 2000), based on the speech of eight people from Flores da Cunha whose ages ranged from 18 to 30 years old. The content analysis revealed that young people who adopt traditional social practices which are linked to the history of Italian immigration apply the variable rule less frequently than those who engage in innovative practices. Traditional practices are oriented to life inside the community, whereas innovative practices are oriented to life outside the community. Young people who wish to remain in the community do not raise /e/ as often as those who wish to leave the place; young people who want to leave the community tend to apply the rule more frequently in order to fit in with the pronunciation that is more usual in other Brazilian regions.
610

Ambiente independente de idioma para suporte a identificação de tuplas duplicadas por meio da similaridade fonética e numérica: otimização de algoritmo baseado em multithreading /

Andrade, Tiago Luís de. January 2011 (has links)
Resumo: Com o objetivo de garantir maior confiabilidade e consistência dos dados armazenados em banco de dados, a etapa de limpeza de dados está situada no início do processo de Descoberta de Conhecimento em Base de Dados (Knowledge Discovery in Database - KDD). Essa etapa tem relevância significativa, pois elimina problemas que refletem fortemente na confiabilidade do conhecimento extraído, como valores ausentes, valores nulos, tuplas duplicadas e valores fora do domínio. Trata-se de uma etapa importante que visa a correção e o ajuste dos dados para as etapas posteriores. Dentro dessa perspectiva, são apresentadas técnicas que buscam solucionar os diversos problemas mencionados. Diante disso, este trabalho tem como metodologia a caracterização da detecção de tuplas duplicadas em banco de dados, apresentação dos principais algoritmos baseados em métricas de distância, algumas ferramentas destinadas para tal atividade e o desenvolvimento de um algoritmo para identificação de registros duplicados baseado em similaridade fonética e numérica independente de idioma, desenvolvido por meio da funcionalidade multithreading para melhorar o desempenho em relação ao tempo de execução do algoritmo. Os testes realizados demonstram que o algoritmo proposto obteve melhores resultados na identificação de registros duplicados em relação aos algoritmos fonéticos existentes, fato este que garante uma melhor limpeza da base de dados / Abstract: In order to ensure greater reliability and consistency of data stored in the database, the data cleaning stage is set early in the process of Knowledge Discovery in Database - KDD. This step has significant importance because it eliminates problems that strongly reflect the reliability of the knowledge extracted as missing values, null values, duplicate tuples and values outside the domain. It is an important step aimed at correction and adjustment for the subsequent stages. Within this perspective, techniques are presented that seek to address the various problems mentioned. Therefore, this work is the characterization method of detecting duplicate tuples in the database, presenting the main algorithms based on distance metrics, some tools designed for such activity and the development of an algorithm to identify duplicate records based on phonetic similarity numeric and language-independent, developed by multithreading functionality to improve performance over the runtime of the algorithm. Tests show that the proposed algorithm achieved better results in identifying duplicate records regarding phonetic algorithms exist, a fact that ensures better cleaning of the database / Orientador: Carlos Roberto Valêncio / Coorientador: Maurizio Babini / Banca: Pedro Luiz Pizzigatti Corrêa / Banca: José Márcio Machado / Mestre

Page generated in 0.0413 seconds