Global ETD Search

1	[pt] CLASSES DE PALAVRAS - DA GRÉCIA ANTIGA AO GOOGLE: UM ESTUDO MOTIVADO PELA CONVERSÃO DE TAGSETS / [en] PART OF SPEECH - FROM ANCIENT GREECE TO GOOGLE: A STUDY MOTIVATED BY TAGSET CONVERSION LUIZA FRIZZO TRUGO 10 November 2016 (has links) [pt] A dissertação Classes de palavras — da Grécia Antiga ao Google: um estudo motivado pela conversão de tagsets consiste em um estudo linguístico sobre classes gramaticais. A pesquisa tem como motivação uma tarefa específica da Linguística Computacional: a anotação de classes gramaticais (POS, do inglês part of speech ). Especificamente, a dissertação relata desafios e opções linguísticas decorrentes da tarefa de alinhamento entre dois tagsets: o tagset utilizado na anotação do corpus Mac-Morpho, um corpus brasileiro de 1.1 milhão de palavras, e o tagset proposto por uma equipe dos laboratórios Google e que vem sendo utilizado no âmbito do projeto Universal Dependencies (UD). A dissertação tem como metodologia a investigação por meio da anotação de grandes corpora e tematiza sobretudo o alinhamento entre as formas participiais. Como resultado, além do estudo e da documentação das opções linguísticas, a presente pesquisa também propiciou um cenário que viabiliza o estudo do impacto de diferentes tagsets em sistemas de Processamento de Linguagem Natural (PLN) e possibilitou a criação e a disponibilização de mais um recurso para a área de processamento de linguagem natural do português: o corpus Mac-Morpho anotado com o tagset e a filosofia de anotação do projeto UD, viabilizando assim estudos futuros sobre o impacto de diferentes tagsets no processamento automático de uma língua. / [en] The present dissertation, Part of speech — from Ancient Greece to Google: a study motivated by tagset conversion, is a linguistic study regarding gramatical word classes. This research is motivated by a specific task from Computational Linguistics: the annotation of part of speech (POS). Specifically, this dissertation reports the challenges and linguistic options arising from the task of aligning two tagsets: the first used in the annotation of the Mac-Morpho corpus — a Brazilian corpus with 1.1 million words — and the second proposed by Google research lab, which has been used in the context of the Universal Dependencies (UD) project. The present work adopts the annotation of large corpora as methodology and focuses mainly on the alignment of the past participle forms. As a result, in addition to the study and the documentation of the linguistic choices, this research provides a scenario which enables the study of the impact different tagsets have on Natural Language Processing (NLP) systems and presents another Portuguese NLP resource: the Mac-Morpho corpus annotated with project UD s tagset and consistent with its annotation philosophy, thus enabling future studies regarding the impact of different tagsets in the automatic processing of a language. [pt] LINGUISTICA COMPUTACIONAL [pt] PARTICIPIO [pt] ANOTACAO [pt] CORPUS [pt] CLASSE DE PALAVRAS [en] COMPUTATIONAL LINGUISTICS [en] PARTICIPLE [en] ANNOTATION [en] CORPORA [en] PART OF SPEECH
2	[en] SUPPORT NOUNS: OPERATIONAL CRITERIA FOR CHARACTERIZATION / [pt] O SUBSTANTIVO-SUPORTE: CRITÉRIOS OPERACIONAIS DE CARACTERIZAÇÃO CLAUDIA MARIA GARCIA MEDEIROS DE OLIVEIRA 06 March 2007 (has links) [pt] Este trabalho tem por objetivo prover um critério operacional para caracterizar substantivos em combinações de substantivo seguido de adjetivo, em que o substantivo apresenta situação análoga à dos chamados verbos leves ou verbos-suporte, largamente estudados em Lingüística e Processamento de Linguagem Natural nos últimos anos. O trabalho se situa na confluência entre estudos lingüísticos, lexicográficos e computacionais e pretende explorar a potencialidade da análise automática de corpora e instrumentos quantitativos em busca de uma maior objetividade na fundamentação de conceitos que norteiam a atividade de análise lingüística. O desenvolvimento da pesquisa alia a pesquisa em corpus ao dicionário tradicional para realizar o levantamento das principais propriedades das combinações S - Adj, particularizado para o caso de ocorrência de adjetivos denominais. A partir das informações lexicográficas e contextuais demonstra-se a existência de um conjunto de substantivos que participam das construções estudadas de maneira semelhante aos verbos- suporte em combinações V - SN. Um método automático de reconhecimento dos substantivos-suporte em textos é elaborado, com o objetivo de fornecer aos estudiosos um instrumento capaz de produzir evidências convincentes, dada a insuficiência de julgamentos intuitivos para justificar a delimitação de expressões de aparente irregularidade. / [en] The main goal of this work is to provide operational criteria for characterizing nouns in Noun - Adjective combinations, in which the noun occurs in an analogous way to so called light verbs or support verbs, widely studied in recent years in both Linguistics and Natural Language Processing. In the work, linguistic, lexicographic and computational studies converge in order to explore the potential for automatic analysis of corpora, whose aim is to provide quantitative tools and methods which would lead to a more objective way of establishing concepts which underlie linguistic analysis. The work unites corpus-based research with traditional lexicography in order to elicit the main properties of the N-Adj combinations occurring with denominal adjectives. The lexicographic and contextual data reveal the existence of a set of nouns that occur in the studied constructions in a way similar to light verbs in V-Noun phrasal combinations. An automatic method for recognizing support nouns in texts is developed, which will provide language specialists with an instrument capable of bringing solid evidence to add to intuitive judgments in the task of justifying the delimitation of expressions that are apparently irregular [pt] LINGUISTICA [en] LINGUISTICS [pt] LEXICOGRAFIA DE CORPUS [en] CORPUS LEXICOGRAPHY [pt] SUBSTANTIVO-SUPORTE [en] SUPPORT NOUN [pt] ADJETIVO DENOMINAL [en] DENOMINAL ADJECTIVE [pt] CLASSE DE PALAVRAS [en] PART OF SPEECH
3	[en] SEMANTIC TYPOLOGIES OF ADVERBS: A COMPARATIVE STUDY / [pt] TIPOLOGIAS SEMÂNTICAS DE ADVÉRBIOS: UM ESTUDO COMPARATIVO ZENAIDE DIAS TEIXEIRA 02 June 2008 (has links) [pt] Este trabalho teve por objetivo descrever, analisar e discutir comparativamente tipologias semânticas de advérbios propostas em duas vertentes dos estudos da linguagem: a Gramática Tradicional, de um lado, e a Lingüística de orientação funcionalista, de outro. Para tal, mapeamos tipologias encontradas em um conjunto representativo de gramáticas tradicionais do português e em uma amostra não menos representativa de trabalhos de lingüistas brasileiros que se debruçaram sobre o tema adotando uma abordagem funcionalista. Propusemos dois quadros tipológicos resumitivos das duas vertentes de classificação, nos quais buscamos identificar as principais classes semânticas estabelecidas em cada uma das duas vertentes. Aplicamos, então esses dois instrumentos de classificação a um mesmo corpus de frases autênticas do português (extraído do centro de recursos distribuídos Linguateca/Frases PB), e analisamos os resultados comparando as duas classificações quanto aos seguintes critérios: (a) abrangência; (b) explicitude; e (c) adequação aos propósitos norteadores (normativo-didáticos ou teórico-descritivos). Tendo em vista tais critérios, apontamos vantagens e desvantagens relativas dos dois tipos de classificação e destacamos alguns problemas enfrentados igualmente na tradição gramatical e na lingüística funcionalista no que tange a caracterização do comportamento semântico dos advérbios. / [en] This work aims to describe, analyze and discuss comparatively semantic typologies of adverbs proposed in two srands of linguistic studies: Traditional Grammar on the one hand and Functional Linguistics on the other. For that, we analyzed and mapped typologies found in a representative set of Portuguese traditional grammars as well as in an equally significant sample of funtictionally-oriented Brazilian linguistic studies. Two typological schemes were proposed to represent the two approaches, identifying the main semantic categories established in each one of them. These two classification instruments were then applied to the analysis of a corpus of Portuguese authentic sentences (an excerpt from the corpus made availble by Linguateca/Frases PB). The classifications were analyzed and compared according to the following criteria: a) range; b) explicitude; c) suitability to purposes (didactic-normative and/or descriptive-theoretical). Using these criteria, it was possible to point out relative advantages and disadvantages in both types of approach and to expose some problems faced equally by traditional grammar and functional linguistics concerning the semantic behaviour of adverbs. [pt] FUNCIONALISMO [en] FUNCTIONALISM [pt] CLASSE DE PALAVRAS [en] PART OF SPEECH [pt] TIPOLOGIAS SEMANTICAS [en] SEMANTICS TYPES [pt] ADVERBIOS [en] ADVERBS [pt] GRAMATICA TRADICIONAL [en] TRADITIONAL GRAMMAR

1

Page generated in 0.1756 seconds