Spelling suggestions: "subject:"corpora"" "subject:"korpora""
221 |
The interplay between non-verbal and verbal interaction in synthetic worlds which supports verbal participation and production in a foreign language. / Le rapport entre le verbal et le non verbal dans des mondes synthétiques et son rôle de soutien pour la production et la participation verbales en langue étrangère.Wigham, Ciara 16 November 2012 (has links)
Cette recherche vise à analyser la communication pédagogique multimodale dans des mondes synthétiques (virtuels). L'étude se focalise sur le rapport entre le verbal et le non verbal et son rôle de soutien pour la production et la participation verbales en langue étrangère. Celui-ci est analysé dans une perspective socio-sémiotique de la multimodalité, dans le contexte d'une formation conduite dans le monde synthétique Second Life selon une approche Enseignement d'une Matière Intégré à une Langue Étrangère (Emile). La formation, 'Building Fragile Spaces' menée dans le cadre du projet européen ARCHI21 a été conçue pour des étudiants d'architecture dont la langue étrangère était soit le français soit l'anglais.Le rapport entre le verbal et le non verbal est examiné selon trois angles différents. Premièrement, le rôle du non verbal pendant une activité de construction collaborative est analysé au vu des opportunités offertes par des mondes synthétiques pour la co-création de l'environnement et pour la collaboration. A travers cette étude, nous nous demandons si l'utilisation du mode non verbal a un impact sur la participation des étudiants dans le mode verbal et si le rapport entre ces deux modes a une influence sur la production verbale. Deuxièmement, l'utilisation du non verbal pour la construction des identités est abordée et est considérée en rapport avec l'interaction et la participation verbale des étudiants dans la langue étrangère. Finalement, la recherche se concentre sur le rapport entre les modalités audio et clavardage dans le mode verbal. Plus particulièrement, l'étude s'intéresse au rôle du clavardage dans l'interaction puisqu'il est en compétition non seulement avec l'audio mais également avec plusieurs modalités non verbales. La possibilité d'utiliser le clavardage pour la rétroaction est également abordée.Cette thèse cherche à contribuer aux considérations méthodologiques exigées pour que la recherche concernant la communication pédagogique multimodale dans des mondes synthétiques puisse aller au-delà des exemples spéculatifs et anecdotiques. Une typologie de modalités verbales et non verbales est proposée. Elle sert à étendre une méthodologie proposée pour la transcription des interactions multimodales aux interactions possibles dans les mondes synthétiques. En vue de la problématique plus générale, dans les domaines des Sciences Humaines et des Sciences du Langage, il s'agit de rendre visibles et accessibles publiquement les données utilisées pour les analyses. En effet, cette étude se réfère à un corpus d'apprentissage dans son approche méthodologique. La construction d'un corpus structuré permet d'effectuer des analyses contextualisées des données recueillies lors de la formation 'Building Fragile Spaces'.Cette recherche propose quelques éléments de réponse concernant l'augmentation de la participation verbale en rapport avec l'organisation proxémique des étudiants, la customisation de l'apparence des avatars des étudiants et l'utilisation importante des actes non verbaux. Concernant la production verbale, l'étude décrit comment, dans le mode non verbal, le mouvement de l'avatar est employé en tant que stratégie pour surmonter des difficultés de communication dans le mode verbal. Ces difficultés concernent, en particulier, l'expression de la direction et de l'orientation. L'étude montre également l'intérêt d'utiliser le clavardage pour offrir de la rétroaction concernant la forme linguistique dans le but de soutenir la production verbale des apprenants dans la modalité audio. Au vu des résultats, l'étude propose quelques considérations concernant la conception des activités pédagogiques pour l'apprentissage des langues dans des mondes synthétiques. / This research focuses on multimodal pedagogical communication in synthetic (virtual) worlds. The study investigates the interplay between verbal and nonverbal interaction which supports verbal participation and production in a foreign language. This is analysed from a socio-semiotic perspective of multimodality within the context of a course held in the synthetic world Second Life, which adopted a Content and Language Integrated Learning (CLIL) approach. The course, entitled 'Building Fragile Spaces', formed part of the European project ARCHI21. It was designed for higher education students of Architecture whose foreign language was either French or English.The interplay between verbal and nonverbal interaction is examined from three different angles. Firstly, considering the opportunities synthetic worlds offer for the co-creation of the environment through building activities and for collaboration, the role played by the nonverbal mode during a collaborative building activity is investigated. The study questions whether the use of the nonverbal mode impacts on the participation of students in the verbal mode, and whether any interplay exists between these two modes that influences verbal production. Secondly, use of the nonverbal mode by students in inworld identity construction is addressed and considered with reference to their verbal interaction, and participation, in the foreign language. Thirdly, the research concentrates upon interplay between the audio and textchat modalities in the verbal mode. More specifically, the focus is on whether the textchat plays a role during interaction, considering it is in competition not only with the audio modality but also with several nonverbal modalities; and on whether the textchat modality can serve for feedback provision on language form.This thesis seeks to contribute to the methodological considerations to allow research to move beyond speculative and anecdotal examples of multimodal pedagogical communication in synthetic worlds. A typology of nonverbal and verbal modalities is proposed, and then drawn upon, to extend a previous methodology suggested for multimodal transcription to interactions in synthetic worlds. Considering, within the fields of Social Sciences and Language Sciences, the more general research problem to render research data used for analyses visible and publically accessible, the study adopts a LEarning and TEaching Corpus (LETEC) methodological approach. Constituting a structured corpus allows for contextual analyses of the data collected during the 'Building Fragile Spaces' course.This research offers insights into how verbal participation increases with reference to the proxemic organisation of students, the customization of students' avatar appearance and an increased use of nonverbal acts. Concerning verbal production, the study shows how avatar movement in the nonverbal mode was used as a strategy to overcome verbal miscommunication when expressing direction and orientation and also the benefits of using the textchat modality for feedback on language form in order to support learners’ productions in the audio modality. In light of these results, the study suggests some considerations concerning the design of pedagogical activities for language learning within synthetic worlds.
|
222 |
Anotação e classificação automática de entidades nomeadas em notícias esportivas em Português Brasileiro / Automatic named entity recognition and classification for brazilian portuguese sport newsRodrigo Constantin Ctenas Zaccara 11 July 2012 (has links)
O objetivo deste trabalho é desenvolver uma plataforma para anotação e classificação automática de entidades nomeadas para notícias escritas em português do Brasil. Para restringir um pouco o escopo do treinamento e análise foram utilizadas notícias esportivas do Campeonato Paulista de 2011 do portal UOL (Universo Online). O primeiro artefato desenvolvido desta plataforma foi a ferramenta WebCorpus. Esta tem como principal intuito facilitar o processo de adição de metainformações a palavras através do uso de uma interface rica web, elaborada para deixar o trabalho ágil e simples. Desta forma as entidades nomeadas das notícias são anotadas e classificadas manualmente. A base de dados foi alimentada pela ferramenta de aquisição e extração de conteúdo desenvolvida também para esta plataforma. O segundo artefato desenvolvido foi o córpus UOLCP2011 (UOL Campeonato Paulista 2011). Este córpus foi anotado e classificado manualmente através do uso da ferramenta WebCorpus utilizando sete tipos de entidades: pessoa, lugar, organização, time, campeonato, estádio e torcida. Para o desenvolvimento do motor de anotação e classificação automática de entidades nomeadas foram utilizadas três diferentes técnicas: maximização de entropia, índices invertidos e métodos de mesclagem das duas técnicas anteriores. Para cada uma destas foram executados três passos: desenvolvimento do algoritmo, treinamento utilizando técnicas de aprendizado de máquina e análise dos melhores resultados. / The main target of this research is to develop an automatic named entity classification tool to sport news written in Brazilian Portuguese. To reduce this scope, during training and analysis only sport news about São Paulo Championship of 2011 written by UOL2 (Universo Online) was used. The first artefact developed was the WebCorpus tool, which aims to make easier the process of add meta informations to words, through a rich web interface. Using this, all the corpora news are tagged manually. The database used by this tool was fed by the crawler tool, also developed during this research. The second artefact developed was the corpora UOLCP2011 (UOL Campeonato Paulista 2011). This corpora was manually tagged using the WebCorpus tool. During this process, seven classification concepts were used: person, place, organization, team, championship, stadium and fans. To develop the automatic named entity classification tool, three different approaches were analysed: maximum entropy, inverted index and merge tecniques using both. Each approach had three steps: algorithm development, training using machine learning tecniques and best score analysis.
|
223 |
Present but not perfect : A study of problems Brazilian students encounter when learning the English present perfect tenseSklar, Fabiana Andrioli January 2011 (has links)
Most Brazilian students learning English face difficulty when studying thepresent perfect. It is one of the most challenging aspects of the Englishlanguage for Portuguese speakers due to the similar form with a divergentsemantic value. The Brazilian Portuguese present perfect forces iteration andthe student automatically transfers the same meaning when translating anEnglish sentence literally. Brazilian learners get confused about when and inwhat situations to use the English present perfect and frequently are not able todistinguish it from the simple past use. This study is comprised of two parts.First, a comparative study was done to investigate which Portuguese tensetranslators of famous literary books consider to be equivalent to the Englishpresent perfect according to the message which is being conveyed. Thedatabase used was the bidirectional parallel corpus of English and PortugueseCOMPARA. Second, textbooks developed to teach English in Brazil wereanalyzed in order to verify from what perspective students were beinginstructed concerning the present perfect and whether the semantic differencesbetween the two languages were pointed out. According to the translationcorpus, the English present perfect is mostly equivalent to Brazilian Portuguesesimple past. Adverbs are also often needed to express the English presentperfect meaning in Portuguese. The textbooks were found to present poorexplanations and seem not to call the learners’ attention to the source of the problems. Textbooks do not stress the importance of the semantic value and thecontext, and do not call attention to the different meanings between theBrazilian and the English present perfect.
|
224 |
Det tredje språket : Tolkspråk och normalisering i teckenspråkstolkning / The third language : interpretese and normalisation in Sign Language InterpretingHassel Borowski, Frida January 2016 (has links)
Den här studien behandlar fenomenet tolkspråk – tanken om att tolkat språk skiljer sig från icke-tolkat språk. Översättningsvetenskapen och dess motsvarighet översättarspråk har utgjort en stor inspirationskälla till arbetet, då forskningen kommit längre där. Ett forskningsområde behandlar så kallade översättningsuniversalier – universella regler eller lagar för hur översatt språk ser ut. En av dessa lagar kallas för normalisering. I studien undersöks om normalisering är applicerbart även på teckenspråkstolkning, med utgångspunkt i påståendet att normalisering kan vara synligt i översatt text som en överrepresentation av typiska målspråksdrag. För att undersöka detta har två jämförbara korpusar använts, dels Svensk teckenspråkskorpus (SSLC) med icke-tolkade teckenspråkstexter, dels Korpus för simultantolkade teckenspråkstexter (KST) med tolkade teckenspråkstexter. Det typiska, teckenspråkiga drag som valts för undersökningen är det tecken som glosas KOPPLA. Förekomsten av tecknet i de båda korpusarna har undersökts för att kunna upptäcka en eventuell överrepresentation i KST. Resultaten visar att KOPPLA mycket riktigt är överrepresenterat i KST, men att det är svårt att generalisera på grund av flera begränsande faktorer. / This essay is concerned with the subject of interpretese – the idea that interpreted language differs from non-interpreted language. Within translation studies, the corresponding term is translationese, and this essay draws upon much of the research in this field, as it is more developed. One particular area of research into translationese revolves around so called translation universals, or universal features of translation. They could be described as rules or laws that define translated language. One of those universals is called normalisation. This essay seeks to answer if normalisation also exists in Sign Language interpreting, with reference to exaggeration of typical target language patterns. Two comparable corpora were used, Swedish Sign Language Corpus (SSLC) with non-interpreted Sign Language texts, and Korpus för simultantolkade teckenspråkstexter (KST) with interpreted Sign Language texts. The typical target language pattern that was chosen for the investigation is the Swedish sign KOPPLA. Instances of the sign were investigated in both corpora, to spot any exaggeration in KST. The results show that KOPPLA is in fact overrepresented in KST, but that one should be careful to generalize, as several limiting factors were at play.
|
225 |
Aplikace pro řízení paralelního zpracování dat / Application for Parallel Data Processing ControlGrepl, Filip January 2021 (has links)
This work deals with the design and implementation of a system for parallel execution of tasks in the Knowledge Technology Research Group. The goal is to create a web application that allows to control their processing and monitor runs of these tasks including the use of system resources. The work first analyzes the current method of parallel data processing and the shortcomings of this solution. Then the work describes the existing tools including the problems that their test deployment revealed. Based on this knowledge, the requirements for a new application are defined and the design of the entire system is created. After that the selected parts of implementation and the way of the whole system testing is described together with the comparison of the efficiency with the original system.
|
226 |
Etude de la concession dans une perspective contrastive français - vietnamien à partir de corpus oraux / Research of the concession from a contrastive perspective French - Vietnamese from oral corporaDo, Thi Thu Trang 16 December 2016 (has links)
Cette thèse étudie l'expression de la concession l'oral dans une perspective contrastive français - vietnamien. A partir d'un corpus d'émissions radiophoniques en français et ensuivant trois approches complémentaires - linguistique, logique et interactionnelle -, le fonctionnement et les caractéristiques des concessions ont été analysés afin de les classer par catégories et d'en proposer un modèle d'expression. Les concessions en vietnamien ont été étudiées à partir de données homologues afin de mettre en évidence les similitudes et les différences. / The aim of this PhD is a linguistic analysis of the concessive clause in a contrastive French/Vietnamese perspective based on a corpus of radio programs. Three complementary approaches (linguistic, logical and interactional) are used to define the functions and the properties of concessive clauses in French as in Vietnamese in order to observe the similarities and the differences toward a modelling.
|
227 |
Caractérisation différentielle de forums de discussion sur le VIH en vietnamien et en français : Éléments pour la fouille comportementale du web social / Differential characterization of discussion forums on HIV in Vietnamese and French : Elements for behaviour mining on the social webHô Dinh, Océane 22 December 2017 (has links)
Les discours normés que produisent les institutions sont concurrencés par les discours informels ou faiblement formalisés issus du web social. La démocratisation de la prise de parole redistribue l’autorité en matière de connaissance et modifie les processus de construction des savoirs. Ces discours spontanés sont accessibles par tous et dans des volumes exponentiels, ce qui offre aux sciences humaines et sociales de nouvelles possibilités d’exploration. Pourtant elles manquent encore de méthodologies pour appréhender ces données complexes et encore peu décrites. L’objectif de la thèse est de montrer dans quelle mesure les discours du web social peuvent compléter les discours institutionnels. Nous y développons une méthodologie de collecte et d’analyse adaptée aux spécificités des discours natifs du numérique (massivité, anonymat, volatilité, caractéristiques structurelles, etc.). Nous portons notre attention sur les forums de discussion comme environnements d’élaboration de ces discours et appliquons la méthodologie développée à une problématique sociale définie : celle de l’épidémie du VIH/SIDA au Viêt Nam. Ce terrain applicatif recouvre plusieurs enjeux de société : sanitaire et social, évolutions des moeurs, concurrence des discours. L’étude est complétée par l’analyse d’un corpus comparable de langue française, relevant des mêmes thématique, genre et discours que le corpus vietnamien, de manière à mettre en évidence les spécificités de contextes socioculturels distincts. / The standard discourse produced by official organisations is confronted with the unofficial or informal discourse of the social web. Empowering people to express themselves results in a new balance of authority, when it comes to knowledge and changes the way people learn. Social web discourse is available to each and everyone and its size is growing fast, which opens up new fields for both humanities and social sciences to investigate. The latter, however, are not equipped to engage with such complex and little-analysed data. The aim of this dissertation is to investigate how far social web discourse can help supplement official discourse. In it we set out a method to collect and analyse data that is in line with the characteristics of a digital environment, namely data size, anonymity, transience, structure. We focus on forums, where such discourse is built, and test our method on a specific social issue, ie the HIV/AIDS epidemic in Vietnam. This field of investigation encompasses several related questions that have to do with health, society, the evolution of morals, the mismatch between different kinds of discourse. Our study is also grounded in the analysis of a comparable French corpus dealing with the same topic, whose genre and discourse characteristics are equivalent to those of the Vietnamese one: this two-pronged research highlights the specific features of different socio-cultural environments.
|
228 |
Collocazioni avverbo + aggettivo in un corpus orale di discenti: un approccio quantitativo e qualitativo / ADVERB + ADJECTIVE COLLOCATIONS IN A SPOKEN LEARNER CORPUS: A QUANTITATIVE AND QUALITATIVE APPROACH / Adverb + adjective collocations in a spoken learner corpus: A quantitative and qualitative approachPOLI, FRANCESCA 21 July 2021 (has links)
Negli ultimi 70 anni, c'è stato un incremento degli studi e ricerche inglesi sulle collocazioni (Firth 1957; Hoey, 2005; Moon, 1998b; Sinclair 1991; 2004; Stubbs, 1996; 2001), i quali hanno evidenziato che la fraseologia è pervasiva alla lingua (Altenberg, 1998; Biber et al., 1999; Cowie, 1991; 1992; Howarth; 1998). Questo indica anche che una buona padronanza delle collocazioni è necessaria se i discenti mirano a raggiungere una fluidità simile a quella di un nativo nella L2. Infatti, la ricerca sulla produzione di linguaggio formulaico da parte degli apprendenti ha dimostrato che le collocazioni sono essenziali nell'acquisizione della lingua seconda (Cowie, 1998; Pawley & Syder, 1983; Peters, 1983) e sono una componente chiave per lo sviluppo della "fluency" (Ellis, 2002; 2003; Ellis et al., 2015; Howarth, 1998).
Nonostante il maggior numero di studi sulle collocazioni, la maggior parte degli studiosi si è concentrata su dati scritti e su un insieme ristretto di combinazioni, come le collocazioni verbo + sostantivo. La scarsa disponibilità di corpora orali di discenti e la maggiore attenzione per le sequenze formulaiche più soggette a errori hanno portato i ricercatori a trascurare collocazioni come avverbio + aggettivo. L'intensificazione è una parte intricata dell'apprendimento delle lingue straniere (Lorenz, 1999) e merita ulteriore attenzione, soprattutto per quanto riguarda i dati parlati, che riflettono meglio il linguaggio dei discenti (Myles, 2015). Il presente lavoro indaga le collocazioni di avverbi + aggettivi in un corpus parlato di recente compilazione di studenti italiani avanzati di inglese L2. La tesi adotta un approccio di Analisi Interlinguistica Contrastiva (Granger, 1998) per verificare se: a) ci sono differenze tra la produzione di collocazioni degli studenti italiani di inglese rispetto ai coetanei madrelingua; b) ci sono differenze tra le collocazioni prodotti dagli studenti italiani e quelle dei madrelingua in termini di modelli sintattici e significato lessicale; c) la congruenza della L1 ha un effetto di trasferimento sulla produzione da parte dei discenti di collocazioni poco frequenti e/o non attestate.
Per rispondere alle tre domande di ricerca, sono state condotte analisi quantitative e qualitative sull'Italian Spoken Learner Corpus (ISLC) e sul corpus gemello di LINDSEI, LOCNEC. LOCNEC è stato utilizzato come corpus di riferimento di madrelingua per il suo alto livello di comparabilità con ISLC. Per le analisi quantitative, è stato seguito l'approccio di Durrant e Schmitt (2009) per il calcolo dei punteggi delle misure di associazione delle collocazioni (t-score e MI) sulla base del corpus di riferimento BNC e le collocazioni sono state poi divise in tre categorie in base al loro punteggio: collocazioni (t-score e MI maggiore o uguale a 2 e 3 rispettivamente), collocazioni infrequenti/non attestate (t-score e MI non disponibili a causa dell'infrequenza), collocazioni in area grigia (t-score e MI inferiore a 2 e 3 rispettivamente). I test T-test e Wilcoxon rank sum test sono stati utilizzati sulle collocazioni estratte da ISLC e LOCNEC e sono state calcolate le dimensioni degli effetti. Inoltre, i test sono stati impiegati per valutare i valori medi individuali di t-score e MI degli studenti e dei madrelingua.
Per quanto riguarda le analisi qualitative, è stato impiegato uno schema a tre livelli per analizzare due serie di collocazioni: la prima serie comprende 11 collocazioni con t-score e MI maggiore uguale a 2 e 3 rispettivamente e una frequenza di 5 nell'ISLC; la seconda serie comprende 9 collocazioni infrequenti/non attestate con una frequenza maggiore o uguale a 2 nell'ISLC. Seguendo lo schema, i due set di collocazioni estratti sia dall'ISLC sia dal LOCNEC sono stati analizzati tenendo conto del loro background collocativo (etimologia, livello CEFR, congruenza L1), delle variabili del discente (sesso, esperienza di soggiorno all'estero, corso universitario, altre lingue), e delle variabili testuali (funzione attributiva vs predicativa dell'aggettivo, pronomi vs it-sentences, tempo verbale, affermativo vs negativo, connotazione positiva vs negativa).
I risultati dei test statistici sono stati tutti significativi con effect size medio-grandi e, insieme alle analisi qualitative, hanno indicato che: gli studenti italiani di inglese producono un minor numero di collocazioni; un maggior numero di non-collocazioni; le loro combinazioni sono meno collocative di quelle dei madrelingua (ovvero, i loro punteggi di misura delle associazioni sono in media più bassi di quelli dei nativi); non ci sono differenze marcate in termini di modelli lessico-grammaticali tra le collocazioni degli studenti e quelle dei madrelingua, ma gli studenti tendono ad assegnare alle loro collocazioni funzioni più creative dal punto di vista pragmatico; non è stata trovata alcuna prova di trasferimento L1 (negativo) in relazione alla produzione da parte dei discenti di collocazioni infrequenti/non attestate, sostenendo così ulteriormente la conclusione precedente.
I risultati corroborano ulteriormente la letteratura sulle collocazioni prodotte dai discenti e aggiungono un altro tassello al puzzle della lingua parlata: il ritardo collocazionale, cioè lo sviluppo più lento delle prestazioni di produzione di collocazioni, può essere trovato anche nei dati parlati e i discenti sembrano anche produrre meno collocazioni identificate da punteggio t-score. Questo ha due importanti, anche se semplici, implicazioni: che gli studenti dovrebbero probabilmente essere esposti a più input di lingua parlata, e che le teorie di acquisizione della lingua seconda potrebbero utilmente rivedere i processi di acquisizione fraseologica degli studenti nel contesto EFL. Un'altra scoperta è relativa ai modelli lessico-grammaticali delle collocazioni degli studenti non erano marcatamente diversi da quelli dei madrelingua, ma erano meno vari e mostravano una creatività pragmatica. Questo potrebbe informare gli studiosi sui potenziali processi di fossilizzazione (Selinker, 1972) nella fraseologia e/o sulle strategie di semplificazione o di evitamento (Farghal & Obiedat, 1995). Infine, anche se gli studi tradizionali hanno trovato che la congruenza L1 gioca un ruolo chiave nella produzione di collocazioni (cfr. Bahns, 1993; Granger, 1998b; Nesselhauf, 2005; Wang, 2016), nessuna prova di congruenza L1 è stata trovata per quanto riguarda i dati parlati, il che è un'interessante controprova. Nel complesso, questa tesi ha sottolineato che la produzione di collocazioni, sia quantitativamente sia pragmaticamente, è in ritardo rispetto alla competenza collocazionale e, sebbene questa linea possa essere molto sottile e non significativa nei testi scritti, il divario si allarga nella lingua parlata. / In the last 70 years, there has been an increase in English studies on collocations (Firth 1957; Hoey, 2005; Moon, 1998; Sinclair 1991; 2004; Stubbs, 1996; 2001) and research which have documented that phraseology is pervasive to language (Altenberg, 1998; Biber et al., 1999; Cowie, 1991; 1992; Howarth; 1998). This also indicates that a good command of collocations is needed if learners aim to achieve native-like fluency in the L2. Indeed, research on learner production of formulaic language has shown that collocations are essential in the acquisition of second language (Cowie, 1998; Pawley & Syder, 1983; Peters, 1983) and are a key component for the development of fluency (Ellis, 2002; 2003; Ellis et al., 2015; Howarth, 1998).
Despite the surge in studies on collocations, the majority of scholars have focused on written data and on a restricted set of combinations, such as verb + noun collocations. The poor availability of spoken learner corpora and the more error-prone formulaic sequences have led researchers to neglect collocations such as adverb + adjective. Intensification is an intricate part of foreign language learning (Lorenz, 1999) and deserves further attention, especially as regards spoken data, which is a better reflection of learner language (Myles, 2015).
The present work investigates adverb + adjective collocations in a newly compiled spoken learner corpus of advanced Italian learners of English L2. The thesis adopts a Contrastive Interlanguage Analysis (Granger, 1998) approach to verify whether: a) there are any differences between the collocation production of Italian learners of English compared to native-speaker peers; b) there are any differences between the Italian learners’ collocations and the native speakers’ in terms of syntactic patterns and lexical meaning; c) L1 congruency has a transfer effect on the learner production of infrequent and/or unattested collocations.
In order to address the three overarching research questions, quantitative and qualitative analyses were carried out on the Italian Spoken Learner Corpus (ISLC) and the sister corpus of LINDSEI, LOCNEC. LOCNEC was used as the native-speaker reference corpus for its high level of comparability to ISLC. For the quantitative analyses, Durrant and Schmitt’s (2009) approach was followed for the calculation of the collocation’s association measure scores (t-score and MI) based on the large reference corpus BNC and the collocations were then divided into three categories based on their score: collocations (t-score and MI equal or greater than 2 and 3 respectively), infrequent/unattested collocations (t-score and MI scores unavailable due to infrequency), grey area collocations (t-score and MI lower than 2 and 3 respectively). T-tests and Wilcoxon rank sum tests were computed on the collocations extracted from ISLC and LOCNEC and effect sizes were calculated. In addition, the tests were employed to assess the average individual t-score and MI values of learners and native speakers.
As regards the qualitative analyses, a three-fold scheme was employed to analyse two sets of collocations: the first set comprises 11 collocations with t-score and MI equal or greater than 2 and 3 respectively and a frequency of equal or greater than 5 in the ISLC; the second set includes 9 infrequent/unattested collocations with a frequency equal or greater than 2 in ISLC. Following the scheme, the two sets of collocations extracted from both ISLC and LOCNEC were analysed by taking into account their collocational background (etymology, CEFR level, L1 congruence), the learner variables (gender, stay-abroad experience, university course, other languages), and the text variables (attributive vs predicative function of the adjective, pronouns vs it-sentences, tense, affirmative vs negative, positive vs negative connotation).
The results of the statistical tests were all significant with medium to large effect sizes and, together with the qualitative analyses, indicated that: Italian learners of English produce a fewer number of collocations; a higher number of non-collocations; their combinations are less collocational than native speakers’ (i.e., their association measure scores as on average lower than the natives’); there are no marked differences in terms of lexico-grammatical patterns between the learners’ collocations and the native speakers’, but the learners tend to assign more pragmatically creative functions to their collocations; no evidence of L1 (negative) transfer was found in relation to the learners’ production of infrequent/unattested collocations, thus further supporting the previous finding.
The findings further corroborate the literature on learners’ collocations and add another piece to the puzzle of spoken language: collocational lag, that is the slower development of collocation performance, can also be found in spoken data and learners also seem to produce fewer t-score collocations. This has two important, though simple, implications: that learners should probably be exposed to more spoken language input, and that second language acquisition theories might usefully review phraseological acquisition processes of EFL learners. Another finding is that the lexico-grammatical patterns of learners’ collocations were not markedly different from native speakers’, but they were less varied and displayed pragmatic creativity. This could inform scholars about potential fossilisation processes (Selinker, 1972) in phraseology and/or simplification or avoidance strategies (Farghal & Obiedat, 1995). Lastly, although mainstream studies have found that L1 congruency plays a role in the production of collocations (cf. Bahns, 1993; Granger, 1998b; Nesselhauf, 2005; Wang, 2016), no evidence of L1 congruency was found as regards spoken data, which is an interesting counter-finding. Overall, this thesis has underlined that collocation production, both quantitatively and pragmatically, lags behind collocation competence and, although this line may be very thin and not significant in written texts, the gap widens in spoken language.
|
229 |
Gestikulace a eventuálnost: mezijazyková studie / Gesture and eventuality: a cross-linguistic studyJehlička, Jakub January 2021 (has links)
Mluvčí typologicky odlišných jazyků volí různé strategie při popisu stejné události v závislosti na dostupných jazykově-specifických gramatických prostředcích. Tyto strategie se projevují např. různými způsoby konceptualizace událostních rámců bě- hem jazykového vyjádření, ale v nejazykové kognici. Jedním z jevů, které byly v této souvislosti zaznemány, jsou jazykově specifické způsoby gestikulace doprovázející mluvené popisy událostí, které reflektují (či manifestují) tělesně ukotvená konceptuál- ní schémata, na nichž naše vnímání událostí stojí. Tématem této práce je multimodální konstruování (construal) událostí v češtině a v angličtině. Konkrétně se práce zaměřuje na spojitosti mezi formálními rysy gest (způsob pohybu a jeho zakončení) a sémantické rysy, které konstituují tzv. aspektuální kontury událostí (konstruování časového a kvalitativního průběhu události). První část prezentovaného výzkumu tvoří analýza materiálu z českého a anglického multimodální korpusu. Oba použité korpusy obsahují nahrávky spontánních pro- jevů v interakcích zachycených během pracovních jednání v akademickém prostředí. Kvantitativní analýzy (metoda tzv. klasifikačních stromů a náhodných lesů) ukázala, že a) v angličtině je významným prediktorem výskytu gest s rysem ukončenosti aktion- sartová kategorie achievement...
|
230 |
The Effectiveness of Corpus-Aided Instruction Using Lexical Bundles to Improve Academic Writing in Instructed Second Language Acquisition: A Multimethod Research DesignPark, Eun Jeong 27 September 2019 (has links)
No description available.
|
Page generated in 0.0426 seconds