Global ETD Search

Return to search

Measuring coselectional constraint in learner corpora: A graph-based approach

Die korpuslinguistische Arbeit untersucht den Erwerb von Koselektionsbeschränkungen bei Lerner*innen des Deutschen als Fremdsprache in einem quasi-longitudinalen Forschungsdesign anhand des Kobalt-Korpus. Neben einigen statistischen Analysen wird vordergründig eine graphbasierte Analyse entwickelt, die auf der Graphmetrik Louvain-Modularität aufbaut. Diese wird für diverse Subkorpora nach verschiedenen Kriterien berechnet und mit Hilfe verschiedener Samplingtechniken umfassend intern validiert. Im Ergebnis zeigen sich eine Abhängigkeit der gemessenen Modularitätswerte vom Sprachstand der Teilnehmer*innen, eine höhere Modularität bei Muttersprachler*innen, niedrigere Modularitätswerte bei weißrussischen vs. chinesischen Lerner*innen sowie ein U-Kurven-förmiger Erwerbsverlauf bei weißrussischen, nicht aber chinesischen Lerner*innen. Unterschiede zwischen den Gruppen werden aus typologischer, kognitiver, diskursiv-kultureller und Registerperspektive diskutiert. Abschließend werden Vorschläge für den Einsatz von graphbasierten Modellierungen in kernlinguistischen Fragestellungen entwickelt. Zusätzlich werden theoretische Lücken in der gebrauchsbasierten Beschreibung von Koselektionsphänomenen (Phraseologie, Idiomatizität, Kollokation) aufgezeigt und ein multidimensionales funktionales Modell als Alternative vorgeschlagen. / The thesis located in corpus linguistics analyzes the acquisition of coselectional constraint in learners of German as a second language in a quasi-longitudinal design based on the Kobalt corpus. Supplemented by a number of statistical analyses, the thesis primarily develops a graph-based analysis making use of Louvain modularity. The graph metric is computed for a range of subcorpora chosen by various criteria. Extensive internal validation is performed through a number of sampling techniques. Results robustly indicate a dependency of modularity on language acquisition progress, higher modularity in L1 vs. L2, lower modularity in Belarusian vs. Chinese learners, and a u-shaped learning development in Belarusian, but not in Chinese learners. Group differences are discussed from a typological, cognitive, cultural and cultural discourse, and register perspective. Finally, future applications of graph-based modeling in core-linguistic research are outlined. In addition, some gaps in the theoretical discussion of coselection phenomena (phraseology, idiomaticity, collocation) in usage-based linguistics are discussed and a multidimensional and functional model is proposed as an alternative.

gebrauchsbasierte Linguistik

Koselektion

idiomatisches Prinzip

Fremdsprachenerwerb

quantitative Linguistik

Korpuslinguistik

Kollokation

usage-based linguistics

coselection

second language acquisition

idiom principle

quantitative linguistics

corpus linguistics

collocation

410 Linguistik

430 Deutsch und verwandte Sprachen

006 Spezielle Computerverfahren

Identifer	oai:union.ndltd.org:HUMBOLT/oai:edoc.hu-berlin.de:18452/22356
Date	24 July 2020
Creators	Shadrova, Anna Valer'evna
Contributors	Lüdeling, Anke, Zeldes, Amir
Publisher	Humboldt-Universität zu Berlin
Source Sets	Humboldt University of Berlin
Language	English
Detected Language	German
Type	doctoralThesis, doc-type:doctoralThesis
Format	application/pdf
Rights	(CC BY-NC 4.0) Attribution-NonCommercial 4.0 International, https://creativecommons.org/licenses/by-nc/4.0/
Relation	https://doi.org/10.5281/zenodo.3584091

Page generated in 0.0022 seconds

Measuring coselectional constraint in learner corpora: A graph-based approach

Description

Links & Downloads

Tags

Additional Fields