Volltext-Downloads (blau) und Frontdoor-Views (grau)

Do FreeWord Order Languages Need More Treebank Data? Investigating Dative Alternation in German, English, and Russian

  • We investigate whether non-configurational languages, which display more word order variation than configurational ones, require more training data for a phenomenon to be parsed successfully. We perform a tightly controlled study comparing the dative alternation for English (a configurational language), German, and Russian (both non-configurational). More specifically, we compare the performance of a dependency parser when only canonical word order is present with its performance on data sets when all word orders are present. Our results show that for all languages, canonical data not only is easier to parse, but there exists no direct correspondence between the size of training sets containing free(er) word order variation and performance.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Daniel Dakota, Timur Gilmanov, Wen Li, Christopher Kuzma, Evgeny Kim, Noor Abo Mokh, Sandra Kübler
Parent Title (English):Proceedings of the 6th Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2015). July 23rd in Bilbao, Basque Country, Spain
Editor:Marie Candito, Jinho Choi, Yannick Versley
Document Type:Conference Proceeding
Year of first Publication:2015
Date of Publication (online):2017/05/23
GND Keyword:Dativ; Deutsch; Englisch; Russisch; Syntaktische Analyse; Wortstellung
First Page:14
Last Page:20
Dewey Decimal Classification:400 Sprache / 400 Sprache, Linguistik
Open Access?:Ja
Licence (German):Es gilt das UrhG