Refine
Document Type
- Article (1)
- Part of a Book (1)
Has Fulltext
- yes (2)
Is part of the Bibliography
- no (2)
Keywords
- XSLT (2) (remove)
Publicationstate
Publisher
To build a comparable Wikipedia corpus of German, French, Italian, Norwegian, Polish and Hungarian for contrastive grammar research, we used a set of XSLT stylesheets to transform the mediawiki anntations to XML. Furthermore, the data has been amnntated with word class information using different taggers. The outcome is a corpus with rich meta data and linguistic annotation that can be used for multilingual research in various linguistic topics.