CMLC-5 + BigNLP / 5th Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing
Refine
Year of publication
- 2017 (12)
Document Type
- Conference Proceeding (9)
- Part of a Book (2)
- Book (1)
Language
- English (12)
Has Fulltext
- yes (12)
Keywords
- Korpus <Linguistik> (12)
- Corpus linguistics (11)
- Corpus technology (6)
- Texttechnologie (5)
- Datenmanagement (4)
- Internet (4)
- Web corpora (3)
- Corpus management (2)
- Englisch (2)
- Kontrastive Linguistik (2)
Publicationstate
Reviewstate
- Peer-Review (12)
Publisher
This paper outlines the broad research context and rationale for a new international comparable corpus (ICC). The ICC is to be largely modelled on the text categories and their quantities the International Corpus of English with only a few changes. The corpus will initially begin with nine European languages but others may join in due course. The paper reports on those and other agreements made at the inaugural planning meeting in Prague on 22-23 June 2017. It also sets out the project’s goals for its first two years.
Many (modernist) works of literature can be understood by their associativeness, be it constructed or “free”. This network-like character of (modernist) literature has often been addressed by terms like “free association”, connotation”, “context” or “intertext”. This paper proposes an experimental and exemplary approach to intraconnect a literary corpus of the Austrian writer Ilse Aichinger with semantic web-technologies to enable interactive explorations of word-associations.