Volltext-Downloads (blau) und Frontdoor-Views (grau)

Proceedings of the LREC 2018 Workshop “Challenges in the Management of Large Corpora (CMLC-6)” 07 May 2018 – Miyazaki, Japan

  • Contents: 1. Christoph Kuras, Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn: Automation, management and improvement of text corpus production, S. 1 2. Thomas Krause, Ulf Leser, Anke Lüdeling and Stephan Druskat: Designing a re-usable and embeddable corpus search library, S. 6 3. Radoslav Rábara, Pavel Rychlý and Ondřej Herman: Distributed corpus search, S. 10 4. Adrien Barbaresi and Antonio Ruiz Tinoco: Using elasticsearch for linguistic analysis of tweets in time and space, S. 14 5. Marc Kupietz, Nils Diewald and Peter Fankhauser: How to Get the Computation Near the Data: Improving data accessibility to, and reusability of analysis functions in corpus query platforms, S. 20 6. Roman Schneider: Example-based querying for specialist corpora, S. 26 7. Paul Rayson: Increasing interoperability for embedding corpus annotation pipelines in Wmatrix and other corpus retrieval tools, S. 33

Export metadata

Additional Services

Search Google Scholar


Publisher:European language resources association (ELRA)
Place of publication:Paris
Editor:Piotr Bański, Marc Kupietz, Adrien Barbaresi, Hanno Biber, Evelyn Breiteneder, Simon Clematide, Andreas Witt
Document Type:Book
Year of first Publication:2018
Date of Publication (online):2018/06/11
GND Keyword:Automatische Sprachanalyse; Korpus <Linguistik>; Technologie
Page Number:VI, 36
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Program areas:Digitale Sprachwissenschaft
Licence (English):License LogoCreative Commons - Attribution-NonCommercial 4.0 International