TY - BOOK U1 - Buch ED - Bański, Piotr ED - Kupietz, Marc ED - Barbaresi, Adrien ED - Biber, Hanno ED - Breiteneder, Evelyn ED - Clematide, Simon ED - Witt, Andreas T1 - Proceedings of the LREC 2018 Workshop “Challenges in the Management of Large Corpora (CMLC-6)” 07 May 2018 – Miyazaki, Japan N2 - Contents: 1. Christoph Kuras, Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn: Automation, management and improvement of text corpus production, S. 1 2. Thomas Krause, Ulf Leser, Anke Lüdeling and Stephan Druskat: Designing a re-usable and embeddable corpus search library, S. 6 3. Radoslav Rábara, Pavel Rychlý and Ondřej Herman: Distributed corpus search, S. 10 4. Adrien Barbaresi and Antonio Ruiz Tinoco: Using elasticsearch for linguistic analysis of tweets in time and space, S. 14 5. Marc Kupietz, Nils Diewald and Peter Fankhauser: How to Get the Computation Near the Data: Improving data accessibility to, and reusability of analysis functions in corpus query platforms, S. 20 6. Roman Schneider: Example-based querying for specialist corpora, S. 26 7. Paul Rayson: Increasing interoperability for embedding corpus annotation pipelines in Wmatrix and other corpus retrieval tools, S. 33 KW - Korpus KW - Automatische Sprachanalyse KW - Technologie Y1 - 2018 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-75227 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-75227 UR - http://lrec-conf.org/workshops/lrec2018/W17/pdf/book_of_proceedings.pdf SN - 979-10-95546-14-6 SB - 979-10-95546-14-6 SP - VI, 36 S1 - VI, 36 PB - European language resources association (ELRA) CY - Paris ER -