Proceedings of the LREC 2018 Workshop “Challenges in the Management of Large Corpora (CMLC-6)” 07 May 2018 – Miyazaki, Japan
- Contents: 1. Christoph Kuras, Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn: Automation, management and improvement of text corpus production, S. 1 2. Thomas Krause, Ulf Leser, Anke Lüdeling and Stephan Druskat: Designing a re-usable and embeddable corpus search library, S. 6 3. Radoslav Rábara, Pavel Rychlý and Ondřej Herman: Distributed corpus search, S. 10 4. Adrien Barbaresi and Antonio Ruiz Tinoco: Using elasticsearch for linguistic analysis of tweets in time and space, S. 14 5. Marc Kupietz, Nils Diewald and Peter Fankhauser: How to Get the Computation Near the Data: Improving data accessibility to, and reusability of analysis functions in corpus query platforms, S. 20 6. Roman Schneider: Example-based querying for specialist corpora, S. 26 7. Paul Rayson: Increasing interoperability for embedding corpus annotation pipelines in Wmatrix and other corpus retrieval tools, S. 33
URN: | urn:nbn:de:bsz:mh39-75227 |
---|---|
URL: | http://lrec-conf.org/workshops/lrec2018/W17/pdf/book_of_proceedings.pdf |
ISBN: | 979-10-95546-14-6 |
Publisher: | European language resources association (ELRA) |
Place of publication: | Paris |
Editor: | Piotr Bański, Marc Kupietz, Adrien Barbaresi, Hanno Biber, Evelyn Breiteneder, Simon Clematide, Andreas Witt |
Document Type: | Book |
Language: | English |
Year of first Publication: | 2018 |
Date of Publication (online): | 2018/06/11 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
GND Keyword: | Automatische Sprachanalyse; Korpus <Linguistik>; Technologie |
Page Number: | VI, 36 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Korpuslinguistik |
Program areas: | Digitale Sprachwissenschaft |
Licence (English): | ![]() |