Proceedings of the LREC 2018 Workshop “Challenges in the Management of Large Corpora (CMLC-6)” 07 May 2018 – Miyazaki, Japan

Contents: 1. Christoph Kuras, Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn: Automation, management and improvement of text corpus production, S. 1 2. Thomas Krause, Ulf Leser, Anke Lüdeling and Stephan Druskat: Designing a re-usable and embeddable corpus search library, S. 6 3. Radoslav Rábara, Pavel Rychlý and Ondřej Herman: Distributed corpus search, S. 10 4. Adrien Barbaresi and Antonio Ruiz Tinoco: Using elasticsearch for linguistic analysis of tweets in time and space, S. 14 5. Marc Kupietz, Nils Diewald and Peter Fankhauser: How to Get the Computation Near the Data: Improving data accessibility to, and reusability of analysis functions in corpus query platforms, S. 20 6. Roman Schneider: Example-based querying for specialist corpora, S. 26 7. Paul Rayson: Increasing interoperability for embedding corpus annotation pipelines in Wmatrix and other corpus retrieval tools, S. 33

Metadaten
URN:	urn:nbn:de:bsz:mh39-75227
URL:	http://lrec-conf.org/workshops/lrec2018/W17/pdf/book_of_proceedings.pdf
ISBN:	979-10-95546-14-6
Publisher:	European language resources association (ELRA)
Place of publication:	Paris
Editor:	Piotr Bański, Marc Kupietz, Adrien Barbaresi, Hanno Biber, Evelyn Breiteneder, Simon Clematide, Andreas Witt
Document Type:	Book
Language:	English
Year of first Publication:	2018
Date of Publication (online):	2018/06/11
Publicationstate:	Veröffentlichungsversion
Reviewstate:	Peer-Review
GND Keyword:	Automatische Sprachanalyse; Korpus <Linguistik>; Technologie
Page Number:	VI, 36
DDC classes:	400 Sprache / 400 Sprache, Linguistik
Open Access?:	ja
Leibniz-Classification:	Sprache, Linguistik
Linguistics-Classification:	Korpuslinguistik
Program areas:	Digitale Sprachwissenschaft
Licence (English):	Creative Commons - Attribution-NonCommercial 4.0 International

Open Access