Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-9) 2021. Limerick, 12 July 2021 (Online-Event)
- Contents: 1. Julien Abadji, Pedro Javier Ortiz Suárez, Laurent Romary and Benoît Sagot: "Ungoliant: An Optimized Pipeline for the Generation of a Very Large-Scale Multilingual Web Corpus", S.1-9. 2. Markus Gärtner, Felicitas Kleinkopf, Melanie Andresen and Sibylle Hermann: "Corpus Reusability and Copyright - Challenges and Opportunities", S.10-19. 3. Nils Diewald, Eliza Margaretha and Marc Kupietz: "Lessons learned in Quality Management for Online Research Software Tools in Linguistics", S.20-26.
| URN: | urn:nbn:de:bsz:mh39-104676 |
|---|---|
| DOI: | https://doi.org/10.14618/ids-pub-10467 |
| Publisher: | Leibniz-Institut für Deutsche Sprache |
| Place of publication: | Mannheim |
| Editor: | Harald LüngenGND, Marc KupietzORCiDGND, Piotr BańskiORCiDGND, Adrien BarbaresiORCiDGND, Simon ClematideORCiDGND, Ines Pisetta |
| Document Type: | Book |
| Language: | English |
| Year of first Publication: | 2021 |
| Date of Publication (online): | 2021/06/23 |
| Publicationstate: | Veröffentlichungsversion |
| Reviewstate: | Peer-Review |
| Tag: | corpus linguistics; corpus management systems; corpus reusability; large corpora; linguistic research software; software quality management |
| GND Keyword: | Computerlinguistik; Datenmanagement; Forschungsdaten; Korpus <Linguistik>; Urheberrecht |
| Page Number: | 26 |
| DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
| Open Access?: | ja |
| Leibniz-Classification: | Sprache, Linguistik |
| Linguistics-Classification: | Korpuslinguistik |
| Program areas: | G2: Sprachinformationssysteme |
| Program areas: | S1: Korpuslinguistik |
| Conferences, Workshops: | Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-9) 2021. Limerick, 12 July 2021 (Online-Event) |
| Licence (German): | Creative Commons - CC BY - Namensnennung 4.0 International |


