When size matters. Legal perspective(s) on N-grams
- N-grams are of utmost importance for modern linguistics and language technology. The legal status of n-grams, however, raises many practical questions. Traditionally, text snippets are considered copyrightable if they meet the originality criterion, but no clear indicators as to the minimum length of original snippets exist; moreover, the solutions adopted in some EU Member States (the paper cites German and French law as examples) are considerably different. Furthermore, recent developments in EU law (the CJEU's Pelham decision and the new right of press publishers) also provide interesting arguments in this debate. The paper presents the existing approaches to the legal protection of n-grams and tries to formulate some clear guidelines as to the length of n-grams that can be freely used and shared.
Author: | Paweł KamockiGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-105191 |
DOI: | https://doi.org/10.3384/ecp18014 |
ISBN: | 978-91-7929-609-4 |
ISSN: | 1650-3740 |
Parent Title (English): | Selected Papers from the CLARIN Annual Conference 2020. Virtual Event, 2020, 5-7 October |
Series (Serial Number): | Linköping Electronic Conference Proceedings (180) |
Publisher: | Linköping University Electronic Press |
Place of publication: | Linköping |
Editor: | Costanza Navarretta, Maria Eskevich |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2021 |
Date of Publication (online): | 2021/07/23 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | copyright; legal issues; n-grams |
GND Keyword: | Computerlinguistik; Korpus <Linguistik>; Rechtsfrage; Rechtsschutz; Rechtsstellung; Urheberrecht |
First Page: | 122 |
Last Page: | 128 |
Note: | A previous version of this article was published in: "Proceedings of CLARIN Annual Conference 2020. 05 – 07 October 2020, Online Edition", see http://nbn-resolving.de/urn:nbn:de:bsz:mh39-100805. |
DDC classes: | 300 Sozialwissenschaften / 340 Recht |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Computerlinguistik |
Linguistics-Classification: | Korpuslinguistik |
Program areas: | S2: Forschungskoordination und –infrastrukturen |
Licence (English): | ![]() |