Volltext-Downloads (blau) und Frontdoor-Views (grau)

The New IDS Corpus Analysis Platform: Challenges and Prospects

  • The present article describes the first stage of the KorAP project, launched recently at the Institut für Deutsche Sprache (IDS) in Mannheim, Germany. The aim of this project is to develop an innovative corpus analysis platform to tackle the increasing demands of modern linguistic research. The platform will facilitate new linguistic findings by making it possible to manage and analyse primary data and annotations in the petabyte range, while at the same time allowing an undistorted view of the primary linguistic data, and thus fully satisfying the demands of a scientific tool. An additional important aim of the project is to make corpus data as openly accessible as possible in light of unavoidable legal restrictions, for instance through support for distributed virtual corpora, user-defined annotations and adaptable user interfaces, as well as interfaces and sandboxes for user-supplied analysis applications. We discuss our motivation for undertaking this endeavour and the challenges that face it. Next, we outline our software implementation plan and describe development to-date.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Piotr BańskiGND, Peter M. Fischer, Elena Frick, Erik Ketzan, Marc KupietzGND, Carsten Schnober, Oliver Schonefeld, Andreas WittORCiDGND
URN:urn:nbn:de:bsz:mh39-44974
URL:http://www.lrec-conf.org/proceedings/lrec2012/index.html
ISBN:978-2-9517408-7-7
Parent Title (English):Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12). Istanbul, Turkey, May 2012
Publisher:European Language Resources Association (ELRA)
Place of publication:Paris
Editor:Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Document Type:Conference Proceeding
Language:English
Year of first Publication:2012
Date of Publication (online):2015/12/16
Publicationstate:Veröffentlichungsversion
Reviewstate:(Verlags)-Lektorat
Tag:Institut für Deutsche Sprache <Mannheim>; Korpusanalyseplattform (KorAP); Textlinguistik
GND Keyword:Korpus <Linguistik>
First Page:2905
Last Page:2911
Dewey Decimal Classification:400 Sprache / 410 Linguistik
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Computerlinguistik
Open Access?:Ja
Licence (German):Es gilt das UrhG