Volltext-Downloads (blau) und Frontdoor-Views (grau)

Recent Developments in the Czech National Corpus

  • The Czech National Corpus (CNC) is a longterm project striving for extensive and continuous mapping of the Czech language. This effort results mostly in compilation, maintenance and providing free public access to a range of various corpora with the aim to offer a diverse, representative, and high-quality data for empirical research mainly in linguistics. Since 2012, the CNC is officially recognized as a research infrastructure funded by the Czech Ministry of Education, Youth and Sports which has caused a recent shift towards user service-oriented operation of the project. All project-related resources are now integrated into the CNC research portal at http://www.korpus.cz/. Currently, the CNC has an established and growing user community of more than 4,500 active users in the Czech Republic and abroad who put almost 1,900 queries per day using one of the user interfaces. The paper discusses the main CNC objectives for each particular domain, aiming at an overview of the current situation supplemented by an outline of future plans.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Michal Křen
Parent Title (English):Proceedings of the 3rd Workshop on Challenges in the Management of Large Corpora (CMLC-3), Lancaster, 20 July 2015.
Publisher:Institut für Deutsche Sprache
Place of publication:Mannheim
Editor:Piotr Bański, Hanno Biber, Evelyn Breiteneder, Marc Kupietz, Harald Lüngen, Andreas Witt
Document Type:Conference Proceeding
Year of first Publication:2015
Date of Publication (online):2015/07/02
Tag:Corpus annotation; Corpus management; Corpus technology; Czech; National corpus
GND Keyword:Annotation; Datenbanksystem; Korpus <Linguistik>; Tschechisch
First Page:1
Last Page:4
Dewey Decimal Classification:400 Sprache / 410 Linguistik
Conferences, Workshops:CMLC-3 / 3rd Workshop on Challenges in the Management of Large Corpora
Open Access?:Ja
Licence (German):License LogoCreative Commons - Namensnennung-Nicht kommerziell-Keine Bearbeitung 3.0 Deutschland