Accelerating corpus search using multiple cores
- The Manatee corpus management system on which the Sketch Engine is built is efficient, but unable to harness the power of today’s multiprocessor machines. We describe a new, compatible implementation of Manatee which we develop in the Go language and report on the performance gains that we obtained.
Author: | Radoslav Rábara, Pavel Rychlý, Ondřej Herman, Miloš Jakubíček |
---|---|
URN: | urn:nbn:de:bsz:mh39-62629 |
Parent Title (English): | Proceedings of the Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP) 2017 including the papers from the Web-as-Corpus (WAC-XI) guest section. Birmingham, 24 July 2017 |
Publisher: | Institut für Deutsche Sprache |
Place of publication: | Mannheim |
Editor: | Piotr Bański, Marc Kupietz, Harald Lüngen, Paul Rayson, Hanno Biber, Evelyn Breiteneder, Simon Clematide, John Mariani, Mark Stevenson, Theresa Sick |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2017 |
Date of Publication (online): | 2017/07/05 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | Corpus technology; Large corpora; Sketch engine; colonial language contact |
GND Keyword: | Datenmanagement; Korpus <Linguistik>; Suchmaschine; Texttechnologie |
Page Number: | 5 |
First Page: | 30 |
Last Page: | 34 |
DDC classes: | 400 Sprache |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Korpuslinguistik |
Conferences, Workshops: | CMLC-5 + BigNLP / 5th Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing |
Licence (German): | Creative Commons - Namensnennung-Nicht kommerziell-Keine Bearbeitung 3.0 Deutschland |