Accelerating corpus search using multiple cores
- The Manatee corpus management system on which the Sketch Engine is built is efficient, but unable to harness the power of today’s multiprocessor machines. We describe a new, compatible implementation of Manatee which we develop in the Go language and report on the performance gains that we obtained.
| Author: | Radoslav Rábara, Pavel Rychlý, Ondřej Herman, Miloš Jakubíček |
|---|---|
| URN: | urn:nbn:de:bsz:mh39-62629 |
| Parent Title (English): | Proceedings of the Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP) 2017 including the papers from the Web-as-Corpus (WAC-XI) guest section. Birmingham, 24 July 2017 |
| Publisher: | Institut für Deutsche Sprache |
| Place of publication: | Mannheim |
| Editor: | Piotr BańskiORCiDGND, Marc KupietzORCiDGND, Harald LüngenGND, Paul Rayson, Hanno Biber, Evelyn BreitenederGND, Simon Clematide, John Mariani, Mark Stevenson, Theresa Sick |
| Document Type: | Conference Proceeding |
| Language: | English |
| Year of first Publication: | 2017 |
| Date of Publication (online): | 2017/07/05 |
| Publicationstate: | Veröffentlichungsversion |
| Reviewstate: | Peer-Review |
| Tag: | Corpus technology; Large corpora; Sketch engine; colonial language contact |
| GND Keyword: | Datenmanagement; Korpus <Linguistik>; Suchmaschine; Texttechnologie |
| Page Number: | 5 |
| First Page: | 30 |
| Last Page: | 34 |
| DDC classes: | 400 Sprache |
| Open Access?: | ja |
| Leibniz-Classification: | Sprache, Linguistik |
| Linguistics-Classification: | Korpuslinguistik |
| Conferences, Workshops: | CMLC-5 + BigNLP / 5th Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing |
| Licence (German): | Creative Commons - Namensnennung-Nicht kommerziell-Keine Bearbeitung 3.0 Deutschland |


