Designing a verb guesser for part of speech tagging in Northern Sotho
- The aim of this article is to describe the design and implementation of a verb guesser that will enhance the results of statistical part of speech (POS) tagging of verbs in Northern Sotho. It will be illustrated that verb stems in Northern Sotho can successfully be recognised by examining their suffixes and combinations of suffixes. Two approaches to verbal derivation analysis will be utilised, namely morphological analysis and corpus querying of suffixes and combinations of suffixes.
Author: | Danie J Prinsloo, Gertrud FaaßORCiD, Elsabé TaljardORCiD, Ulrich HeidORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-124766 |
DOI: | https://doi.org/10.2989/SALALS.2008.26.2.1.565 |
ISSN: | 1727-9461 |
Parent Title (English): | Southern African Linguistics and Applied Language Studies |
Publisher: | Taylor & Francis |
Place of publication: | London |
Document Type: | Article |
Language: | English |
Year of first Publication: | 2008 |
Date of Publication (online): | 2024/01/31 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung] |
Publicationstate: | Zweitveröffentlichung |
Publicationstate: | Postprint |
Reviewstate: | Peer-Review |
Tag: | part of speech tagging; verb guesser; verbal derivation |
GND Keyword: | Computerlinguistik; Datenanalyse; Pedi-Sprache; Suffix; Verb; Verbalstamm; Wortart |
Volume: | 26 |
Issue: | 2 |
First Page: | 185 |
Last Page: | 196 |
Note: | This is an Accepted Manuscript of an article published by Taylor & Francis in Southern African Linguistics and Applied Language Studies in October 2008, available at: https://doi.org/10.2989/SALALS.2008.26.2.1.565. |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Licence (German): | ![]() |