Refine
Year of publication
- 2021 (6) (remove)
Document Type
- Conference Proceeding (6) (remove)
Language
- English (6)
Has Fulltext
- yes (6)
Keywords
- Beleidigung (3)
- Beschimpfung (3)
- abusive language (3)
- Automatische Sprachanalyse (2)
- Datensatz (2)
- Deutsch (2)
- Semantik (2)
- Ambiguität (1)
- Ausrichten <Technik> (1)
- Automatische Spracherkennung (1)
Publicationstate
Reviewstate
- Peer-Review (6)
Publisher
- Association for Computational Linguistics (6) (remove)
We describe a simple procedure for the automatic creation of word-level alignments between printed documents and their respective full-text versions. The procedure is unsupervised, uses standard, off-the-shelf components only, and reaches an F-score of 85.01 in the basic setup and up to 86.63 when using pre- and post-processing. Potential areas of application are manual database curation (incl. document triage) and biomedical expression OCR.