Refine
Year of publication
- 2021 (1) (remove)
Document Type
Language
- English (1)
Has Fulltext
- yes (1)
Is part of the Bibliography
- no (1)
Keywords
- Ausrichten <Technik> (1)
- Computerlinguistik (1)
- Optische Zeichenerkennung (1)
- Volltext (1)
- XML (1)
- biomedical language processing (1)
- document triage (1)
- manual database curation (1)
- word-level alignment (1)
Publicationstate
- Veröffentlichungsversion (1) (remove)
Reviewstate
- Peer-Review (1)
Publisher
We describe a simple procedure for the automatic creation of word-level alignments between printed documents and their respective full-text versions. The procedure is unsupervised, uses standard, off-the-shelf components only, and reaches an F-score of 85.01 in the basic setup and up to 86.63 when using pre- and post-processing. Potential areas of application are manual database curation (incl. document triage) and biomedical expression OCR.