TY - CHAP U1 - Konferenzveröffentlichung A1 - Müller, Mark-Christoph A1 - Ghosh, Sucheta A1 - Rey, Maja A1 - Wittig, Ulrike A1 - Müller, Wolfgang A1 - Strube, Michael ED - Chandrasekaran, Muthu Kumar ED - de Waard, Anita ED - Feigenblat, Guy ED - Freitag, Dayne ED - Ghosal, Tirthankar ED - Hovy, Eduard ED - Knoth, Petr ED - Konopnicki, David ED - Mayr, Philipp ED - Patton, Robert M. ED - Shmueli-Scheuer, Michal T1 - Reconstructing manual information extraction with DB-to-document backprojection: Experiments in the life science domain T2 - Proceedings of the First Workshop on Scholarly Document Processing. Online, November 19, 2020 N2 - We introduce a novel scientific document processing task for making previously inaccessible information in printed paper documents available to automatic processing. We describe our data set of scanned documents and data records from the biological database SABIO-RK, provide a definition of the task, and report findings from preliminary experiments. Rigorous evaluation proved challenging due to lack of gold-standard data and a difficult notion of correctness. Qualitative inspection of results, however, showed the feasibility and usefulness of the task. KW - Computerlinguistik KW - Information Extraction KW - Schriftstück KW - Experiment KW - Datenanalyse KW - Qualitative Inhaltsanalyse KW - manual information extraction KW - life science KW - document processing KW - automatic processing KW - SABIO-RK Y1 - 2020 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-110854 SN - 978-1-952148-70-5 SS - 978-1-952148-70-5 U6 - https://doi.org/10.18653/v1/2020.sdp-1.9 DO - https://doi.org/10.18653/v1/2020.sdp-1.9 SP - 81 EP - 90 PB - Association for Computational Linguistics CY - Stroudsburg, Pennsylvania ER -