OPUS 4 | Search

Das amtliche Regelwerk zur neuen Rechtschreibung als Hypertext. HTML-Version des amtlichen Regelwerks (1997)

Zur Lexikon-Grammatik-Schnittstelle in einem hypermedialen Informationssystem (1998)

Der Beitrag beschreibt Konzeption und Umsetzung der Anbindung von lexikalischen Datenbanken an das grammatische Informationssystem grammis, das seit Mitte 1993 am Institut für deutsche Sprache (IDS) entwickelt wird. Im Rahmen dieses Projekts wird erforscht, wie grammatisches Wissen mit moderner Computertechnik anschaulich dargestellt und verständlich vermittelt werden kann.

Texttechnologie und Grammatik (2006)

Schneider, Roman

Eine Ontologie für die Grammatik. Modellierung und Einsatzgebiete domänspezifischer Wissensstrukturen (2006)

Schneider, Roman

Texttechnologie und Grammatik (2006)

Schneider, Roman

A database-driven ontology for German grammar (2007)

Schneider, Roman

The main objective of this article is to describe the current activities at the Mannheim Institute for German Language regarding the implementation of a domain-specific ontology for German grammar. We differentiate ontology bases from ontology management Systems, point out the benefits of database-driven Solutions, and go Step by Step through all phases of the ontology lifecycle. In Order to demonstrate the practical use of our approach, we outline the interface between our ontology and the grammis web Information System, and compare the ontology-based retrieval mechanism with traditional full text search.

E-VALBU: Advanced SQL/XML processing of dictionary data using an object-relational XML database (2008)

Schneider, Roman

Contemporary practical lexicography uses a wide range of advanced technological aids,most prominently database systems for the administration of dictionary content. Since XML has become a de facto standard for the coding of lexicographic articles, integrated markup functionality – such as query, update, or transformation of instances – is of particular importance. Even the multi-channel distribution of dictionary data benefits from powerful XML database services. Exemplified by E-VALBU, the most comprehensive electronic dictionary on German verb valency, we outline an integrated approach for advanced XML storing and processing within an object-relational database, and for a public retrieval frontend using Web Services and AJAX technology.

Vorwort (2008)

Schneider, Roman

Web 3.0 ante portas? Integration von Social Web und Semantic Web (2008)

Schneider, Roman

Das Medium Internet ist im Wandel, und mit ihm ändern sich seine Publikations- und Rezeptionsbedingungen. Welche Chancen bieten die momentan parallel diskutierten Zukunftsentwürfe von Social Web und Semantic Web? Zur Beantwortung dieser Frage beschäftigt sich der Beitrag mit den Grundlagen beider Modelle unter den Aspekten Anwendungsbezug und Technologie, beleuchtet darüber hinaus jedoch auch deren Unzulänglichkeiten sowie den Mehrwert einer mediengerechten Kombination. Am Beispiel des grammatischen Online-Informationssystems grammis wird eine Strategie zur integrativen Nutzung der jeweiligen Stärken skizziert.

Ein XML-basiertes Datenbanksystem für digitale Wörterbücher- Ein Werkstattbericht aus dem Institut für Deutsche Sprache (2009)

Müller-Spitzer, Carolin ; Schneider, Roman

Das Online-Wortschatz-Informationssystem Deutsch (OWID) ist ein digitales Wörterbuchportal des Instituts für Deutsche Sprache. Alle darin zusammengeführten lexikografischen Daten sind auf XML-Basis feingranular strukturiert. Speicherung, Verwaltung und Retrieval dieser Daten übernimmt das Orade-basierte Electronic Dictionary Administration System (EDAS). Der vorliegende Beitrag erläutert die XML-basierte Modellierung der Daten, XML-spezifische Fragen der Speicherung, sowie das Retrieval mit XPath und SQL/XML.

Vorwort (2010)

Schmitz, Hans-Christian ; Schneider, Roman ; Schröder, Bernhard

A Functional Database Framework for Querying Very Large Multi-Layer Corpora (2011)

Schneider, Roman

Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.

“… jetzt auch im praktischen Hypertextformat” – Die Anfänge der Grammatik mit der Maus (2011)

Schneider, Roman

Evaluating DBMS-based Access Strategies to Very Large Multi-layer Corpora (2012)

Schneider, Roman

Linguistic query systems are special purpose IR applications. As text sizes, annotation layers, and metadata schemes of language corpora grow rapidly, performing complex searches becomes a highly computational expensive task. We evaluate several storage models and indexing variants in two multi-processor/multi-core environments, focusing on prototypical linguistic querying scenarios. Our aim is to reveal modeling and querying tendencies – rather than absolute benchmark results – when using a relational database management system (RDBMS) and MapReduce for natural language corpus retrieval. Based on these findings, we are going to improve our approach for the efficient exploitation of very large corpora, combining advantages of state-of-the-art database systems with decomposition/parallelization strategies. Our reference implementation uses the German DeReKo reference corpus with currently more than 4 billion word forms, various multi-layer linguistic annotations, and several types of text-specific metadata. The proposed strategy is language-independent and adaptable to large-scale multilingual corpora.

KoGra-DB: Using MapReduce for language corpora (2013)

Schneider, Roman

Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.

Editorial (2013)

Schneider, Roman ; Storrer, Angelika ; Mehler, Alexander

Hypertext, Wissensnetz und Datenbank: die Webinformationssysteme Grammis und ProGr@mm (2014)

Schneider, Roman ; Schwinn, Horst

Re-designing Online Terminology Resources for German Grammar. Project Report (2016)

Suchowolec, Karolina ; Lang, Christian ; Schneider, Roman

The compilation of terminological vocabularies plays a central role in the organization and retrieval of scientific texts. Both simple keyword lists as well as sophisticated modellings of relationships between terminological concepts can make a most valuable contribution to the analysis, classification, and finding of appropriate digital documents, either on the Web or within local repositories. This seems especially true for long-established scientific fields with various theoretical and historical branches, such as linguistics, where the use of terminology within documents from different origins is sometimes far from being consistent. In this short paper, we report on the early stages of a project that aims at the re-design of an existing domain-specific KOS for grammatical content grammis. In particular, we deal with the terminological part of grammis and present the state-of-the-art of this online resource as well as the key re-design principles. Further, we propose questions regarding ramifications of the Linked Open Data and Semantic Web approaches for our re-design decisions.

Die grammatischen Online-Angebote des IDS aus Sicht der Germanistik im Ausland. Gegenwart und Zukunft (2018)

Dalmas, Martine ; Schneider, Roman

Seit Mitte der 1990er Jahre wird am Institut für deutsche Sprache (IDS) in Mannheim erforscht, wie der hochkomplexe Gegenstandsbereich „Grammatik“ unter Ausnutzung digitaler Sprachressourcen und hypertextueller Navigationsstrukturen gleichermaßen wissenschaftlich fundiert und anschaulich vermittelt werden kann. Die grammatischen Online-Informationssysteme des IDS wenden sich nicht allein an Forscher und die interessierte Öffentlichkeit in Deutschland, sondern in gleichem Maße an Germanisten und Deutsch-Lernende in der ganzen Welt. Der vorliegende Beitrag beschreibt die damit verbundenen Hoffnungen und Anspruche. Daran anschließend thematisiert er praktische Einsatzmöglichkeiten und skizziert die funktionale und inhaltliche Weiterentwicklung der digitalen Grammatik-Angebote.

GeCoTagger: annotation of German verb complements with conditional random fields (2018)

Fürbacher, Monica ; Schneider, Roman

Complement phrases are essential for constructing well-formed sentences in German. Identifying verb complements and categorizing complement classes is challenging even for linguists who are specialized in the field of verb valency. Against this background, we introduce an ML-based algorithm which is able to identify and classify complement phrases of any German verb in any written sentence context. We use a large training set consisting of example sentences from a valency dictionary, enriched with POS tagging, and the ML-based technique of Conditional Random Fields (CRF) to generate the classification models.

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

35 search hits