OPUS 4 | Search

Refine

Has Fulltext

yes (7)

7 search hits

1 to 7

Sort by

Relating Words. A Model of Base Recognition. Part 1 (1993)

Raffelsiefen, Renate

Framenet in Action: The Case of Attaching (2003)

Fillmore, Charles J. ; Schwarzer-Petruck, Myriam ; Ruppenhofer, Josef ; Wright, Abby

Unification of XML Documents with Concurrent Markup (2005)

Witt, Andreas ; Goecke, Daniela ; Sasaki, Felix ; Lüngen, Harald

An approach to the unification of XML (Extensible Markup Language) documents with identical textual content and concurrent markup in the framework of XML-based multi-layer annotation is introduced. A Prolog program allows the possible relationships between element instances on two annotation layers that share PCDATA to be explored and also the computing of a target node hierarchy for a well-formed, merged XML document. Special attention is paid to identity conflicts between element instances, for which a default solution that takes into account metarelations that hold between element types on the different annotation layers is provided. In addition, rules can be specified by a user to prescribe how identity conflicts should be solved for certain element types.

Multilingual language resources and interoperability (2009)

Witt, Andreas ; Heid, Ulrich ; Sasaki, Felix ; Sérasset, Gilles

This article introduces the topic of ‘‘Multilingual language resources and interoperability’’. We start with a taxonomy and parameters for classifying language resources. Later we provide examples and issues of interoperatability, and resource architectures to solve such issues. Finally we discuss aspects of linguistic formalisms and interoperability.

Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources (2009)

Rehm, Georg ; Schonefeld, Oliver ; Witt, Andreas ; Hinrichs, Erhard ; Reis, Marga

We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.

SusTEInability of linguistic resources through feature structures (2009)

Witt, Andreas ; Rehm, Georg ; Hinrichs, Erhard ; Lehmberg, Timm ; Stegmann, Jens

This article shows that the TEI tag set for feature structures can be adopted to represent a heterogeneous set of linguistic corpora. The majority of corpora is annotated using markup languages that are based on the Annotation Graph framework, the upcoming Linguistic Annotation Format ISO standard, or according to tag sets defined by or based upon the TEI guidelines. A unified representation comprises the separation of conceptually different annotation layers contained in the original corpus data (e.g. syntax, phonology, and semantics) into multiple XML files. These annotation layers are linked to each other implicitly by the identical textual content of all files. A suitable data structure for the representation of these annotations is a multi-rooted tree that again can be represented by the TEI and ISO tag set for feature structures. The mapping process and representational issues are discussed as well as the advantages and drawbacks associated with the use of the TEI tag set for feature structures as a storage and exchange format for linguistically annotated data.

Grammatical Categories and the Methodology of Linguistics (1994)

Meyer, Peter

1 to 7

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

7 search hits