OPUS 4 | 400 Sprache, Linguistik

400 Sprache, Linguistik

400 Sprache (135)
401 Sprachphilosophie, Sprachtheorie (2)
402 Verschiedenes
403 Wörterbücher, Enzyklopädien
404 Spezielle Themen (1)
405 Fortlaufende Sammelwerke
406 Organisationen, Management
407 Ausbildung, Forschung, verwandte Themen (1)
408 Behandlung nach Personengruppen
409 Geografische, personenbezogene Behandlung

42 search hits

1 to 10

Sort by

Lessons learned from joining forces across disparate disciplines in the NFDI: the conference on research on text analytics. Presented at the 1st Conference on Research Data Infrastructure (CoRDI), Karlsruhe, 12. – 14. September 2023 (2023)

Krieger, Ulrich ; Trippel, Thorsten

This contribution summarizes the lessons learned from the organization of a joint conference on text analytics research by the Business, Economic, and Related Data (BERD@NFDI) and Text+ consortia within the National Research Data Infrastructure (NFDI) in Germany. The collaboration aimed to identify common ground and foster interdisciplinary dialogue between scholars in the humanities and in the business domain. The lessons learned include the importance of presenting research questions using textual data to establish common ground, similarities in methodology for processing textual data between the consortia, similarities in research data management, and the need for regular interconsortial discussions on textual analysis methods and data. The collaboration proved valuable for interdisciplinary dialogue within the NFDI, and further collaboration between the consortia is planned.

Data for my research: Where can I get it, where do I take it, what can I do with it, how do I use it in my resume? Looking at the research data infrastructures CLARIN in Europe and Text+ in Germany. Presented at the CLEOPATRA final public workshop, Hannover, 2023-05-15 (2023)

Trippel, Thorsten

"Reproducibility crisis" and "empirical turn" are only two keywords when it comes to providing reasons for research data management. Research data is omnipresent and with the more and more automatic data processing procedures, they become even more important. However, just because new methods require data and produce data, this does not mean that data are easily accessible, reusable or even make a difference in the CV of a researcher, even if a large portion of research goes into data creation, acquisition, preparation, and analysis. In this talk I will present where we find data in the research process, where we may find appropriate support for data management and advocate for a procedure for including it in research publications and resumes. This presentation relies on work within the BMBF-funded project CLARIN-D. It also builds on work within the German National Research Data Infrastructure (NFDI) consortium Text+, DFG project number 460033370.

Increasing CMDI’s semantic interoperability with schema.org (2022)

Meisinger, Nino ; Trippel, Thorsten ; Zinn, Claus

The CLARIN Concept Registry (CCR) is the common semantic ground for most CMDI-based profiles to describe language-related resources in the CLARIN universe. While the CCR supports semantic interoperability within this universe, it does not extend beyond it. The flexibility of CMDI, however, allows users to use other term or concept registries when defining their metadata components. In this paper, we describe our use of schema.org, a light ontology used by many parties across disciplines.

Text+ und die GND – Community-Hub und Wissensgraph (2022)

Kett, Jürgen ; Kudella, Christoph ; Rapp, Andrea ; Stein, Regine ; Trippel, Thorsten

In dem auf die Forschungsdaten sprach- und textbasierter Disziplinen ausgerichteten NFDI-Konsortium Text+ spielen Normdaten eine zentrale Rolle für die interoperable Beschreibung und semantische Verknüpfung von verteilten Datenquellen. Insbesondere die Gemeinsame Normdatei (GND) ist ein bedeutender Hub im Zentrum eines im Entstehen begriffenen, domänenübergreifenden Wissensgraphen. Diese Funktion soll im Rahmen von Text+ durch den Aufbau einer GND-Agentur für sprach- und textbasierte Forschungsdaten weiterentwickelt und ausgebaut werden. Ziel ist es, niedrigschwellige, qualitätsgesicherte Beteiligungsmöglichkeiten für Forschende zu schaffen und zugleich den Vernetzungsgrad der GND auch durch Terminologie-Mappings zu erweitern. Spezifische Anforderungen und Nutzungspraktiken werden hierbei anhand der Datendomänen von Text+ exemplifziert.

CLARIAH-DE work package 5 - community engagement: outreach/dissemination and liaison (2021)

Walker, Nathalie ; Werthmann, Antonina ; Trippel, Thorsten ; Buddenbohm, Stefan ; Weimer, Lukas ; Friedrichs, Sonja

This poster summarizes the results of the CLARIAH-DE Work Package 5 - Community Engagement: Outreach/Dissemination and Liaison. Work package 5 engages with the community through dissemination activities, outreach and liaison. The work package set itself the following sub goals: - Combining the existing dissemination and outreach activities of CLARIN-D and DARIAH-DE in a meaningful way and elaborating on them. In some cases this meant continuity, in other cases a new appearance for resources. - Providing a web portal as a gateway to the CLARIAH-DE project. - Creating a common identity and corporate identity and maintaining the established level of trust users already put into CLARIN-D and DARIAH-DE. - Providing a social media presence as well as a physical presence at workshops, conferences and other meetings in the Digital Humanities.

The CLARIN infrastructure as an interoperable language technology platform for SSH and beyond (2023)

Branco, António ; Eskevich, Maria ; Frontini, Francesca ; Hajič, Jan ; Hinrichs, Erhard ; de Jong, Franciska ; Kamocki, Paweł ; König, Alexander ; Lindén, Krister ; Navarretta, Constanza ; Piasecki, Maciej ; Piperidis, Stelios ; Pitkänen, Olli ; Simov, Kiril ; Skadiņa, Inguna ; Trippel, Thorsten ; Witt, Andreas ; Zinn, Claus

CLARIN is a European Research Infrastructure Consortium developing and providing a federated and interoperable platform to support scientists in the field of the Social Sciences and Humanities in carrying-out language-related research. This contribution provides an overview of the entire infrastructure with a particular focus on tool interoperability, ease of access to research data, tools and services, the importance of sharing knowledge within and across (national) communities, and community building. By taking into account FAIR principles from the very beginning, CLARIN succeeded in becoming a successful example of a research infrastructure that is actively used by its members. The benefits CLARIN members reap from their infrastructure secure a future for their common good that is both sustainable and attractive to partners beyond the original target groups.

How to connect language resources, infrastructures, and communities (2022)

Draxler, Christoph ; Geyken, Alexander ; Hinrichs, Erhard ; Klosa-Kückelhaus, Annette ; Teich, Elke ; Trippel, Thorsten

This chapter will present lessons learned from CLARIN-D, the German CLARIN national consortium. Members of the CLARIN-D communities and of the CLARIN-D consortium have been engaged in innovative, data-driven, and community-based research, using language resources and tools in the humanities and neigh-bouring disciplines. We will present different use cases and users’ stories that demonstrate the innovative research potential of large digital corpora and lexical resources for the study of language change and variation, for language documentation, for literary studies, and for the social sciences. We will emphasize the added value of making language resources and tools available in the CLARIN distributed research infrastructure and will discuss legal and ethical issues that need to be addressed in the use of such an infrastructure. Innovative technical solutions for accessing digital materials still under copyright and for data mining such materials will be presented. We will outline the need for close interaction with communities of interest in the areas of curriculum development, data management, and training the next generation of digital humanities scholars. The importance of community-supported standards for encoding language resources and the practice of community-based quality control for digital research data will be presented as a crucial step toward the provisioning of high quality research data. The chapter will conclude with a discussion of impor-tant directions for innovative research and for supporting infrastructure development over the next decade and beyond.

The use of graphs as a data structure for reusing lexical resources (2007)

Trippel, Thorsten

Lexical resources are often represented in table form, e. g., in relational databases, or represented in specially marked up texts, for example, in document based XML models. This paper describes how it is possible to model lexical structures as graphs and how this model can be used to exploit existing lexical resources and even how different types of lexical resources can be combined.

Interoperable language resources (2007)

Declerck, Thierry ; Ide, Nancy ; Trippel, Thorsten

In this contribution we present some work of the R&D European project “LIRICS” and of the ISO/TC 37/SC 4 committee related to the topic of interoperability and re-use of language resources. We introduce some basic mechanisms of the standardization work in ISO and describe in more details the general approach on how to cope with the annotation of language data within ISO.

Lexicography (2008)

Trippel, Thorsten

1 to 10

Open Access

400 Sprache, Linguistik

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

42 search hits