Refine
Year of publication
- 2008 (101) (remove)
Document Type
- Part of a Book (56)
- Article (20)
- Conference Proceeding (16)
- Book (5)
- Doctoral Thesis (3)
- Working Paper (1)
Is part of the Bibliography
- no (101) (remove)
Keywords
- Deutsch (33)
- Korpus <Linguistik> (9)
- Wörterbuch (7)
- Gesprochene Sprache (6)
- Englisch (5)
- Automatische Sprachanalyse (4)
- Deutschland <DDR> (4)
- Europa (4)
- Konversationsanalyse (4)
- Spanisch (4)
Publicationstate
- Veröffentlichungsversion (101) (remove)
Reviewstate
- (Verlags)-Lektorat (72)
- Peer-Review (14)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (3)
- Verlags-Lektorat (3)
- (Verlag)-Lektorat (1)
- (Verlags-) Lektorat (1)
- (Verlags-)Lektorat (1)
- Peer-Revied (1)
- Peer-review (1)
Publisher
- de Gruyter (20)
- Narr (8)
- European Language Resources Association (ELRA) (5)
- iudicium (4)
- BBAW (3)
- Lang (3)
- Stauffenburg (3)
- University of Oulu (3)
- Academia (2)
- Institut für Deutsche Sprache (2)
Lingvistiskās ainavas metode – netradicionāls ceļš multilingvisma jautājumu izpētē un mācīšanā
(2008)
Šī raksta mērķis ir iepazīstināt ar lingvistiskās ainavas metodi un izskaidrot tās priekšrocības ne tikai valodnieku pētījumos, bet arī tās ieviešanā mācību procesā skolās un augstskolās. Pēc šī nelielā ievada vēlamies jums parādīt ne tikai metodes ieviešanas gaitu, bet arī pašreizējo attīstības stadiju. Mēs iepazīstināsim arī ar 2008. gada sākumā izstrādāto projektu ,,Latvijas lingvistiskā ainava Baltijas valstu kontekstā”, kuru arī šobrīd realizējam Rēzeknes Augstskolā (maģistra studiju programmas ,,Filoloģija” studenti un divi docētāji). Tāpat tiks dots neliels ieskats par projektā gūtajiem rezultātiem un problēmām, ar kurām saskārāmies pētījuma laikā, kā arī iepazīstināsim ar jauniegūto pieredzi.
Current Natural Language Processing (NLP) systems feature high-complexity processing pipelines that require the use of components at different levels of linguistic and application specific processing. These components often have to interface with external e.g. machine learning and information retrieval libraries as well as tools for human annotation and visualization. At the UKP Lab, we are working on the Darmstadt Knowledge Processing Software Repository (DKPro) (Gurevych et al., 2007a; Müller et al., 2008) to create a highly flexible, scalable and easy-to-use toolkit that allows rapid creation of complex NLP pipelines for semantic information processing on demand. The DKPro repository consists of several main parts created to serve the purposes of different NLP application areas
In this paper we investigate the coverage of the two knowledge sources WordNet and Wikipedia for the task of bridging resolution. We report on an annotation experiment which yielded pairs of bridging anaphors and their antecedents in spoken multi-party dialog. Manual inspection of the two knowledge sources showed that, with some interesting exceptions, Wikipedia is superior to WordNet when it comes to the coverage of information necessary to resolve the bridging anaphors in our data set. We further describe a simple procedure for the automatic extraction of the required knowledge from Wikipedia by means of an API, and discuss some of the implications of the procedure’s performance.
The thesis describes a fully automatic system for the resolution of the pronouns 'it', 'this', and 'that' in English unrestricted multi-party dialog. Referential relations considered include both normal NP-antecedence as well as discourse-deictic pronouns. The thesis contains a theoretical part with a comprehensive empiricial study, and a practical part describing machine learning experiments.
In this paper, we present a suite of flexible UIMA-based components for information retrieval research which have been successfully used (and re-used) in several projects in different application domains. Implementing the whole system as UIMA components is beneficial for configuration management, component reuse, implementation costs, analysis and visualization.
Lexicography
(2008)
Lexicon schemas and their use are discussed in this paper from the perspective of lexicographers and field linguists. A variety of lexicon schemas have been developed, with goals ranging from computational lexicography (DATR) through archiving (LIFT, TEI) to standardization (LMF, FSR). A number of requirements for lexicon schemas are given. The lexicon schemas are introduced and compared to each other in terms of conversion and usability for this particular user group, using a common lexicon entry and providing examples for each schema under consideration. The formats are assessed and the final recommendation is given for the potential users, namely to request standard compliance from the developers of the tools used. This paper should foster a discussion between authors of standards, lexicographers and field linguists.
Our research task consists in the study of the way in which multilingual resources are mobilized in team work within collaborative activities; how they are exploited in a specific way in order both to enhance collaboration and to respect the specificities of the members’ linguistic competences and practices within the team. Central to our analytical work, which is inspired by ethnomethodological conversation analysis, is the relationship between multilingual resources and the situated organization of linguistic uses and of social practices. These two aspects are reflexively articulated, multilingual resources being shaped by the very contexts of their use and activities being constrained and thus structured by the available resources.
Lors de la négociation située de l'alternance des tours de parole en interaction (Sacks, Schegloff et Jefferson, 1974), les participants s'orientent vers la complétude possible des unités de construction de tour. Grâce à une complétion différée d'un tour de parole précédent, un locuteur peut revendiquer son droit à la parole au-delà d'un tour intercalaire d'un autre locuteur. Cet article exploite différentes formes de cette "delayed completion" (Lerner, 1989) en français parlé. À l'aide du cadre théorique de l'Analyse conversationnelle (ten Have, 1999), nous démontrerons que ce procédé ne relève pas uniquement d'une alternance de tour de parole problématique, mais aussi de séquences collaboratives, qui sont en lien étroit avec le phénomène des constructions syntaxiques collaboratives. En s'intéressant à ces structures syntaxiques émergentes, il est possible de démontrer la négociation située et locale - tour par tour – du droit à la parole et de la dynamique de l'alternance des tours en conversation ordinaire. A base d'une collection d'extraits issus d'interactions naturelles enregistrées en audio ou en vidéo, différentes manières de revendiquer ou de partager son tour seront illustrées. Lors des analyses, une attention particulière sera dédiée à quelques phénomènes récurrents dans les séquences de complétion différée. Ainsi, l'exploitation de certaines conjonctions en tant que marqueurs discursifs ou la présence d'allongements vocaliques en fin du premier segment semblent indiquer des co-occurrences de ressources audibles spécifiques à différents types de complétion différée en conversation française.
Cet article se fonde sur une collection de répétitions suite à un chevauchement, tirée de données vidéo en allemand et en français. La description systématique de cet outil de reprise de tour articule une comparaison entre cas clairs et cas déviants de ce phénomène. Il est démontré que le recyclage est aussi bien une ressource du locuteur suivant que du locuteur en cours.