Refine
Year of publication
- 2018 (2)
Document Type
- Other (2) (remove)
Language
- Multiple languages (2) (remove)
Has Fulltext
- no (2)
Is part of the Bibliography
- no (2) (remove)
Keywords
- Korpus <Linguistik> (2)
- CMC (1)
- Computerunterstützte Kommunikation (1)
- DMC (1)
- Deutsch (1)
- Digital Humanities (1)
- Jugendsprache (1)
- Mehrsprachigkeit (1)
- Metadaten (1)
- Natürliche Sprache (1)
Publisher
The NottDeuYTSch corpus contains over 33 million words taken from approximately 3 million YouTube comments from videos published between 2008 to 2018 targeted at a young, German-speaking demographic and represents an authentic language snapshot of young German speakers. The corpus was proportionally sampled based on video category and year from a database of 112 popular German-speaking YouTube channels in the DACH region for optimal representativeness and balance and contains a considerable amount of associated metadata for each comment that enable further longitudinal cross-sectional analyses.
CorpusExplorer
(2018)
Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK).