Lecture Notes in Computer Science
Refine
Year of publication
- 2017 (1) (remove)
Document Type
Language
- English (1)
Has Fulltext
- yes (1)
Is part of the Bibliography
- no (1) (remove)
Keywords
- Computerlinguistik (1)
- Deep learning (1)
- Maschinelles Lernen (1)
- Semantik (1)
- Veröffentlichung (1)
- author name disambiguation (1)
- classification (1)
- clustering (1)
- deep learning (1)
- machine learning (1)
Publicationstate
- Postprint (1)
- Zweitveröffentlichung (1)
Reviewstate
- Peer-Review (1)
Publisher
- Springer (1)
10450
We present a supervised machine learning AND system which tackles semantic similarity between publication titles by means of word embeddings. Word embeddings are integrated as external components, which keeps the model small and efficient, while allowing for easy extensibility and domain adaptation. Initial experiments show that word embeddings can improve the Recall and F score of the binary classification sub-task of AND. Results for the clustering sub-task are less clear, but also promising and overall show the feasibility of the approach.