Authors: Robert Schwarzenberg,Lisa Raithel,David Harbecke
Where published:
WS 2019 6
ArXiv: 1904.01500
Document:
PDF
DOI
Artifact development version:
GitHub
Abstract URL: http://arxiv.org/abs/1904.01500v1
Distributed word vector spaces are considered hard to interpret which hinders
the understanding of natural language processing (NLP) models. In this work, we
introduce a new method to interpret arbitrary samples from a word vector space.
To this end, we train a neural model to conceptualize word vectors, which means
that it activates higher order concepts it recognizes in a given vector.
Contrary to prior approaches, our model operates in the original vector space
and is capable of learning non-linear relations between word vectors and
concepts. Furthermore, we show that it produces considerably less entropic
concept activation profiles than the popular cosine similarity.