Open library

This portal has been archived. Explore the next generation of this technology.

Learning to Generate Word Representations using Subword Information

lib:85a8003a5e3f2a59 (v1.0.0)

Authors: Yeachan Kim,Kang-Min Kim,Ji-Min Lee,SangKeun Lee
Where published: COLING 2018 8
Document: PDF DOI

Abstract URL: https://www.aclweb.org/anthology/C18-1216/

Distributed representations of words play a major role in the field of natural language processing by encoding semantic and syntactic information of words. However, most existing works on learning word representations typically regard words as individual atomic units and thus are blind to subword information in words. This further gives rise to a difficulty in representing out-of-vocabulary (OOV) words. In this paper, we present a character-based word representation approach to deal with this limitation. The proposed model learns to generate word representations from characters. In our model, we employ a convolutional neural network and a highway network over characters to extract salient features effectively. Unlike previous models that learn word representations from a large corpus, we take a set of pre-trained word embeddings and generalize it to word entries, including OOV words. To demonstrate the efficacy of the proposed model, we perform both an intrinsic and an extrinsic task which are word similarity and language modeling, respectively. Experimental results show clearly that the proposed model significantly outperforms strong baseline models that regard words or their subwords as atomic units. For example, we achieve as much as 18.5{\%} improvement on average in perplexity for morphologically rich languages compared to strong baselines in the language modeling task.

Relevant initiatives

Related knowledge about this paper

Search on this portal

Reproduced results (crowd-benchmarking and competitions)

Artifact and reproducibility checklists

Common formats for research projects and shared artifacts

Collective Knowledge (organizing research projects based on FAIR principles)

Reproducibility initiatives

Comments

Please log in to add your comments!

If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!

Learning to Generate Word Representations using Subword Information

Relevant initiatives Hide

Comments Hide

Relevant initiatives

Comments