Authors: Christo Kirov,Ryan Cotterell,John Sylak-Glassman,Géraldine Walther,Ekaterina Vylomova,Patrick Xia,Manaal Faruqui,Sabrina J. Mielke,Arya D. McCarthy,Sandra Kübler,David Yarowsky,Jason Eisner,Mans Hulden
Where published:
LREC 2018 5
ArXiv: 1810.11101
Document:
PDF
DOI
Abstract URL: https://arxiv.org/abs/1810.11101v2
The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema. Additional supporting data and tools are also released on a per-language basis when available. UniMorph is based at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University in Baltimore, Maryland and is sponsored by the DARPA LORELEI program. This paper details advances made to the collection, annotation, and dissemination of project resources since the initial UniMorph release described at LREC 2016. lexical resources} }