Summary of the paper

Title Enriching ODIN
Authors Fei Xia, William Lewis, Michael Wayne Goodman, Joshua Crowgey and Emily M. Bender
Abstract In this paper, we describe the expansion of the ODIN resource, a database containing many thousands of instances of Interlinear Glossed Text (IGT) for over a thousand languages harvested from scholarly linguistic papers posted to the Web. A database containing a large number of instances of IGT, which are effectively richly annotated and heuristically aligned bitexts, provides a unique resource for bootstrapping NLP tools for resource-poor languages. To make the data in ODIN more readily consumable by tool developers and NLP researchers, we propose a new XML format for IGT, called Xigt. We call the updated release ODIN-II.
Topics Endangered Languages, Tools, Systems, Applications
Full paper Enriching ODIN
