Rethinking Reusable Resources
David M. de Matos, Ricardo Ribeiro, Nuno J. Mamede
L2F - Spoken Language Systems Laboratory - INESC ID Lisboa
We address the common and recurring problem of data reuse, focusing on the following topics: (i) the current state of affairs (in particular, problems with data); (ii) requirements for change; (iii) the proposed solution (its problems and advantages, as well as related work in this area), including the canonical-, I/O-, and data transformation models; (iv) maintenance issues; (v) implementation and deployment aspects; (vi) conclusions and future directions, including results from work done so far and aspects that merit future work.
Data reuse, language resources, linguistic data repositories, canonical models