Summary of the paper

Title Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Authors Anil Kumar Singh, Kiran Pala and Harshit Surana
Abstract Developing resources which can be used for Natural Language Processing is an extremely difficult task for any language, but is even more so for less privileged (or less computerized) languages. One way to overcome this difficulty is to adapt the resources of a linguistically close resource rich language. In this paper we discuss how the cost of such adaption can be estimated using subjective and objective measures of linguistic similarity for allocating financial resources, time, manpower etc. Since this is the first work of its kind, the method described in this paper should be seen as only a preliminary method, indicative of how better methods can be developed. Corpora of several less computerized languages had to be collected for the work described in the paper, which was difficult because for many of these varieties there is not much electronic data available. Even if it is, it is in non-standard encodings, which means that we had to build encoding converters for these varieties. The varieties we have focused on are some of the varieties spoken in the South Asian region.
Language Multiple languages
Topics LR national/international projects, organizational/policy issues, Multilinguality, Typological databases
Full paper Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Slides -
Bibtex @InProceedings{SINGH08.894,
  author = {Anil Kumar Singh, Kiran Pala and Harshit Surana},
  title = {Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA