Summary of the paper

Title A General Method for Creating a Bilingual Transliteration Dictionary
Authors Amit Kirschenbaum and Shuly Wintner
Abstract Transliteration is the rendering in one language of terms from another language (and, possibly, another writing system), approximating spelling and/or phonetic equivalents between the two languages. A transliteration dictionary is a crucial resource for a variety of natural language applications, most notably machine translation. We describe a general method for creating bilingual transliteration dictionaries from Wikipedia article titles. The method can be applied to any language pair with Wikipedia presence, independently of the writing systems involved, and requires only a single simple resource that can be provided by any literate bilingual speaker. It was successfully applied to extract a Hebrew-English transliteration dictionary which, when incorporated in a machine translation system, indeed improved its performance.
Topics Lexicon, lexical database, Machine Translation, SpeechToSpeech Translation, Multilinguality
Full paper A General Method for Creating a Bilingual Transliteration Dictionary
Slides -
Bibtex @InProceedings{KIRSCHENBAUM10.16,
  author = {Amit Kirschenbaum and Shuly Wintner},
  title = {A General Method for Creating a Bilingual Transliteration Dictionary},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
Powered by ELDA © 2010 ELDA/ELRA