Summary of the paper

Title Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System
Authors Marta R. Costa-jussà and José A. R. Fonollosa
Abstract This paper proposes to introduce a novel reordering model in the open-source Moses toolkit. The main idea is to provide weighted reordering hypotheses to the SMT decoder. These hypotheses are built using a first-step Ngram-based SMT translation from a source language into a third representation that is called reordered source language. Each hypothesis has its own weight provided by the Ngram-based decoder. This proposed reordering technique offers a better and more efficient translation when compared to both the distance-based and the lexicalized reordering. In addition to this reordering approach, this paper describes a domain adaptation technique which is based on a linear combination of an specific in-domain and an extra out-domain translation models. Results for both approaches are reported in the Arabic-to-English 2008 IWSLT task. When implementing the weighted reordering hypotheses and the domain adaptation technique in the final translation system, translation results reach improvements up to 2.5 BLEU compared to a standard state-of-the-art Moses baseline system.
Topics Machine Translation, SpeechToSpeech Translation, Statistical and machine learning methods, Language modelling
Full paper Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System
Slides -
Bibtex @InProceedings{RCOSTAJUSS10.23,
  author = {Marta R. Costa-jussà and José A. R. Fonollosa},
  title = {Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA