Summary of the paper

Title Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation
Authors Sara Stymne and Lars Ahrenberg
Abstract One problem in statistical machine translation (SMT) is that the output often is ungrammatical. To address this issue, we have investigated the use of a grammar checker for two purposes in connection with SMT: as an evaluation tool and as a postprocessing tool. To assess the feasibility of the grammar checker on SMT output, we performed an error analysis, which showed that the precision of error identification in general was higher on SMT output than in previous studies on human texts. Using the grammar checker as an evaluation tool gives a complementary picture to standard metrics such as Bleu, which do not account well for grammaticality. We use the grammar checker as a postprocessing tool by automatically applying the error correction suggestions it gives. There are only small overall improvements of the postprocessing on automatic metrics, but the sentences that are affected by the changes are improved, as shown both by automatic metrics and by a human error analysis. These results indicate that grammar checker techniques are a useful complement to SMT.
Topics Machine Translation, SpeechToSpeech Translation, Evaluation methodologies, Grammar and Syntax
Full paper Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation
Slides Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation
Bibtex @InProceedings{STYMNE10.426,
  author = {Sara Stymne and Lars Ahrenberg},
  title = {Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA