Summary of the paper

Title Terra: a Collection of Translation Error-Annotated Corpora
Authors Mark Fishel, Ondřej Bojar and Maja Popović
Abstract Recently the first methods of automatic diagnostics of machine translation have emerged; since this area of research is relatively young, the efforts are not coordinated. We present a collection of translation error-annotated corpora, consisting of automatically produced translations and their detailed manual translation error analysis. Using the collected corpora we evaluate the available state-of-the-art methods of MT diagnostics and assess, how well the methods perform, how they compare to each other and whether they can be useful in practice.
Topics Evaluation methodologies, Corpus (creation, annotation, etc.), Machine Translation, SpeechToSpeech Translation
Full paper Terra: a Collection of Translation Error-Annotated Corpora
Bibtex @InProceedings{FISHEL12.481,
  author = {Mark Fishel and Ondřej Bojar and Maja Popović},
  title = {Terra: a Collection of Translation Error-Annotated Corpora},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
Powered by ELDA © 2012 ELDA/ELRA