Summary of the paper

Title The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic
Authors Ahmed Abdelali, Irina Temnikova, Samy Hedaya and Stephan Vogel
Abstract This article presents the WAW Corpus, an interpreting corpus for English/Arabic, which can be used for teaching interpreters, studying the characteristics of interpreters’ work, as well as to train machine translation systems. The corpus contains recordings of lectures and speeches from international conferences, their interpretations, the transcripts of the original speeches and of their interpretations, as well as human translations of both kinds of transcripts into the opposite language of the language pair. The article presents the corpus curation, statistics, assessment, as well as a case study of the corpus use.
Topics Corpus (Creation, Annotation, Etc.), Other
Full paper The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic
Bibtex @InProceedings{ABDELALI18.507,
  author = {Ahmed Abdelali and Irina Temnikova and Samy Hedaya and Stephan Vogel},
  title = "{The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic}",
  booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)},
  year = {2018},
  month = {May 7-12, 2018},
  address = {Miyazaki, Japan},
  editor = {Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {979-10-95546-00-9},
  language = {english}
  }
Powered by ELDA © 2018 ELDA/ELRA