Summary of the paper

Title A Bilingual Discourse Corpus and Its Applications
Authors Yang Liu, Jiajun Zhang, Chengqing Zong, Yating Yang and Xi Zhou
Abstract Existing discourse research only focuses on the monolingual languages and the inconsistency between languages limits the power of the discourse theory in multilingual applications such as machine translation. To address this issue, we design and build a bilingual discource corpus in which we are currently defining and annotating the bilingual elementary discourse units (BEDUs). The BEDUs are then organized into hierarchical structures. Using this discourse style, we have annotated nearly 20K LDC sentences. Finally, we design a bilingual discourse based method for machine translation evaluation and show the effectiveness of our bilingual discourse annotations.
Topics Discourse Annotation, Representation and Processing, Machine Translation, SpeechToSpeech Translation, Corpus (Creation, Annotation, etc.)
Full paper A Bilingual Discourse Corpus and Its Applications
