Annotation of anaphoric expressions in an aligned bilingual corpus
Agnès Tutin (1), Meriam Haddara (1), Ruslan Mitkov (2), Constantin Orasan (2)
(1) LIDILEM, Université Grenoble 3-Stendhal BP 25 F-38040 Grenoble Cédex 9; (2) Computational Linguistic Group, School of Humanities, Languages and Social Sciences, Stafford Street, Wolverhampton WV1 1SB, UK
This paper discusses a French-English corpus annotated and aligned at anaphoric level. It also presents an annotation scheme based on the study of a detailed corpus featuring different types of correspondences and mismatches. The scheme which is adapted from EAGLES recommendations, supports the alignment at anaphoric level and caters for the different kinds of mismatches.
French-English corpus, parallel corpora, anaphora resolution, alignment