A powerful and versatile XML format for representing role-semantic annotation
Katrin Erk, Sebastian Padó
Computational Linguistics, Saarland University, Saarbrücken, Germany
We present two XML formats for the description and encoding of semantic role information in corpora. The TIGER/SALSA XML format provides a modular representation for semantic roles and syntactic structure. The Text-SALSA XML format is a lightweight version of TIGER/SALSA XML designed for manual annotation with an XML editor rather than a special tool. Both formats can deal with underspecification, roles crossing the sentence boundary, compound splitting, and whole-sentence tags for meta-level comments.
semantic roles, XML, representation, multi-level annotation, corpora