Summary of the paper

Title Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Authors Željko Agić, Daša Berović, Danijela Merkler and Marko Tadić
Abstract We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development.
Topics Parsing, Grammar and Syntax
Full paper Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Bibtex @InProceedings{AGI14.694,
  author = {Željko Agić and Daša Berović and Danijela Merkler and Marko Tadić},
  title = {Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
 }
Powered by ELDA © 2014 ELDA/ELRA