Summary of the paper

Title Active Learning for Building a Corpus of Questions for Parsing
Authors Jordi Atserias, Giuseppe Attardi, Maria Simi and Hugo Zaragoza
Abstract This paper describes how we built a dependency Treebank for questions. The questions for the Treebank were drawn from questions from the TREC 10 QA task and from Yahoo! Answers. Among the uses for the corpus is to train a dependency parser achieving good accuracy on parsing questions without hurting its overall accuracy. We also explore active learning techniques to determine the suitable size for a corpus of questions in order to achieve adequate accuracy while minimizing the annotation efforts.
Topics Corpus (creation, annotation, etc.), Parsing
Full paper Active Learning for Building a Corpus of Questions for Parsing
Slides Active Learning for Building a Corpus of Questions for Parsing
Bibtex @InProceedings{ATSERIAS10.656,
  author = {Jordi Atserias and Giuseppe Attardi and Maria Simi and Hugo Zaragoza},
  title = {Active Learning for Building a Corpus of Questions for Parsing},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA