Summary of the paper

Title WikiWoods: Syntacto-Semantic Annotation for English Wikipedia
Authors Dan Flickinger, Stephan Oepen and Gisle Ytrestøl
Abstract WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikipedia. We sketch an automated processing pipeline to extract relevant textual content from Wikipedia sources, segment documents into sentence-like units, parse and disambiguate using a broad-coverage precision grammar, and support the export of syntactic and semantic information in various formats. The full parsed corpus is accompanied by a subset of Wikipedia articles for which gold-standard annotations in the same format were produced manually. This subset was selected to represent a coherent domain, Wikipedia entries on the broad topic of Natural Language Processing.
Topics Corpus (creation, annotation, etc.), Grammar and Syntax, Semantics
Full paper WikiWoods: Syntacto-Semantic Annotation for English Wikipedia
Slides WikiWoods: Syntacto-Semantic Annotation for English Wikipedia
Bibtex @InProceedings{FLICKINGER10.432,
  author = {Dan Flickinger and Stephan Oepen and Gisle Ytrestøl},
  title = {WikiWoods: Syntacto-Semantic Annotation for English Wikipedia},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
Powered by ELDA © 2010 ELDA/ELRA