Summary of the paper

Title Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation
Authors Daisuke Kawahara and Sadao Kurohashi
Abstract We present a method for acquiring reliable predicate-argument structures from raw corpora for automatic compilation of case frames. Such lexicon compilation requires highly reliable predicate-argument structures to practically contribute to Natural Language Processing (NLP) applications, such as paraphrasing, text entailment, and machine translation. However, to precisely identify predicate-argument structures, case frames are required. This issue is similar to the question ""what came first: the chicken or the egg?"" In this paper, we propose the first step in the extraction of reliable predicate-argument structures without using case frames. We first apply chunking to raw corpora and then extract reliable chunks to ensure that high-quality predicate-argument structures are obtained from the chunks. We conducted experiments to confirm the effectiveness of our approach. We successfully extracted reliable chunks of an accuracy of 98% and high-quality predicate-argument structures of an accuracy of 97%. Our experiments confirmed that we succeeded in acquiring highly reliable predicate-argument structures that can be used to compile case frames.
Topics Acquisition, Lexicon, lexical database, Knowledge Discovery/Representation
Full paper Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation
Slides Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation
Bibtex @InProceedings{KAWAHARA10.733,
  author = {Daisuke Kawahara and Sadao Kurohashi},
  title = {Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA