Summary of the paper

Title A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization
Authors Fajri Koto
Abstract In this paper we report our effort to construct the first ever Indonesian corpora for chat summarization. Specifically, we utilized documents of multi-participant chat from a well known online instant messaging application, WhatsApp. We construct the gold standard by asking three native speakers to manually summarize 300 chat sections (152 of them contain images). As result, three reference summaries in extractive and either abstractive form are produced for each chat sections. The corpus is still in its early stage of investigation, yielding exciting possibilities of future works.
Topics Corpus (Creation, Annotation, etc.), Summarisation, Text Mining
Full paper A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization
Bibtex @InProceedings{KOTO16.366,
  author = {Fajri Koto},
  title = {A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {23-28},
  location = {Portorož, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
 }
Powered by ELDA © 2016 ELDA/ELRA