Summary of the paper

Title A Multi-domain Corpus of Swedish Word Sense Annotation
Authors Richard Johansson, Yvonne Adesam, Gerlof Bouma and Karin Hedberg
Abstract We describe the word sense annotation layer in \emph{Eukalyptus}, a freely available five-domain corpus of contemporary Swedish with several annotation layers. The annotation uses the SALDO lexicon to define the sense inventory, and allows word sense annotation of compound segments and multiword units. We give an overview of the new annotation tool developed for this project, and finally present an analysis of the inter-annotator agreement between two annotators.
Topics Word Sense Disambiguation, Lexicon, Lexical Database, Corpus (Creation, Annotation, etc.)
Full paper A Multi-domain Corpus of Swedish Word Sense Annotation
Bibtex @InProceedings{JOHANSSON16.899,
  author = {Richard Johansson and Yvonne Adesam and Gerlof Bouma and Karin Hedberg},
  title = {A Multi-domain Corpus of Swedish Word Sense Annotation},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {23-28},
  location = {Portoro┼ż, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
Powered by ELDA © 2016 ELDA/ELRA