Summary of the paper

Title AraNLP: a Java-based Library for the Processing of Arabic Text.
Authors Maha Althobaiti, Udo Kruschwitz and Massimo Poesio
Abstract "We present a free, Java-based library named ""AraNLP"" that covers various Arabic text preprocessing tools. Although a good number of tools for processing Arabic text already exist, integration and compatibility problems continually occur. AraNLP is an attempt to gather most of the vital Arabic text preprocessing tools into one library that can be accessed easily by integrating or accurately adapting existing tools and by developing new ones when required. The library includes a sentence detector, tokenizer, light stemmer, root stemmer, part-of speech tagger (POS-tagger), word segmenter, normalizer, and a punctuation and diacritic remover."
Topics Text Mining, Information Extraction, Information Retrieval
Full paper AraNLP: a Java-based Library for the Processing of Arabic Text.
Bibtex @InProceedings{ALTHOBAITI14.621,
  author = {Maha Althobaiti and Udo Kruschwitz and Massimo Poesio},
  title = {AraNLP: a Java-based Library for the Processing of Arabic Text.},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
 }
Powered by ELDA © 2014 ELDA/ELRA