Summary of the paper

Title Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody
Authors Claire Brierley, Majdi Sawalha and Eric Atwell
Abstract "In this paper, we focus on the prosodic effect of qalqalah or ""vibration"" applied to a subset of Arabic consonants under certain constraints during correct Qur'anic recitation or taǧwīd, using our Boundary-Annotated Qur’an dataset of 77430 words (Brierley et al 2012; Sawalha et al 2014). These qalqalah events are rule-governed and are signified orthographically in the Arabic script. Hence they can be given abstract definition in the form of regular expressions and thus located and collected automatically. High frequency qalqalah content words are also found to be statistically significant discriminators or keywords when comparing Meccan and Medinan chapters in the Qur'an using a state-of-the-art Visual Analytics toolkit: Semantic Pathways. Thus we hypothesise that qalqalah prosody is one way of highlighting salient items in the text. Finally, we implement Arabic transcription technology (Brierley et al under review; Sawalha et al forthcoming) to create a qalqalah pronunciation guide where each word is transcribed phonetically in IPA and mapped to its chapter-verse ID. This is funded research under the EPSRC ""Working Together"" theme."
Topics Text Mining, Corpus (Creation, Annotation, etc.)
Full paper Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody
Bibtex @InProceedings{BRIERLEY14.119,
  author = {Claire Brierley and Majdi Sawalha and Eric Atwell},
  title = {Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
 }
Powered by ELDA © 2014 ELDA/ELRA