Generating an Arabic full-form lexicon for bidirectional morphology lookup
Abdelhadi Soudi, Andreas Eisele
(1) CLC and CS department, Ecole Nationale, de L'Industrie Minérale, Rabat, Morocco, email@example.com; (2) Computational Linguistics Department, Saarland University, P.O.Box 151150, D-66041 Saarbrücken, Germany, firstname.lastname@example.org
We describe the generation of an Arabic full-form lexicon and its conversion into a two-level Finite State Transducer (FST) for morphology analysis and generation. The implementation of morphological lookup is based on a representation of the relevant data in the form of a FST, for which generic implementations exist that facilitate the integration into larger software systems for natural language processing. We show the feasibility of our encoding and the analysis of both vowelled and unvowelled Arabic words.
Arabic, morphology, lexicon, analysis, generation