| Title | Unsupervised acquisition of concatenative morphology | 
  
  | Authors | Lionel Nicolas, Jacques Farré and Cécile Darme | 
  
  | Abstract | Among the linguistic resources formalizing a language, morphological rules are among those that can be achieved in a reasonable time. Nevertheless, since the construction of such resource can require linguistic expertise, morphological rules are still lacking for many languages. The automatized acquisition of morphology is thus an open topic of interest within the NLP field. We present an approach that allows to automatically compute, from raw corpora, a data-representative description of the concatenative mechanisms of a morphology. Our approach takes advantage of phenomena that are observable for all languages using morphological inflection and derivation but are more easy to exploit when dealing with concatenative mechanisms. Since it has been developed toward the objective of being used on as many languages as possible, applying this approach to a varied set of languages needs very few expert work. The results obtained for our first participation in the 2010 edition of MorphoChallenge have confirmed both the practical interest and the potential of the method. | 
  
  | Topics | Morphology, Acquisition, Language modelling | 
  
  | Full paper  | Unsupervised acquisition of concatenative morphology | 
  
  | Bibtex | @InProceedings{NICOLAS12.450, author =  {Lionel Nicolas and Jacques Farré and Cécile Darme},
 title =  {Unsupervised acquisition of concatenative morphology},
 booktitle =  {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
 year =  {2012},
 month =  {may},
 date =  {23-25},
 address =  {Istanbul, Turkey},
 editor =  {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
 publisher =  {European Language Resources Association (ELRA)},
 isbn =  {978-2-9517408-7-7},
 language =  {english}
 }
 |