| Title | Using an ASR database to design a pronunciation evaluation system in Basque | 
  
  | Authors | Igor Odriozola, Eva Navas, Inma Hernáez, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez and Daniel Erro | 
  
  | Abstract | This paper presents a method to build CAPT systems for under resourced languages, as Basque, using a general purpose ASR speech database. More precisely, the proposed method consists in automatically determine the threshold of GOP (Goodness Of Pronunciation) scores, which have been used as pronunciation scores in phone-level. Two score distributions have been obtained for each phoneme corresponding to its correct and incorrect pronunciations. The distribution of the scores for erroneous pronunciation has been calculated inserting controlled errors in the dictionary, so that each changed phoneme has been randomly replaced by a phoneme from the same group. These groups have been obtained by means of a phonetic clustering performed using regression trees. After obtaining both distributions, the EER (Equal Error Rate) of each distribution pair has been calculated and used as a decision threshold for each phoneme. The results show that this method is useful when there is no database specifically designed for CAPT systems, although it is not as accurate as those specifically designed for this purpose. | 
  
  | Topics | Tools, systems, applications, Speech Recognition/Understanding, Speech resource/database | 
  
  | Full paper  | Using an ASR database to design a pronunciation evaluation system in Basque | 
  
  | Bibtex | @InProceedings{ODRIOZOLA12.822, author =  {Igor Odriozola and Eva Navas and Inma Hernáez and Iñaki Sainz and Ibon Saratxaga and Jon Sánchez and Daniel Erro},
 title =  {Using an ASR database to design a pronunciation evaluation system in Basque},
 booktitle =  {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
 year =  {2012},
 month =  {may},
 date =  {23-25},
 address =  {Istanbul, Turkey},
 editor =  {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
 publisher =  {European Language Resources Association (ELRA)},
 isbn =  {978-2-9517408-7-7},
 language =  {english}
 }
 |