LREC 2000 2nd International Conference on Language Resources & Evaluation

Previous Paper   Next Paper

Title SALA: SpeechDat across Latin America. Results of the First Phase
Authors Moreno Asunción (Universitat Politècnica de Catalunya, Jordi Girona 1-3 08034 Barcelona, SPAIN,,
Comeyne Robrecht (Lernout & Hauspie, Ieper, Belgium)
Haslam Keith (Vocalis, Cambridge, UK)
van den Heuvel Henk (SPEX, Nijmegen, Netherlands, e-mail:
Höge Harald (Siemens AG, München, Germany)
Horbach Sabine (Philips, Aagen, Germany, CSELT, Torino, Italy)
Micca Giorgio (CSELT, Via G. Reiss Romoli 274, 10148 Torino, Italia,
Keywords Latin America, Oral Databases, Spanish and Portuguese, Speech Recognition, Telephone Speech
Session Session SP3 - Spoken Language Resources' Projects
Full Paper, 10.pdf
Abstract The objective of the SALA (SpeechDat across Latin America) project is to record large SpeechDat-like databases to train telephone speech recognisers for any country in Latin America. The SALA consortium is composed by several European companies, (CSELT, Italy; Lernout & Hauspie, Belgium; Philips, Germany; Siemens AG, Germany; Vocalis, U.K.) and Universities (UPC Spain, SPEX The Netherlands). This paper gives an overview of the project, introduces the definition of the databases, shows the dialectal distribution in the countries where recordings take place and gives information about validation issues, actual status and practical experiences in recruiting and annotating such large databases in Latin America.