Summary of the paper

Title HESITA(te) in Portuguese
Authors Sara Candeias, Dirce Celorico, Jorge Proença, Arlindo Veiga, Carla Lopes and Fernando Perdigão
Abstract Hesitations, so-called disfluencies, are a characteristic of spontaneous speech, playing a primary role in its structure, reflecting aspects of the language production and the management of inter-communication. In this paper we intend to present a database of hesitations in European Portuguese speech - HESITA - as a relevant base of work to study a variety of speech phenomena. Patterns of hesitations, hesitation distribution according to speaking style, and phonetic properties of the fillers are some of the characteristics we extrapolated from the HESITA database. This database also represents an important resource for improvement in synthetic speech naturalness as well as in robust acoustic modelling for automatic speech recognition. The HESITA database is the output of a project in the speech-processing field for European Portuguese held by an interdisciplinary group in intimate articulation between engineering tools and experience and the linguistic approach.
Topics Discourse Annotation, Representation and Processing, Corpus (Creation, Annotation, etc.)
Full paper HESITA(te) in Portuguese
Bibtex @InProceedings{CANDEIAS14.587,
  author = {Sara Candeias and Dirce Celorico and Jorge Proença and Arlindo Veiga and Carla Lopes and Fernando Perdigão},
  title = {HESITA(te) in Portuguese},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
Powered by ELDA © 2014 ELDA/ELRA