Summary of the paper

Title Bootstrapping Named Entity Extraction for the Creation of Mobile Services
Authors Joseph Polifroni, Imre Kiss and Mark Adler
Abstract As users become more accustomed to using their mobile devices to organize and schedule their lives, there is more of a demand for applications that can make that process easier. Automatic speech recognition technology has already been developed to enable essentially unlimited vocabulary in a mobile setting. Understanding the words that are spoken is the next challenge. In this paper, we describe efforts to develop a dataset and classifier to recognize named entities in speech. Using sets of both real and simulated data, in conjunction with a very large set of real named entities, we created a challenging corpus of training and test data. We use these data to develop a classifier to identify names and locations on a word-by-word basis. In this paper, we describe the process of creating the data and determining a set of features to use for named entity recognition. We report on our classification performance on these data, as well as point to future work in improving all aspects of the system.
Topics Named Entity recognition, Statistical and machine learning methods, Speech Recognition/Understanding
Full paper Bootstrapping Named Entity Extraction for the Creation of Mobile Services
Slides -
Bibtex @InProceedings{POLIFRONI10.280,
  author = {Joseph Polifroni and Imre Kiss and Mark Adler},
  title = {Bootstrapping Named Entity Extraction for the Creation of Mobile Services},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
Powered by ELDA © 2010 ELDA/ELRA