Title

The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research

Authors

Hisao Kuwabara (Department of Electronics and Information Science, Teikyo University of Science & Technology Uenohara, Kitatsuru-gun, Yamanashi 409-0193, Japan)

Shuich Itahashi (Institute of Information Sciences and Electronics, Tsukuba University Tennodai, Tsukuba, Ibaraki 305-8573, Japan)

Mikio Yamamoto (Institute of Information Sciences and Electronics, Tsukuba University Tennodai, Tsukuba, Ibaraki 305-8573, Japan)

Toshiyuki Takezawa (ATR Spoken Language Translation Research Laboratories Hikaridai, Soraku-gun, Kyoto 619-0288, Japan)

Satoshi Nakamura (ATR Spoken Language Translation Research Laboratories Hikaridai, Soraku-gun, Kyoto 619-0288, Japan)

Kazuya Takeda (Center for Integrated Acoustic Information Research, Nagoya University Furomachi, Chigusa-ku, Nagoya 464-8603, Japan)

Session

SO1: Large Projects-Initiatives For Speech Corpora

Abstract

The present status of Japanese speech database has been described. The database project in Japan started in early 1980s. The first one was a committee of Japan Electronic Industry Development Association, abbreviated as JEIDA, which aimed at creating a speech database that can commonly evaluate performance of the then existing speech input/output machines and systems. Several database projects have been undertaken since then including the one initiated by the Advanced Telecommunication Research Institute (ATR) and now it has come to the point where an enormous amount of spontaneous speech data is available. A survey has been conducted recently about the usage of the presently existing speech databases among industry and university institutions in Japan where speech research is now actively going on. It has been revealed that the ATR’s continuous speech database is the most frequently used followed by the equivalent version of the Acoustical Society of Japan.

Keywords

Development, Management, Application, Speech research

Full Paper

14.pdf