Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language
Masaki Murata (Communications Research Laboratory, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0289, Japan)
Hitoshi Isahara (Communications Research Laboratory, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0289, Japan)
SP2: Speech Varieties And Multilingual ASR
We extracted the differences between spoken language and written language from a spoken-language corpus and a written-language corpus by using the UNIX command ``diff'' and examined the differences to determine the construction of the grammars of the two corpora. We also transformed written-language sentences into spoken-language sentences by using rules based on the extracted differences.
Differences between spoken and written languages, Automatic translation between the written and spoken languages, Diff