Towards a Thesaurus of Predicates
Satoshi Shirai (ATR Spoken Language Translation Research Laboratories 2-2-2, Hikaridai, Seika-cho, Kyoto, 619-0288, Japan)
Kazuhide Yamamoto (ATR Spoken Language Translation Research Laboratories 2-2-2, Hikaridai, Seika-cho, Kyoto, 619-0288, Japan)
Francis Bond (Communication Science Laboratories, Nippon Telegraph and Telephone Corporation 2-4, Hikaridai, Seika-cho, Kyoto, 619-0237, Japan)
Hozumi Tanaka (Graduate School of Information Science and Engineering, Tokyo Institute of Technology 2-12-1, Oookayama, Meguro-ku, Tokyo, 152-8552, Japan)
WO20: Machine Translation
We propose a thesaurus of predicates that can help to resolve pre-editing and/or post-editing problems in machine translation environments. It differs from earlier approaches such as conventional dictionaries in that we are aiming to link a wide range of near-synonyms and paraphrases. We are compiling such similar examples through both introspection and the use of translation data, giving us a large collection of monolingual and bilingual equivalences. This thesaurus enables the following machine translation techniques. (a) Unification of synonymous expressions in the source language (source language paraphrasing). (b) Conversion of homonymous expressions to more easily translated ones (source language rewriting). (c) Development of expressions appearing in the target language into various expressions (target language paraphrasing).
Thesaurus of predicates