LREC 2000 2nd International Conference on Language Resources & Evaluation

Previous Paper   Next Paper

Title Resources for Multilingual Text Generation in Three Slavic Languages
Authors Bateman John A. (Linguistics and Literature, University of Bremen,
Teich Elke (Institute for Applied Linguistics, Translation and Interpretation, University of the Saarland,
Kruijff Geert-Jan (Institute of Formal and Applied Linguistics, Charles University,
Kruijff-Korbayová Ivanna (Institute of Formal and Applied Linguistics, Charles University,
Sharoff Serge (Russian Research Institute for, Artificial Inteligence,
Skoumalová Hana (Institute of Theoretical and Computational Linguistics, Charles University,
Keywords Generation, Multilingual LR, Reusing LR, Text
Session Session WP9 - Applications using Written Language Resources
Full Paper, 241.pdf
Abstract The paper discusses the methods followed to re-use a large-scale, broad-coverage English grammar for constructing similar scale grammars for Bulgarian, Czech and Russian for the fast prototyping of a multilingual generation system. We present (1) the theoretical and methodological basis for resource sharing across languages, (2) the use of a corpus-based contrastive register analysis, in particular, contrastive analysis of mood and agency. Because the study concerns reuse of the grammar of a language that is typologically quite different from the languages treated, the issues addressed in this paper appear relevant to a wider range of researchers in need of large-scale grammars for less-researched languages.