
A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources 


Jakub Piskorski (DFKI – German Research Center for Artificial Intelligence Stuhlsatzenhausweg 3, 66 123 Saarbrücken, Germany) 

Witold Drozdzynsky (DFKI – German Research Center for Artificial Intelligence Stuhlsatzenhausweg 3, 66 123 Saarbrücken, Germany)

Oliver Scherf (DFKI – German Research Center for Artificial Intelligence Stuhlsatzenhausweg 3, 66 123 Saarbrücken, Germany)

Feiyu Xu (DFKI – German Research Center for Artificial Intelligence Stuhlsatzenhausweg 3, 66 123 Saarbrücken, Germany)


WP3: Tools & Components


Finite-state devices are widely used to compactly model linguistic phenomena, whereas regular expressions are regarded as the adequate level of abstraction for thinking about finite-state languages. In this paper we present a flexible XML-based and Unicodecompatible regular compiler for creating, and integrating existing linguistic resources. Our tool provides user-friendly graphical interface which enables the transparent control of the compilation process and allows for testing generated finite-state grammars with several diagnostic tools. Through the direct database connection, existing  linguistic resources can be converted into user-definable finite-state representations. 


Conversion, Linguistic resources

Full Paper
