VIQTORYA -- A Visual Query Tool for Syntactically Annotated Corpora


Ilona Steiner (Seminar für Sprachwissenschaft, Universität Tübingen Wilhelmstr. 113, D-72074 Tübingen, Germany)

Laura Kallmeyer (TALaNa-Lattice, Université Paris 7 2, place Jussieu, F-75251 Paris cedex 05, France)


WP4: Corpus Annotation


This paper presents a query tool for syntactically annotated corpora. The query tool is developed to search the Tübingen Treebanks annotated at the University of Tübingen. However, in principle it also can be adapted to other corpora. The tool uses a query language that allows to search for tokens, syntactic categories, grammatical functions and binary relations of (immediate) dominance and linear precedence between nodes. The overall idea is to extract in an initializing phase the relevant information from the corpus and store it in a compact way in a relational database. An incoming query is then translated into a corresponding SQL query that is evaluated on the database. A graphical user interface allows to specify queries in a user-friendly way.


Query tool, Query language, Treebank, Linguistic database, Syntactic annotation

