Converting a Corpus into a Hypertext: An Approach Using XML Topic Maps and XSLT
Eva Anna Lenz (Universit¨at Dortmund, Institut f¨ur deutsche Sprache und Literatur Emil-Figge-Str. 50, D-44227 Dortmund, Germany)
Angelika Storrer (Universit¨at Dortmund, Institut f¨ur deutsche Sprache und Literatur Emil-Figge-Str. 50, D-44227 Dortmund, Germany)
WP1: Corpora & Corpus Tools
In the context of the HyTex project, our goal is to convert a corpus into a hypertext, basing conversion strategies on annotations which explicitly mark up the text-grammatical structures and relations between text segments. Domain-specific knowledge is represented in the form of a knowledge net, using topic maps. We use XML as an interchange format. In this paper, we focus on a declarative rule language designed to express conversion strategies in terms of text-grammatical structures and hypertext results. The strategies can be formulated in a concise formal syntax which is independend of the markup, and which can be transformed automatically into executable program code.