Evaluation of parsed corpora: Experiments in user-transparent and user-visible evaluation
Diana Santos (SINTEF Tele og Data )
Caroline Gasperin, (Faculdade de Informática, PPGCC, PUCRS)
In the present paper, we describe and discuss the evaluation of parsed corpora, namely the ones that are available on the Web for querying in the AC/DC project. The paper has two parts: the first one suggests a set of different evaluation parameters and measures that are much more illuminating than commonly used simple precision measures, while the second evaluates the parsed corpus for a particular task -- that of automatic thesaurus building. The two evaluations are thus complementary, in that, in Gaizauskas (1998) terminology, the first is a typical user-transparent evaluation, while the second is user-visible.
Parse evaluation, Parsed corpora, Portuguese, Semantic aquisition, Tagging