Title A Similarity Measure for Unsupervised Semantic Disambiguation
Author(s) Roberto Basili, Marco Cammisa, Fabio Massimo Zanzotto

Department of Computer Science, Systems and Production, University of Rome, Tor Vergata, Italy

Session O11-EW
Abstract In this paper we propose a similarity measure aimed to support an unsupervised approach to semantic tagging. This proposal represents a variant of the notion of "Conceptual Density" previously suggested as a tool for sense disambiguation. However, the major difference is the learning framework in which this measure applied to the Wordnet hierarchy enables a "natural" corpus-driven empirical estimation of lexical and contextual probabilities for probabilistic semantic tagging. Experimental results over an hand-annotated portion of the British National Corpus (about 5 M words) are also discussed. Although below the results obtained by a supervised method (Maximum Entropy trained over hand labelled data), the proposed unsupervised tagger confirms the effectiveness of the proposed metric as well as show a promising research direction.
Keyword(s) Similarity measure, Semantic Tagging, Lexical Acquisition, Wordnet
Language(s) English
Full Paper 732.pdf