Proper Names In A Semantic Database


Rita Marinelli (Istituto di Linguistica Computazionale del CNR - Pisa - Italy)

Adriana Roventini (Istituto di Linguistica Computazionale del CNR - Pisa - Italy)


WP2: Lexicons


Among the resources developed in SI-TAL (Integrated Systems for the Automatic Treatment of Language), ItalWordNet (IWN) were built as reference semantic database, enlarging the Italian WordNet developed in the framework of the European project EuroWordNet (EWN). The Italian lexical database was increased, by introducing and codifying, besides the new grammatical categories of the adjectives and adverbs, a subset of proper names. In the IWN context, the subset of proper names represents a quantitatively limited portion, about 3600 synsets, but it may become a qualitatively important extension. The ever growing amount of non-structured information, stored in natural language, requires the availability of computational instruments able to manage this kind of information where proper names show a remarkable incidence in any types of texts. The work here presented falls in this context, taking into account the proper names, and is focused on: i) encoding in the IWN database; ii) more typical uses in either proper or metaphorical and methonymic ways such as textual corpora evidence; iii) possibility of a well reasoned and structured enlarging of this data on the basis of the recent experience carried out in IWN.


Lexical resources, Semantic lexicons, Proper names

Full Paper