Summary of the paper

Title The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Authors Cvetana Krstev, Ranka Stanković, Duško Vitas and Ivan Obradović
Abstract In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic and morphological expansion of the query, the latter being very important in highly inflective languages, such as Serbian. Wordnets can also be used for adding another language to a query, if appropriate, thus making the query bilingual. Problems encountered in retrieving documents of interest are discussed and illustrated by examples. A brief description of resources is given, followed by an outline of the web tool which enables their integration. Finally, a set of examples is chosen in order to illustrate the use of the lexical resources and tool in question. Results obtained for these examples show that the number of documents obtained through a query by using our approach can double and even quadruple in some cases.
Language Single language
Topics LR web services, MultiWord Expressions & Collocations, Information Extraction, Information Retrieval
Full paper The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Slides The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Bibtex @InProceedings{KRSTEV08.67,
  author = {Cvetana Krstev, Ranka Stanković, Duško Vitas and Ivan Obradović},
  title = {The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA