Summary of the paper

Title Neural Embedding Language Models in Semantic Clustering of Web Search Results
Authors Andrey Kutuzov and Elizaveta Kuzmenko
Abstract In this paper, a new approach towards semantic clustering of the results of ambiguous search queries is presented. We propose using distributed vector representations of words trained with the help of prediction-based neural embedding models to detect senses of search queries and to cluster search engine results page according to these senses. The words from titles and snippets together with semantic relationships between them form a graph, which is further partitioned into components related to different query senses. This approach to search engine results clustering is evaluated against a new manually annotated evaluation data set of Russian search queries. We show that in the task of semantically clustering search results, prediction-based models slightly but stably outperform traditional count-based ones, with the same training corpora.
Topics Semantics, Information Extraction, Information Retrieval, Word Sense Disambiguation
Full paper Neural Embedding Language Models in Semantic Clustering of Web Search Results
Bibtex @InProceedings{KUTUZOV16.1223,
  author = {Andrey Kutuzov and Elizaveta Kuzmenko},
  title = {Neural Embedding Language Models in Semantic Clustering of Web Search Results},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {23-28},
  location = {Portorož, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
 }
Powered by ELDA © 2016 ELDA/ELRA