Conferência De: dblp, scopus

Clique Analysis of Query Log Graphs

String Processing and Information Retrieval. SPIRE

Francisco A.P.; Baeza-Yates R.; Oliveira A.L.2008

Informações chave

Autores:

Francisco A.P. (A P Francisco); Baeza-Yates R.; Oliveira A.L. (Arlindo Manuel Limede de Oliveira)

Publicado em

01/01/2008

Resumo

In this paper we propose a method for the analysis of very large graphs obtained from query logs, using query coverage inspection. The goal is to extract semantic relations between queries and their terms. We take a new approach to successfully and efficiently cluster these large graphs by analyzing clique overlap and a priori induced cliques. The clustering quality is evaluated with an extension of the modularity score. Results obtained with real data show that the identified clusters can be used to infer properties of the queries and interesting semantic relations between them and their terms. The quality of the semantic relations is evaluated both using a tf-idf based score and data from the Open Directory Project. The proposed approach is also able to identify and filter out multitopical URLs, a feature that is interesting in itself.

Detalhes da publicação

Título do contentor da publicação

String Processing and Information Retrieval. SPIRE

Primeira página ou número de artigo

188

Última página

199

Volume

5280 LNCS

Domínio Científico (FOS)

electrical-engineering-electronic-engineering-information-engineering - Engenharia Eletrotécnica, Eletrónica e Informática

Idioma da publicação (código ISO)

eng - Inglês

Acesso à publicação:

Acesso apenas a metadados