An initial investigation of query expansion bias

Wilkie, Colin and Azzopardi, Leif; (2017) An initial investigation of query expansion bias. In: ICTIR 2017 - Proceedings of the 2017 ACM SIGIR International Conference on the Theory of Information Retrieval. ACM, NLD, pp. 285-288. ISBN 9781450344906 (https://doi.org/10.1145/3121050.3121097)

[thumbnail of Wilkie-Azzopardi-SIGIR-2017-An-initial-investigation-of-query-expansion-bias]
Preview
Text. Filename: Wilkie_Azzopardi_SIGIR_2017_An_initial_investigation_of_query_expansion_bias.pdf
Accepted Author Manuscript

Download (522kB)| Preview

Abstract

Query expansion is a useful retrieval mechanism for creating more verbose queries from the users initial keyword search. Query expansion generally have multiple parameters that allowthe user to define how many terms and where those terms come from are introduced to the expanded query. However, the idea that query expansion may be introducing biases into the system by selecting terms from overly retrievable documents has never been formally evaluated. In this work, the relationship between performance and retrievability bias is explored when various query expansion methods are employed to aide retrieval. Several parameters are altered, independently, to identify those that have an impact on bias. Parameters altered include; Rocchio's beta, length normalisation parameters, the number of terms added and the number of documents those terms are extracted from. The evaluation performed here identifies a strong correlation between performance and retrievability bias, suggesting that performance is increased by making the system more biased thus more likely to pick terms from a set of overly retrievable documents.