Validating simulated interaction for retrieval evaluation

Pääkkönen, Teemu and Kekäläinen, Jaana and Keskustalo, Heikki and Azzopardi, Leif and Maxwell, David and Järvelin, Kalervo (2017) Validating simulated interaction for retrieval evaluation. Information Retrieval Journal. ISSN 1573-7659 (https://doi.org/10.1007/s10791-017-9301-2)

Preview

Text. Filename: Paakkonen_etal_IRJ_2017_Validating_simulated_interaction_for_retrieval_evaluation.pdf
Accepted Author Manuscript
Download (1MB)| Preview

Abstract

A searcher’s interaction with a retrieval system consists of actions such as query formulation, search result list interaction and document interaction. The simulation of searcher interaction has recently gained momentum in the analysis and evaluation of interactive information retrieval (IIR). However, a key issue that has not yet been adequately addressed is the validity of such IIR simulations and whether they reliably predict the performance obtained by a searcher across the session. The aim of this paper is to determine the validity of the common interaction model (CIM) typically used for simulating multi-query sessions. We focus on search result interactions, i.e., inspecting snippets, examining documents and deciding when to stop examining the results of a single query, or when to stop the whole session. To this end, we run a series of simulations grounded by real world behavioral data to show how accurate and responsive the model is to various experimental conditions under which the data were produced. We then validate on a second real world data set derived under similar experimental conditions. We seek to predict cumulated gain across the session. We find that the interaction model with a query-level stopping strategy based on consecutive non-relevant snippets leads to the highest prediction accuracy, and lowest deviation from ground truth, around 9 to 15% depending on the experimental conditions. To our knowledge, the present study is the first validation effort of the CIM that shows that the model’s acceptance and use is justified within IIR evaluations. We also identify and discuss ways to further improve the CIM and its behavioral parameters for more accurate simulations.

Share and Export

Item metadata

Item type:	Article
ID code:	60833
Dates:	Date Event 6 May 2017 Published 7 April 2017 Accepted
Subjects:	Bibliography. Library Science. Information Resources > Library Science. Information Science
Department:	Faculty of Science > Computer and Information Sciences
Depositing user:	Pure Administrator
Date deposited:	06 Jun 2017 00:55
Last modified:	13 Apr 2024 18:31
URI:	https://strathprints.strath.ac.uk/id/eprint/60833

CORE (COnnecting REpositories)