A flexible framework for offline effectiveness metrics
Moffat, Alistair and Mackenzie, Joel and Thomas, Paul and Azzopardi, Leif; (2022) A flexible framework for offline effectiveness metrics. In: SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval . ACM, New York, pp. 578-587. ISBN 9781450387323 (https://doi.org/10.1145/3477495.3531924)
Preview |
Text.
Filename: Moffat_etal_SIGIR_2022_A_flexible_framework_for_offline_effectiveness_metrics.pdf
Accepted Author Manuscript License: Strathprints license 1.0 Download (446kB)| Preview |
Abstract
The use of offline effectiveness metrics is one of the cornerstones of evaluation in information retrieval. Static resources that include test collections and sets of topics, the corresponding relevance judgments connecting them, and metrics that map document rankings from a retrieval system to numeric scores have been used for multiple decades as an important way of comparing systems. The basis behind this experimental structure is that the metric score for a system can serve as a surrogate measurement for user satisfaction. Here we introduce a user behavior framework that extends the C/W/L family. The essence of the new framework - which we call C/W/L/A - is that the user actions that are undertaken while reading the ranking can be considered separately from the benefit that each user will have derived as they exit the ranking. This split structure allows the great majority of current effectiveness metrics to be systematically categorized, and thus their relative properties and relationships to be better understood; and at the same time permits a wide range of novel combinations to be considered. We then carry out experiments using relevance judgments, document rankings, and user satisfaction data from two distinct sources, comparing the patterns of metric scores generated, and showing that those metrics vary quite markedly in terms of their ability to predict user satisfaction.
-
-
Item type: Book Section ID code: 84879 Dates: DateEvent7 July 2022Published31 March 2022AcceptedSubjects: Bibliography. Library Science. Information Resources > Library Science. Information Science > Information storage and retrieval systems
Science > Mathematics > Electronic computers. Computer science > Other topics, A-Z > Human-computer interactionDepartment: Faculty of Science > Computer and Information Sciences Depositing user: Pure Administrator Date deposited: 24 Mar 2023 15:21 Last modified: 11 Nov 2024 15:30 URI: https://strathprints.strath.ac.uk/id/eprint/84879