Improving social bookmark search using personalised latent variable language models

Tools

Harvey, Morgan and Ruthven, Ian and Carman, Mark J. (2011) Improving social bookmark search using personalised latent variable language models. In: 4th ACM International Conference on Web Search and Data Mining, 2011-02-09 - 2011-02-12. (https://doi.org/10.1145/1935826.1935898)

[thumbnail of Harvey-etal-WSDM-2011-Improving-social-bookmark-search-using-personalised-latent-variable-language-models]

Preview

Text. Filename: Harvey_etal_WSDM_2011_Improving_social_bookmark_search_using_personalised_latent_variable_language_models.pdf
Accepted Author Manuscript
Download (433kB)| Preview

Abstract

Social tagging systems have recently become very popular as a method of categorising information online and have been used to annotate a wide range of different resources. In such systems users are free to choose whatever keywords or 'tags' they wish to annotate each resource, resulting in a highly personalised, unrestricted vocabulary. While this freedom of choice has several notable advantages, it does come at the cost of making searching of these systems more difficult as the vocabulary problem introduced is more pronounced than in a normal information retrieval setting. In this paper we propose to use hidden topic models as a principled way of reducing the dimensionality of this data to provide more accurate resource rankings with higher recall. We first describe Latent Dirichlet Allocation (LDA), a sim- ple topic model and then introduce 2 extended models which can be used to personalise the results by including informa- tion about the user who made each annotation. We test these 3 models and compare them with 3 non-topic model baselines on a large data sample obtained from the Delicious social bookmarking site. Our evaluations show that our methods significantly outperform all of the baselines with the personalised models also improving significantly upon unpersonalised LDA.

ORCID iDs

Harvey, Morgan, Ruthven, Ian

and Carman, Mark J.;

Share and Export

Item metadata

Item type:	Conference or Workshop Item(Paper)
ID code:	63044
Dates:	Date Event 12 February 2011 Published
Subjects:	Bibliography. Library Science. Information Resources > Library Science. Information Science
Department:	Faculty of Science > Computer and Information Sciences
Depositing user:	Pure Administrator
Date deposited:	25 Jan 2018 12:17
Last modified:	25 Apr 2025 17:31
URI:	https://strathprints.strath.ac.uk/id/eprint/63044

CORE (COnnecting REpositories)