Picture of aircraft jet engine

Strathclyde research that powers aerospace engineering...

The Strathprints institutional repository is a digital archive of University of Strathclyde's Open Access research outputs. Strathprints provides access to thousands of Open Access research papers by University of Strathclyde researchers, including by Strathclyde researchers involved in aerospace engineering and from the Advanced Space Concepts Laboratory - but also other internationally significant research from within the Department of Mechanical & Aerospace Engineering. Discover why Strathclyde is powering international aerospace research...

Strathprints also exposes world leading research from the Faculties of Science, Engineering, Humanities & Social Sciences, and from the Strathclyde Business School.

Discover more...

Combination of Similarity Measures for Effective Spoken Document Retrieval

Crestani, F. (2003) Combination of Similarity Measures for Effective Spoken Document Retrieval. Journal of Information Science, 29 (2). pp. 87-96. ISSN 0165-5515

Full text not available in this repository. (Request a copy from the Strathclyde author)

Abstract

Often users of information retrieval systems and document authors use different terms to refer to the same concept. For this simple reason, information retrieval is affected by the 'term mismatch' problem. The term mismatch problem does not only have the effect of hindering the retrieval of relevant documents, it also produces bad rankings of relevant documents. A similar problem can be found in spoken document retrieval, where terms misrecognized by the speech recognition process can hinder the retrieval of potentially relevant spoken documents. We will call this problem 'term misrecognition', by analogy to the term mismatch problem. This paper presents two classes of retrieval models that attempt to tackle both the term mismatch and the term misrecognition problems at retrieval time using term similarity information. The models use either complete or partial knowledge of semantic and phonetic term similarity, evaluated using statistical methods from the corpus.