Skip navigation

Examining the information retrieval process from an inductive perspective

Examining the information retrieval process from an inductive perspective

Cummins, Ronan, Lalmas, Mounia and O’Riordan, Colm (2010) Examining the information retrieval process from an inductive perspective. In: Proceedings of the 19th ACM international conference on Information and knowledge management. ACM, New York, NY, USA, pp. 89-98. ISBN 978-1450300995 (doi:10.1145/1871437.1871453)

Full text not available from this repository. (Request a copy)

Abstract

Term-weighting functions derived from various models of retrieval aim to model human notions of relevance more accurately. However, there is a lack of analysis of the sources of evidence from which important features of these term weighting schemes originate. In general, features pertaining to these term-weighting schemes can be collected from (1) the document, (2) the entire collection and (3) the query. In this work, we perform an empirical analysis to determine the increase in effectiveness as information from these three different sources becomes more accurate.

First, we determine the number of documents to be indexed to accurately estimate collection-wide features to obtain near optimal effectiveness for a range of a term-weighting functions. Similarly, we determine the amount of a document and query that must be sampled to achieve near-peak effectiveness. This analysis also allows us to determine the factors that contribute most to the performance of a term-weighting function (i.e. the document, the collection or the query).

We use our framework to construct a new model of weighting where we discard the 'bag of words' model and aim to retrieve documents based on the initial physical representation of a document using some basic axioms of retrieval. We show that this is a good first step towards incorporating some more interesting features into a term-weighting function

Item Type: Conference Proceedings
Title of Proceedings: Proceedings of the 19th ACM international conference on Information and knowledge management
Additional Information: [1] First published: 2010. [2] Published as: Cummins, Ronan, Lalmas, Mounia and O’Riordan, Colm (2010) Examining the information retrieval process from an inductive perspective. In: Proceedings of the 19th ACM international conference on Information and knowledge management. ACM, New York, NY, USA, pp. 89-98. [3] This paper was first presented at CIKM '10, the International Conference on Information and Knowledge Management, held from 26-30 October 2010 in Toronto, Ontario, Canada. It was given in Session 1B on 27 October 2010 within the IR Track: Ranking and Retrieval Model.
Uncontrolled Keywords: experimentation, performance, information retrieval, models, term-weighting, permission
Subjects: Q Science > QA Mathematics > QA76 Computer software
Pre-2014 Departments: School of Computing & Mathematical Sciences
Related URLs:
Last Modified: 26 Sep 2019 10:34
URI: http://gala.gre.ac.uk/id/eprint/10094

Actions (login required)

View Item View Item