Across the bridge: CLEF 2001 - Non-English monolingual retrieval. The French task.
Matayo, Eugenia and Valsamidis, Tony (2002) Across the bridge: CLEF 2001 - Non-English monolingual retrieval. The French task. In: Evaluation of Cross-Language Information Retrieval Systems. Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001, Darmstadt, Germany, September 3-4, 2001. Revised Papers. Lecture Notes in Computer Science (2406). Springer, Berlin, Germany, pp. 291-299. ISBN 9783540456919 ISSN 0302-9743
Full text not available from this repository.Abstract
This paper presents work on document retrieval based on first time participation in the CLEF 2001 monolingual retrieval task using French. The experiment findings indicated that Okapi, the text retrieval system in use, can successfully be used for non-English text retrieval. A lot of internal pre-processing is required in the basic search system for conversion into Okapi access formats. Various shell scripts were written to achieve the conversion in a UNIX environment, failure of which would significantly have impeded the overall performance. Based on the experiment findings using Okapi - originally designed for English - it was clear that, although most European languages share conventional word boundaries and variant word morphemes formed by the additon of suffixes, there is significant difference between French and English retrieval depending on the adaptation of indexing and search strategies in use. No sophisticated method for higher recall and precision such as stemming techniques, phrase translation or de-compounding was employed for the experiment and our results were suggestively poor. Future participation would include more refined query translation tools.
Item Type: | Conference Proceedings |
---|---|
Title of Proceedings: | Evaluation of Cross-Language Information Retrieval Systems. Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001, Darmstadt, Germany, September 3-4, 2001. Revised Papers |
Additional Information: | [1] This paper was first presented at the Second Workshop of the Cross-Language Evaluation Forum, (CLEF 2001), held from 3-4 September 2001 in Darmstadt, Germany. |
Uncontrolled Keywords: | cross-language information retrieval, document retrieval, machine translation, English language, French language |
Subjects: | P Language and Literature > P Philology. Linguistics P Language and Literature > PC Romance languages Q Science > QA Mathematics > QA76 Computer software |
Pre-2014 Departments: | School of Computing & Mathematical Sciences School of Computing & Mathematical Sciences > Department of Information Systems & Digital Media School of Computing & Mathematical Sciences > eCentre |
Related URLs: | |
Last Modified: | 14 Oct 2016 09:01 |
URI: | http://gala.gre.ac.uk/id/eprint/587 |
Actions (login required)
View Item |