creators_name: Nadeau, David creators_name: Jarmasz, Mario creators_name: Barrière, Caroline creators_name: Foster, George creators_name: St-Jacques, Claude type: confpaper datestamp: 2005-06-19 lastmod: 2011-03-11 08:56:05 metadata_visibility: show title: Using COTS Search Engines and Custom Query Strategies at CLEF ispublished: pub subjects: comp-sci-art-intel full_text_status: public keywords: cross-language information retrieval abstract: This paper presents a system for bilingual information retrieval using commercial off-the-shelf search engines (COTS). Several custom query construction, expansion and translation strategies are compared. We present the experiments and the corresponding results for the CLEF 2004 event. date: 2004 date_type: published pagerange: 73-80 refereed: TRUE referencetext: Brown, P. F., Della Pietra, S. A., Della Pietra, V. J., and Mercer, R. L. (1993). The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263-311. Buckley, C. and Salton, G. (1995), Optimization of relevance feedback weights, Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval, 351-357. Chen, A. (2002), Cross-Language Retrieval Experiments at CLEF 2002, CLEF 2002, Cross-Language Evaluation Forum. Cöster, R., Sahlgren, M. and Karlgren, J. (2003), Selective compound splitting of Swedish queries for Boolean combinations of truncated terms, CLEF 2003, Cross-Language Evaluation Forum. Craswell, N., Hawking, D., Wilkinson R. and Wu M. (2003), Overview of the TREC 2003 Web Track, The Twelfth Text Retrieval Conference, TREC-2003, Washington, D. C. Jarmasz, M. and Barrière, C. (2004), A Terminological Resource and a Terabyte-Sized Corpus for Automatic Keyphrase in Context Translation, Technical Report, National Research Council of Canada. Klir, G. J. and Yuan B. (1995), Fuzzy Sets and Fuzzy Logic, Prentice Hall: Upper Saddle River, NJ. Lam-Adesina, A.M., Jones, G.J.H. (2002), Exeter at CLEF 2001: Experiments with Machine Translation for bilingual retrieval, CLEF 2001, LNCS 2406, Peters, C., Braschler, M., Gonzalo, J. and Kluck, M. (Eds.), Springer, Germany. Miyamoto, S. (1990). Fuzzy Sets in Information Retrieval and Cluster Analysis. Dordrecht, Netherlands: Kluwer Academic Publishers. Porter, M. F. (1980). An Algorithm for Suffix Stripping. Program, 14(3): 130-127. Salton, G. and Buckley, C. (1988), Term-weighting approaches in automatic text retrieval, Information Processing and Management: an International Journal, 24 (5): 513-523 Terra, E. and Clarke, C.L.A.. (2003), Frequency estimates for statistical word similarity measures. In Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), Edmonton, Canada, 244 – 251. Turney, P.D. (2000), Learning Algorithms for Keyphrase Extraction, Information Retrieval, 2(4): 303-336. Verlinde, S., Selva, T. & GRELEP (Groupe de Recherche en Lexicographie Pédagogique) (2003). Dafles citation: Nadeau, David and Jarmasz, Mario and Barrière, Caroline and Foster, George and St-Jacques, Claude (2004) Using COTS Search Engines and Custom Query Strategies at CLEF. [Conference Paper] document_url: http://cogprints.org/4398/1/NRC-48082.pdf