Assessing automatically extracted bilingual lexicons for CLIR in vertical Domains

Jean-Michel Renders, Hervé Dejean, Eric Gaussier
In this paper, we describe the methods we used for the cross-lingual evaluation forum CLEF 2002, and more specifically for the GIRT Task. The methods are based on (1) the extraction of the two bilingual lexicons, one from parallel corpora and the other one from comparable corpora, (2) the optimal combination of these bilingual lexicons in Cross-Language Information Retrieval and (3) the combination with monolingual IR on parallel corpora. While our original submission to CLEF2002 was restricted to short queries (using only the title field), we present here the results extended to compete queries.
To appear in \"Lecture Notes in Computer Science\"