Top 82 papers published in the topic of Clef in 2008

Showing papers on "Clef published in 2008"

Book•

Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers

[...]

Carol Peters, Valentin Jijkoun, Thomas Mandl, Henning Müller, Douglas W. Oard, Anselmo Peñas, Vivien Petras, Diana Santos - Show less +4 more

1 May 2008

TL;DR: The proceedings of the 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, held in Budapest, Hungary, September 2007, were thoroughly refereed proceedings as mentioned in this paper.

...read moreread less

Abstract: This book constitutes the thoroughly refereed proceedings of the 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, held in Budapest, Hungary, September 2007. The revised and extended papers were carefully reviewed and selected for inclusion in the book. There are 115 contributions in total and an introduction. The seven distinct evaluation tracks in CLEF 2007, are designed to test the performance of a wide range of multilingual information access systems or system components. The papers are organized in topical sections on Multilingual Textual Document Retrieval (Ad Hoc), Domain-Specific Information Retrieval (Domain-Specific), Multiple Language Question Answering (QA@CLEF), cross-language retrieval in image collections (Image CLEF), cross-language speech retrieval (CL-SR), multilingual Web retrieval (WebCLEF), cross-language geographical retrieval (GeoCLEF), and CLEF in other evaluations.

...read moreread less

147 citations

Proceedings Article•

Working Notes for the CLEF 2008 Workshop

[...]

Pamela Forner¹, Anselmo Peñas, Iñaki Alegria², Corina Forascu³, Nicolas Moreau, Petya Osenova⁴, Prokopis Prokopidis, Paulo Rocha, Bogdan Sacaleanu, Richard F. E. Sutcliffe, E.F. Tjong Kim Sang - Show less +7 more•Institutions (4)

Information Sciences Institute¹, University of the Basque Country², Romanian Academy³, Bulgarian Academy of Sciences⁴

1 Jan 2008

102 citations

Book Chapter•10.1007/978-3-540-85760-0_86•

Overview of the CLEF-2007 Cross-Language Speech Retrieval Track

[...]

Pavel Pecina¹, Petra Hoffmannová¹, Gareth J. F. Jones², Ying Zhang², Douglas W. Oard³ - Show less +1 more•Institutions (3)

Charles University in Prague¹, Dublin City University², University of Maryland, College Park³

1 May 2008

TL;DR: The CLEF-2007 Cross-Language Speech Retrieval (CL-SR) track included two tasks: to identify topically coherent segments of English interviews in a known-boundary condition and to identify time stamps marking the beginning of topically relevant passages in Czech interviews in an unknown- boundary condition.

...read moreread less

Abstract: The CLEF-2007 Cross-Language Speech Retrieval (CL-SR) track included two tasks: to identify topically coherent segments of English interviews in a known-boundary condition, and to identify time stamps marking the beginning of topically relevant passages in Czech interviews in an unknown-boundary condition. Six teams participated in the English evaluation, performing both monolingual and cross-language searches of ASR transcripts, automatically generated metadata, and manually generated metadata. Four teams participated in the Czech evaluation, performing monolingual searches of automatic speech recognition transcripts.

...read moreread less

74 citations

Book Chapter•10.1007/978-3-540-85760-0_10•

Cross-Lingual Information Retrieval System for Indian Languages

[...]

Jagadeesh Jagarlamudi¹, Alaganandam Kumaran¹•Institutions (1)

Microsoft¹

1 May 2008

TL;DR: This paper describes the attempt to build a Cross-Lingual Information Retrieval (CLIR) system as a part of the Indian language sub-task of the main Adhoc monolingual and bilingual track in CLEF competition.

...read moreread less

Abstract: This paper describes our attempt to build a Cross-Lingual Information Retrieval (CLIR) system as a part of the Indian language sub-task of the main Adhoc monolingual and bilingual track in CLEF competition. In this track, the task required retrieval of relevant documents from an English corpus in response to a query expressed in different Indian languages including Hindi, Tamil, Telugu, Bengali and Marathi. Groups participating in this track were required to submit a English to English monolingual run and a Hindi to English bilingual run with optional runs in rest of the languages. Our submission consisted of a monolingual English run and a Hindi to English cross-lingual run. We used a word alignment table that was learnt by a Statistical Machine Translation (SMT) system trained on aligned parallel sentences, to map a query in the source language into an equivalent query in the language of the document collection. The relevant documents are then retrieved using a Language Modeling based retrieval algorithm. On the CLEF 2007 data set, our official cross-lingual performance was 54.4% of the monolingual performance and in the post submission experiments we found that it can be significantly improved up to 76.3%.

...read moreread less

67 citations

Book Chapter•10.1007/978-3-540-85760-0_2•

CLEF 2007: Ad Hoc Track Overview

[...]

Giorgio Maria Di Nunzio¹, Nicola Ferro¹, Thomas Mandl², Carol Peters³•Institutions (3)

University of Padua¹, University of Hildesheim², Istituto di Scienza e Tecnologie dell'Informazione³

1 May 2008

TL;DR: The objectives and organization of the CLEF 2007 Ad Hoc track are described and the main characteristics of the tasks offered to test monolingual and cross-language textual document retrieval systems are discussed.

...read moreread less

Abstract: We describe the objectives and organization of the CLEF 2007 Ad Hoc track and discuss the main characteristics of the tasks offered to test monolingual and cross-language textual document retrieval systems. The track was divided into two streams. The main stream offered mono- and bilingual tasks on target collections for central European languages (Bulgarian, Czech and Hungarian). Similarly to last year, a bilingual task that encouraged system testing with non-European languages against English documents was also offered; this year, particular attention was given to Indian languages. The second stream, designed for more experienced participants, offered mono- and bilingual "robust" tasks with the objective of privileging experiments which achieve good stable performance over all queries rather than high average performance. These experiments re-used CLEF test collections from previous years in three languages (English, French, and Portuguese). The performance achieved for each task is presented and discussed.

...read moreread less

49 citations

Book Chapter•10.1007/978-3-540-85760-0_22•

The Domain-Specific Track at CLEF 2007

[...]

Vivien Petras, Stefan Baerisch, Maximillian Stempfhuber

1 May 2008

TL;DR: An overview of the CLEF domain-specific track is presented including a description of the tasks, collections, topic preparation, and relevance assessments as well as contributions to the track.

...read moreread less

Abstract: The domain-specific track uses test collections from the social science domain to test monolingual and cross-language retrieval in structured bibliographic databases. Special attention is given to the existence of controlled vocabularies for content description and their potential usefulness in retrieval. Test collections and topics are provided in German, English and Russian. This year, a new English test collection (from the CSA Sociological Abstracts database) was added. We present an overview of the CLEF domain-specific track including a description of the tasks, collections, topic preparation, and relevance assessments as well as contributions to the track. The track participants experimented with different retrieval models ranging from classic vector-space to probabilistic to language models. The controlled vocabularies were used for query expansion or as bilingual dictionaries for query translation.

...read moreread less

29 citations

AliQAn, Spanish QA System at CLEF-2008 ⁄

[...]

Sandra Roger, Katia Vila, Antonio Ferrández, María Pardiño, José Manuel Gómez Soriano, Marcel Puchol-Blasco, Jesús Peral - Show less +3 more

1 Jan 2008

TL;DR: In the CLEF-2008 Spanish monolingual QA evaluation task, this article described the participation of the system AliQAn, which was developed in the Department of Language Processing and Information System at the University of Alicante.

...read moreread less

Abstract: This paper describes the participation of the system AliQAn, a monolingual opendomain Question Answering (QA) System developed in the Department of Language Processing and Information System at the University of Alicante, in the CLEF-2008 Spanish monolingual QA evaluation task. Here, we focus on explaining a couple of strong points of the current version of AliQAn: (i) our algorithm for dealing with topic-related questions, and (ii) our approach for decreasing the number of inexact answers. We have also explored the use of the Wikipedia corpora, which have proposed some new challenges for the QA task. Besides, the achieved results (overall accuracy of 19.50%) are shown and discussed in this paper.

...read moreread less

28 citations

Proceedings Article•

FIRE: Forum for Information Retrieval Evaluation

[...]

Mandar Mitra¹, Prosenjit Majumdar•Institutions (1)

Indian Statistical Institute¹

1 Jan 2008

TL;DR: The basic experimental framework and metrics as represented by the Cranfield paradigm will be reviewed, and an overview of the main evaluation fora for IR -TREC, CLEF and NTCIR - will be given.

...read moreread less

Abstract: This talk will present our plans for organizing FIRE, a forum for Information Retrieval evaluation, focused on Indian languages. We will start by reviewing the basic experimental framework and metrics as represented by the Cranfield paradigm. An overview of the main evaluation fora for IR -TREC, CLEF and NTCIR, which are all based on this paradigm, and from which FIRE draws inspiration -will be given. The components of the evaluation framework, viz. corpora, search topics, and relevance judgments will also be discussed.

...read moreread less

24 citations

Book Chapter•10.1007/978-3-540-85760-0_46•

Priberam's Question Answering System in QA@CLEF 2007

[...]

Carlos Amaral, Adán Cassan, Helena Figueira, André F. T. Martins, Afonso Mendes, Pedro Mendes, Cláudia Pinto, Daniel Vidal - Show less +4 more

1 May 2008

TL;DR: This paper accounts for Priberam's participation in the monolingual question answering (QA) track of CLEF 2007, which endowed the system with syntactical processing, in order to capture the syntactic structure of the question.

...read moreread less

Abstract: This paper accounts for Priberam's participation in the monolingual question answering (QA) track of CLEF 2007. In previous participations, Priberam's QA system obtained encouraging results both in monolingual and cross-language tasks. This year we endowed the system with syntactical processing, in order to capture the syntactic structure of the question. The main goal was to obtain a more tuned question categorisation and consequently a more precise answer extraction. Besides this, we provided our system with the ability to handle topic-related questions and to use encyclopaedic sources like Wikipedia. The paper provides a description of the improvements made in the system, followed by the discussion of the results obtained in Portuguese and Spanish monolingual runs.

...read moreread less

22 citations

Book Chapter•10.1007/978-3-540-85760-0_12•

Bengali and Hindi to English CLIR Evaluation

[...]

Debasis Mandal¹, Mayank Gupta¹, Sandipan Dandapat¹, Pratyush Banerjee¹, Sudeshna Sarkar¹ - Show less +1 more•Institutions (1)

Indian Institute of Technology Kharagpur¹

1 May 2008

TL;DR: The CLEF results point to the need for a rich bilingual lexicon, a translation disambiguator, Named Entity Recognizer and a better transliterator for CLIR involving Indian languages, as part of the participation in CLEF 2007 Ad-hoc bilingual track.

...read moreread less

Abstract: This paper presents a cross-language retrieval system for the retrieval of English documents in response to queries in Bengali and Hindi, as part of our participation in CLEF 2007 Ad-hoc bilingual track. We followed the dictionary-based Machine Translation approach to generate the equivalent English query out of Indian language topics. Our main challenge was to work with a limited coverage dictionary (of coverage ~ 20%) that was available for Hindi-English, and virtually non-existent dictionary for Bengali-English. So we depended mostly on a phonetic transliteration system to overcome this. The CLEF results point to the need for a rich bilingual lexicon, a translation disambiguator, Named Entity Recognizer and a better transliterator for CLIR involving Indian languages. The best MAP values for Bengali and Hindi CLIR for our experiment were 7.26% and 4.77%, which are 20% and 13% of our best monolingual retrieval, respectively.

...read moreread less

22 citations

Book Chapter•10.1007/978-3-540-85760-0_23•

The XTRIEVAL Framework at CLEF 2007: Domain-Specific Track

[...]

Jens Kürsten¹, Thomas Wilhelm¹, Maximilian Eibl¹•Institutions (1)

Chemnitz University of Technology¹

1 May 2008

TL;DR: The architecture and configuration of the XTRIEVAL (eXtensible reTRIeval and EVALuation) framework is described and the performance of the submitted runs was on average compared to other participating groups.

...read moreread less

Abstract: This article describes the architecture and configuration of the XTRIEVAL (eXtensible reTRIeval and EVALuation) framework. A first prototype is described in [1]. For CLEF 2007 a second prototype was implemented which was focused on the cross-language aspect. Runs for all subtasks of the Domain-Specific track were submitted. The performance of our submitted runs was on average compared to other participating groups. Additional experiments on the Multilingual task demonstrated substantial improvement.

...read moreread less

Question Answering with Joost at CLEF 2008

[...]

Gosse Bouma¹, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann - Show less +1 more•Institutions (1)

University of Groningen¹

1 Jan 2008

Book Chapter•10.1007/978-3-540-85760-0_6•

Bulgarian, Hungarian and Czech Stemming Using YASS

[...]

Prasenjit Majumder¹, Mandar Mitra¹, Dipasree Pal¹•Institutions (1)

Indian Statistical Institute¹

1 May 2008

TL;DR: It is found that, for all languages, YASS produces significant improvements over the baseline (unstemmed) runs and is comparable to that of other available stemmers for all the three east European Languages.

...read moreread less

Abstract: This is the second year in a row we are participating in CLEF. Our aim is to test the performance of a statistical stemmer on various languages. For CLEF 2006, we tried the stemmer on French [1]; while for CLEF 2007, we did experiments for the Hungarian, Bulgarian and Czech monolingual tasks. We find that, for all languages, YASS produces significant improvements over the baseline (unstemmed) runs. The performance of YASS is also found to be comparable to that of other available stemmers for all the three east European Languages.

...read moreread less

Book Chapter•10.1007/978-3-540-85760-0_29•

Overview of QAST 2007

[...]

Jordi Turmo¹, Pere R. Comas¹, Christelle Ayache, Djamel Mostefa, Sophie Rosset, Lori Lamel - Show less +2 more•Institutions (1)

Polytechnic University of Catalonia¹

1 May 2008

TL;DR: QAST, a pilot track of CLEF 2007 aimed at evaluating the task of Question Answering in Speech Transcripts, shows that question answering technology can be useful for manually transcribed speech as for automatically recognized speech.

...read moreread less

Abstract: This paper describes QAST, a pilot track of CLEF 2007 aimed at evaluating the task of Question Answering in Speech Transcripts. The paper summarizes the evaluation framework, the systems that participated and the results achieved. These results have shown that question answering technology can be useful to deal with spontaneous speech transcripts, so for manually transcribed speech as for automatically recognized speech. The loss in accuracy from dealing with manual transcripts to dealing with automatic ones implies that there is room for future reseach in this area.

...read moreread less

What happened in CLEF 2008 Introduction to the Working Notes

[...]

Carol Peters¹•Institutions (1)

National Research Council¹

1 Jan 2008

TL;DR: These Working Notes contain descriptions of the experiments conducted within CLEF 2004 – the fifth in a series of annual system evaluation campaigns organised by the Cross-Language Evaluation Forum, to provide the necessary background to the experiments reported in this volume.

...read moreread less

Abstract: These Working Notes contain descriptions of the experiments conducted within CLEF 2004 – the fifth in a series of annual system evaluation campaigns organised by the Cross-Language Evaluation Forum 1 . The results of the experiments will be presented and discussed in the CLEF 2004 Workshop, 15-17 September, Bath, UK. The final papers - revised and extended as a result of the discussions at the Workshop - together with a comparative analysis of the results will appear in the CLEF 2004 Proceedings, to be published by Springer in their Lecture Notes for Computer Science series. CLEF organises a series of evaluation tracks designed to test different aspects of mono- and cross-language information retrieval system development with the main focus on European languages. The objective is to provide an infrastructure that facilitates experimentation with all kinds of multilingual information access – from the development of procedures for monolingual retrieval operating on many languages to the implementation of complete multilingual multimedia search services. In a ddition, CLEF aims at encouraging contacts between the R&D and the application communities and promoting the industrial take-up of research results. The main features of the 2004 campaign are briefly outlined here below in order to provide the necessary background to the experiments reported in this volume. 1. Tracks and Tasks in CLEF 2004 In recent years, CLEF has distinguished between core tracks, which were those offered regularly in each campaign (the monolingual, bilingual, multilingual and domain-specific tracks), and additional tracks, which were organised on an experimental basis with the objective of identifying new requirements and appropriate methodologies for their testing in a cross-language context. This distinctio n no longer held in 2004. The great success of the so-called additional tracks in CLEF 2003, and in particular of th e tracks that tested systems for question answering and image retrieval, has led to their inclusion as regular tracks this year. This has meant that CLEF 2004 marks a breaking point with respect to the previous campaigns. The focus is no longer on multilingual document retrieval but has diversified to include different kinds of text retrieval across languages (from documents to exact answers) and retrieval on different kinds of media (not just text but image and speech as well). CLEF 2004 thus offered six tracks designed to evaluate the performance of systems for: • mono-, bi- and multilingual document retrieval on news collections (Ad-hoc) • mono- and cross-language domain-specific retrieval (GIRT) • interactive cross-language retrieval (iCLEF) • multiple language question answering (QA@CLEF) • cross-language retrieval on image collections (ImageCLEF) • cross-language spoken document retrieval (CL-SDR)

...read moreread less

Proceedings Article•

Question Answering on Speech Transcriptions: the QAST evaluation in CLEF

[...]

Lori Lamel, Sophie Rosset, Christelle Ayache, Djamel Mostefa, Jordi Turmo¹, Pere R. Comas¹ - Show less +2 more•Institutions (1)

Polytechnic University of Catalonia¹

1 May 2008

TL;DR: The framework and results of the pilot QAst evaluation held as part of CLEF 2007 is described, illustrating some of the additional challenges posed by QA in spoken documents relative to written ones.

...read moreread less

Abstract: This paper reports on the QAST track of CLEF aiming to evaluate Question Answering on Speech Transcriptions. Accessing information in spoken documents provides additional challenges to those of text-based QA, needing to address the characteristics of spoken language, as well as errors in the case of automatic transcriptions of spontaneous speech. The framework and results of the pilot QAst evaluation held as part of CLEF 2007 is described, illustrating some of the additional challenges posed by QA in spoken documents relative to written ones. The current plans for future multiple-language and multiple-task QAst evaluations are described.

...read moreread less

Book Chapter•10.1007/978-3-540-85760-0_11•

Bengali, Hindi and Telugu to English Ad-Hoc Bilingual Task at CLEF 2007

[...]

Sivaji Bandyopadhyay¹, Tapabrata Mondal¹, Sudip Kumar Naskar¹, Asif Ekbal¹, Rejwanul Haque¹, Srinivasa Rao Godhavarthy¹ - Show less +2 more•Institutions (1)

Jadavpur University¹

1 May 2008

TL;DR: This paper presents the experiments carried out at Jadavpur University as part of the participation in the CLEF 2007 ad-hoc bilingual task, which considered Bengali, Hindi and Telugu as query languages for the retrieval from English document collection.

...read moreread less

Abstract: This paper presents the experiments carried out at Jadavpur University as part of the participation in the CLEF 2007 ad-hoc bilingual task. This is our first participation in the CLEF evaluation task and we have considered Bengali, Hindi and Telugu as query languages for the retrieval from English document collection. We have discussed our Bengali, Hindi and Telugu to English CLIR system as part of the ad-hoc bilingual task, the English IR system for the ad-hoc monolingual task and the associated experiments at CLEF. Query construction was manual for Telugu-English ad-hoc bilingual task, while it was automatic for all other tasks.

...read moreread less

Book Chapter•10.1007/978-3-540-85760-0_33•

Multilingual Question Answering through Intermediate Translation: LCC's PowerAnswer at QA@CLEF 2007

[...]

Mitchell Bowden¹, Marian Olteanu¹, Pasin Suriyentrakorn¹, Thomas William D'Silva¹, Dan Moldovan¹ - Show less +1 more•Institutions (1)

Language Computer Corporation¹

1 May 2008

TL;DR: This paper reports on Language Computer Corporation's QA@CLEF 2007 preparation, participation and results, and describes the improved system and methodology and updates from QA @CLEF 2006.

...read moreread less

Abstract: This paper reports on Language Computer Corporation's QA@CLEF 2007 preparation, participation and results. For this exercise, LCC integrated its open-domain PowerAnswer Question Answering system with its statistical Machine Translation engine. For 2007, LCC participated in the English-to-French and English-to-Portuguese cross-language tasks. The approach is that of intermediate translation, only processing English within the QA system regardless of the input or source languages. The output snippets were then mapped back into the source language documents for the final output of the system and submission. What follows is a description of the improved system and methodology and updates from QA@CLEF 2006.

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_125•

Overview of CLEF 2008 INFILE pilot track

[...]

Romaric Besançon, Stéphane Chaudiron¹, Djamel Mostefa, Olivier Hamon², Ismaïl Timimi¹, Khalid Choukri - Show less +2 more•Institutions (2)

Lille University of Science and Technology¹, University of Paris²

17 Sep 2008

TL;DR: The INFILE campaign was run for the first time as a pilot track in CLEF 2008 and only had 3 submissions (from one participant) which are presented in this article.

...read moreread less

Abstract: The INFILE campaign was run for the first time as a pilot track in CLEF 2008 Its purpose was the evaluation of cross-language adaptive filtering systems It used a corpus of 300,000 newswires from Agence France Presse (AFP) in three languages: Arabic, English and French, and a set of 50 topics in general and specific domain (scientific and technological information) Due to delays in the organization of the task, the campaign only had 3 submissions (from one participant) which are presented in this article

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_69•

Some results using different approaches to merge visual and text-based features in CLEF'08 photo collection

[...]

Ana García-Serrano¹, Xaro Benavent², Ruben Granados³, José Miguel Goñi-Menoyo³•Institutions (3)

National University of Distance Education¹, University of Valencia², Technical University of Madrid³

17 Sep 2008

TL;DR: This paper describes the participation of the MIRACLE team at the ImageCLEF Photographic Retrieval task of CLEF 2008 and improves the text-based baseline when applying one of the three merging algorithms, although visual results are lower than textual ones.

...read moreread less

Abstract: This paper describes the participation of the MIRACLE team at the ImageCLEF Photographic Retrieval task of CLEF 2008. We succeeded in submitting 41 runs. Obtained results from text-based retrieval are better than content-based as previous experiments in the MIRACLE team campaigns [5, 6] using different software. Our main aim was to experiment with several merging approaches to fuse text-based retrieval and content-based retrieval results, and it happened that we improve the text-based baseline when applying one of the three merging algorithms, although visual results are lower than textual ones.

...read moreread less

Journal Article•

8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007

[...]

Valentin Jijkoun, Katja Hofmann, David Ahn, Khalid, J. van Rantwijk, M. de Rijke, E.F. Tjong Kim Sang - Show less +3 more

01 Jan 2008-Springer US

Proceedings Article•

Bengali, Hindi and Telugu to English Ad-hoc Bilingual Task.

[...]

Sivaji Bandyopadhyay, Tapabrata Mondal¹, Sudip Kumar Naskar, Asif Ekbal¹, Rejwanul Haque¹, Srinivasa Rao Godhavarthy¹ - Show less +2 more•Institutions (1)

Jadavpur University¹

1 Jan 2008

TL;DR: The experiments carried out at Jadavpur University as part of participation in the CLEF 2007 ad-hoc bilingual task as mentioned in this paper showed that Bengali, Hindi and Telugu were considered as query languages for the retrieval from English document collection.

...read moreread less

Abstract: This paper presents the experiments carried out at Jadavpur University as part of participation in the CLEF 2007 ad-hoc bilingual task. This is our first participation in the CLEF evaluation task and we have considered Bengali, Hindi and Telugu as query languages for the retrieval from English document collection. We have discussed our Bengali, Hindi and Telugu to English CLIR system as part of the ad-hoc bilingual task, English IR system for the ad-hoc monolingual task and the associated experiments at CLEF. Query construction was manual for Telugu-English ad-hoc bilingual task, while it was automatic for all other tasks.

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_14•

IXA at CLEF 2008 robust-WSD task: using word sense disambiguation for (cross lingual) information retrieval

[...]

Eneko Agirre¹, Arantxa Otegi¹, German Rigau¹•Institutions (1)

University of the Basque Country¹

17 Sep 2008

TL;DR: The main goal was to improve (Cross Lingual) Information Retrieval results using WSD information, and it is attained improvements in both mono and bilingual subtasks, with statistically significant differences on the second.

...read moreread less

Abstract: This paper describes experiments for the CLEF 2008 Robust-WSD task, both for the monolingual (English) and the bilingual (Spanish to English) subtasks. We tried several query and document expansion and translation strategies, with and without the use of the word sense disambiguation results provided by the organizers. All expansions and translations were done using the English and Spanish wordnets as provided by the organizers and no other resource was used. We used Indri as the search engine, which we tuned in the training part. Our main goal was to improve (Cross Lingual) Information Retrieval results using WSD information, and we attained improvements in both mono and bilingual subtasks, with statistically significant differences on the second. Our best systems ranked 4th overall and 3rd overall in the monolingual and bilingual subtasks, respectively.

...read moreread less

Book Chapter•10.1007/978-3-540-85760-0_45•

QA@L2F, First Steps at QA@CLEF

[...]

Ana Cristina Mendes¹, Luísa Coheur¹, Nuno J. Mamede¹, Ricardo Ribeiro¹, Fernando Batista¹, David Martins de Matos¹ - Show less +2 more•Institutions (1)

INESC-ID¹

1 May 2008

TL;DR: [email protected]2F, the question-answering system developed at L2f, INESC-ID follows different strategies according with the question type, and relies strongly on named entity recognition and on the pre-detection of linguistic patterns.

...read moreread less

Abstract: This paper presents [email protected]2F, the question-answering system developed at L2F, INESC-ID. [email protected]2F follows different strategies according with the question type, and relies strongly on named entity recognition and on the pre-detection of linguistic patterns. Each question type is mapped into a single strategy; however, if no answer is found, the system proceeds and tries to find an answer using one of the other strategies.

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_38•

Using AliQAn in monolingual QA@CLEF 2008

[...]

Sandra Roger¹, Katia Vila¹, Antonio Ferrández¹, María Pardiño¹, José Manuel Gómez¹, Marcel Puchol-Blasco¹, Jesús Pearl¹ - Show less +3 more•Institutions (1)

University of Alicante¹

17 Sep 2008

TL;DR: The main goals of the current version of AliQAn were to deal with topic-related questions and to decrease the number of inexact answers and the use of the Wikipedia corpora.

...read moreread less

Abstract: This paper describes the participation of the system AliQAn in the CLEF 2008 Spanish monolingual QA task This time, the main goals of the current version of AliQAn were to deal with topic-related questions and to decrease the number of inexact answers We have also explored the use of the Wikipedia corpora, which have posed some new challenges for the QA task

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_61•

Robust question answering for speech transcripts: UPC experience in QAst 2008

[...]

Pere R. Comas¹, Jordi Turmo¹•Institutions (1)

Polytechnic University of Catalonia¹

17 Sep 2008

TL;DR: This paper describes the participation of the Technical University of Catalonia in the CLEF 2008 Question Answering on Speech Transcripts track and performs a detailed analysis of the results and draws conclusions relating QA performance to word error rate in transcripts.

...read moreread less

Abstract: This paper describes the participation of the Technical University of Catalonia in the CLEF 2008 Question Answering on Speech Transcripts track. We have participated in the English and Spanish scenarios of QAst. For the processing of manual transcripts we have deployed a robust factoid Question Answering that uses minimal syntactic information. For the handling of automatic transcripts we modify the QA system with a Passage Retrieval and Answer Extraction engine based on a sequence alignment algorithm that searches for "sounds like" sequences. We perform a detailed analysis of our results and draw conclusions relating QA performance to word error rate in transcripts.

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_43•

Ihardetsi: a Basque question answering system at QA@CLEF 2008

[...]

Olatz Ansa¹, Xabier Arregi¹, Arantxa Otegi¹, Ander Soraluze¹•Institutions (1)

University of the Basque Country¹

17 Sep 2008

TL;DR: Ihardetsi, a question answering system for Basque, is described, a machine translation system that first processes a question in the source language, then translates it into the target language and sends the obtained Basque question as input to the monolingual module.

...read moreread less

Abstract: This paper describes Ihardetsi, a question answering system for Basque. We present the results of our first participation in the [email protected] 2008 evaluation task. We participated in three subtasks using Basque, English and Spanish as source languages, and Basque as target language. We approached the Spanish-Basque and English-Basque cross-lingual tasks with a machine translation system that first processes a question in the source language (i.e. Spanish, English), then translates it into the target language (i.e. Basque) and, finally, sends the obtained Basque question as input to the monolingual module.

...read moreread less

Book Chapter•10.1007/978-3-540-85760-0_90•

What Can and Cannot Be Found in Czech Spontaneous Speech Using Document-Oriented IR Methods -- UWB at CLEF 2007 CL-SR Track

[...]

Pavel Ircing¹, Josef Psutka¹, Jan Vavruška¹•Institutions (1)

University of West Bohemia¹

1 May 2008

TL;DR: An overview of the system build and experiments performed for the CLEF 2007 CL-SR track by the University of West Bohemia is presented and a more detailed analysis of the results is provided.

...read moreread less

Abstract: The paper presents an overview of the system build and experiments performed for the CLEF 2007 CL-SR track by the University of West Bohemia. We have concentrated on the monolingual experiments using the Czech collection only. The approach that was successfully employed by our team in the last year's campaign (simple tf.idf model with blind relevance feedback, accompanied with solid linguistic preprocessing) was used again but the set of performed experiments was broadened and a more detailed analysis of the results is provided.

...read moreread less

Book Chapter•10.1007/978-3-642-04447-2_12•

Cross language experiments at Persian@CLEF 2008

[...]

Abolfazl AleAhmad¹, Ehsan Kamalloo¹, Arash Zareh¹, Masoud Rahgozar¹, Farhad Oroumchian² - Show less +1 more•Institutions (2)

University of Tehran¹, University of Wollongong in Dubai²

17 Sep 2008

TL;DR: This study discusses two teams from University of Tehran involved in cross language text retrieval part of the track using two different CLIR approaches that are query translation and document translation and creates a Hybrid CLIR system by score-based merging of the two retrieval system results.

...read moreread less

Abstract: In this study we will discuss our cross language text retrieval experiments of Persian ad hoc track at CLEF 2008. Two teams from University of Tehran were involved in cross language text retrieval part of the track using two different CLIR approaches that are query translation and document translation. For query translation we use a method named Combinatorial Translation Probability (CTP) calculation for estimation of translation probabilities. In the document translation part, we use the Shiraz machine translation system for translation of documents into English. Then we create a Hybrid CLIR system by score-based merging of the two retrieval system results. In addition, we investigated N-grams and a light stemmer in our monolingual experiments.

...read moreread less

From CLEF to TrebleCLEF: the Evolution of the Cross-Language Evaluation Forum.

[...]

Nicola Ferro, Carol Peters

1 Jan 2008

TL;DR: A critical assessment of the results achieved so far of the Cross-Language Evaluation Forum and the main ideas for the future of CLEF are outlined.

...read moreread less

Abstract: TheCross-LanguageEvaluation Forum (CLEF) has been running for nearly ten years now; the aim of this paper is to provide a critical assessment of the results achieved so far. In the first part of the paper, we provide a brief overview of the entire activity and summarise the main achievements; in the second part, we focus our attention on the Ad Hoc track with the aim of showing how the results of evaluation can be exploited to increase understanding of the many issues involved in multilingual retrieval system development. In the final part, we outline our main ideas for the future of CLEF.

...read moreread less