TL;DR: The x-ray task was the only fully novel task this year, although the other three tasks introduced modifications to keep up relevancy of the proposed challenges.
Abstract: This paper presents an overview of the ImageCLEF 2015 evaluation campaign, an event that was organized as part of the CLEF labs 2015 ImageCLEF is an ongoing initiative that promotes the evaluation of technologies for annotation, indexing and retrieval for providing information access to databases of images in various usage scenarios and domains In 2015, the 13th edition of ImageCLEF, four main tasks were proposed: 1 automatic concept annotation, localization and sentence description generation for general images; 2 identification, multi-label classification and separation of compound figures from biomedical literature; 3 clustering of x-rays from all over the body; and 4 prediction of missing radiological annotations in reports of liver CT images The x-ray task was the only fully novel task this year, although the other three tasks introduced modifications to keep up relevancy of the proposed challenges The participation was considerably positive in this edition of the lab, receiving almost twice the number of submitted working notes papers as compared to previous years
TL;DR: In this article, a user interface for a virtual musical instrument presents a number of chord touch regions, each corresponding to a chord of a diatonic key, including treble clef and bass clef touch zones.
Abstract: A user interface for a virtual musical instrument presents a number of chord touch regions, each corresponding to a chord of a diatonic key. Within each chord region a number of touch zones are provided, including treble clef zones and bass clef zones. Each treble clef touch zone within a region will sound a different chord voicing. Each bass clef touch zone will sound a bass note of the chord. Other user interactions can modify or mute the chords, and vary the bass notes being played together with the chords. A set of related chords and/or a set of rhythmic patterns can be generated based on a selected instrument and a selected style of music.
TL;DR: This paper reports on Task 1b of the 2015 CLEF eHealth evaluation lab which extended the previous information extraction tasks of ShARe/CLEF e health evaluation labs by considering ten types of en- tities including disorders, that were to be extracted from biomedical text in French.
Abstract: This paper reports on Task 1b of the 2015 CLEF eHealth evaluation lab which extended the previous information extraction tasks of ShARe/CLEF eHealth evaluation labs by considering ten types of en- tities including disorders, that were to be extracted from biomedical text in French. The task consisted of two phases: entity recognition (phase 1), in which participants could supply plain or normalized entities, and entity normalization (phase 2). The entities to be extracted were dened according to Semantic Groups in the Unied
TL;DR: The Living Labs for Information Retrieval Evaluation LL4IR CLEF Lab as mentioned in this paper provides a benchmarking platform for researchers to evaluate their ranking systems in a live setting with real users in their natural task environments.
Abstract: In this paper we report on the first Living Labs for Information Retrieval Evaluation LL4IR CLEF Lab. Our main goal with the lab is to provide a benchmarking platform for researchers to evaluate their ranking systems in a live setting with real users in their natural task environments. For this first edition of the challenge we focused on two specific use-cases: product search and web search. Ranking systems submitted by participants were experimentally compared using interleaved comparisons to the production system from the corresponding use-case. In this paper we describe how these experiments were performed, what the resulting outcomes are, and conclude with some lessons learned.
TL;DR: An unsupervised method for the semantic analysis of questions, that generates queries, based on graph transformations, in two steps, that makes use of very general constraints on the query structure that allows to maintain semantic ambiguities in different graphs.
Abstract: For our participation to QALD-5, we developed a system for answering questions on a knowledge base. We proposed an unsupervised method for the semantic analysis of questions, that generates queries, based on graph transformations, in two steps. First step is independent of the knowledge base schema and makes use of very general constraints on the query structure that allows us to maintain semantic ambiguities in different graphs. Ambiguities are then solved globally at the final step when querying the knowledge base.
TL;DR: A novel way of computing the type/token ratio of an author is introduced and it is shown that, although strong correlations have been observed between high extroversion and low type/ token ratios in the past, this ratio is not necessarily a strong indicator of extrovertedness.
Abstract: In this paper we go through our approach at solving the PAN Author Profiling task. We introduce a novel way of computing the type/token ratio of an author and show that, although strong correlations have been observed between high extroversion and low type/token ratios in the past, this ratio is not necessarily a strong indicator of extroversion. Since the text of a person is influenced by all 7 features (gender, age, and big five personality traits) that are required to be automatically identified in this task, we used this ratio, along with Term frequency-Inverse document frequency (tf-idf ) matrices, in all 7 subtasks and all 4 corpora and obtained good results.
TL;DR: The Entrance Exams task at the CLEF QA Track 2014 is described, requiring not only a high degree of textual inference, but also the development of strate- gies for selecting the correct answer.
Abstract: This paper describes the Entrance Exams task at the CLEF QA Track 2015. Following the last two editions, the data set has been extracted from actu- al university entrance examinations including a variety of topics and question types. Systems receive a set of Multiple-Choice Reading Comprehension tests where the task is to select the correct answer among a finite set of candidates, according to the given text. Questions are designed originally for testing human examinees, rather than evaluating computer systems. Therefore, the data set challenges human ability to show their understanding of texts. Thus, questions and answers are lexically distant from their supporting excerpts in text, requir- ing not only a high degree of textual inference, but also the development of strategies for selecting the correct answer.
TL;DR: The approach for construction of a monolingual Persian plagia- rism corpus that can be used to evaluate the performance of Persian plagiarism detection systems is described.
Abstract: The task of text alignment corpus construction at PAN 2015 competi- tion consists of preparing a plagiarism corpus so that it can provide various ob- fuscation types and versatile obfuscation degrees. Meanwhile, its format and metadata structure should follow previous PAN plagiarism corpora. In this pa- per, we describe our approach for construction of a monolingual Persian plagia- rism corpus that can be used to evaluate the performance of Persian plagiarism detection systems.
TL;DR: This paper describes and evaluates an unsupervised authorship verification model called SPATIUM-L1, which can be adapted without any problem to different languages (such as Dutch, English, Greek, and Spanish).
Abstract: This paper describes and evaluates an unsupervised authorship verification model called SPATIUM-L1. The suggested strategy can be adapted without any problem to different languages (such as Dutch, English, Greek, and Spanish) with their genre and topic differ significantly. As features, we suggest using the k most frequent terms of the disputed text (isolated words and punctuation symbols with k may vary from 200 to 300). Applying a simple distance measure and a set of impostors, we determine whether or not the disputed text was written by the proposed author. Moreover, based on a simple rule, we can define when there is enough evidence to propose an answer with a high degree of confidence or when the attribution scheme is given without certainty. The evaluations are based on four test collections (PAN AUTHOR IDENTIFICATION task at CLEF 2015).
TL;DR: This study presents a historical overview of 15 years of QA evaluation tracks using the method of systematic review, and examined the different tasks or specific labs created in each QA track, the types of evaluation question used, as well as the evaluation measures used in the different competitions analyzed.
Abstract: Question Answering (QA) Systems are put forward as a real alternative to Information Retrieval systems as they provide the user with a fast and comprehensible answer to his or her information need. It has been 15 years since TREC introduced the first QA track. The principal campaigns in the evaluation of Information Retrieval have been specific tracks focusing on the development and evaluation of this type of system. This study is a brief review of the TREC, CLEF and NTCIR Conferences from the QA perspective. We present a historical overview of 15 years of QA evaluation tracks using the method of systematic review. We have examined identified the different tasks or specific labs created in each QA track, the types of evaluation question used, as well as the evaluation measures used in the different competitions analyzed. Of the conferences, it is CLEF that has applied the greater variety of types of test question (factoid, definition, list, causal, yes/no, amongst others). NTCIR, held on 13 occasions, is the conference which has made use of a greater number of different evaluation measures. Accuracy, precision and recall have been the three most used evaluation measures in the three campaigns.
TL;DR: A novel method to recognize clinical entities based on conditional random fields (CRF) based on WI-ENRE system, which is effective in the named entity recognition of biomedical texts.
Abstract: Named entity recognition of biomedical text is the shared task 1b of the 2015 CLEF eHealth evaluation lab, which focuses on making biomedical text easier to understand for patients and clinical workers. In this paper, we propose a novel method to recognize clinical entities based on conditional random fields (CRF). The biomedical texts are split into sections and paragraphs. Then the NLP tools are used for POS tagging and parsing, and four groups of features are ex- tracted to train the entity recognition model. In the subsequent phase for entity normalization, the MetaMap of Unified Medical Language System (UMLS) tool is used to search for concept unique identifiers (CUIs) category. In addition, CRF++ package is adopted to recognize clinical entities in another phase for en- tity recognition. The experiments show that our system named as WI-ENRE, is effective in the named entity recognition of biomedical texts. The Fmeasure of EMEA and MEDLINE reach to 0.56 and 0.45 respectively in exact match.
TL;DR: This article used word-based and style-based features to identify the differences between the known and unknown problems of one given set and label the unknown ones accordingly using a Random Forest based classifier.
Abstract: Authorship attribution, being an important problem in many areas including information retrieval, computational linguistics, law and journalism etc., has been identified as a subject of increasingly research interest in the recent years. In case of Author Identification task in PAN at CLEF 2015, the main focus was given on cross-genre and cross-topic author verification tasks. We have used several word-based and style-based features to identify the differences between the known and unknown problems of one given set and label the unknown ones accordingly using a Random Forest based classifier.
TL;DR: The proposed method is based on named entity recognition, part-of-speech weighting and sentence quality measuring, and an algorithm from smoothing from the local context for sentence reordering for tweet contextualization.
Abstract: This paper presents the approach we developed for automatic multi-document summarization applied to short message contextualization, in particular to tweet contextualization. The proposed method is based on named entity recognition, part-of-speech weighting and sentence quality measuring. In contrast to previous research, we introduced an algorithm from smoothing from the local context. Our approach exploits topic-comment structure of a text. Moreover, we developed a graph-based algorithm for sentence reordering. The method has been evaluated at INEX/CLEF tweet contextualization track. We provide the evaluation results over the 4 years of the track. The method was also adapted to snippet retrieval and query expansion. The evaluation results indicate good performance of the approach.
TL;DR: A Web-based query expansion model and a learning-to-rank algorithm to better understand and satisfy the CLEF eHealth Task 2.0 is proposed.
Abstract: This paper presents our work on the 2015 CLEF eHealth Task 2. In particular, we propose a Web-based query expansion model and a learning-to-rank algorithm to better understand and satisfy the task.
TL;DR: The paper describes the approach for the Authorship Identification task at the PAN CLEF 2015, which uses a predefined threshold in order to decide if the unknown document is written by the known author or not.
Abstract: The paper describes our approach for the Authorship Identification task at the PAN CLEF 2015. We extract textual patterns based on features ob- tained from shortest path walks over Integrated Syntactic Graphs (ISG). Then we calculate a similarity between the unknown document and the known document with these patterns. The approach uses a predefined threshold in order to decide if the unknown document is written by the known author or not.
TL;DR: This paper reports on the participation in the clinical named entity recognition task of the CLEF eHealth 2015 evaluation initiative i.e. to fully automatically identify clinically relevant entities in medical text in French using several biomedical knowledge organization systems containing terms and their variations already in French.
Abstract: In this paper, we report on our participation in the clinical named entity recognition task of the CLEF eHealth 2015 evaluation initiative i.e. to fully automatically identify clinically relevant entities in medical text in French. We address the task by using several biomedical knowledge organization systems (KOS) containing terms and their variations already in French or that we have partially translated in the context of existing projects. The extraction method is available online in the form a web-based service that requests the KOS to extract clinical concepts from Electronic Health Records. It is also available via a user-friendly interface developed for clinicians. Our system has not obtained good results in inexact matching against the gold standard. However, this first participation allowed us to analyze our system and method and will allow us to
TL;DR: This paper describes the CLEF QA Track 2015, which was divided into four tasks: i QALD: focused on translating natural language questions into SPARQL; ii Entrance Exams:focused on answering questions to assess machine reading capabilities; iii BioASQ1 focused on large-scale semantic indexing and iv BioASZ2 for Question Answering in the biomedical domain.
Abstract: This paper describes the CLEF QA Track 2015. Following the scenario stated last year for the CLEF QA Track, the starting point for accessing information is always a Natural Language question. However, answering some questions may need to query Linked Data especially if aggregations or logical inferences are required, some questions may need textual inferences and querying free-text, and finally, answering some queries may require both sources of information. In this edition, the Track was divided into four tasks: i QALD: focused on translating natural language questions into SPARQL; ii Entrance Exams: focused on answering questions to assess machine reading capabilities; iii BioASQ1 focused on large-scale semantic indexing and iv BioASQ2 for Question Answering in the biomedical domain.
TL;DR: This paper presents the experience implementing a homotopy-based classification (HBC) system for the ‘PAN 2015 Author Identification’ and embedded into the General Impostor Method resulting in an ensemble of the SBC model.
Abstract: This paper presents our experience implementing a homotopy-based classification (HBC) system for the ‘PAN 2015 Author Identification’ [20]. Known documents from a specific author and randomly selected impostor documents are used as a dictionary to generate a contested document. Given the contribution of the known documents to the contested document we can verify the authorship of the document. This classification is embedded into the General Impostor Method resulting in an ensemble of the SBC model.
TL;DR: This article used syntactic features such as syntactic based n-grams of various types in order to predict the age, gender and personality traits that has the author of a given text.
Abstract: This paper describes our approach to tackle the Author Profiling task at PAN 2015. Our method relies on syntactic features, such as syntactic based n-grams of various types in order to predict the age, gender and personality traits that has the author of a given text. In this paper, we describe the used features, the employed classification algorithm, and other general ideas concerning the experiments we conducted.
TL;DR: A binary linear classifier is trained both on the features describing known and unknown documents individually and on the joint features comparing these two types of documents, resulting in competitive results that outperform the baseline and position the system among the top PAN shared task participants.
Abstract: We present a simple and effective approach to authorship verifica- tion for Dutch, English, Spanish and Greek, which can be easily ported to yet other languages We train a binary linear classifier both on the features describing known and unknown documents individually, and on the joint features comparing these two types of documents The list of feature types includes, among others, character n-grams, the lexical overlap, visual text properties and a compression measure We obtain competitive results that outperform the baseline and position our system among the top PAN shared task participants
TL;DR: The Living Labs for Information Retrieval (LL4IR) lab was held for the first time at CLEF and GESIS participated in this pilot evaluation and the system that is based on the Solr search engine and includes a re-reranking based on historical click data was described.
Abstract: The Living Labs for Information Retrieval (LL4IR) lab was held for the first time at CLEF and GESIS participated in this pilot evaluation. We took part in the product search task and describe our system that is based on the Solr search engine and includes a re-reranking based on historical click data. This brief workshop note also includes some preliminary results, discussion and some lessons learned.
TL;DR: This paper describes the participants' participation at the BioASQ Task 3b of CLEF 2015 Question Answering track and the ideal answer generation subtask in Phase B, where relevant passages are selected and combined to produce answer text.
Abstract: This paper describes our participation at the BioASQ Task 3b of CLEF 2015 Question Answering track. We participated at the document retrieval sub- task in Phase A and the ideal answer generation subtask in Phase B. As of previ- ous year, in the document retrieval task, we mostly experimented with semantic concept-enriched dependence model and sequential dependence model. In the ideal answer generation task, relevant passages are selected and combined to au- tomatically produce answer text.
TL;DR: The proposal rely on a biased fusion of content-only retrieval, using BM25F and LGD retrieval models, user non-social profile based on the catalog of the requester, and social profiles using user/user links generated from their catalogs and ratings on books.
Abstract: This paper describes the work achieved by the MRIM research group of Grenoble, using some data from the LaHC of Saint-´ Etienne, in a way to test personalized retrieval of books for the Social Book Search Lab of CLEF 2015. Our proposal rely on a biased fusion of content-only retrieval, using BM25F and LGD retrieval models, user non-social profile based on the catalog of the requester, and social profiles using user/user links generated from their catalogs and ratings on books. The official results obtained show a clear positive impact of user profile, and a small positive impact of the social elements we used. Post official results that present non biased fusion scores are also presented.
TL;DR: The authors' system retrieves passages relevant to the question, through lexical expansion involving WordNet and word vectors, then a tree edit model is used on graph representations of the passages and answer choices to extract edit se- quences.
Abstract: This paper describes our participation to the Entrance Ex- ams Task of CLEF 2015's Question Answering Track. The goal is to an- swer multiple-choice questions on short texts. Our system rst retrieves passages relevant to the question, through lexical expansion involving WordNet and word vectors. Then a tree edit model is used on graph representations of the passages and answer choices to extract edit se- quences. Finally, features are computed from those edit sequences and used in various machine-learned models to take the nal decision. We submitted several runs in the task, one of which yielding a c@1 of 0.36, which makes our team the second best on the task.
TL;DR: The properties of the multilingual software submitted for PAN2015 which recognizes the age, gender and personality traits of Twitter users in four languages, namely, English, Spanish, Dutch and Italian are described.
Abstract: User generated text on social media sites is a rich source of information that can be used to identify different aspects of their authors. Proper mining of this content provides an automatic way of identifying users which is very valuable for applications that rely on personalisation. In this work, we describe the properties of our multilingual software submitted for PAN2015 which recognizes the age, gender and personality traits of Twitter users in four languages, namely, English, Spanish, Dutch and Italian.
TL;DR: This book constitutes the refereed proceedings of the 6th International Conference of the CLEF Initiative, CLEF 2015, held in Toulouse, France, in September 2015, and covers a broad range of issues in the fields of multilingual and multimodal information access evaluation.
Abstract: This book constitutes the refereed proceedings of the 6th International Conference of the CLEF Initiative, CLEF 2015, held in Toulouse, France, in September 2015. The 31 full papers and 20 short papers presented were carefully reviewed and selected from 68 submissions. They cover a broad range of issues in the fields of multilingual and multimodal information access evaluation, also included are a set of labs and workshops designed to test different aspects of mono and cross-language information retrieval systems.
TL;DR: This article used personality traits alongside age and gender in a corpus of tweets in four languages (English, Spanish, Italian and Dutch) for the PAN 2015 Author Profiling Challenge by the team from Xerox Research Centre Europe (XRCE).
Abstract: This technical notebook describes the methodology used – and results achieved – for the PAN 2015 Author Profiling Challenge by the team from Xerox Research Centre Europe (XRCE). This year, personality traits are introduced alongside age and gender in a corpus of tweets in four languages – English, Spanish, Italian and Dutch. We describe a largely language agnostic methodology for classification which uses language specific linguistic processing to generate features. We also report on experiments in which we use machine translation to accommodate for languages in which there is less training data. Native language results are successful, but socio-demographic signals in language seem to be lost under MT conditions.
TL;DR: This paper describes and evaluates an effective author profiling model called SPATIUM-L1, which can be adapted without any problem to different languages (such as Dutch, English, Italian, and Spanish) in Twitter tweets.
Abstract: This paper describes and evaluates an effective author profiling model called SPATIUM-L1. The suggested strategy can be adapted without any problem to different languages (such as Dutch, English, Italian, and Spanish) in Twitter tweets. As features, we suggest using the 200 most frequent terms of the query text (isolated words and punctuation symbols). Applying a simple distance measure and looking at the three nearest neighbors, we can determine the gender (with the nominal values male and female), the age group (with the ordinal measurement 18-24|25-34|35-49|>50), and the Big Five personality traits (extraversion, neuroticism, agreeableness, conscientiousness, and openness on an interval scale containing eleven items). Evaluations are based on four test collections (PAN AUTHOR PROFILING task at CLEF 2015).
TL;DR: A social feature re-ranking system is built based on a full-text search engine and a set of rules to filtering out unnecessary books from the recommendation list are defined.
Abstract: The Social Book Search (SBS) Lab is part of CLEF 2015 lab series. This is the third time that the CYUT CSIE team attends the SBS track. Based on a full-text search engine, we build a social feature re-ranking system and introduce more knowledge on understanding the queries. We defined a set of rules to filtering out unnecessary books from the recommendation list. The official run results show that the system performance is improved from our previous system.
TL;DR: This paper proposes a probabilistic model for product search based on the intuition that descriptive fields and the category might fit with the query and addresses head queries that are frequently submitted on e-commerce Web sites.
Abstract: We describe in this paper our participation in the product search task of LL4IR CLEF 2015 Lab. This task aims to evaluate, with living labs protective point of view, the retrieval effectiveness over e-commerce search engines. During the online shopping process, users would search for interesting products and quickly access those that fit with their needs among a long tail of similar or closely related products. Our contribution addresses head queries that are frequently submitted on e-commerce Web sites. Head queries usually target featured products with several variations, accessories, and complementary products. We propose a probabilistic model for product search based on the intuition that descriptive fields and the category might fit with the query. Fi-naly, we present results obtained during the second round of the product search task.