Australasian Document Computing Symposium

About: Australasian Document Computing Symposium is an academic conference. The conference publishes majorly in the area(s): Relevance (information retrieval) & Ranking (information retrieval). Over the lifetime, 292 publications have been published by the conference receiving 2696 citations.

...read moreread less

Topics: Relevance (information retrieval), Ranking (information retrieval), Computer science, Web search query, Query expansion ...read more

Conference Tools

Create Scientific Poster

Create Conference poster

Create Presentation with AI

Papers published on a yearly basis

2022
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1997
1993

Papers

Proceedings Article•10.1145/2682862.2682863•

Improvements to BM25 and Language Models Examined

[...]

Andrew Trotman¹, Antti Puurula², Blake Burgess¹•Institutions (2)

University of Otago¹, University of Waikato²

26 Nov 2014

TL;DR: This investigation finds that once trained (using particle swarm optimization) there is very little difference in performance between these functions, that relevance feedback is effective, that stemming is effective and that it remains unclear which function is best over-all.

...read moreread less

Abstract: Recent work on search engine ranking functions report improvements on BM25 and Language Models with Dirichlet Smoothing. In this investigation 9 recent ranking functions (BM25, BM25+, BM25T, BM25-adpt, BM25L, TF1°δ°p×ID, LM-DS, LM-PYP, and LM-PYP-TFIDF) are compared by training on the INEX 2009 Wikipedia collection and testing on INEX 2010 and 9 TREC collections. We find that once trained (using particle swarm optimization) there is very little difference in performance between these functions, that relevance feedback is effective, that stemming is effective, and that it remains unclear which function is best over-all.

...read moreread less

214 citations

Book Chapter•10.1142/9789814350976_0002•

The Limit Cycle Instability in Dwarf Nova Accretion Disks

[...]

John K. Cannizzo

1 Dec 1993

205 citations

Proceedings Article•

On Being Here to Stay: Treaties and Aboriginal Rights in Canada

[...]

Vanessa Sloan Morgan

24 Nov 2015

178 citations

Proceedings Article•10.1145/2838931.2838934•

CQADupStack: A Benchmark Data Set for Community Question-Answering Research

[...]

Doris Hoogeveen¹, Karin Verspoor¹, Timothy Baldwin¹•Institutions (1)

University of Melbourne¹

8 Dec 2015

TL;DR: This paper presents a benchmark dataset, CQADupStack, for use in community question-answering (cQA) research, which contains threads from twelve StackExchange subforums, annotated with duplicate question information.

...read moreread less

Abstract: This paper presents a benchmark dataset, CQADupStack, for use in community question-answering (cQA) research. It contains threads from twelve StackExchange subforums, annotated with duplicate question information. We provide pre-defined training and test splits, both for retrieval and classification experiments, to ensure maximum comparability between different studies using the set. Furthermore, it comes with a script to manipulate the data in various ways. We give an analysis of the data in the set, and report benchmark results on a duplicate question retrieval task using well established retrieval models.

...read moreread less

114 citations

Proceedings Article•

External evaluation of topic models

[...]

David Newman¹, Sarvnaz Karimi¹, Lawrence Cavedon¹•Institutions (1)

NICTA¹

7 Dec 2009

TL;DR: The authors' PMI score, computed using word-pair co-occurrence statistics from external data sources, has relatively good agreement with human scoring and it is shown that the ability to identify less useful topics can improve the results of a topic-based document similarity metric.

...read moreread less

Abstract: Topic models can learn topics that are highly interpretable, semantically-coherent and can be used similarly to subject headings. But sometimes learned topics are lists of words that do not convey much useful information. We propose models that score the usefulness of topics, including a model that computes a score based on pointwise mutual information (PMI) of pairs of words in a topic. Our PMI score, computed using word-pair co-occurrence statistics from external data sources, has relatively good agreement with human scoring. We also show that the ability to identify less useful topics can improve the results of a topic-based document similarity metric.

...read moreread less

108 citations