TRECVID

Topic Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1145/1282280.1282340•

Representing shape with a spatial pyramid kernel

[...]

Anna Bosch¹, Andrew Zisserman², X. Munoz¹•Institutions (2)

University of Girona¹, University of Oxford²

9 Jul 2007

TL;DR: This work introduces a descriptor that represents local image shape and its spatial layout, together with a spatial pyramid kernel that is designed so that the shape correspondence between two images can be measured by the distance between their descriptors using the kernel.

...read moreread less

Abstract: The objective of this paper is classifying images by the object categories they contain, for example motorbikes or dolphins. There are three areas of novelty. First, we introduce a descriptor that represents local image shape and its spatial layout, together with a spatial pyramid kernel. These are designed so that the shape correspondence between two images can be measured by the distance between their descriptors using the kernel. Second, we generalize the spatial pyramid kernel, and learn its level weighting parameters (on a validation set). This significantly improves classification performance. Third, we show that shape and appearance kernels may be combined (again by learning parameters on a validation set).Results are reported for classification on Caltech-101 and retrieval on the TRECVID 2006 data sets. For Caltech-101 it is shown that the class specific optimization that we introduce exceeds the state of the art performance by more than 10%.

...read moreread less

1,635 citations

Proceedings Article•10.1145/1178677.1178722•

Evaluation campaigns and TRECVid

[...]

Alan F. Smeaton¹, Paul Over², Wessel Kraaij•Institutions (2)

Dublin City University¹, National Institute of Standards and Technology²

26 Oct 2006

TL;DR: An introduction to information retrieval (IR) evaluation from both a user and a system perspective is given, high-lighting that system evaluation is by far the most prevalent type of evaluation carried out.

...read moreread less

Abstract: The TREC Video Retrieval Evaluation (TRECVid)is an international benchmarking activity to encourage research in video information retrieval by providing a large test collection, uniform scoring procedures, and a forum for organizations 1 interested in comparing their results. TRECVid completed its fifth annual cycle at the end of 2005 and in 2006 TRECVid will involve almost 70 research organizations, universities and other consortia. Throughout its existence, TRECVid has benchmarked both interactive and automatic/manual searching for shots from within a video corpus,automatic detection of a variety of semantic and low-level video features, shot boundary detection and the detection of story boundaries in broadcast TV news. This paper will give an introduction to information retrieval (IR) evaluation from both a user and a system perspective, high-lighting that system evaluation is by far the most prevalent type of evaluation carried out. We also include a summary of TRECVid as an example of a system evaluation bench-marking campaign and this allows us to discuss whether such campaigns are a good thing or a bad thing. There are arguments for and against these campaigns and we present some of them in the paper concluding that on balance they have had a very positive impact on research progress.

...read moreread less

1,473 citations

Proceedings Article•10.1145/1290082.1290111•

Evaluating bag-of-visual-words representations in scene classification

[...]

Jun Yang¹, Yu-Gang Jiang², Alexander G. Hauptmann¹, Chong-Wah Ngo²•Institutions (2)

Carnegie Mellon University¹, City University of Hong Kong²

24 Sep 2007

TL;DR: This study provides an empirical basis for designing visual-word representations that are likely to produce superior classification performance and applies techniques used in text categorization to generate image representations that differ in the dimension, selection, and weighting of visual words.

...read moreread less

Abstract: Based on keypoints extracted as salient image patches, an image can be described as a "bag of visual words" and this representation has been used in scene classification. The choice of dimension, selection, and weighting of visual words in this representation is crucial to the classification performance but has not been thoroughly studied in previous work. Given the analogy between this representation and the bag-of-words representation of text documents, we apply techniques used in text categorization, including term weighting, stop word removal, feature selection, to generate image representations that differ in the dimension, selection, and weighting of visual words. The impact of these representation choices to scene classification is studied through extensive experiments on the TRECVID and PASCAL collection. This study provides an empirical basis for designing visual-word representations that are likely to produce superior classification performance.

...read moreread less

980 citations

Proceedings Article•10.1145/1180639.1180727•

The challenge problem for automated detection of 101 semantic concepts in multimedia

[...]

Cees G. M. Snoek¹, Marcel Worring¹, Jan C. van Gemert¹, Jan-Mark Geusebroek¹, Arnold W. M. Smeulders¹ - Show less +1 more•Institutions (1)

University of Amsterdam¹

23 Oct 2006

TL;DR: The challenge problem for generic video indexing is introduced to gain insight in intermediate steps that affect performance of multimedia analysis methods, while at the same time fostering repeatability of experiments.

...read moreread less

Abstract: We introduce the challenge problem for generic video indexing to gain insight in intermediate steps that affect performance of multimedia analysis methods, while at the same time fostering repeatability of experiments. To arrive at a challenge problem, we provide a general scheme for the systematic examination of automated concept detection methods, by decomposing the generic video indexing problem into 2 unimodal analysis experiments, 2 multimodal analysis experiments, and 1 combined analysis experiment. For each experiment, we evaluate generic video indexing performance on 85 hours of international broadcast news data, from the TRECVID 2005/2006 benchmark, using a lexicon of 101 semantic concepts. By establishing a minimum performance on each experiment, the challenge problem allows for component-based optimization of the generic indexing issue, while simultaneously offering other researchers a reference for comparison during indexing methodology development. To stimulate further investigations in intermediate analysis steps that inuence video indexing performance, the challenge offers to the research community a manually annotated concept lexicon, pre-computed low-level multimedia features, trained classifier models, and five experiments together with baseline performance, which are all available at http://www.mediamill.nl/challenge/.

...read moreread less

723 citations

TRECVID 2012 - An overview of the goals, tasks, data, evaluation mechanisms, and metrics

[...]

Paul Over, Jonathan G. Fiscus, G. Sanders, B. Shaw, Martial Michel, George Awad, Alan F. Smeaton, Wessel Kraaij, Georges Quénot - Show less +5 more

1 Jan 2013

TL;DR: The TREC Video Retrieval Evaluation (TRECVID) 2012 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in content-based exploitation of digital video via open, metrics-based evaluation as mentioned in this paper.

...read moreread less

Abstract: The TREC Video Retrieval Evaluation (TRECVID) 2012 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in content-based exploitation of digital video via open, metrics-based evaluation. Over the last ten years this effort has yielded a better understanding of how systems can effectively accomplish such processing and how one can reliably benchmark their performance. TRECVID is funded by the NIST and other US government agencies. Many organizations and individuals worldwide contribute significant time and effort.

...read moreread less

598 citations

...

Expand

Year	Papers
2021	16
2020	45
2019	39
2018	56
2017	71
2016	100

Topic Tools

Papers published on a yearly basis

Papers

Representing shape with a spatial pyramid kernel

Evaluation campaigns and TRECVid

Evaluating bag-of-visual-words representations in scene classification

The challenge problem for automated detection of 101 semantic concepts in multimedia

TRECVID 2012 - An overview of the goals, tasks, data, evaluation mechanisms, and metrics

Related Topics (5)

Performance Metrics