Proceedings Article10.1109/ICME.2002.1035908
Baseball scene classification using multimedia features
Wei Hua,Mei Han,Yihong Gong +2 more
- 07 Nov 2002
- Vol. 1, pp 821-824
50
TL;DR: This paper proposes a maximum entropy based method for baseball scene classification in TV broadcast videos, chosen because it can automatically select and fuse multimedia features from temporal contexts.
read more
Abstract: In this paper, we address the issue of classifying video scenes which is essential in video indexing, archiving and summarization. Compared with previous methods, we emphasize the integration of multimedia features, including image, audio and speech cues. With current state-of-the-art image and audio analysis techniques, most image and audio features we can extract from videos are very low level, therefore, classifying scenes based on features from a single medium yields poor performance. We propose a maximum entropy based method for baseball scene classification in TV broadcast videos. The maximum entropy scheme is chosen because it can automatically select and fuse multimedia features from temporal contexts.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Fusion of audio and motion information on HMM-based highlight extraction for baseball games
Chih-Chieh Cheng,Chiou-Ting Hsu +1 more
TL;DR: This paper proposes a novel representation method based on likelihood models to extract baseball game highlights based on audio-motion integrated cues and employs a hidden Markov model to model and detect the transition of the integrated representation for highlight segments.
86
•Book
Content-Based Analysis of Digital Video
Alan Hanjalic
- 01 Jan 2004
TL;DR: System developers, researchers and students working in the area of content-based analysis and retrieval of video and multimedia in general will find this book invaluable.
76
Audiovisual integration for tennis broadcast structuring
TL;DR: This paper focuses on the integration of multimodal features for sport video structure analysis using a statistical model which takes into account both the shot content and the interleaving of shots in the global framework of Hidden Markov Models.
HMM based structuring of tennis videos using visual and audio cues
Ewa Kijak,Guillaume Gravier,Patrick Gros,Lionel Oisel,Frédéric Bimbot +4 more
- 06 Jul 2003
TL;DR: This paper focuses on the use of hidden Markov models for structure analysis of videos, and demonstrates how they can be efficiently applied to merge audio and visual cues, and validated in the particular domain of tennis videos.
Event-Importance Based Customized and Automatic Cricket Highlight Generation
Maheshkumar H. Kolekar,Somnath Sengupta +1 more
- 09 Jul 2006
TL;DR: A novel approach towards customized and automated generation of sports highlights from its extracted events and semantic concepts is presented and it has successfully extracted highlights from recorded video of cricket match.
References
A maximum entropy approach to natural language processing
TL;DR: A maximum-likelihood approach for automatically constructing maximum entropy models is presented and how to implement this approach efficiently is described, using as examples several problems in natural language processing.
Statistical Models for Text Segmentation
TL;DR: Assessment of the approach on quantitative and qualitative grounds demonstrates its effectiveness in two very different domains, Wall Street Journal news articles and television broadcast news story transcripts, using a new probabilistically motivated error metric.
791
Automatically extracting highlights for TV Baseball programs
Yong Rui,Anoop Gupta,Alex Acero +2 more
- 30 Oct 2000
TL;DR: This paper explores how to provide for the ability to extract highlights automatically, so that viewing time can be reduced, and presents results comparing output of algorithms against human-selected highlights for a diverse collection of baseball games with very encouraging results.
Translating collocations for bilingual lexicons: a statistical approach
TL;DR: A program named Champollion is described which, given a pair of parallel corpora in two different languages and a list of collocations in one of them, automatically produces their translations, to provide a tool for compiling bilingual lexical information above the word level in multiple languages, for different domains.
571
Algorithms and system for segmentation and structure analysis in soccer video
Peng Xu,Lexing Xie,Shih-Fu Chang,Ajay Divakaran,Anthony Vetro,Huifang Sun +5 more
- 01 Aug 2001
TL;DR: It is shown that low-level features and mid-level view classes can be combined to extract more information about the game, via the example of detecting grass orientation in the field, and the best result in segmentation is 86.5%.
264
Related Papers (5)
Yong Rui,Anoop Gupta,Alex Acero +2 more
- 30 Oct 2000
P. Chang,Mei Han,Yihong Gong +2 more
- 10 Dec 2002
Di Zhong,Shih-Fu Chang +1 more
- 01 Aug 2001