1. What are the contributions in "News video story segmentation using fusion of multi-level multi-modal features in trecvid 2003" ?
In this paper, the authors present their new results in news video story segmentation and classification in the context of TRECVID video retrieval benchmarking event 2003.. Using the large news video set from the TRECVID 2003 benchmark, the authors demonstrate satisfactory performance ( F1 measure up to 0. 76 ) and more importantly observe an interesting opportunity for further improvement.
read more
2. What are the future works mentioned in the paper "News video story segmentation using fusion of multi-level multi-modal features in trecvid 2003" ?
According to their observation, a ME model extended with temporal states would be a promising solution since the statistical behaviors of features in relation to the story transition dynamics may change over time in the course of a news program.
read more
3. How many binary features are generated at a candidate point?
All the binary features generated at a candidate point are sequently collected into {gj} and are further fed into the ME model; e.g., for pitch jump raw feature with 4 threshold levels, it would generate 3 · 4 = 12 binary features since the authors have to check if the feature is ”on” in the 3 observation windows and each is binarized with 4 different levels.
read more
4. What is the relevant feature in the induced binary feature?
As for the induced binary features, the anchor face feature in a certain observation window is the most relevant; the next induced is the significant pause within the noncommercial section.
read more


