Journal Article10.48550/arxiv.2408.00538
High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets
Jinli Zhang,Bowen Xu,Sören Schwertfeger +2 more
- 01 Aug 2024
TL;DR: This paper investigates high-quality video encoding and decoding for robotic datasets, evaluating modern encoders and their settings to optimize storage size, quality, and encoding time within ROS 1 and 2 frameworks for efficient dataset sharing and benchmarking.
read more
Abstract: Robotic datasets are important for scientific benchmarking and developing algorithms, for example for Simultaneous Localization and Mapping (SLAM). Modern robotic datasets feature video data of high resolution and high framerates. Storing and sharing those datasets becomes thus very costly, especially if more than one camera is used for the datasets. It is thus essential to store this video data in a compressed format. This paper investigates the use of modern video encoders for robotic datasets. We provide a software that can replay mp4 videos within ROS 1 and ROS 2 frameworks, supporting the synchronized playback in simulated time. Furthermore, the paper evaluates different encoders and their settings to find optimal configurations in terms of resulting size, quality and encoding time. Through this work we show that it is possible to store and share even highest quality video datasets within reasonable storage constraints.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

TABLE VI COMPRESSION RESULTS IN GB OF 26MIN SHANGHAITECH MAPPING ROBOT OUTDOOR DATASET 
Fig. 1. One frame from our test clip 
TABLE I VISUAL COMPARISON OF DIFFERENT VMAF SCORED OF DETAIL OF I-FRAME 1284 OF THE TEST VIDEO. 
TABLE III 60 SECOND TEST VIDEO EXPERIMENT: SVTAV1 WITH VMAF SCORES GREATER THAN 99.45 AND COMPRESSION TIMES LESS THAN 1000 SECONDS. ULTIMATELY WE CHOSE THE SETTINGS IN BOLD. 
TABLE IV X265 TEST DATA 
Fig. 2. Playback tool options.
References
Are we ready for autonomous driving? The KITTI vision benchmark suite
Andreas Geiger,Philip Lenz,Raquel Urtasun +2 more
- 16 Jun 2012
TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.
Overview of the High Efficiency Video Coding (HEVC) Standard
TL;DR: The main goal of the HEVC standardization effort is to enable significantly improved compression performance relative to existing standards-in the range of 50% bit-rate reduction for equal perceptual video quality.
Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age
Cesar Cadena,Luca Carlone,Henry Carrillo,Yasir Latif,Davide Scaramuzza,José L. Neira,Ian Reid,John J. Leonard +7 more
TL;DR: What is now the de-facto standard formulation for SLAM is presented, covering a broad set of topics including robustness and scalability in long-term mapping, metric and semantic representations for mapping, theoretical performance guarantees, active SLAM and exploration, and other new frontiers.
Block Partitioning Structure in the HEVC Standard
TL;DR: Technical details of the block partitioning structure of HEVC are introduced with an emphasis on the method of designing a consistent framework by combining the three different units together and experimental results are provided to justify the role of each component.
489