About: Video coding format is a research topic. Over the lifetime, 36 publications have been published within this topic receiving 182 citations. The topic is also known as: video coding format & video bitstream format.
TL;DR: A technical overview of key coding techniques in AV1 is provided and the coding performance gains are validated by video compression tests performed with the libaom AV1 encoder against the libvpx VP9 encoder.
Abstract: In 2018, the Alliance for Open Media (AOMedia) finalized its first video compression format AV1, which is jointly developed by the industry consortium of leading video technology companies. The main goal of AV1 is to provide an open source and royalty-free video coding format that substantially outperforms state-of-the-art codecs available on the market in compression efficiency while remaining practical decoding complexity as well as being optimized for hardware feasibility and scalability on modern devices. To give detailed insights into how the targeted performance and feasibility is realized, this paper provides a technical overview of key coding techniques in AV1. Besides, the coding performance gains are validated by video compression tests performed with the libaom AV1 encoder against the libvpx VP9 encoder. Preliminary comparison with two leading HEVC encoders, x265 and HM, and the reference software of VVC is also conducted on AOM's common test set and an open 4k set.
TL;DR: This experiment adopts the PSNR and SSIM on WebP and JPG, PNG for comparison test, then implements data compression and reduction test, showing that WebP has high value in the life, it will develop well in the future.
Abstract: WebP is a new image compression technology proposed by Google. It is derived from the VP8 video coding format; it also supports lossless and lossy compressions. Compared with the present popular image format, WebP has smaller volume than JPG and PNG in the condition of the same picture quality conditions. By coding with MATLAB, this experiment adopts the PSNR and SSIM on WebP and JPG, PNG for comparison test, then implement data compression and reduction test. Since MATLAB does not identify the WebP format, this experiment uses canvas to process WebP images, thus measuring the PSNR and WebP of SSIM. The results of experiment show that WebP has high value in the life, it will develop well in the future.
TL;DR: A view selection method inspired by plenoptic sampling followed by transform-based view coding and view synthesis prediction to code residual views is introduced, which has an improved rate-distortion performance and preserves the structure of the perceived light fields better.
Abstract: Full parallax light field displays require high pixel density and huge amounts of data. Compression is a necessary tool used by 3D display systems to cope with the high bandwidth requirements. One of the formats adopted by MPEG for 3D video coding standards is the use of multiple views with associated depth maps. Depth maps enable the coding of a reduced number of views, and are used by compression and synthesis software to reconstruct the light field. However, most of the developed coding and synthesis tools target linearly arranged cameras with small baselines. Here we propose to use the 3D video coding format for full parallax light field coding. We introduce a view selection method inspired by plenoptic sampling followed by transform-based view coding and view synthesis prediction to code residual views. We determine the minimal requirements for view sub-sampling and present the rate-distortion performance of our proposal. We also compare our method with established video compression techniques, such as H.264/AVC, H.264/MVC, and the new 3D video coding algorithm, 3DV-ATM. Our results show that our method not only has an improved rate-distortion performance, it also preserves the structure of the perceived light fields better.
TL;DR: In this paper, a method of generating data representing a degree of importance of data blocks in a coded digital image of a digital video sequence coded in a hybrid predictive video coding format was proposed.
Abstract: A method of generating data representing a degree of importance of data blocks in a coded digital image of a digital video sequence coded in a hybrid predictive video coding format, said image being divided into a plurality of data blocks comprises, for the data blocks of the image, a step of determination of data representing the degree of importance of each of those data blocks on the basis of the coding or absence of coding of said data block, the determination also depending on the possible use of the data block considered for the coding of at least one data block of at least one other image of the video sequence. The invention also relates to a method of transmitting a coded video sequence.
TL;DR: A high-throughput hardware architecture for ten non-directional AV1 intra prediction modes: DC, Paeth, Smooth, Smooth Vertical, Smooth Horizontal and Recursive-filtering-based modes 0 to 4, which is able to process every single combination of these blocks according to the 10-way partition tree.
Abstract: The AOMedia Video 1 (AV1), developed by the Alliance for Open Media (AOM) industry consortium and released in 2018, is an open-source and royalty-free video coding format. The goal of AV1 is to deliver substantial compression gains over state-of-the-art codecs such as VP9 and HEVC, while keeping a practical decoding complexity, hardware feasibility and an open and free status. This paper presents a high-throughput hardware architecture for ten non-directional AV1 intra prediction modes: DC, Paeth, Smooth, Smooth Vertical, Smooth Horizontal and Recursive-filtering-based modes 0 to 4. The proposed design supports all 19 block sizes specified by AV1 and, in the worst-case scenario, is able to process every single combination of these blocks according to the 10-way partition tree, with a throughput of UHD 4K videos at up to 30 frames per second. When synthesized to the TSMC 40nm cell library targeting a frequency of 648MHz, the proposed design used 128.5K gates and showed a power dissipation and energy consumption per predicted sample of 65.5mW and 0.176pJ/sample, respectively. No other works were found in the literature describing hardware designs for AV1 intra prediction encoder.