Publication | Closed Access
Adaptive CU Split Decision with Pooling-variable CNN for VVC Intra Encoding
90
Citations
8
References
2019
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningVideo ProcessingVideo InterpretationCu SplitImage AnalysisPattern RecognitionSparse Neural NetworkQuadtree StructureVideo TransformerMachine VisionPooling-variable CnnVvc Intra EncodingFeature LearningVersatile Video CodingComputer EngineeringComputer ScienceVideo UnderstandingDeep LearningFeature FusionComputer Vision
In the versatile video coding (VVC) proposed by the Joint Video Exploration Team (JVET), the quad-tree with the nested multi-type tree (QTMT) partition scheme has been adopted based on the quadtree structure in the high efficiency video coding (HEVC). The video coding quality of VVC is better than the HEVC, but the algorithm complexity has also increased greatly. In this work, we present an adaptive CU split decision for intra frame with the pooling-variable convolutional neural network (CNN), targeting at various coding unit (CU) shape. The shape-adaptive CNN is realized by the variable pooling layer size where we can make the most of the pooling layer in CNN and retain the original information. Based on the proposed CNN, the CU split or not will be decided by only one trained network, same architecture and parameters for the CUs with multiple sizes. Moreover, with the proposed shape-based CNN training scheme, the various training sample size can be processed successfully. The CUbased network can avoid the full rate-distortion optimization for the CU split and the CU-level rate control can also be enabled. The experiment results show that the proposed method can save 33% coding time with only 0.99% Bjontegaard Delta bitrate (BD-rate) increase.
| Year | Citations | |
|---|---|---|
Page 1
Page 1