Reducing Complexity of HEVC: A Deep Learning Approach

Xu, Mai; Li, Tianyi; Wang, Zulin; Deng, Xin; Yang, Ren; Guan, Zhenyu

doi:10.1109/TIP.2018.2847035

Computer Science > Computer Vision and Pattern Recognition

arXiv:1710.01218 (cs)

[Submitted on 19 Sep 2017 (v1), last revised 22 Mar 2018 (this version, v3)]

Title:Reducing Complexity of HEVC: A Deep Learning Approach

Authors:Mai Xu, Tianyi Li, Zulin Wang, Xin Deng, Ren Yang, Zhenyu Guan

View PDF

Abstract:High Efficiency Video Coding (HEVC) significantly reduces bit-rates over the proceeding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the bruteforce search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra- and inter-modes, which is based on convolutional neural network (CNN) and long- and short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for HEVC intra- and inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit (CTU) in the form of a hierarchical CU partition map (HCPM). Then, we propose an early-terminated hierarchical CNN (ETH-CNN) for learning to predict the HCPM. Consequently, the encoding complexity of intra-mode HEVC can be drastically reduced by replacing the brute-force search with ETH-CNN to decide the CU partition. Third, an early-terminated hierarchical LSTM (ETH-LSTM) is proposed to learn the temporal correlation of the CU partition. Then, we combine ETH-LSTM and ETH-CNN to predict the CU partition for reducing the HEVC complexity for inter-mode. Finally, experimental results show that our approach outperforms other state-of-the-art approaches in reducing the HEVC complexity at both intra- and inter-modes.

Comments:	17 pages, with 12 figures and 7 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1710.01218 [cs.CV]
	(or arXiv:1710.01218v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1710.01218
Journal reference:	Published in IEEE Transactions on Image Processing, Oct. 2018
Related DOI:	https://doi.org/10.1109/TIP.2018.2847035

Submission history

From: Tianyi Li [view email]
[v1] Tue, 19 Sep 2017 02:02:00 UTC (2,274 KB)
[v2] Thu, 18 Jan 2018 07:48:00 UTC (3,307 KB)
[v3] Thu, 22 Mar 2018 11:13:05 UTC (3,307 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Reducing Complexity of HEVC: A Deep Learning Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Reducing Complexity of HEVC: A Deep Learning Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators