Reinforcement learning for video encoder control in HEVC

Philipp Helle, Heiko Schwarz, Thomas Wiegand, Klaus Muller

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

In todays video compression systems, the encoder typically follows an optimization procedure to find a compressed representation of the video signal. While primary optimization criteria are bit rate and image distortion, low complexity of this procedure may also be of importance in some applications, making complexity a third objective. We approach this problem by treating the encoding procedure as a decision process in time and make it amenable to reinforcement learning. Our learning algorithm computes a strategy in a compact functional representation, which is then employed in the video encoder to control its search. By including measured execution time into the reinforcement signal with a lagrangian weight, we realize a trade-off between RD-performance and computational complexity controlled by a single parameter. Using the reference software test model (HM) of the HEVC video coding standard, we show that over half the encoding time can be saved at the same RD-performance.

Original languageEnglish
Title of host publicationProceedings - 2017 International Conference on Systems, Signals and Image Processing, IWSSIP 2017
PublisherIEEE Computer Society
ISBN (Electronic)9781509063444
DOIs
Publication statusPublished - 2017 Jun 30
Externally publishedYes
Event24th International Conference on Systems, Signals and Image Processing, IWSSIP 2017 - Poznan, Poland
Duration: 2017 May 222017 May 24

Other

Other24th International Conference on Systems, Signals and Image Processing, IWSSIP 2017
Country/TerritoryPoland
CityPoznan
Period17/5/2217/5/24

Keywords

  • Computational Complexity
  • Cost Sensitive Classification
  • HEVC
  • Machine Learning
  • Optimization
  • Rate-Distortion Optimization
  • Reinforcement Learning
  • Video Coding

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Signal Processing
  • Software

Fingerprint

Dive into the research topics of 'Reinforcement learning for video encoder control in HEVC'. Together they form a unique fingerprint.

Cite this