Multi-view video coding with view interpolation prediction for 2D camera arrays

Tae Young Chung, Il Lyong Jung, Kwanwoong Song, Chang-Su Kim

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

An efficient compression algorithm for multi-view video sequences, which are captured by two-dimensional (2D) camera arrays, is proposed in this work. First, we propose a novel prediction structure, called three-dimensional hierarchical B prediction (3DHBP), which can efficiently reduce horizontal inter-view redundancies, vertical inter-view redundancies, and temporal redundancies in multi-view videos. Second, we develop a view interpolation scheme based on the bilateral disparity estimation. The interpolation scheme yields high quality view frames by adapting disparity estimation and compensation procedures using the information in neighboring frames. Simulation results demonstrate that the proposed multi-view video coding algorithm provides significantly better rate-distortion (R-D) performance than the conventional algorithm, by employing the 3DHBP structure and using interpolated view frames as additional reference frames.

Original languageEnglish
Pages (from-to)474-486
Number of pages13
JournalJournal of Visual Communication and Image Representation
Volume21
Issue number5-6
DOIs
Publication statusPublished - 2010 Jul 1

Fingerprint

Image coding
Redundancy
Interpolation
Cameras

Keywords

  • 2D camera array
  • 3D-TV
  • Bilateral criterion
  • H.264/AVC
  • Hierarchical B prediction
  • Multi-view video coding
  • View interpolation

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Media Technology
  • Computer Vision and Pattern Recognition
  • Signal Processing

Cite this

Multi-view video coding with view interpolation prediction for 2D camera arrays. / Chung, Tae Young; Jung, Il Lyong; Song, Kwanwoong; Kim, Chang-Su.

In: Journal of Visual Communication and Image Representation, Vol. 21, No. 5-6, 01.07.2010, p. 474-486.

Research output: Contribution to journalArticle

@article{0311ba273b9e43d2ba6c6a43cda6b7b7,
title = "Multi-view video coding with view interpolation prediction for 2D camera arrays",
abstract = "An efficient compression algorithm for multi-view video sequences, which are captured by two-dimensional (2D) camera arrays, is proposed in this work. First, we propose a novel prediction structure, called three-dimensional hierarchical B prediction (3DHBP), which can efficiently reduce horizontal inter-view redundancies, vertical inter-view redundancies, and temporal redundancies in multi-view videos. Second, we develop a view interpolation scheme based on the bilateral disparity estimation. The interpolation scheme yields high quality view frames by adapting disparity estimation and compensation procedures using the information in neighboring frames. Simulation results demonstrate that the proposed multi-view video coding algorithm provides significantly better rate-distortion (R-D) performance than the conventional algorithm, by employing the 3DHBP structure and using interpolated view frames as additional reference frames.",
keywords = "2D camera array, 3D-TV, Bilateral criterion, H.264/AVC, Hierarchical B prediction, Multi-view video coding, View interpolation",
author = "Chung, {Tae Young} and Jung, {Il Lyong} and Kwanwoong Song and Chang-Su Kim",
year = "2010",
month = "7",
day = "1",
doi = "10.1016/j.jvcir.2009.10.001",
language = "English",
volume = "21",
pages = "474--486",
journal = "Journal of Visual Communication and Image Representation",
issn = "1047-3203",
publisher = "Academic Press Inc.",
number = "5-6",

}

TY - JOUR

T1 - Multi-view video coding with view interpolation prediction for 2D camera arrays

AU - Chung, Tae Young

AU - Jung, Il Lyong

AU - Song, Kwanwoong

AU - Kim, Chang-Su

PY - 2010/7/1

Y1 - 2010/7/1

N2 - An efficient compression algorithm for multi-view video sequences, which are captured by two-dimensional (2D) camera arrays, is proposed in this work. First, we propose a novel prediction structure, called three-dimensional hierarchical B prediction (3DHBP), which can efficiently reduce horizontal inter-view redundancies, vertical inter-view redundancies, and temporal redundancies in multi-view videos. Second, we develop a view interpolation scheme based on the bilateral disparity estimation. The interpolation scheme yields high quality view frames by adapting disparity estimation and compensation procedures using the information in neighboring frames. Simulation results demonstrate that the proposed multi-view video coding algorithm provides significantly better rate-distortion (R-D) performance than the conventional algorithm, by employing the 3DHBP structure and using interpolated view frames as additional reference frames.

AB - An efficient compression algorithm for multi-view video sequences, which are captured by two-dimensional (2D) camera arrays, is proposed in this work. First, we propose a novel prediction structure, called three-dimensional hierarchical B prediction (3DHBP), which can efficiently reduce horizontal inter-view redundancies, vertical inter-view redundancies, and temporal redundancies in multi-view videos. Second, we develop a view interpolation scheme based on the bilateral disparity estimation. The interpolation scheme yields high quality view frames by adapting disparity estimation and compensation procedures using the information in neighboring frames. Simulation results demonstrate that the proposed multi-view video coding algorithm provides significantly better rate-distortion (R-D) performance than the conventional algorithm, by employing the 3DHBP structure and using interpolated view frames as additional reference frames.

KW - 2D camera array

KW - 3D-TV

KW - Bilateral criterion

KW - H.264/AVC

KW - Hierarchical B prediction

KW - Multi-view video coding

KW - View interpolation

UR - http://www.scopus.com/inward/record.url?scp=77957876005&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77957876005&partnerID=8YFLogxK

U2 - 10.1016/j.jvcir.2009.10.001

DO - 10.1016/j.jvcir.2009.10.001

M3 - Article

VL - 21

SP - 474

EP - 486

JO - Journal of Visual Communication and Image Representation

JF - Journal of Visual Communication and Image Representation

SN - 1047-3203

IS - 5-6

ER -