Perceptually scalable extension of H.264

Hojin Ha, Jincheol Park, Sanghoon Lee, Alan Conrad Bovik

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

We propose a novel visual scalable video coding (VSVC) framework, named VSVC H.264/AVC. In this approach, the non-uniform sampling characteristic of the human eye is used to modify scalable video coding (SVC) H.264/AVC. We exploit the visibility of video content and the scalability of the video codec to achieve optimal subjective visual quality given limited system resources. To achieve the largest coding gain with controlled perceptual quality degradation, a perceptual weighting scheme is deployed wherein the compressed video is weighted as a function of visual saliency and of the non-uniform distribution of retinal photoreceptors. We develop a resource allocation algorithm emphasizing both efficiency and fairness by controlling the size of the salient region in each quality layer. Efficiency is emphasized on the low quality layer of the SVC. The bits saved by eliminating perceptual redundancy in regions of low interest are allocated to lower block-level distortions in salient regions. Fairness is enforced on the higher quality layers by enlarging the size of the salient regions. The simulation results show that the proposed VSVC framework significantly improves the subjective visual quality of compressed videos.

Original languageEnglish
Article number5739513
Pages (from-to)1667-1678
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume21
Issue number11
DOIs
Publication statusPublished - 2011 Nov 1

Fingerprint

Scalable video coding
Visibility
Resource allocation
Redundancy
Scalability
Sampling
Degradation

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering
  • Media Technology

Cite this

Ha, Hojin ; Park, Jincheol ; Lee, Sanghoon ; Bovik, Alan Conrad. / Perceptually scalable extension of H.264. In: IEEE Transactions on Circuits and Systems for Video Technology. 2011 ; Vol. 21, No. 11. pp. 1667-1678.
@article{4ca44f94cc934e57b40fbd740d08d572,
title = "Perceptually scalable extension of H.264",
abstract = "We propose a novel visual scalable video coding (VSVC) framework, named VSVC H.264/AVC. In this approach, the non-uniform sampling characteristic of the human eye is used to modify scalable video coding (SVC) H.264/AVC. We exploit the visibility of video content and the scalability of the video codec to achieve optimal subjective visual quality given limited system resources. To achieve the largest coding gain with controlled perceptual quality degradation, a perceptual weighting scheme is deployed wherein the compressed video is weighted as a function of visual saliency and of the non-uniform distribution of retinal photoreceptors. We develop a resource allocation algorithm emphasizing both efficiency and fairness by controlling the size of the salient region in each quality layer. Efficiency is emphasized on the low quality layer of the SVC. The bits saved by eliminating perceptual redundancy in regions of low interest are allocated to lower block-level distortions in salient regions. Fairness is enforced on the higher quality layers by enlarging the size of the salient regions. The simulation results show that the proposed VSVC framework significantly improves the subjective visual quality of compressed videos.",
author = "Hojin Ha and Jincheol Park and Sanghoon Lee and Bovik, {Alan Conrad}",
year = "2011",
month = "11",
day = "1",
doi = "10.1109/TCSVT.2011.2133370",
language = "English",
volume = "21",
pages = "1667--1678",
journal = "IEEE Transactions on Circuits and Systems for Video Technology",
issn = "1051-8215",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "11",

}

Perceptually scalable extension of H.264. / Ha, Hojin; Park, Jincheol; Lee, Sanghoon; Bovik, Alan Conrad.

In: IEEE Transactions on Circuits and Systems for Video Technology, Vol. 21, No. 11, 5739513, 01.11.2011, p. 1667-1678.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Perceptually scalable extension of H.264

AU - Ha, Hojin

AU - Park, Jincheol

AU - Lee, Sanghoon

AU - Bovik, Alan Conrad

PY - 2011/11/1

Y1 - 2011/11/1

N2 - We propose a novel visual scalable video coding (VSVC) framework, named VSVC H.264/AVC. In this approach, the non-uniform sampling characteristic of the human eye is used to modify scalable video coding (SVC) H.264/AVC. We exploit the visibility of video content and the scalability of the video codec to achieve optimal subjective visual quality given limited system resources. To achieve the largest coding gain with controlled perceptual quality degradation, a perceptual weighting scheme is deployed wherein the compressed video is weighted as a function of visual saliency and of the non-uniform distribution of retinal photoreceptors. We develop a resource allocation algorithm emphasizing both efficiency and fairness by controlling the size of the salient region in each quality layer. Efficiency is emphasized on the low quality layer of the SVC. The bits saved by eliminating perceptual redundancy in regions of low interest are allocated to lower block-level distortions in salient regions. Fairness is enforced on the higher quality layers by enlarging the size of the salient regions. The simulation results show that the proposed VSVC framework significantly improves the subjective visual quality of compressed videos.

AB - We propose a novel visual scalable video coding (VSVC) framework, named VSVC H.264/AVC. In this approach, the non-uniform sampling characteristic of the human eye is used to modify scalable video coding (SVC) H.264/AVC. We exploit the visibility of video content and the scalability of the video codec to achieve optimal subjective visual quality given limited system resources. To achieve the largest coding gain with controlled perceptual quality degradation, a perceptual weighting scheme is deployed wherein the compressed video is weighted as a function of visual saliency and of the non-uniform distribution of retinal photoreceptors. We develop a resource allocation algorithm emphasizing both efficiency and fairness by controlling the size of the salient region in each quality layer. Efficiency is emphasized on the low quality layer of the SVC. The bits saved by eliminating perceptual redundancy in regions of low interest are allocated to lower block-level distortions in salient regions. Fairness is enforced on the higher quality layers by enlarging the size of the salient regions. The simulation results show that the proposed VSVC framework significantly improves the subjective visual quality of compressed videos.

UR - http://www.scopus.com/inward/record.url?scp=80455145024&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80455145024&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2011.2133370

DO - 10.1109/TCSVT.2011.2133370

M3 - Article

AN - SCOPUS:80455145024

VL - 21

SP - 1667

EP - 1678

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

SN - 1051-8215

IS - 11

M1 - 5739513

ER -