Perceptually scalable extension of H.264

Hojin Ha, Jincheol Park, Sanghoon Lee, Alan Conrad Bovik

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

We propose a novel visual scalable video coding (VSVC) framework, named VSVC H.264/AVC. In this approach, the non-uniform sampling characteristic of the human eye is used to modify scalable video coding (SVC) H.264/AVC. We exploit the visibility of video content and the scalability of the video codec to achieve optimal subjective visual quality given limited system resources. To achieve the largest coding gain with controlled perceptual quality degradation, a perceptual weighting scheme is deployed wherein the compressed video is weighted as a function of visual saliency and of the non-uniform distribution of retinal photoreceptors. We develop a resource allocation algorithm emphasizing both efficiency and fairness by controlling the size of the salient region in each quality layer. Efficiency is emphasized on the low quality layer of the SVC. The bits saved by eliminating perceptual redundancy in regions of low interest are allocated to lower block-level distortions in salient regions. Fairness is enforced on the higher quality layers by enlarging the size of the salient regions. The simulation results show that the proposed VSVC framework significantly improves the subjective visual quality of compressed videos.

Original languageEnglish
Article number5739513
Pages (from-to)1667-1678
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume21
Issue number11
DOIs
Publication statusPublished - 2011 Nov

Bibliographical note

Funding Information:
Manuscript received October 27, 2008; revised April 12, 2010 and December 28, 2010; accepted February 13, 2011. Date of publication March 28, 2011; date of current version November 2, 2011. This research was supported in part by the Ministry of Knowledge Economy, Korea, under the National HRD support program for convergence information technology supervised by the National IT Industry Promotion Agency (NIPA), under Grant NIPA-2010-C6150-1001-0013, and by the Basic Science Research Program through the National Research Foundation of Korea funded by the Ministry of Education, Science and Technology, under Grant 2010-0011995. This paper was recommended by Associate Editor H. Gharavi.

All Science Journal Classification (ASJC) codes

  • Media Technology
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Perceptually scalable extension of H.264'. Together they form a unique fingerprint.

Cite this