Harmonic enhancement in low bitrate audio coding using an efficient long-term predictor

Jeongook Song, Chang Heon Lee, Hyen O. Oh, Hong-Goo Kang

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

This paper proposes audio coding using an efficient long-term prediction method to enhance the perceptual quality of audio codecs to speech input signals at low bit-rates. The MPEG-4 AAC-LTP exploited a similar concept, but its improvement was not significant because of small prediction gain due to long prediction lags and aliased components caused by the transformation with a time-domain aliasing cancelation (TDAC) technique. The proposed algorithm increases the prediction gain by employing a deharmonizing predictor and a long-term compensation filter. The look-back memory elements are first constructed by applying the de-harmonizing predictor to the input signal, then the prediction residual is encoded and decoded by transform audio coding. Finally, the long-term compensation filter is applied to the updated look-back memory of the decoded prediction residual to obtain synthesized signals. Experimental results show that the proposed algorithm has much lower spectral distortion and higher perceptual quality than conventional approaches especially for harmonic signals, such as voiced speech.

Original languageEnglish
Article number939542
JournalEurasip Journal on Advances in Signal Processing
Volume2010
DOIs
Publication statusPublished - 2010 Oct 18

Fingerprint

Data storage equipment
Compensation and Redress

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Information Systems
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Cite this

@article{bdbcfbeb12d642b1bc352ce2db59d343,
title = "Harmonic enhancement in low bitrate audio coding using an efficient long-term predictor",
abstract = "This paper proposes audio coding using an efficient long-term prediction method to enhance the perceptual quality of audio codecs to speech input signals at low bit-rates. The MPEG-4 AAC-LTP exploited a similar concept, but its improvement was not significant because of small prediction gain due to long prediction lags and aliased components caused by the transformation with a time-domain aliasing cancelation (TDAC) technique. The proposed algorithm increases the prediction gain by employing a deharmonizing predictor and a long-term compensation filter. The look-back memory elements are first constructed by applying the de-harmonizing predictor to the input signal, then the prediction residual is encoded and decoded by transform audio coding. Finally, the long-term compensation filter is applied to the updated look-back memory of the decoded prediction residual to obtain synthesized signals. Experimental results show that the proposed algorithm has much lower spectral distortion and higher perceptual quality than conventional approaches especially for harmonic signals, such as voiced speech.",
author = "Jeongook Song and Lee, {Chang Heon} and Oh, {Hyen O.} and Hong-Goo Kang",
year = "2010",
month = "10",
day = "18",
doi = "10.1155/2010/939542",
language = "English",
volume = "2010",
journal = "Eurasip Journal on Advances in Signal Processing",
issn = "1687-6172",
publisher = "Springer Publishing Company",

}

Harmonic enhancement in low bitrate audio coding using an efficient long-term predictor. / Song, Jeongook; Lee, Chang Heon; Oh, Hyen O.; Kang, Hong-Goo.

In: Eurasip Journal on Advances in Signal Processing, Vol. 2010, 939542, 18.10.2010.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Harmonic enhancement in low bitrate audio coding using an efficient long-term predictor

AU - Song, Jeongook

AU - Lee, Chang Heon

AU - Oh, Hyen O.

AU - Kang, Hong-Goo

PY - 2010/10/18

Y1 - 2010/10/18

N2 - This paper proposes audio coding using an efficient long-term prediction method to enhance the perceptual quality of audio codecs to speech input signals at low bit-rates. The MPEG-4 AAC-LTP exploited a similar concept, but its improvement was not significant because of small prediction gain due to long prediction lags and aliased components caused by the transformation with a time-domain aliasing cancelation (TDAC) technique. The proposed algorithm increases the prediction gain by employing a deharmonizing predictor and a long-term compensation filter. The look-back memory elements are first constructed by applying the de-harmonizing predictor to the input signal, then the prediction residual is encoded and decoded by transform audio coding. Finally, the long-term compensation filter is applied to the updated look-back memory of the decoded prediction residual to obtain synthesized signals. Experimental results show that the proposed algorithm has much lower spectral distortion and higher perceptual quality than conventional approaches especially for harmonic signals, such as voiced speech.

AB - This paper proposes audio coding using an efficient long-term prediction method to enhance the perceptual quality of audio codecs to speech input signals at low bit-rates. The MPEG-4 AAC-LTP exploited a similar concept, but its improvement was not significant because of small prediction gain due to long prediction lags and aliased components caused by the transformation with a time-domain aliasing cancelation (TDAC) technique. The proposed algorithm increases the prediction gain by employing a deharmonizing predictor and a long-term compensation filter. The look-back memory elements are first constructed by applying the de-harmonizing predictor to the input signal, then the prediction residual is encoded and decoded by transform audio coding. Finally, the long-term compensation filter is applied to the updated look-back memory of the decoded prediction residual to obtain synthesized signals. Experimental results show that the proposed algorithm has much lower spectral distortion and higher perceptual quality than conventional approaches especially for harmonic signals, such as voiced speech.

UR - http://www.scopus.com/inward/record.url?scp=77957829366&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77957829366&partnerID=8YFLogxK

U2 - 10.1155/2010/939542

DO - 10.1155/2010/939542

M3 - Article

VL - 2010

JO - Eurasip Journal on Advances in Signal Processing

JF - Eurasip Journal on Advances in Signal Processing

SN - 1687-6172

M1 - 939542

ER -