A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standard

Sung Kyo Jung, Kyung Tae Kim, Hong Goo Kang

Research output: Contribution to journalConference article

7 Citations (Scopus)

Abstract

This paper presents a new scalable coder based on the ITU-T G.723.1 standard which is one of the most famous speech coders for VoIP applications. In order to support both bit-rate scalability and bandwidth scalability, the proposed coder adopts a split-band approach, where the input signal, sampled at 16 kHz, is decomposed into two equal frequency bands. The lower-band speech is coded with the standard coder such as the G.723.1 standard. In addition, the low-band enhancement layer for the lower-band speech improves the perceptual quality of decoded speech by employing additional coding units based on a cascaded codebook approach. The higher-band signal is encoded using an MDCT-based transform coding scheme. The proposed coder at a bit-rate of 19.4 kbit/s provides speech quality comparable to the ITU-T 24 kbit/s G.722.1 coder, while it also has interoperability with G.723.1.

Original languageEnglish
Pages (from-to)I285-I288
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
Publication statusPublished - 2004 Sep 28
EventProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
Duration: 2004 May 172004 May 21

Fingerprint

Bandwidth
Scalability
Interoperability
Frequency bands

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

@article{dfb2bfbbbcc14081888d4133b118127b,
title = "A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standard",
abstract = "This paper presents a new scalable coder based on the ITU-T G.723.1 standard which is one of the most famous speech coders for VoIP applications. In order to support both bit-rate scalability and bandwidth scalability, the proposed coder adopts a split-band approach, where the input signal, sampled at 16 kHz, is decomposed into two equal frequency bands. The lower-band speech is coded with the standard coder such as the G.723.1 standard. In addition, the low-band enhancement layer for the lower-band speech improves the perceptual quality of decoded speech by employing additional coding units based on a cascaded codebook approach. The higher-band signal is encoded using an MDCT-based transform coding scheme. The proposed coder at a bit-rate of 19.4 kbit/s provides speech quality comparable to the ITU-T 24 kbit/s G.722.1 coder, while it also has interoperability with G.723.1.",
author = "Jung, {Sung Kyo} and Kim, {Kyung Tae} and Kang, {Hong Goo}",
year = "2004",
month = "9",
day = "28",
language = "English",
volume = "1",
pages = "I285--I288",
journal = "Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing",
issn = "0736-7791",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standard. / Jung, Sung Kyo; Kim, Kyung Tae; Kang, Hong Goo.

In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1, 28.09.2004, p. I285-I288.

Research output: Contribution to journalConference article

TY - JOUR

T1 - A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standard

AU - Jung, Sung Kyo

AU - Kim, Kyung Tae

AU - Kang, Hong Goo

PY - 2004/9/28

Y1 - 2004/9/28

N2 - This paper presents a new scalable coder based on the ITU-T G.723.1 standard which is one of the most famous speech coders for VoIP applications. In order to support both bit-rate scalability and bandwidth scalability, the proposed coder adopts a split-band approach, where the input signal, sampled at 16 kHz, is decomposed into two equal frequency bands. The lower-band speech is coded with the standard coder such as the G.723.1 standard. In addition, the low-band enhancement layer for the lower-band speech improves the perceptual quality of decoded speech by employing additional coding units based on a cascaded codebook approach. The higher-band signal is encoded using an MDCT-based transform coding scheme. The proposed coder at a bit-rate of 19.4 kbit/s provides speech quality comparable to the ITU-T 24 kbit/s G.722.1 coder, while it also has interoperability with G.723.1.

AB - This paper presents a new scalable coder based on the ITU-T G.723.1 standard which is one of the most famous speech coders for VoIP applications. In order to support both bit-rate scalability and bandwidth scalability, the proposed coder adopts a split-band approach, where the input signal, sampled at 16 kHz, is decomposed into two equal frequency bands. The lower-band speech is coded with the standard coder such as the G.723.1 standard. In addition, the low-band enhancement layer for the lower-band speech improves the perceptual quality of decoded speech by employing additional coding units based on a cascaded codebook approach. The higher-band signal is encoded using an MDCT-based transform coding scheme. The proposed coder at a bit-rate of 19.4 kbit/s provides speech quality comparable to the ITU-T 24 kbit/s G.722.1 coder, while it also has interoperability with G.723.1.

UR - http://www.scopus.com/inward/record.url?scp=4544367819&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=4544367819&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:4544367819

VL - 1

SP - I285-I288

JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

SN - 0736-7791

ER -