Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders

Kyung Tae Kim, Sung Kyo Jung, Mi Suk Lee, Hong Goo Kang, Dae Hee Youn

Research output: Contribution to conferencePaper

Abstract

In this paper we present an efficient coding method for the upper band(4-7kHz) of wideband(0.5-7kHz) speech coding based on a band-split approach. Due to the impulselike characteristics in upper band signal, it is very difficult to efficiently quantize the signal at low bit-rate when we use transform coding techniques. We propose two temporal normalization techniques, direct temporal energy normalization and frequency domain linear prediction, to reduce the extremely noticeable artifacts. Simulation results show that the proposed algorithm successfully encodes the upper band signal, and the new split-band type wideband coder adopting the proposed technology provides better quality than 56 kbit/s ITU-T G. 722 at the bitrate of 20 kbit/s.

Original languageEnglish
Pages2661-2664
Number of pages4
Publication statusPublished - 2004 Jan 1
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: 2004 Oct 42004 Oct 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
CountryKorea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

Fingerprint

normalization
coding
artifact
energy
simulation
Split
Normalization

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language

Cite this

Kim, K. T., Jung, S. K., Lee, M. S., Kang, H. G., & Youn, D. H. (2004). Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders. 2661-2664. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.
Kim, Kyung Tae ; Jung, Sung Kyo ; Lee, Mi Suk ; Kang, Hong Goo ; Youn, Dae Hee. / Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.4 p.
@conference{f0859dbe277541ddb7d593cd6acaadb6,
title = "Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders",
abstract = "In this paper we present an efficient coding method for the upper band(4-7kHz) of wideband(0.5-7kHz) speech coding based on a band-split approach. Due to the impulselike characteristics in upper band signal, it is very difficult to efficiently quantize the signal at low bit-rate when we use transform coding techniques. We propose two temporal normalization techniques, direct temporal energy normalization and frequency domain linear prediction, to reduce the extremely noticeable artifacts. Simulation results show that the proposed algorithm successfully encodes the upper band signal, and the new split-band type wideband coder adopting the proposed technology provides better quality than 56 kbit/s ITU-T G. 722 at the bitrate of 20 kbit/s.",
author = "Kim, {Kyung Tae} and Jung, {Sung Kyo} and Lee, {Mi Suk} and Kang, {Hong Goo} and Youn, {Dae Hee}",
year = "2004",
month = "1",
day = "1",
language = "English",
pages = "2661--2664",
note = "8th International Conference on Spoken Language Processing, ICSLP 2004 ; Conference date: 04-10-2004 Through 08-10-2004",

}

Kim, KT, Jung, SK, Lee, MS, Kang, HG & Youn, DH 2004, 'Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders' Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of, 04/10/4 - 04/10/8, pp. 2661-2664.

Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders. / Kim, Kyung Tae; Jung, Sung Kyo; Lee, Mi Suk; Kang, Hong Goo; Youn, Dae Hee.

2004. 2661-2664 Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.

Research output: Contribution to conferencePaper

TY - CONF

T1 - Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders

AU - Kim, Kyung Tae

AU - Jung, Sung Kyo

AU - Lee, Mi Suk

AU - Kang, Hong Goo

AU - Youn, Dae Hee

PY - 2004/1/1

Y1 - 2004/1/1

N2 - In this paper we present an efficient coding method for the upper band(4-7kHz) of wideband(0.5-7kHz) speech coding based on a band-split approach. Due to the impulselike characteristics in upper band signal, it is very difficult to efficiently quantize the signal at low bit-rate when we use transform coding techniques. We propose two temporal normalization techniques, direct temporal energy normalization and frequency domain linear prediction, to reduce the extremely noticeable artifacts. Simulation results show that the proposed algorithm successfully encodes the upper band signal, and the new split-band type wideband coder adopting the proposed technology provides better quality than 56 kbit/s ITU-T G. 722 at the bitrate of 20 kbit/s.

AB - In this paper we present an efficient coding method for the upper band(4-7kHz) of wideband(0.5-7kHz) speech coding based on a band-split approach. Due to the impulselike characteristics in upper band signal, it is very difficult to efficiently quantize the signal at low bit-rate when we use transform coding techniques. We propose two temporal normalization techniques, direct temporal energy normalization and frequency domain linear prediction, to reduce the extremely noticeable artifacts. Simulation results show that the proposed algorithm successfully encodes the upper band signal, and the new split-band type wideband coder adopting the proposed technology provides better quality than 56 kbit/s ITU-T G. 722 at the bitrate of 20 kbit/s.

UR - http://www.scopus.com/inward/record.url?scp=85009069230&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009069230&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85009069230

SP - 2661

EP - 2664

ER -

Kim KT, Jung SK, Lee MS, Kang HG, Youn DH. Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders. 2004. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.