Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band

Kyung Tae Kim, Jeung Yoon Choi, Hong Goo Kang

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

The perceptual relevance of adopting the temporal envelope to model the frequency band of 4-7 kHz (highband) in wideband speech signal is described in this letter. Based on theoretical work in psychoacoustics, we find out that the temporal envelope can indeed be a perceptual cue for the high-band signal, i.e., a noiseless sound can be obtained if the temporal envelope is roughly preserved. Subjective listening tests verify that transparent quality can be obtained if the model is used for the 4.5-7 kHz band. The proposed model has the benefits of offering flexible scalability and reducing the cost for quantization in coding applications.

Original languageEnglish
Pages (from-to)EL88-EL94
JournalJournal of the Acoustical Society of America
Volume122
Issue number3
DOIs
Publication statusPublished - 2007 Oct 17

Fingerprint

envelopes
psychoacoustics
cues
coding
broadband
costs
acoustics
Perceptual Cues
Costs
Quantization
Sound
Psychoacoustics

All Science Journal Classification (ASJC) codes

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Cite this

@article{d87c0e51b46a40fbae36dd9329fce35f,
title = "Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band",
abstract = "The perceptual relevance of adopting the temporal envelope to model the frequency band of 4-7 kHz (highband) in wideband speech signal is described in this letter. Based on theoretical work in psychoacoustics, we find out that the temporal envelope can indeed be a perceptual cue for the high-band signal, i.e., a noiseless sound can be obtained if the temporal envelope is roughly preserved. Subjective listening tests verify that transparent quality can be obtained if the model is used for the 4.5-7 kHz band. The proposed model has the benefits of offering flexible scalability and reducing the cost for quantization in coding applications.",
author = "Kim, {Kyung Tae} and Choi, {Jeung Yoon} and Kang, {Hong Goo}",
year = "2007",
month = "10",
day = "17",
doi = "10.1121/1.2761927",
language = "English",
volume = "122",
pages = "EL88--EL94",
journal = "Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "Acoustical Society of America",
number = "3",

}

Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band. / Kim, Kyung Tae; Choi, Jeung Yoon; Kang, Hong Goo.

In: Journal of the Acoustical Society of America, Vol. 122, No. 3, 17.10.2007, p. EL88-EL94.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band

AU - Kim, Kyung Tae

AU - Choi, Jeung Yoon

AU - Kang, Hong Goo

PY - 2007/10/17

Y1 - 2007/10/17

N2 - The perceptual relevance of adopting the temporal envelope to model the frequency band of 4-7 kHz (highband) in wideband speech signal is described in this letter. Based on theoretical work in psychoacoustics, we find out that the temporal envelope can indeed be a perceptual cue for the high-band signal, i.e., a noiseless sound can be obtained if the temporal envelope is roughly preserved. Subjective listening tests verify that transparent quality can be obtained if the model is used for the 4.5-7 kHz band. The proposed model has the benefits of offering flexible scalability and reducing the cost for quantization in coding applications.

AB - The perceptual relevance of adopting the temporal envelope to model the frequency band of 4-7 kHz (highband) in wideband speech signal is described in this letter. Based on theoretical work in psychoacoustics, we find out that the temporal envelope can indeed be a perceptual cue for the high-band signal, i.e., a noiseless sound can be obtained if the temporal envelope is roughly preserved. Subjective listening tests verify that transparent quality can be obtained if the model is used for the 4.5-7 kHz band. The proposed model has the benefits of offering flexible scalability and reducing the cost for quantization in coding applications.

UR - http://www.scopus.com/inward/record.url?scp=35248812994&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35248812994&partnerID=8YFLogxK

U2 - 10.1121/1.2761927

DO - 10.1121/1.2761927

M3 - Article

C2 - 17927313

AN - SCOPUS:35248812994

VL - 122

SP - EL88-EL94

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

SN - 0001-4966

IS - 3

ER -