A source-filter based adaptive harmonic model and its application to speech prosody modification

Jee Sok Lee, Frank K. Soong, Hong Goo Kang

Research output: Contribution to journalConference article

1 Citation (Scopus)

Abstract

This paper presents a source-filter based adaptive harmonic model (aHM) that can modify prosody of given speech signals. Although the conventional aHM generates a homogeneous replication of the input speech, it is not suitable for prosody modification since temporal and spectral information are interweaved. The proposed method overcomes such limitation by further decomposing the harmonic parameter extracted from aHM into source and filter related components. By applying source-filter structure to aHM, the proposed algorithm can modify pitch of the synthesized speech with introducing only minor degradation. Both objective and subjective test results show that the proposed algorithm can naturally manipulate pitch contour, of which performance is much better than conventional algorithms such as pitch synchronous overlap add (PSOLA) and speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT).

Original languageEnglish
Pages (from-to)39-43
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2013 Jan 1
Event14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, France
Duration: 2013 Aug 252013 Aug 29

Fingerprint

Prosody
Harmonic
Filter
Model
Speech Signal
Interpolation
Replication
Overlap
Minor
Degradation
Interpolate
Speech
Harmonics

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Cite this

@article{0f4e628e673d4e60a4265f5da9466181,
title = "A source-filter based adaptive harmonic model and its application to speech prosody modification",
abstract = "This paper presents a source-filter based adaptive harmonic model (aHM) that can modify prosody of given speech signals. Although the conventional aHM generates a homogeneous replication of the input speech, it is not suitable for prosody modification since temporal and spectral information are interweaved. The proposed method overcomes such limitation by further decomposing the harmonic parameter extracted from aHM into source and filter related components. By applying source-filter structure to aHM, the proposed algorithm can modify pitch of the synthesized speech with introducing only minor degradation. Both objective and subjective test results show that the proposed algorithm can naturally manipulate pitch contour, of which performance is much better than conventional algorithms such as pitch synchronous overlap add (PSOLA) and speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT).",
author = "Lee, {Jee Sok} and Soong, {Frank K.} and Kang, {Hong Goo}",
year = "2013",
month = "1",
day = "1",
language = "English",
pages = "39--43",
journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
issn = "2308-457X",

}

TY - JOUR

T1 - A source-filter based adaptive harmonic model and its application to speech prosody modification

AU - Lee, Jee Sok

AU - Soong, Frank K.

AU - Kang, Hong Goo

PY - 2013/1/1

Y1 - 2013/1/1

N2 - This paper presents a source-filter based adaptive harmonic model (aHM) that can modify prosody of given speech signals. Although the conventional aHM generates a homogeneous replication of the input speech, it is not suitable for prosody modification since temporal and spectral information are interweaved. The proposed method overcomes such limitation by further decomposing the harmonic parameter extracted from aHM into source and filter related components. By applying source-filter structure to aHM, the proposed algorithm can modify pitch of the synthesized speech with introducing only minor degradation. Both objective and subjective test results show that the proposed algorithm can naturally manipulate pitch contour, of which performance is much better than conventional algorithms such as pitch synchronous overlap add (PSOLA) and speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT).

AB - This paper presents a source-filter based adaptive harmonic model (aHM) that can modify prosody of given speech signals. Although the conventional aHM generates a homogeneous replication of the input speech, it is not suitable for prosody modification since temporal and spectral information are interweaved. The proposed method overcomes such limitation by further decomposing the harmonic parameter extracted from aHM into source and filter related components. By applying source-filter structure to aHM, the proposed algorithm can modify pitch of the synthesized speech with introducing only minor degradation. Both objective and subjective test results show that the proposed algorithm can naturally manipulate pitch contour, of which performance is much better than conventional algorithms such as pitch synchronous overlap add (PSOLA) and speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT).

UR - http://www.scopus.com/inward/record.url?scp=84906259724&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84906259724&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84906259724

SP - 39

EP - 43

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

SN - 2308-457X

ER -