TY - GEN
T1 - Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system
AU - Song, Eunwoo
AU - Joo, Young Sun
AU - Kang, Hong Goo
N1 - Publisher Copyright:
© 2015 IEEE.
Copyright:
Copyright 2015 Elsevier B.V., All rights reserved.
PY - 2015/8/4
Y1 - 2015/8/4
N2 - This paper proposes an improved time-frequency trajectory excitation (TFTE) modeling method for a statistical parametric speech synthesis system. The proposed approach overcomes the dimensional variation problem of the training process caused by the inherent nature of the pitch-dependent analysis paradigm. By reducing the redundancies of the parameters using predicted average block coefficients (PABC), the proposed algorithm efficiently models excitation, even if its dimension is varied. Objective and subjective test results verify that the proposed algorithm provides not only robustness to the training process but also naturalness to the synthesized speech.
AB - This paper proposes an improved time-frequency trajectory excitation (TFTE) modeling method for a statistical parametric speech synthesis system. The proposed approach overcomes the dimensional variation problem of the training process caused by the inherent nature of the pitch-dependent analysis paradigm. By reducing the redundancies of the parameters using predicted average block coefficients (PABC), the proposed algorithm efficiently models excitation, even if its dimension is varied. Objective and subjective test results verify that the proposed algorithm provides not only robustness to the training process but also naturalness to the synthesized speech.
UR - http://www.scopus.com/inward/record.url?scp=84946032846&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84946032846&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2015.7178912
DO - 10.1109/ICASSP.2015.7178912
M3 - Conference contribution
AN - SCOPUS:84946032846
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4949
EP - 4953
BT - 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
Y2 - 19 April 2014 through 24 April 2014
ER -