Duration modeling using cumulative duration probability

Tae Young Yang, Chungyong Lee, Dae Hee Youn

Research output: Contribution to journalArticle

Abstract

A duration modeling technique is proposed for the HMM based connected digit recognizer. The proposed duration modeling technique uses a cumulative duration probability. The cumulative duration probability is defined as the partial sum of the duration probabilities which can be estimated from the training speech data. Two approaches of using it are presented. First, the cumulative duration probability is used as a weighting factor to the state transition probability of HMM. Second, it replaces the conventional state transition probability. In both approaches, the cumulative duration probability is combined directly to the Viterbi decoding procedure. A modified Viterbi decoding procedure is also presented. One of the advantages of the proposed duration modeling technique is that the cumulative duration probability rules the transitions of states and words at each frame. Therefore, an additional post-procedure is not required. The proposed technique was examined by recognition experiments on Korean connected digit. Experimental results showed that two approach achieved almost same performances and that the average recognition accuracy was enhanced from 83.60% to 93.12%.

Original languageEnglish
Pages (from-to)1452-1454
Number of pages3
JournalIEICE Transactions on Information and Systems
VolumeE85-D
Issue number9
Publication statusPublished - 2002 Jan 1

Fingerprint

Decoding
Experiments

All Science Journal Classification (ASJC) codes

  • Software
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Cite this

Yang, Tae Young ; Lee, Chungyong ; Youn, Dae Hee. / Duration modeling using cumulative duration probability. In: IEICE Transactions on Information and Systems. 2002 ; Vol. E85-D, No. 9. pp. 1452-1454.
@article{24f97fde79b441e4ad070fc483104430,
title = "Duration modeling using cumulative duration probability",
abstract = "A duration modeling technique is proposed for the HMM based connected digit recognizer. The proposed duration modeling technique uses a cumulative duration probability. The cumulative duration probability is defined as the partial sum of the duration probabilities which can be estimated from the training speech data. Two approaches of using it are presented. First, the cumulative duration probability is used as a weighting factor to the state transition probability of HMM. Second, it replaces the conventional state transition probability. In both approaches, the cumulative duration probability is combined directly to the Viterbi decoding procedure. A modified Viterbi decoding procedure is also presented. One of the advantages of the proposed duration modeling technique is that the cumulative duration probability rules the transitions of states and words at each frame. Therefore, an additional post-procedure is not required. The proposed technique was examined by recognition experiments on Korean connected digit. Experimental results showed that two approach achieved almost same performances and that the average recognition accuracy was enhanced from 83.60{\%} to 93.12{\%}.",
author = "Yang, {Tae Young} and Chungyong Lee and Youn, {Dae Hee}",
year = "2002",
month = "1",
day = "1",
language = "English",
volume = "E85-D",
pages = "1452--1454",
journal = "IEICE Transactions on Information and Systems",
issn = "0916-8532",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "9",

}

Duration modeling using cumulative duration probability. / Yang, Tae Young; Lee, Chungyong; Youn, Dae Hee.

In: IEICE Transactions on Information and Systems, Vol. E85-D, No. 9, 01.01.2002, p. 1452-1454.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Duration modeling using cumulative duration probability

AU - Yang, Tae Young

AU - Lee, Chungyong

AU - Youn, Dae Hee

PY - 2002/1/1

Y1 - 2002/1/1

N2 - A duration modeling technique is proposed for the HMM based connected digit recognizer. The proposed duration modeling technique uses a cumulative duration probability. The cumulative duration probability is defined as the partial sum of the duration probabilities which can be estimated from the training speech data. Two approaches of using it are presented. First, the cumulative duration probability is used as a weighting factor to the state transition probability of HMM. Second, it replaces the conventional state transition probability. In both approaches, the cumulative duration probability is combined directly to the Viterbi decoding procedure. A modified Viterbi decoding procedure is also presented. One of the advantages of the proposed duration modeling technique is that the cumulative duration probability rules the transitions of states and words at each frame. Therefore, an additional post-procedure is not required. The proposed technique was examined by recognition experiments on Korean connected digit. Experimental results showed that two approach achieved almost same performances and that the average recognition accuracy was enhanced from 83.60% to 93.12%.

AB - A duration modeling technique is proposed for the HMM based connected digit recognizer. The proposed duration modeling technique uses a cumulative duration probability. The cumulative duration probability is defined as the partial sum of the duration probabilities which can be estimated from the training speech data. Two approaches of using it are presented. First, the cumulative duration probability is used as a weighting factor to the state transition probability of HMM. Second, it replaces the conventional state transition probability. In both approaches, the cumulative duration probability is combined directly to the Viterbi decoding procedure. A modified Viterbi decoding procedure is also presented. One of the advantages of the proposed duration modeling technique is that the cumulative duration probability rules the transitions of states and words at each frame. Therefore, an additional post-procedure is not required. The proposed technique was examined by recognition experiments on Korean connected digit. Experimental results showed that two approach achieved almost same performances and that the average recognition accuracy was enhanced from 83.60% to 93.12%.

UR - http://www.scopus.com/inward/record.url?scp=0036737839&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036737839&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:0036737839

VL - E85-D

SP - 1452

EP - 1454

JO - IEICE Transactions on Information and Systems

JF - IEICE Transactions on Information and Systems

SN - 0916-8532

IS - 9

ER -