Knowledge-based manner class segmentation based on the acoustic event and landmark detection algorithm

Jung In Lee, Jeung Yoon Choi, Hong Goo Kang

Research output: Contribution to journalArticle

Abstract

There have been steady demands for a speech segmentation method to handle various speech applications. Conventional segmentation algorithms show reliable performance but they require a sufficient training database. This letter proposes a manner class segmentation method based on the acoustic event and landmark detection used in the knowledge-based speech recognition system. Measurements of sub-band abruptness and additional parameters are used to detect the acoustic events. Candidates of manner classes are segmented from the acoustic events and determined based on the knowledge of acoustic phonetics and acoustic parameters. Manners of vowel/glide, nasal, fricative, stop burst, stop closure, and silence are segmented in this system. In total, 71% of manner classes are correctly segmented with 20-ms error boundaries.

Original languageEnglish
Pages (from-to)1682-1685
Number of pages4
JournalIEICE Transactions on Information and Systems
VolumeE97-D
Issue number6
DOIs
Publication statusPublished - 2014 Jun

Fingerprint

Acoustics
Speech analysis
Speech recognition

All Science Journal Classification (ASJC) codes

  • Software
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Cite this

@article{9d49f72321724268a8118bf601a34c65,
title = "Knowledge-based manner class segmentation based on the acoustic event and landmark detection algorithm",
abstract = "There have been steady demands for a speech segmentation method to handle various speech applications. Conventional segmentation algorithms show reliable performance but they require a sufficient training database. This letter proposes a manner class segmentation method based on the acoustic event and landmark detection used in the knowledge-based speech recognition system. Measurements of sub-band abruptness and additional parameters are used to detect the acoustic events. Candidates of manner classes are segmented from the acoustic events and determined based on the knowledge of acoustic phonetics and acoustic parameters. Manners of vowel/glide, nasal, fricative, stop burst, stop closure, and silence are segmented in this system. In total, 71{\%} of manner classes are correctly segmented with 20-ms error boundaries.",
author = "Lee, {Jung In} and Choi, {Jeung Yoon} and Kang, {Hong Goo}",
year = "2014",
month = "6",
doi = "10.1587/transinf.E97.D.1682",
language = "English",
volume = "E97-D",
pages = "1682--1685",
journal = "IEICE Transactions on Information and Systems",
issn = "0916-8532",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "6",

}

Knowledge-based manner class segmentation based on the acoustic event and landmark detection algorithm. / Lee, Jung In; Choi, Jeung Yoon; Kang, Hong Goo.

In: IEICE Transactions on Information and Systems, Vol. E97-D, No. 6, 06.2014, p. 1682-1685.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Knowledge-based manner class segmentation based on the acoustic event and landmark detection algorithm

AU - Lee, Jung In

AU - Choi, Jeung Yoon

AU - Kang, Hong Goo

PY - 2014/6

Y1 - 2014/6

N2 - There have been steady demands for a speech segmentation method to handle various speech applications. Conventional segmentation algorithms show reliable performance but they require a sufficient training database. This letter proposes a manner class segmentation method based on the acoustic event and landmark detection used in the knowledge-based speech recognition system. Measurements of sub-band abruptness and additional parameters are used to detect the acoustic events. Candidates of manner classes are segmented from the acoustic events and determined based on the knowledge of acoustic phonetics and acoustic parameters. Manners of vowel/glide, nasal, fricative, stop burst, stop closure, and silence are segmented in this system. In total, 71% of manner classes are correctly segmented with 20-ms error boundaries.

AB - There have been steady demands for a speech segmentation method to handle various speech applications. Conventional segmentation algorithms show reliable performance but they require a sufficient training database. This letter proposes a manner class segmentation method based on the acoustic event and landmark detection used in the knowledge-based speech recognition system. Measurements of sub-band abruptness and additional parameters are used to detect the acoustic events. Candidates of manner classes are segmented from the acoustic events and determined based on the knowledge of acoustic phonetics and acoustic parameters. Manners of vowel/glide, nasal, fricative, stop burst, stop closure, and silence are segmented in this system. In total, 71% of manner classes are correctly segmented with 20-ms error boundaries.

UR - http://www.scopus.com/inward/record.url?scp=84901766723&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84901766723&partnerID=8YFLogxK

U2 - 10.1587/transinf.E97.D.1682

DO - 10.1587/transinf.E97.D.1682

M3 - Article

AN - SCOPUS:84901766723

VL - E97-D

SP - 1682

EP - 1685

JO - IEICE Transactions on Information and Systems

JF - IEICE Transactions on Information and Systems

SN - 0916-8532

IS - 6

ER -