On the time variability of vocal tract for speaker recognition

Samuel Kim, Thomas Eriksson, Hong Goo Kang

Research output: Contribution to conferencePaper

2 Citations (Scopus)

Abstract

A novel scheme to analyze the effects of time variability of vocal tract for speaker recognition is proposed. We adopt a pitch synchronous feature extraction method to describe even more detailed characteristics of vocal tract, and decompose it into rapidly varying and slowly varying components with a specified linear filter along with time axis. Speaker identification tasks are performed with weighted combination of two decomposed feature sets and their corresponding models to show the efficiency of each decomposed feature set. Simulation results show that slowly varying components contain more speaker discriminative information than rapidly varying components do.

Original languageEnglish
Pages2377-2380
Number of pages4
Publication statusPublished - 2004 Jan 1
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: 2004 Oct 42004 Oct 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
CountryKorea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

Fingerprint

efficiency
simulation
time
Vocal Tract
Filter
Simulation
Feature Extraction

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language

Cite this

Kim, S., Eriksson, T., & Kang, H. G. (2004). On the time variability of vocal tract for speaker recognition. 2377-2380. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.
Kim, Samuel ; Eriksson, Thomas ; Kang, Hong Goo. / On the time variability of vocal tract for speaker recognition. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.4 p.
@conference{393299c340e144d2985d17a8d7e68765,
title = "On the time variability of vocal tract for speaker recognition",
abstract = "A novel scheme to analyze the effects of time variability of vocal tract for speaker recognition is proposed. We adopt a pitch synchronous feature extraction method to describe even more detailed characteristics of vocal tract, and decompose it into rapidly varying and slowly varying components with a specified linear filter along with time axis. Speaker identification tasks are performed with weighted combination of two decomposed feature sets and their corresponding models to show the efficiency of each decomposed feature set. Simulation results show that slowly varying components contain more speaker discriminative information than rapidly varying components do.",
author = "Samuel Kim and Thomas Eriksson and Kang, {Hong Goo}",
year = "2004",
month = "1",
day = "1",
language = "English",
pages = "2377--2380",
note = "8th International Conference on Spoken Language Processing, ICSLP 2004 ; Conference date: 04-10-2004 Through 08-10-2004",

}

Kim, S, Eriksson, T & Kang, HG 2004, 'On the time variability of vocal tract for speaker recognition' Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of, 04/10/4 - 04/10/8, pp. 2377-2380.

On the time variability of vocal tract for speaker recognition. / Kim, Samuel; Eriksson, Thomas; Kang, Hong Goo.

2004. 2377-2380 Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.

Research output: Contribution to conferencePaper

TY - CONF

T1 - On the time variability of vocal tract for speaker recognition

AU - Kim, Samuel

AU - Eriksson, Thomas

AU - Kang, Hong Goo

PY - 2004/1/1

Y1 - 2004/1/1

N2 - A novel scheme to analyze the effects of time variability of vocal tract for speaker recognition is proposed. We adopt a pitch synchronous feature extraction method to describe even more detailed characteristics of vocal tract, and decompose it into rapidly varying and slowly varying components with a specified linear filter along with time axis. Speaker identification tasks are performed with weighted combination of two decomposed feature sets and their corresponding models to show the efficiency of each decomposed feature set. Simulation results show that slowly varying components contain more speaker discriminative information than rapidly varying components do.

AB - A novel scheme to analyze the effects of time variability of vocal tract for speaker recognition is proposed. We adopt a pitch synchronous feature extraction method to describe even more detailed characteristics of vocal tract, and decompose it into rapidly varying and slowly varying components with a specified linear filter along with time axis. Speaker identification tasks are performed with weighted combination of two decomposed feature sets and their corresponding models to show the efficiency of each decomposed feature set. Simulation results show that slowly varying components contain more speaker discriminative information than rapidly varying components do.

UR - http://www.scopus.com/inward/record.url?scp=85009065118&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009065118&partnerID=8YFLogxK

M3 - Paper

SP - 2377

EP - 2380

ER -

Kim S, Eriksson T, Kang HG. On the time variability of vocal tract for speaker recognition. 2004. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.