A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification

Chi Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong Goo Kang

Research output: Contribution to conferencePaper

6 Citations (Scopus)

Abstract

In this paper, we propose a spectral kurtosis based approach to extract features with a variable frame length and rate for speaker verification. Since the speaker-specific information of features in each frame changes depending upon the characteristics of speech, it is important to determine the appropriate frame length and rate to extract the salient feature frames. In order to distinctively represent the characteristics of vowels and consonants both in time and frequency domains, we introduce a variable frame length and rate (VFLR) method based on spectral kurtosis, which provides a local measure of time-frequency concentration. Experimental results verify that the proposed VFLR method improves the performance of the speaker verification system on the NIST SRE-06 database by 9.725% (relative) compared to the feature extraction method with the fixed length and rate.

Original languageEnglish
Pages2754-2757
Number of pages4
Publication statusPublished - 2010 Dec 1
Event11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010 - Makuhari, Chiba, Japan
Duration: 2010 Sep 262010 Sep 30

Other

Other11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010
CountryJapan
CityMakuhari, Chiba
Period10/9/2610/9/30

Fingerprint

Databases
Spectrality
Length
Data Base
Feature Extraction
Consonant
Salient

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Speech and Hearing

Cite this

Jung, C. S., Han, K. J., Seo, H., Narayanan, S. S., & Kang, H. G. (2010). A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. 2754-2757. Paper presented at 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010, Makuhari, Chiba, Japan.
Jung, Chi Sang ; Han, Kyu J. ; Seo, Hyunson ; Narayanan, Shrikanth S. ; Kang, Hong Goo. / A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. Paper presented at 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010, Makuhari, Chiba, Japan.4 p.
@conference{cd5d5073ca4541848e97deff5490bc9e,
title = "A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification",
abstract = "In this paper, we propose a spectral kurtosis based approach to extract features with a variable frame length and rate for speaker verification. Since the speaker-specific information of features in each frame changes depending upon the characteristics of speech, it is important to determine the appropriate frame length and rate to extract the salient feature frames. In order to distinctively represent the characteristics of vowels and consonants both in time and frequency domains, we introduce a variable frame length and rate (VFLR) method based on spectral kurtosis, which provides a local measure of time-frequency concentration. Experimental results verify that the proposed VFLR method improves the performance of the speaker verification system on the NIST SRE-06 database by 9.725{\%} (relative) compared to the feature extraction method with the fixed length and rate.",
author = "Jung, {Chi Sang} and Han, {Kyu J.} and Hyunson Seo and Narayanan, {Shrikanth S.} and Kang, {Hong Goo}",
year = "2010",
month = "12",
day = "1",
language = "English",
pages = "2754--2757",
note = "11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010 ; Conference date: 26-09-2010 Through 30-09-2010",

}

Jung, CS, Han, KJ, Seo, H, Narayanan, SS & Kang, HG 2010, 'A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification' Paper presented at 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010, Makuhari, Chiba, Japan, 10/9/26 - 10/9/30, pp. 2754-2757.

A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. / Jung, Chi Sang; Han, Kyu J.; Seo, Hyunson; Narayanan, Shrikanth S.; Kang, Hong Goo.

2010. 2754-2757 Paper presented at 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010, Makuhari, Chiba, Japan.

Research output: Contribution to conferencePaper

TY - CONF

T1 - A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification

AU - Jung, Chi Sang

AU - Han, Kyu J.

AU - Seo, Hyunson

AU - Narayanan, Shrikanth S.

AU - Kang, Hong Goo

PY - 2010/12/1

Y1 - 2010/12/1

N2 - In this paper, we propose a spectral kurtosis based approach to extract features with a variable frame length and rate for speaker verification. Since the speaker-specific information of features in each frame changes depending upon the characteristics of speech, it is important to determine the appropriate frame length and rate to extract the salient feature frames. In order to distinctively represent the characteristics of vowels and consonants both in time and frequency domains, we introduce a variable frame length and rate (VFLR) method based on spectral kurtosis, which provides a local measure of time-frequency concentration. Experimental results verify that the proposed VFLR method improves the performance of the speaker verification system on the NIST SRE-06 database by 9.725% (relative) compared to the feature extraction method with the fixed length and rate.

AB - In this paper, we propose a spectral kurtosis based approach to extract features with a variable frame length and rate for speaker verification. Since the speaker-specific information of features in each frame changes depending upon the characteristics of speech, it is important to determine the appropriate frame length and rate to extract the salient feature frames. In order to distinctively represent the characteristics of vowels and consonants both in time and frequency domains, we introduce a variable frame length and rate (VFLR) method based on spectral kurtosis, which provides a local measure of time-frequency concentration. Experimental results verify that the proposed VFLR method improves the performance of the speaker verification system on the NIST SRE-06 database by 9.725% (relative) compared to the feature extraction method with the fixed length and rate.

UR - http://www.scopus.com/inward/record.url?scp=79959823356&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79959823356&partnerID=8YFLogxK

M3 - Paper

SP - 2754

EP - 2757

ER -

Jung CS, Han KJ, Seo H, Narayanan SS, Kang HG. A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. 2010. Paper presented at 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010, Makuhari, Chiba, Japan.