Speaker recognition based on transformed line spectral frequencies

Bong Jin Lee, Samuel Kim, Hong-Goo Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

In this paper, line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering the physical aspect of vocal tract. They are location of formants/nulls, bandwidth of formants/nulls, bandwidth and location of formants, bandwidth and location of nulls, and interval of adjacent formant and null location. Identification tests using TIMIT database verify that all features are useful for speaker recognition, especially the bandwidth and location of formants show the best performance. Simulation results also show that LSF and some of transformed LSF give better performance than MFCC.

Original languageEnglish
Title of host publicationProceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004
EditorsS.J. Ko
Pages177-180
Number of pages4
Publication statusPublished - 2004 Dec 1
EventProceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004 - Seoul, Korea, Republic of
Duration: 2004 Nov 182004 Nov 19

Publication series

NameProceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004

Other

OtherProceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004
CountryKorea, Republic of
CitySeoul
Period04/11/1804/11/19

Fingerprint

Bandwidth

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Lee, B. J., Kim, S., & Kang, H-G. (2004). Speaker recognition based on transformed line spectral frequencies. In S. J. Ko (Ed.), Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004 (pp. 177-180). (Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004).
Lee, Bong Jin ; Kim, Samuel ; Kang, Hong-Goo. / Speaker recognition based on transformed line spectral frequencies. Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004. editor / S.J. Ko. 2004. pp. 177-180 (Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004).
@inproceedings{4c16a7de03bf4284b9f20829e80294ed,
title = "Speaker recognition based on transformed line spectral frequencies",
abstract = "In this paper, line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering the physical aspect of vocal tract. They are location of formants/nulls, bandwidth of formants/nulls, bandwidth and location of formants, bandwidth and location of nulls, and interval of adjacent formant and null location. Identification tests using TIMIT database verify that all features are useful for speaker recognition, especially the bandwidth and location of formants show the best performance. Simulation results also show that LSF and some of transformed LSF give better performance than MFCC.",
author = "Lee, {Bong Jin} and Samuel Kim and Hong-Goo Kang",
year = "2004",
month = "12",
day = "1",
language = "English",
isbn = "0780386396",
series = "Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004",
pages = "177--180",
editor = "S.J. Ko",
booktitle = "Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004",

}

Lee, BJ, Kim, S & Kang, H-G 2004, Speaker recognition based on transformed line spectral frequencies. in SJ Ko (ed.), Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004. Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004, pp. 177-180, Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004, Seoul, Korea, Republic of, 04/11/18.

Speaker recognition based on transformed line spectral frequencies. / Lee, Bong Jin; Kim, Samuel; Kang, Hong-Goo.

Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004. ed. / S.J. Ko. 2004. p. 177-180 (Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Speaker recognition based on transformed line spectral frequencies

AU - Lee, Bong Jin

AU - Kim, Samuel

AU - Kang, Hong-Goo

PY - 2004/12/1

Y1 - 2004/12/1

N2 - In this paper, line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering the physical aspect of vocal tract. They are location of formants/nulls, bandwidth of formants/nulls, bandwidth and location of formants, bandwidth and location of nulls, and interval of adjacent formant and null location. Identification tests using TIMIT database verify that all features are useful for speaker recognition, especially the bandwidth and location of formants show the best performance. Simulation results also show that LSF and some of transformed LSF give better performance than MFCC.

AB - In this paper, line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering the physical aspect of vocal tract. They are location of formants/nulls, bandwidth of formants/nulls, bandwidth and location of formants, bandwidth and location of nulls, and interval of adjacent formant and null location. Identification tests using TIMIT database verify that all features are useful for speaker recognition, especially the bandwidth and location of formants show the best performance. Simulation results also show that LSF and some of transformed LSF give better performance than MFCC.

UR - http://www.scopus.com/inward/record.url?scp=21544467018&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=21544467018&partnerID=8YFLogxK

M3 - Conference contribution

SN - 0780386396

T3 - Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004

SP - 177

EP - 180

BT - Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004

A2 - Ko, S.J.

ER -

Lee BJ, Kim S, Kang H-G. Speaker recognition based on transformed line spectral frequencies. In Ko SJ, editor, Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004. 2004. p. 177-180. (Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2004).