FEATURE VECTOR EXTRACTION METHOD FOR SPEAKER RECOGNITION SYSTEMS

Yadigar N.İmamverdiyev, Lyudmila V.Suxostat

About the Journal Editorial Board For authors Publication ethics Archive Abstracting & Indexing Contact us

№2, 2012

FEATURE VECTOR EXTRACTION METHOD FOR SPEAKER RECOGNITION SYSTEMS

Yadigar N.İmamverdiyev, Lyudmila V.Suxostat

yadigar@lan.ab.az, lsuhostat@hotmail.com

Development of methods increasing the accuracy of speaker recognition system is the main objective of the research. Feature vector extraction is one of the most important stages of speaker recognition system. In this paper we propose a feature extraction method based on Hilbert-Huang transform considering instability and non-linearity of human speech. (pp. 14-19)

Keywords: speaker recognition, speech feature extraction, Hilbert-Huang transform, Intrinsic Mode Function

View article(7562)

References

Benesty J., Sondhi M., Huang Y. Springer handbook of speech processing. Springer, 2007, 1176 p.
Johnson K., Speaker Normalization in speech perception, In Pisoni, D.B. & Remez, R. (eds) The Handbook of Speech Perception. Oxford: Blackwell Publishers, 2005, pp. 363‒389.
Mason J.S., Thompson J. Gender effects in speaker recognition // Proc. ICSP-93, Beijing, pp. 733‒736, 1993.
Doddington G. Speaker recognition based on idiolectal differences between speakers // Proc. of Eurospeech, vol. 4, pp. 2521‒2524, 2001.
Маркел Дж., Грей А.Х. Линейное предсказание речи. М.: Связь, 1980, 308 с.
Furui S. Cepstral analysis techniques for automatic speaker verification // IEEE tran. acoust., speech, signal processing, vol. 27, pp. 254‒272, 1981.
Kinnunen T. Spectral features for automatic text-independent speaker recognition, Licentiate thesis, Department of Computer Science, University of Joensuu, Joensuu, Finland, 2003.
Kinnunen T., Li H. An overview of text-independent speaker recognition: from features to supervectors // Speech Communication, 2010, vol. 52, no. 1, pp. 12‒40.
Huang N.E. Hilbert-Huang Transform and its applications, World Scientific Publishing, 2005.
Li X., Li D., Liang Z., Voss L.J., Sleigh J.W. Analysis of depth of anesthesia with Hilbert-Huang spectral entropy // Clin Neurophysiol, 2008, no. 119, pp. 2465‒2475.
Ward N.C., Dersch D.R. Text-independent speaker identification and verification using the TIMIT database // Proc. of ICSLP, 1998, v. 2, pp. 233‒237.
Przybocki M.A., Martin A.F. NIST Speaker Recognition Evaluation chronicles // Proc. Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, June 2004.