Analysis of Speaker Verification System Using Support Vector Machine

Authors

  • P Shanmugapriya Saranathan College of Engineering, Trichy, Tamilnadu, India
  • Y. Venkataramani

DOI:

https://doi.org/10.24297/jac.v13i10.5839

Abstract

The integration of GMM- super vector and Support Vector Machine (SVM) has become one of most popular strategy in text-independent speaker verification system.  This paper describes the application of Fuzzy Support Vector Machine (FSVM) for classification of speakers using GMM-super vectors. Super vectors are formed by stacking the mean vectors of adapted GMMs from UBM using maximum a posteriori (MAP). GMM super vectors characterize speaker’s acoustic characteristics which are used for developing a speaker dependent fuzzy SVM model. Introducing fuzzy theory in support vector machine yields better classification accuracy and requires less number of support vectors. Experiments were conducted on 2001 NIST speaker recognition evaluation corpus. Performance of GMM-FSVM based speaker verification system is compared with the conventional GMM-UBM and GMM-SVM based systems.  Experimental results indicate that the fuzzy SVM based speaker verification system with GMM super vector achieves better performance to GMM-UBM system.  

Downloads

Download data is not yet available.

Author Biography

P Shanmugapriya, Saranathan College of Engineering, Trichy, Tamilnadu, India

Electronics and Communication Engineering

References

[1] Vapnik.V.N., “Statistical Learning Theory”, New York, USA: Wiley, 1998. www.mit.edu/~6.454/www_spring_2001/emin/slt.pdf
[2] Cortes.C and Vapnik.V, “Support vector networks”, Machine Learning, 20, pp.273–297, 1995. Burges.C.J.C., “A tutorial on support vector machines for pattern recognition”, Data Min. Knowl. Disc. 2 (2), 121–167, 1998. research.microsoft.com/en-us/um/people/cburges/papers/SVMTutorial.pdf
[3] Blanz.V, Scholkopf.B, Bulthoff.H, Burges.C, Vapnik.V and Vetter.T, “Comparison of view–based object recognition algorithms using realistic 3D models”, Artificial Neural Networks—ICANN’96, pp. 251- 256, Berlin, 1996. http://dl.acm.org/citation.cfm?id=684894
[4] Schmidt.M, “Identifying speaker with support vector networks”, In Interface ’96 Proceedings, Sydney, 1996.
[5] Campbell.W.M, Sturim.D.E, Reynolds.D.A, “Support Vector Machines using GMM Supervectors for Speaker Verification”, IEEE Signal Processing Letters, 2006, 13, pp.308-311. https://www.researchgate.net/publication/3343440_Support_vector_machines_using_GMM_supervectors_for_speaker_verification
[6] Campbell.W.M, Sturim.D.E, Reynolds.D.A and Solomonoff. A, “SVM based Speaker Verification using a GMM Super vector Kernel and NAP Variability Compensation”, in Proceedings International Conference Acoustics, Speech, and Signal Processing, France, 2006, pp.97-100. https://www.ll.mit.edu/mission/cybersec/publications/.../060514_CampbellW.pdf
[7] Shi-Huang Chen, Yu-Ren Luo, “Speaker verification using MFCC and Support vector Machine”, Proceedings of the International Multiconference of Engineers and computer scientists, Hong Kong, vol. 1, March 18-20, 2009. www.iaeng.org/publication/IMECS2009/IMECS2009_pp532-535.pdf
[8] Kong Aik Lee,Chang Huai You, Haizhou Li and Tomi Kinnunen and Khe Chai Sim, “Using discrete probabilities with Bhattacharyya measure for SVM- based speaker verification”, IEEE Transaction on audio, speech and language processing, vol. 19, no. 4, May 2011. cs.uef.fi/sipu/pub/Bhattacharyya_TASL.pdf
[9] Pedro.J, Moreno, Purdy.P, Ho, “Verification Using Probabilistic Distance Kernels”, Eurospeech, 1-4 September, 2003. www.hpl.hp.com/techreports/2004/HPL-2004-7.html
[10] Man Wai Mak, Wei Rao, “Utterance Partitioning with acoustic vector resampling for GMM-SVM speaker verification”, Speech Communication, 53, pp. 119-130, 2011.
http://citeseerx.ist.psu.edu/viewdoc/citations;jsessionid=524637905DED778C54349B0B2BB0FEC7?doi=10.1.1.170.4398
[11] Zhao Jian, Dong Yuan, Zhao Xianyu, “Score Normalization and Language Robustness in Text Independent Multi-language Speaker Verification”, Lecture notes in computer science, 2007, pp.4681:1121-1130. ieeexplore.ieee.org/iel5/5971803/6074142/06074159.pdf
[12] Yao, Y., Frasconi, P., & Pontil, M. (2001). Fingerprint classification with combinations of support vector machines, AVBPA 2001, LNCS 2091, pp. 253–258. link.springer.com/chapter/10.1007/3-540-45344-X_37
[13] Campbell, W. M., Singer, E., Torres-Carrasquillo, P. A., & Reynolds, D. A. (2004). Language recognition with support vector machines, Odyssey 2004, May 31st– June 4th, Toledo, Spain. https://www.ll.mit.edu/mission/cybersec/...files/.../040531_CampbellW_SingerE.pdf
[14] S.Zribi Boujelbene, D.Ben Ayed Mezghani, and N. Ellouze, “Improved Feature data for Robust Speaker Identification using hybrid Gaussian Mixture Models - Sequential Minimal Optimization System”, The International Review on Computers and Software, Vol. 4.3, ISSN: 1828-6003, May 2009, pp.344-350.
[15] W. Zunjingand, and C. Zhigang, “Improved MFCC-Based Feature for Robust Speaker Identification”, Tsinghua Science and Technology, Vol. 10.2, 2005, pp. 158-161.
http://whale.cse.nsysu.edu.tw/~khwu/Improved%20MFCC-Based%20Feature%20for%20Robust%20Speaker%20Identification.pdf
[16] Bing Xiang, Toby Berger, “Efficient Text-Independent Speaker Verification With Structural Gaussian Mixture Models And Neural Network”, IEEE Transactions On Speech And Audio Processing, Vol. 11, No. 5, pp. 447-456, September 2003.http://www.dl.edi-info.ir/Efficient%20Text-Independent%20Speaker%20Verification%20with%20Structural%20Gaussian%20Mixture%20Models.pdf
[17] D.A.Reynolds, T.F.Quatieri, R.B.Dunn, “Speaker verification using adapted Gaussian mixture models”, Digital Signal Process. 10, 19–41, 2000. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.556.4188&rep=rep1&type=pdf
[18] Fuqian Shi, Jiang Xu, “Emotional Cellular-Based Multi- Class Fuzzy Support Vector Machines on Product’s KANSEI Extraction”, International Journal of Applied Mathematics & Information Sciences, Vol.6, No. 1, pp. 41-49, 2012. https://pdfs.semanticscholar.org/a184/1cd6b389bb36340524a63481a4ddab564c84.pdf
[19] C.F.Lin and S.D.Wang, “Fuzzy support vector machines,” IEEE Transactions on Neural Networks, vol. 13, no. 2, March 2002.
[20] Chun-fu Lin, Sheng-de Wang, “Training algorithms for fuzzy support vector machines with noisy data”, Pattern Recognition Letters, 25, pp. 1647–1656, 2004. www.cs.pdx.edu/~mm/aml2010/JoshHoakPresentation.pdf
[21] C.J.C.Burges and B.Schölkopf, “Improving the Accuracy and Speed of Support Vector Learning Machines”, Advances in Neural Information Processing Systems 9, Cambridge, MIT Press, 1997, pp. 375–381. https://people.eecs.berkeley.edu/~malik/cs294/decoste-scholkopf.pdf
[22] W.M.Campbell, J.P.Campbell, D.A.Reynolds, D.A.Jones and T.R.Leek, “High Level Speaker Verification with Support Vector Machines”, in Proceedings of the International Conference on Acoustics Speech and Signal Processing,2004, pp.73-76.
[23] S.Furui, “Cepstral analysis technique for automatic speaker verification”, IEEE Trans. Acoust. Speech Signal Processing, vol.ASSP-29, pp.254-272, Apr.1981.
https://www.researchgate.net/publication/3176892_Cepstral_analysis_technique_for_automatic_speaker_verification
[24] H.Hermansky and N.Morgan, “RASTA processing of speech”, IEEE Trans. Speech and Audio Processing, vol.2, no.4, pp.578-589, 1994. ieeexplore.ieee.org/document/326616/
[25] S.Chen and R.A.Gopinath, “Gaussianization”, Proc. NIPS 2000, Denver Colorado.

Additional Files

Published

2017-02-25

How to Cite

Shanmugapriya, P., & Venkataramani, Y. (2017). Analysis of Speaker Verification System Using Support Vector Machine. JOURNAL OF ADVANCES IN CHEMISTRY, 13(10), 6531–6542. https://doi.org/10.24297/jac.v13i10.5839

Issue

Section

Articles