πŸ“ž +91-7667918914 | βœ‰οΈ ijarcce@gmail.com
International Journal of Advanced Research in Computer and Communication Engineering
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 3, ISSUE 6, JUNE 2014

Comparative Analysis of Speech Features for Speech Emotion Recognition

PRASHANT AHER, ALICE CHEERAN Electrical Engineering Department, Veermata Jijabai Technological Institute (VJTI), Mumbai, India

πŸ‘ 43 viewsπŸ“₯ 0 downloads
Share: 𝕏 f in ✈ βœ‰
Abstract: In this paper, comparative analysis of speech cepstral features is performed to recognise emotion. We identify two effective feature namely, Mel Frequency Cepstral Coefficient (MFCC) and Cochlear Filterbank cepstral coefficients extracted from speech signal. MFCC as a baseline approach is compared to the feature extracted from cochlear filterbank with zero crossing at the output of each channel. Extracted features are fed to Support Vector Machine (SVM) classifier. As shown in our results, cochlear feature provide highest recognition accuracy provided using linear kernel. It gives 89.67% classification accuracy for Berlin Emotional Speech Database. A study on noise robustness of above mentioned feature was also carried out. MFCC and cochlear feature have recognition accuracy of 81.9% and 86% respectively in clean testing conditions with RBF kernel function but when mismatch between training and testing set increases as in real time situations, recognition accuracy of MFCC feature is 11% while cochlear feature gives accuracy of 25%, which shows that cochlear feature is more robust to noise.

Keywords: Cochlea, cochlear filterbank, mel filterbank, mel scale, noise robustness, linear polynomial and RBF kernel, speech emotion recognition, Zero Crossing Peak Amplitude, SVM.

How to Cite:

[1] PRASHANT AHER, ALICE CHEERAN Electrical Engineering Department, Veermata Jijabai Technological Institute (VJTI), Mumbai, India, β€œComparative Analysis of Speech Features for Speech Emotion Recognition,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE)

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.