Feature Extraction of Assamese Speech Based One Motion Analysis

Parismita Gogoi; Debashree Sharma; Rosy Bordoloi; Snigdha Sarma; Ananya Goswami

doi:10.17485/IJST/v16iSP2.3252

Article

Feature Extraction of Assamese Speech Based One Motion Analysis

VIEWS 420
PDF 100

Indian Journal of Science and Technology

DOI: 10.17485/IJST/v16iSP2.3252

Year: 2023, Volume: 16, Issue: Special Issue 2, Pages: 6-14

Original Article

Feature Extraction of Assamese Speech Based One Motion Analysis

Parismita Gogoi^1*, Debashree Sharma¹, Rosy Bordoloi¹, Snigdha Sarma¹, Ananya Goswami¹

¹Department of Electronics and Communication Engineering, DUIET, Dibrugarh University, Assam, India

*Corresponding Author
Email: [email protected]

Received Date:23 March 2023, Accepted Date:26 June 2023, Published Date:20 October 2023

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: The present work aims to investigate the recognition of emotion from Assamese speech. Methods: This work presents a method based on the Gaussian Mixture Model (GMM) classifier and Mel-frequency cepstral coefficients (MFCC) as feature extraction technique for emotion recognition from Assamese speeches. Findings: We have conducted experiments considering different emotions: Angry, Happy, Neutral and Sad. The speech emotion recognition system database is the emotional speech samples collected manually from 20 speakers and some standard samples available on the internet. The speakers are from different districts of Assam and use different dialects of the Assamese language, such as Western (Kamrupi), Central, and Eastern. They fall under the age group of 18-26 years. The field survey consists of recordings done at Dibrugarh University and outside the campus. After the GMM training and testing process, the accuracy we obtained is 51.25%. The experiments confirmed that angry and happy emotions have high energy in the higher frequency region. In contrast, neutral and sad emotions have low energy in the higher frequency region. Novelty: This work will help predict the attitudes and actions of different speakers based on their manner of speaking. In addition, the present work will also help in other aspects of human-machine interaction in our daily life. The Assamese emotional speech database used in the work is self-collected from different dialect groups to understand the variability of emotions in dialectal perspective.

Keywords: Assamese, GMM, emotion, speech, MFCC

References

Sekkate S, Khalil M, Adib A. A statistical feature extraction for deep speech emotion recognition in a bilingual scenario. Multimedia Tools and Applications. 2023;82:11443–11460. Available from: https://doi.org/10.1007/s11042-022-14051-z
Monisha STA, Sultana S. A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages. Advances in Human-Computer Interaction. 2022;2022:1–11. Available from: https://doi.org/10.1155/2022/9602429
Wani TM, Gunawan TS, Qadri SAA, Kartiwi M, Ambikairajah E. A Comprehensive Review of Speech Emotion Recognition Systems. IEEE Access. 2021;9:47795–47814. Available from: https://doi.org/10.1109/ACCESS.2021.3068045
Horii D, Ito A, Nose T. Analysis of Feature Extraction by Convolutional Neural Network for Speech Emotion Recognition. In: 2021 IEEE 10th Global Conference on Consumer Electronics (GCCE). Kyoto, Japan, 12-15 October 2021. IEEE. .
Kolita S, Acharjee PB. Analysis on Syllable-Based Intonational Features of Assamese Speech Signals. In: Mathematical and Computational Intelligence to Socio-scientific Analytics and Applications , Lecture Notes in Networks and Systems. (Vol. 518, pp. 231-242) Springer Nature Singapore. 2022.
Singh J, Saheer LB, Faust O. Speech Emotion Recognition Using Attention Model. International Journal of Environmental Research and Public Health. 2023;20(6):1–21. Available from: https://doi.org/10.3390/ijerph20065140
Ayadi ME, MSK, FK. Survey on speech emotion recog-nition: Features, classification schemes, and databases. Pattern recognition. 2011;44(3):572–587. Available from: https://doi.org/10.1016/j.patcog.2010.09.020
Sudhakar RS, Anil MC. Analysis of speech features foremotion detection: a review. In: 2015 International Conference on Computing Communication Control and Automation. Pune, India, 26-27 February 2015. IEEE. .
Mansour A, Lachiri Z. SVM based Emotional Speaker Recognition using MFCC-SDC Features. International Journal of Advanced Computer Science and Applications. 2017;8(4):538–544. Available from: https://pdfs.semanticscholar.org/fd5d/b3ca0d157866259af2c9cfed8a77a6e9bb88.pdf
Kandali AB, AR, Basu TK. Emotion recognition fromAssamese speeches using MFCC features and GMM classifier. In: TENCON 2008 - 2008 IEEE Region 10 Conference. Hyderabad, India, 19-21 November 2008. IEEE. .
Ververidis D, Kotropoulos C. A review of emotional speech databases. In: Proceedings of the Panhellenic Conference on Informatics (PCI). p. 560–574.
Hu H, Xu MX, Wu W. GMM Supervector Based SVM with Spectral Features for Speech Emotion Recognition. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07. Honolulu, HI, USA, 15-20 April 2007. IEEE. .
Kaushik R, Sharma M, Sarma KK, Kaplun DI. I-vector based emotion recognition in Assamese speech. International Journal of Engineering and Future Technology. 2016;1(1):111–124. Available from: http://www.ceser.in/ceserp/index.php/IJEFT/article/view/4423

Copyright

© 2023 Gogoi et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)