• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology

Article

Indian Journal of Science and Technology

Year: 2023, Volume: 16, Issue: Special Issue 2, Pages: 6-14

Original Article

Feature Extraction of Assamese Speech Based One Motion Analysis

Received Date:23 March 2023, Accepted Date:26 June 2023, Published Date:20 October 2023

Abstract

Objectives: The present work aims to investigate the recognition of emotion from Assamese speech. Methods: This work presents a method based on the Gaussian Mixture Model (GMM) classifier and Mel-frequency cepstral coefficients (MFCC) as feature extraction technique for emotion recognition from Assamese speeches. Findings: We have conducted experiments considering different emotions: Angry, Happy, Neutral and Sad. The speech emotion recognition system database is the emotional speech samples collected manually from 20 speakers and some standard samples available on the internet. The speakers are from different districts of Assam and use different dialects of the Assamese language, such as Western (Kamrupi), Central, and Eastern. They fall under the age group of 18-26 years. The field survey consists of recordings done at Dibrugarh University and outside the campus. After the GMM training and testing process, the accuracy we obtained is 51.25%. The experiments confirmed that angry and happy emotions have high energy in the higher frequency region. In contrast, neutral and sad emotions have low energy in the higher frequency region. Novelty: This work will help predict the attitudes and actions of different speakers based on their manner of speaking. In addition, the present work will also help in other aspects of human-machine interaction in our daily life. The Assamese emotional speech database used in the work is self-collected from different dialect groups to understand the variability of emotions in dialectal perspective.

Keywords: Assamese, GMM, emotion, speech, MFCC

References

  1. Sekkate S, Khalil M, Adib A. A statistical feature extraction for deep speech emotion recognition in a bilingual scenario. Multimedia Tools and Applications. 2023;82:11443–11460. Available from: https://doi.org/10.1007/s11042-022-14051-z
  2. Monisha STA, Sultana S. A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages. Advances in Human-Computer Interaction. 2022;2022:1–11. Available from: https://doi.org/10.1155/2022/9602429
  3. Wani TM, Gunawan TS, Qadri SAA, Kartiwi M, Ambikairajah E. A Comprehensive Review of Speech Emotion Recognition Systems. IEEE Access. 2021;9:47795–47814. Available from: https://doi.org/10.1109/ACCESS.2021.3068045
  4. Kolita S, Acharjee PB. Analysis on Syllable-Based Intonational Features of Assamese Speech Signals. In: Mathematical and Computational Intelligence to Socio-scientific Analytics and Applications , Lecture Notes in Networks and Systems. (Vol. 518, pp. 231-242) Springer Nature Singapore. 2022.
  5. Singh J, Saheer LB, Faust O. Speech Emotion Recognition Using Attention Model. International Journal of Environmental Research and Public Health. 2023;20(6):1–21. Available from: https://doi.org/10.3390/ijerph20065140
  6. Ayadi ME, MSK, FK. Survey on speech emotion recog-nition: Features, classification schemes, and databases. Pattern recognition. 2011;44(3):572–587. Available from: https://doi.org/10.1016/j.patcog.2010.09.020
  7. Mansour A, Lachiri Z. SVM based Emotional Speaker Recognition using MFCC-SDC Features. International Journal of Advanced Computer Science and Applications. 2017;8(4):538–544. Available from: https://pdfs.semanticscholar.org/fd5d/b3ca0d157866259af2c9cfed8a77a6e9bb88.pdf
  8. Kandali AB, AR, Basu TK. Emotion recognition fromAssamese speeches using MFCC features and GMM classifier. In: TENCON 2008 - 2008 IEEE Region 10 Conference. Hyderabad, India, 19-21 November 2008. IEEE. .
  9. Kaushik R, Sharma M, Sarma KK, Kaplun DI. I-vector based emotion recognition in Assamese speech. International Journal of Engineering and Future Technology. 2016;1(1):111–124. Available from: http://www.ceser.in/ceserp/index.php/IJEFT/article/view/4423

Copyright

© 2023 Gogoi et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)

DON'T MISS OUT!

Subscribe now for latest articles and news.