Performance Analysis of SOFM based Reduced Complexity Feature Extraction Methods with back Propagation Neural Network for Multilingual Digit Recognition

John Sahaya Rani Alex lowast; Ajinkya Sunil Mukhedkar and Nithya Venkatesan

doi:10.17485/ijst/2015/v8i19/76217

Article

Performance Analysis of SOFM based Reduced Complexity Feature Extraction Methods with back Propagation Neural Network for Multilingual Digit Recognition

VIEWS 703
PDF 200

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2015/v8i19/76217

Year: 2015, Volume: 8, Issue: 19, Pages: 1-8

Original Article

Performance Analysis of SOFM based Reduced Complexity Feature Extraction Methods with back Propagation Neural Network for Multilingual Digit Recognition

John Sahaya Rani Alex^∗ , Ajinkya Sunil Mukhedkar and Nithya Venkatesan

School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, India.
[email protected], [email protected], [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Background: Speech recognition is an active area of research, used to transliterate words vocalized by individuals in order to make them machine recognizable. For real time speech recognition applications the response time, size of training data and recognition accuracy are the important aspects. Methods: A Hybrid speech recognition system is proposed on the basis on Artificial Neural Network (ANN) in this research. The Self Organising Feature Map (SOFM) is used to reduce the feature vector dimensions which are extracted using the Mel-Frequency Cepstrum Coefficients (MFCC), Perceptual Linear Predictive (PLP) and Discrete Wavelet Transform (DWT) methods. The Back Propagation Network (BPN) algorithm is used for training the Artificial Neural Network for pattern classification. Findings: The proposed method is tested with TIDIGITS data. Results indicate that despite ofthe large reduction in the feature vector dimensions the recognition accuracy obtained using SOFM technique is same as that of the recognition accuracy of the conventional methods. The response time is also fast and the data size of the input data is reduced considerably. The proposed hybrid system is further tested using multilingual isolated digit data.
Keywords: Artificial Neural Network, Discrete Wavelet Transform, Feature Extraction, Mel Frequency Cepstrum Coefficients, Perceptual Linear Predictive, Self-organising Feature Map, Speech Recognition