Indian Journal of Science and Technology
Year: 2015, Volume: 8, Issue: 19, Pages: 1-8
John Sahaya Rani Alex∗ , Ajinkya Sunil Mukhedkar and Nithya Venkatesan
Background: Speech recognition is an active area of research, used to transliterate words vocalized by individuals in order to make them machine recognizable. For real time speech recognition applications the response time, size of training data and recognition accuracy are the important aspects. Methods: A Hybrid speech recognition system is proposed on the basis on Artificial Neural Network (ANN) in this research. The Self Organising Feature Map (SOFM) is used to reduce the feature vector dimensions which are extracted using the Mel-Frequency Cepstrum Coefficients (MFCC), Perceptual Linear Predictive (PLP) and Discrete Wavelet Transform (DWT) methods. The Back Propagation Network (BPN) algorithm is used for training the Artificial Neural Network for pattern classification. Findings: The proposed method is tested with TIDIGITS data. Results indicate that despite ofthe large reduction in the feature vector dimensions the recognition accuracy obtained using SOFM technique is same as that of the recognition accuracy of the conventional methods. The response time is also fast and the data size of the input data is reduced considerably. The proposed hybrid system is further tested using multilingual isolated digit data.
Keywords: Artificial Neural Network, Discrete Wavelet Transform, Feature Extraction, Mel Frequency Cepstrum Coefficients, Perceptual Linear Predictive, Self-organising Feature Map, Speech Recognition
Subscribe now for latest articles and news.