Segmentation of Continuous Tamil Speech into Syllable like Units

V  Anantha Natarajan  and S  Jothilakshmi

doi:10.17485/ijst/2015/v8i17/61362

Article

Segmentation of Continuous Tamil Speech into Syllable like Units

VIEWS 1044
PDF 304

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2015/v8i17/61362

Year: 2015, Volume: 8, Issue: 17, Pages: 1-5

Original Article

Segmentation of Continuous Tamil Speech into Syllable like Units

V. Anantha Natarajan^*and S. Jothilakshmi

Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar - 608 002, Tamil Nadu, India; [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

The present growth in the field of information and communication technologies has diverted the focus of many researchers towards the speech technologies. Speech technology comprises of many subfields like speech synthesis, speech recognition, speaker recognition, speech compression, speaker verification and Multimodal interaction. The basic units of the speech synthesis andspeechrecognitionsystemare syllable,phoneme andword.This studymainly focusesonsyllable segmentation or syllabification with the aim to further develop a speech synthesis tool in Tamil language for Human Computer Interaction [HCI]. The syllable boundaries are identified using the formantfrequency, F1. The proposed syllable segmentation algorithm is applied and tested on a set of recorded continuous speech corpus. Initially, the continuous speech signal is divided into segments by removing the silence regions. The silence removal method used in this work depends on features such as signal energy and spectral centroid. After removing silence portion from the speech signals, the speech segments are further processed using Linear Predictive Coding (LPC) to extract the formant frequencies. Then the peaks in the formant frequencies are used as clue to mark the syllable boundaries in the speech. The proposed algorithm is producing an average accuracy of 89% in identifying syllable boundaries when it is compared with the hand labeled syllable boundaries.
Keywords: Linear Predictive Coding, Speech Recognition, Speech Synthesis, Syllabification and Formants