• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology

Article

Indian Journal of Science and Technology

Year: 2023, Volume: 16, Issue: 5, Pages: 309-317

Original Article

Comparative Analysis of Kannada Formant Synthesized Utterances and their Quality

Received Date:27 October 2022, Accepted Date:16 December 2022, Published Date:04 February 2023

Abstract

Objectives: The goal of this work is to synthesize Kannada utterances using a modified Klatt type formant synthesizer to evaluate its performance by comparing against eSpeak synthesizer in terms of intelligibility and quality of the utterances generated. Methods: Kannada utterances viz., vowels, diphthongs, Consonant-Vowel (CV) coarticulations and simple words are generated using a modified Klatt type formant synthesizer and eSpeak. The vowels and diphthongs generated by both the synthesizers are compared with natural recorded utterances using F1-F2 formants and the CV co-articulations are compared using spectrograms. The synthesized word utterances are compared with natural recorded utterances using Log Spectral Distance to find out which synthesizer outputs the frequency spectrum that is closest to the frequency spectrum of the natural utterances. Also, the synthesized word utterances are evaluated for their intelligibility and quality using Mean Opinion Score (MOS) obtained from 10 native Kannada language speakers. Findings: The word utterances synthesized by the modified Klatt type formant synthesizer scored a MOS of 86% and 4.46 out of 5 for the parameters of intelligibility and quality whereas for the same two parameters eSpeak scored 70% and 4.14 out of 5 respectively. Novelty: Klatt type formant synthesizer that uses pitch synchronous parameter update method synthesizes good quality Kannada sound utterances and storing the control parameters of the synthesizer using polynomials reduces the database footprint.

Keywords: Kannada Formant Synthesizer; Klatt type Synthesizer; eSpeak; Kannada TTS; Formant synthesis quality

References

  1. Trivedi A, Pant N, Shah P, Sonik S, Agrawal S. Speech to text and text to speech recognition systems-A review. IOSR Journal of Computer Engineering. 2018;20(2):36–43. Available from: https://www.iosrjournals.org/iosr-jce/papers/Vol20-issue2/Version-1/E2002013643.pdf
  2. Dutonde SK, Mapari GS, Wagh SJ, Kapse A. Review on Text to Speech Synthesizer. International Journal of Advance Research and Innovative Ideas in Education. 2022;8(3):592–596. Available from: https://ijariie.com/AdminUploadPdf/Review_on_Text_to_Speech_Synthesizer_ijariie16614.pdf
  3. Tan X, Qin T, Soong F, Liu TY. A survey on neural speech synthesis. 2021. Available from: https://arxiv.org/pdf/2106.15561.pdf
  4. Kuligowska K, Kisielewicz P, Włodarz A. Speech synthesis systems: disadvantages and limitations. International Journal of Engineering & Technology. 2018;7(2.28):234. Available from: https://doi.org/10.14419/ijet.v7i2.28.12933
  5. Panda SP, Nayak AK, Rai SC. A survey on speech synthesis techniques in Indian languages. Multimedia Systems. 2020;26(4):453–478. Available from: https://doi.org/10.1007/s00530-020-00659-4
  6. Lukose S, Upadhya SS. Text to speech synthesizer-formant synthesis. 2017 International Conference on Nascent Technologies in Engineering (ICNTE). 2017;p. 1–4. Available from: https://doi.org/10.1109/ICNTE.2017.7947945
  7. D’souza AV, Ravi DJ. An Approach for Formant Synthesis of Kannada. Journal of Signal Processing. 2022;8(2):31–38. Available from: https://doi.org/10.46610/JOSP.2022.v08i02.006

Copyright

© 2023 D’Souza & Ravi. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)

DON'T MISS OUT!

Subscribe now for latest articles and news.