Investigations of the Approximate Percentage of Noise Required to Perceive Hindi Phonemes using HNM

Padmini Rajput  and Parveen Lehana

doi:10.17485/ijst/2019/v12i15/116614

Article

Investigations of the Approximate Percentage of Noise Required to Perceive Hindi Phonemes using HNM

VIEWS 722
PDF 297

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2019/v12i15/116614

Year: 2019, Volume: 12, Issue: 15, Pages: 1-16

Original Article

Investigations of the Approximate Percentage of Noise Required to Perceive Hindi Phonemes using HNM

Padmini Rajput^* and Parveen Lehana

Department of Electronics, University of Jammu, Jammu – 180006, Jammu and Kashmir, India;
[email protected], [email protected]

*Author for correspondence
Padmini Rajput
Department of Electronics, University of Jammu, Jammu – 180006, Jammu and Kashmir, India. Email: [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: Harmonic plus Noise Model (HNM) analysis model has been found to be one of the best methods of speech production in terms of important characteristics like naturalness, intelligibility, and pleasantness which are of pre-requisite in any speech synthesiser. Present study explores the approximate percentage of noise required to perceive some phonemes of Hindi language. Method / Analysis: HNM assumes speech as a combination of both periodic and aperiodic signals, so the effect of each part may be individually measured on the quality and intelligibility of different phonemes using HNM. HNM has been employed as the analysis-synthesis platform and the quality of the synthesized speech is tested with the ITU-T standard PESQ measure (perceptual evaluation of speech quality and MOS (mean opinion score). Findings: Objective results suggest that the percentage of the noise serves as a significant constituent in the quality of synthesized speech. Novelty: Investigations suggest that each individual phoneme requires different noise and voice percentage for clear perception. Further, the optimum percentage of the noise part for good speech quality has been found speaker and phoneme dependent.

Keywords: Analysis-synthesis Models, Speech Processing, HNM, PESQ