TTS System for Punjabi using Festival Framework

Sukhpreet Kaur Gill

Abstract


Text to speech Generation is the process of converting the raw text into the speech output. This paper represents the development of Punjabi voice using Festvox tool and the Festival is used as engine to run that developed voice. The statistical parametric method of waveform generation is used. The phoneme is chosen as basic unit for speech synthesis. The corpus for Punjabi language is collected from newspaper and news channels. The recording is done in noise free environment

Keywords


Festvox, Statistical Parametric Method, Festival.

Full Text:

PDF

References


A.W. Black and K. Lenzo, “Building voices in the Festival speech synthesis system”, for FestVox 2.7 Edition, Cambridge, pp. 23-93, 2014.

A.W. Black, “CLUSTERGEN: A Statistical Parametric Synthesizer using Trajectory Modeling”, in Proceedings of Interspeech, Pittsburgh, A, USA, pp. 1762-1765, 2006.

A.W. Black and K. Lenzo,” Multilingual text – to-speech synthesis”, in Proceedings of the ICASSP, Montreal, Canada, 2004.

B. Kumar and B. Chettri, “Currents Trends, Frameworks and Techniques used in Speech

synthesis-A Survey”, International journals of soft computing and Engineering, Vol.2, pp. 2231-2307, 2012.

B.K. Rajan,V. Rijoy, D.P. Gopinath and N. George, “Duration Modeling for Text to Speech Synthesis System using Festival Speech Engine Developed for Malayalam Language”, in Proceedings of International Conference Circuit, Power and Computing Technologies, pp. 1-5, 2015.

V. Goyal and G.S. Lehal, “Evaluation of Hindi

to Punjabi Machine Translation System”, International Journal of Computer Science Issues, Vol.4, No. 1, pp. 36-39, 2009.

S. Luthra and P. Singh, “Punjabi Speech Generation System Based on Phonemes”, International Journal of Computer Applications, Vol. 49, No. 13, pp. 40-44, 2012.

N.P. Narendra, S.K. Rao, K. Ghosh , R.R. Vempada and S. Maity, “Development of syllable-based text to speech synthesis system in Bengali”, International Journal of Speech Technology, Vol.14, pp. 167-181, 2011.

K. Prahallad, N. Kumar, S. Rajendran and V. Keri, “The-IIT-H Indic Speech Databases”, in Proceedings of Interspeech, Portland, Oregon, USA, 2012.

K. Richmond and S. King,”Multisyn: Open-domain unit selection for the Festival speech synthesis system”, Science Direct, Vol. 49, No. 4, pp.317-330, 2007.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.