Speech synthesis in Serbian based on artificial neural networks

Tijana Delic; Milan Secujski

doi:10.1109/TELFOR.2016.7818807

Speech synthesis in Serbian based on artificial neural networks

Source

2016 24th Telecommunications Forum (TELFOR) > 1 - 4

Abstract

There are two basic groups of methods for text to speech synthesis — concatenative and parametric. Expansion of speech technology applications, made parametric methods more attractive owing to their flexibility of changing speaker and speech style. The paper describes the development of the first speech synthisizer based on deep neural networks in Serbian language by using open source Merlin toolkit. The method gave extraordinary results, as proven by subjective mark of more than 4,5 out of 5.