There are two basic groups of methods for text to speech synthesis — concatenative and parametric. Expansion of speech technology applications, made parametric methods more attractive owing to their flexibility of changing speaker and speech style. The paper describes the development of the first speech synthisizer based on deep neural networks in Serbian language by using open source Merlin toolkit. The method gave extraordinary results, as proven by subjective mark of more than 4,5 out of 5.