AUTOMATIC PROSODY GENERATION IN A TEXT-TO-SPEECH SYSTEM FOR HEBREW

Branislav Popović, Dragan Knežević, Milan Sečujski, Darko Pekar

DOI Number
-
First page
467
Last page
477

Abstract


The paper presents the module for automatic prosody generation within a system for automatic synthesis of high-quality speech based on arbitrary text in Hebrew. The high quality of synthesis is due to the high accuracy of automatic prosody generation, enabling the introduction of elements of natural sentence prosody of Hebrew. Automatic morphological annotation of text is based on the application of an expert algorithm relying on transformational rules. Syntactic-prosodic parsing is also rule based, while the generation of the acoustic representation of prosodic features is based on classification and regression trees. A tree structure generated during the training phase enables accurate prediction of the acoustic representatives of prosody, namely, durations of phonetic segments as well as temporal evolution of fundamental frequency and energy. Such an approach to automatic prosody generation has lead to an improvement in the quality of synthesized speech, as confirmed by listening tests.

Full Text:

PDF

References


J.P.H. van Santen, "Contextual Effects on Vowel Duration", Speech Commun., 1992, vol. 11, no. 6, pp. 513-546.

M. Sečujski, N. Jakovljević and D. Pekar, "Automatic Prosody Generation for Serbo-Croatian Speech Synthesis Based on Regression Trees", In Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011, Florence, Italy, pp. 3157-3160.

Ö. Öztürk and T. Çiloğlu, "Segmental Duration Modelling in Turkish", In Proceedings of the 9th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, Lect. Notes Comput. Sc., Springer, 2006, vol. 4188, pp. 669-676.

A. Lazaridis, P. Zervas, N. Fakotakis and G. Kokkinakis, "A CART Approach for Duration Modeling of Greek Phonemes", In Proceedings of the 12th International Conference on Speech and Computer, 2007, Moscow, Russia, pp. 287-292.

N. Chomsky, Morphophonemics in Modern Hebrew. Routledge, 2012.

J. Fellman, "Concerning the "Revival" of the Hebrew Language", Anthropol. Linguist., May 1973, vol. 15, no. 5, pp. 250-257.

B. Popović, M. Sečujski, V. Delić, M. Janev and I. Stanković, "Automatic Morphological Annotation in a Text-to-Speech System for Hebrew", in Proceedings of the 15th International Conference on Speech and Computer, Pilsen, Czech Republic, Lect. Notes Comput. Sc., Springer, 2013, vol. 8113, pp. 319-326.

L. Breiman, J.H. Friedman, C.J. Stone and R.A. Olsen, Classification and Regression Trees. Chapman & Hall/CRC, Boca Raton, London, New York, Washington D.C., 1984.

A. Black and N. Campbell, "Optimising Selection of Units from Speech Databases for Concatenative Synthesis", In Proceedings of the 4th European Conference on Speech Communication and Technology, 1995, Madrid, Spain, pp. 581-584.

V. Delić, M. Sečujski, N. Jakovljević, M. Janev, R. Obradović and D. Pekar, "Speech Technologies for Serbian and Kindred South Slavic Languages", Adv. Speech Recognition, Chapter 9, 2010.


Refbacks

  • There are currently no refbacks.


ISSN: 0353-3670 (Print)

ISSN: 2217-5997 (Online)

COBISS.SR-ID 12826626