Experiments with Signal-Driven Symbolic Prosody for Statistical Parametric Speech Synthesis (Contributo in atti di convegno)

Type
Label
  • Experiments with Signal-Driven Symbolic Prosody for Statistical Parametric Speech Synthesis (Contributo in atti di convegno) (literal)
Anno
  • 2013-01-01T00:00:00+01:00 (literal)
Alternative label
  • Tesser F., Sommavilla G., Paci G., Cosi P. (2013)
    Experiments with Signal-Driven Symbolic Prosody for Statistical Parametric Speech Synthesis
    in 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, September 2nd, 2013
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Tesser F., Sommavilla G., Paci G., Cosi P. (literal)
Pagina inizio
  • 183 (literal)
Pagina fine
  • 187 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#altreInformazioni
  • on-line Proceedings of 8th ISCA Speech Synthesis Workshop Barcelona, Spain August 31st - September 2nd, 2013, Barcelona, Spain, September 2nd, 2013 (url: http://ssw8.talp.cat/papers/ssw8_PS2-7_Tesser.pdf) pp. 183-187 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#url
  • http://ssw8.talp.cat/papers/ssw8_PS2-7_Tesser.pdf (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#titoloVolume
  • 8th ISCA Speech Synthesis Workshop (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
  • 8th (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#volumeInCollana
  • 8th (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#pagineTotali
  • 5 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • ISTC CNR, UOS Padova (literal)
Titolo
  • Experiments with Signal-Driven Symbolic Prosody for Statistical Parametric Speech Synthesis (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#curatoriVolume
  • Antonio Bonafonte (literal)
Abstract
  • This paper presents a preliminary study on the use of symbolic prosody extracted from the speech signal to improve parameters prediction on HMM-based speech synthesis. The relationship between the prosodic labelling and the actual prosody of the training data is usually ignored in the building phase of corpus based TTS voices. In this work, different systems have been trained using prosodic labels predicted from speech and compared with the conventional system that predicts those labels solely from text. Experiments have been done using data from two speakers (one male and one female). Objective evaluation performed on a test set of the corpora shows that the proposed systems improve the prediction accuracy of phonemes duration and F0 trajectories. Advantages on the use of signal-driven symbolic prosody in place of the conventional text-driven symbolic prosody, and future works about the effective use of these information in the synthesis stage of a Text To Speech systems are also described. (literal)
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Autore CNR di
Prodotto
Insieme di parole chiave di
data.CNR.it