Statistical Spectral Envelope Transformation applied to Emotional Speech (Contributo in atti di convegno)

Type
Label
  • Statistical Spectral Envelope Transformation applied to Emotional Speech (Contributo in atti di convegno) (literal)
Anno
  • 2010-01-01T00:00:00+01:00 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
  • 10.1002/9781119991298 (literal)
Alternative label
  • Fabio Tesser, Enrico Zovato, Piero Cosi Institute of Cognitive Sciences and Technologies, Italian National Research Council Padova, Italy (2010)
    Statistical Spectral Envelope Transformation applied to Emotional Speech
    in 13th Int. Conference on Digital Audio Effects (DAFx-10), Graz, Austria, September 6-10, 2010
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Fabio Tesser, Enrico Zovato, Piero Cosi Institute of Cognitive Sciences and Technologies, Italian National Research Council Padova, Italy (literal)
Pagina inizio
  • 479 (literal)
Pagina fine
  • 482 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#altreInformazioni
  • Udo Zölzer Published Online: 10 MAR 2011 DOI: 10.1002/9781119991298.fmatter Copyright © 2011 John Wiley & Sons, Ltd Book Title Helmut Schmidt University - University of the Federal Armed Forces, Hamburg, Germany Published Online: 10 MAR 2011 Published Print: 11 MAR 2011 Print ISBN: 9780470665992 Online ISBN: 9781119991298 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#citta
  • Graz, Austria (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#url
  • http://dafx10.iem.at/proceedings/papers/TesserZovatoCosi_DAFx10_P81.pdf (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#titoloVolume
  • Proceedings of DAFx-10 13th International Conference on Digital Audio Effects (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autoriCuratela
  • Tesser F, Zovato E, Cosi P (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#pagineTotali
  • 4 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#descrizioneSinteticaDelProdotto
  • Transformation of sound by statistical techniques is a promising method for a new range of digital audio effects. In this paper a data driven voice transformation algorithm is used to alter the timbre of a neutral (non-emotional) voice in order to reproduce a particular emotional vocal timbre. Perceptually based Mel-Cepstral analysis and Mel Log Spectral Approximation digital filter are used to represent the speech timbre and to synthesize speech with modified spectral envelope. The transformation function adopts a GMM (Gaussian Mixture Model) based parametrization in order convert the spectral envelopes. Experiments with the first and second order derivatives of the mel-cepstral coefficients have been undertaken to prove the benefit of including dynamic information in the model. The proposed algorithm has been evaluated by means of objective measures in the neutral-to-happy and neutral-to-sad tasks. (literal)
Note
  • R (literal)
  • Wiley OnLine L (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • Fabio Tesser, Piero Cosi Institute of Cognitive Sciences and Technologies, Italian National Research Council Padova, Italy ISTC-CNR, UOS Padova, Padova, Italy Enrico Zovato Loquendo S.p.A. Torino, Italy (literal)
Titolo
  • Statistical Spectral Envelope Transformation applied to Emotional Speech (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#inCollana
  • Hannes Pomberger, Franz Zotter And Alois Sontacchi (Ed.) Proceedings of DAFx-10 13th International Conference on Digital Audio Effects (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#isbn
  • 978-3-200-01940-9 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autoriVolume
  • Hannes Pomberger, Franz Zotter And Alois Sontacchi (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#curatoriVolume
  • Hannes Pomberger, Franz Zotter And Alois Sontacchi (literal)
Abstract
  • Transformation of sound by statistical techniques is a promising method for a new range of digital audio effects. In this paper a data driven voice transformation algorithm is used to alter the timbre of a neutral (non-emotional) voice in order to reproduce a particular emotional vocal timbre. Perceptually based Mel-Cepstral analysis and Mel Log Spectral Approximation digital filter are used to represent the speech timbre and to synthesize speech with modified spectral envelope. The transformation function adopts a GMM (Gaussian Mixture Model) based parametrization in order convert the spectral envelopes. Experiments with the first and second order derivatives of the mel-cepstral coefficients have been undertaken to prove the benefit of including dynamic information in the model. The proposed algorithm has been evaluated by means of objective measures in the neutral-to-happy and neutral-to-sad tasks. (literal)
Editore
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Autore CNR di
Prodotto
Editore di
Insieme di parole chiave di
data.CNR.it