http://www.cnr.it/ontology/cnr/individuo/prodotto/ID140185
Statistical Spectral Envelope Transformation applied to Emotional Speech (Contributo in atti di convegno)
- Type
- Label
- Statistical Spectral Envelope Transformation applied to Emotional Speech (Contributo in atti di convegno) (literal)
- Anno
- 2010-01-01T00:00:00+01:00 (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
- 10.1002/9781119991298 (literal)
- Alternative label
Fabio Tesser, Enrico Zovato, Piero Cosi
Institute of Cognitive Sciences and Technologies,
Italian National Research Council
Padova, Italy (2010)
Statistical Spectral Envelope Transformation applied to Emotional Speech
in 13th Int. Conference on Digital Audio Effects (DAFx-10), Graz, Austria, September 6-10, 2010
(literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
- Fabio Tesser, Enrico Zovato, Piero Cosi
Institute of Cognitive Sciences and Technologies,
Italian National Research Council
Padova, Italy (literal)
- Pagina inizio
- Pagina fine
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#altreInformazioni
- Udo Zölzer
Published Online: 10 MAR 2011
DOI: 10.1002/9781119991298.fmatter
Copyright © 2011 John Wiley & Sons, Ltd
Book Title
Helmut Schmidt University - University of the Federal Armed Forces, Hamburg, Germany
Published Online: 10 MAR 2011
Published Print: 11 MAR 2011
Print ISBN: 9780470665992
Online ISBN: 9781119991298 (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#citta
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#url
- http://dafx10.iem.at/proceedings/papers/TesserZovatoCosi_DAFx10_P81.pdf (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#titoloVolume
- Proceedings of DAFx-10 13th International Conference on Digital Audio Effects (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autoriCuratela
- Tesser F, Zovato E, Cosi P (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#pagineTotali
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#descrizioneSinteticaDelProdotto
- Transformation of sound by statistical techniques is a promising method for a new range of digital audio effects. In this paper a data driven voice transformation algorithm is used to alter the timbre of a neutral (non-emotional) voice in order to reproduce a particular emotional vocal timbre.
Perceptually based Mel-Cepstral analysis and Mel Log Spectral Approximation digital filter are used to represent the speech timbre and to synthesize speech with modified spectral envelope.
The transformation function adopts a GMM (Gaussian Mixture Model) based parametrization in order convert the spectral envelopes. Experiments with the first and second order derivatives of the mel-cepstral coefficients have been undertaken to prove the benefit of including dynamic information in the model.
The proposed algorithm has been evaluated by means of objective measures in the neutral-to-happy and neutral-to-sad tasks. (literal)
- Note
- R (literal)
- Wiley OnLine L (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
- Fabio Tesser, Piero Cosi
Institute of Cognitive Sciences and Technologies,
Italian National Research Council
Padova, Italy
ISTC-CNR, UOS Padova, Padova, Italy
Enrico Zovato
Loquendo S.p.A.
Torino, Italy (literal)
- Titolo
- Statistical Spectral Envelope Transformation applied to Emotional Speech (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#inCollana
- Hannes Pomberger, Franz Zotter And Alois Sontacchi (Ed.) Proceedings of DAFx-10 13th International Conference on Digital Audio Effects (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#isbn
- 978-3-200-01940-9 (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autoriVolume
- Hannes Pomberger, Franz Zotter And Alois Sontacchi (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#curatoriVolume
- Hannes Pomberger, Franz Zotter And Alois Sontacchi (literal)
- Abstract
- Transformation of sound by statistical techniques is a promising method for a new range of digital audio effects. In this paper a data driven voice transformation algorithm is used to alter the timbre of a neutral (non-emotional) voice in order to reproduce a particular emotional vocal timbre. Perceptually based Mel-Cepstral analysis and Mel Log Spectral Approximation digital filter are used to represent the speech timbre and to synthesize speech with modified spectral envelope. The transformation function adopts a GMM (Gaussian Mixture Model) based parametrization in order convert the spectral envelopes. Experiments with the first and second order derivatives of the mel-cepstral coefficients have been undertaken to prove the benefit of including dynamic information in the model. The proposed algorithm has been evaluated by means of objective measures in the neutral-to-happy and neutral-to-sad tasks. (literal)
- Editore
- Prodotto di
- Autore CNR
- Insieme di parole chiave
Incoming links:
- Autore CNR di
- Prodotto
- Editore di
- Insieme di parole chiave di