A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition (Contributo in atti di convegno)

Type
Label
  • A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition (Contributo in atti di convegno) (literal)
Anno
  • 1990-01-01T00:00:00+01:00 (literal)
Alternative label
  • Cosi P., Falavigna D., Mian G.A., Omologo M. (1990)
    A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition
    in Proceedings EUSIPCO-90, Barcellona, Spain, 18-21 September, 1990
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Cosi P., Falavigna D., Mian G.A., Omologo M. (literal)
Pagina inizio
  • 1199 (literal)
Pagina fine
  • 1202 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#altreInformazioni
  • Cosi P., Falavigna D., Mian G.A., Omologo M. A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition Proceedings EUSIPCO-90 - Signal Processing: Theory and Applications. Fifth European Signal Processing Conference Barcellona, Spain 18-21 September, 1990 pp. 1199-1202 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#titoloVolume
  • Proceedings EUSIPCO-90 - Signal Processing: Theory and Applications. Fifth European Signal Processing Conference (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#pagineTotali
  • 2034 (literal)
Note
  • B (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • ISTC CNR, UOS Padova Cosi P. ITC-IRST Istituto per la Ricerca Scientifica e Tecnologica, Pante' di Povo, 38050 Trento Italia Falavigna D., Omologo M. Dipartimento di Elettronica e Informatica Via Gradenigo 6, 35100 Padova, Italia Mian G.A. (literal)
Titolo
  • A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#isbn
  • 0444886362 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#curatoriVolume
  • Luis Torres, Enrique Masgrau, Miguel A. Lagunas (literal)
Abstract
  • A joint synchrony/mean-rate auditory model, recently proposed by Seneff[6], is embedded into a classical DTW-based system for the recognition of Italian digits. Its performances are evaluĀ¬ated in both clean and noisy speech and compared with those of a system based on the MelĀ¬cepstrum representation. Experimental results show that the Mel representation outperforms the auditory model. Problems encountered by the auditory model in noisy speech are outlined and suggestions for noise compensation techniques both inside and outside the model are given. Simple image processing techniques aiming to clean up the synchrony spectrogram in noisy speech are suggested and some promising preliminary results are presented. (literal)
Editore
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Autore CNR di
Prodotto
Editore di
Insieme di parole chiave di
data.CNR.it