Analysis and recognition of highly degraded printed characters (Articolo in rivista)

Type
Label
  • Analysis and recognition of highly degraded printed characters (Articolo in rivista) (literal)
Anno
  • 2004-01-01T00:00:00+01:00 (literal)
Alternative label
  • Tonazzini A., Vezzosi S., Bedini L. (2004)
    Analysis and recognition of highly degraded printed characters
    in International journal on document analysis and recognition (Internet)
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Tonazzini A., Vezzosi S., Bedini L. (literal)
Pagina inizio
  • 236 (literal)
Pagina fine
  • 247 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
  • 6 (literal)
Rivista
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#note
  • 17 november 2003 - Pubblicazione online (A0-24) (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#descrizioneSinteticaDelProdotto
  • This paper proposes an integrated system for the processing and the analysis of highly degraded printed documents, with the aim at recognizing the text characters. As a case study, ancient printed texts are considered. The system is constituted of various blocks operating sequentially. Starting from a single page of the document, the background noise is reduced by wavelet-based decomposition and filtering, the text lines are detected, extracted, and segmented into blobs corresponding to characters, by a simple and fast adaptive thresholding, and the various blobs are analyzed by a feed-forward multilayer neural network, trained with a back-propagation algorithm. For each character, the probability associated to the recognition is then used as a discriminating parameter that determines the automatic activation of a feedback process, leading back the system to a block for refining segmentation. This block acts only on the small portions of the text where the recognition cannot be relied on, and makes use of blind deconvolution and MRF-based segmentation techniques, whose high complexity is greatly reduced when applied to a few sub-images of small size. The experimental results highlight that the proposed system performs a very precise segmentation of the characters and then a highly effective recognition of even strongly degraded texts. (literal)
Note
  • PuMa (literal)
  • S (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • ISTI-CNR (literal)
Titolo
  • Analysis and recognition of highly degraded printed characters (literal)
Abstract
  • This paper proposes an integrated system for the processing and analysis of highly degraded printed documents for the purpose of recognizing text characters. As a case study, ancient printed texts are considered. The system is comprised of various blocks operating sequentially. Starting with a single page of the document, the background noise is reduced by wavelet-based decomposition and filtering, the text lines are detected, extracted, and segmented by a simple and fast adaptive thresholding into blobs corresponding to characters, and the various blobs are analyzed by a feedforward multilayer neural network trained with a back-propagation algorithm. For each character, the probability associated with the recognition is then used as a discriminating parameter that determines the automatic activation of a feedback process, leading the system back to a block for refining segmentation. This block acts only on the small portions of the text where the recognition cannot be relied on and makes use of blind deconvolution and MRF-based segmentation techniques whose high complexity is greatly reduced when applied to a few subimages of small size. The experimental results highlight that the proposed system performs a very precise segmentation of the characters and then a highly effective recognition of even strongly degraded texts. (literal)
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Prodotto
Autore CNR di
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
Insieme di parole chiave di
data.CNR.it