A Tree-Based Approach to Clustering XML Documents by Structure (Articolo in rivista)

Type
Label
  • A Tree-Based Approach to Clustering XML Documents by Structure (Articolo in rivista) (literal)
Anno
  • 2004-01-01T00:00:00+01:00 (literal)
Alternative label
  • Costa Gianni, Manco Giuseppe, Ortale Riccardo, Tagarelli Andrea (2004)
    A Tree-Based Approach to Clustering XML Documents by Structure
    in Lecture notes in computer science
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Costa Gianni, Manco Giuseppe, Ortale Riccardo, Tagarelli Andrea (literal)
Pagina inizio
  • 137 (literal)
Pagina fine
  • 148 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
  • 3202- (literal)
Rivista
Note
  • ISI Web of Science (WOS) (literal)
Titolo
  • A Tree-Based Approach to Clustering XML Documents by Structure (literal)
Abstract
  • We propose a novel methodology for clustering XML documents on the basis of their structural similarities. The basic idea is to equip each cluster with an \emph{XML cluster representative}, i.e. an XML document subsuming the most typical structural specifics of a set of XML documents. Clustering is essentially accomplished by comparing cluster representatives, and updating the representatives as soon as new clusters are detected. We propose an algorithm for computing an XML representative through three phases. Suitable techniques for identifying significant node matchings and for reliably merging and pruning XML trees are investigated. Also, experimental evaluation performed on both synthetic and real data shows the effectiveness of our approach. (literal)
Prodotto di

Incoming links:


Prodotto
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
data.CNR.it