A Multi-Domain Architecture for Mining Frequent Items and Itemsets from Distributed Data Streams (Articolo in rivista)

Type
Label
  • A Multi-Domain Architecture for Mining Frequent Items and Itemsets from Distributed Data Streams (Articolo in rivista) (literal)
Anno
  • 2014-01-01T00:00:00+01:00 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
  • 10.1007/s10723-013-9277-0 (literal)
Alternative label
  • Cesario, Eugenio; Mastroianni, Carlo; Talia, Domenico (2014)
    A Multi-Domain Architecture for Mining Frequent Items and Itemsets from Distributed Data Streams
    in Journal of grid computing
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Cesario, Eugenio; Mastroianni, Carlo; Talia, Domenico (literal)
Pagina inizio
  • 153 (literal)
Pagina fine
  • 168 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#url
  • http://www.scopus.com/record/display.url?eid=2-s2.0-84899436704&origin=inward (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
  • 12 (literal)
Rivista
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroFascicolo
  • 1 (literal)
Note
  • Scopu (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • ICAR-CNR; ICAR-CNR; Università della Calabria (literal)
Titolo
  • A Multi-Domain Architecture for Mining Frequent Items and Itemsets from Distributed Data Streams (literal)
Abstract
  • Real-time analysis of distributed data streams is a challenging task since it requires scalable solutions to handle streams of data that are generated very rapidly by multiple sources. This paper presents the design and the implementation of an architecture for the analysis of data streams in distributed environments. In particular, data stream analysis has been carried out for the computation of items and itemsets that exceed a frequency threshold. The mining approach is hybrid, that is, frequent items are calculated with a single pass, using a sketch algorithm, while frequent itemsets are calculated by a further multi-pass analysis. The architecture combines parallel and distributed processing to keep the pace with the rate of distributed data streams. In order to keep computation close to data, miners are distributed among the domains where data streams are generated. The paper reports the experimental results obtained with a prototype of the architecture, tested on a Grid composed of three domains each one handling a data stream. © 2013 Springer Science+Business Media Dordrecht. (literal)
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Prodotto
Autore CNR di
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
Insieme di parole chiave di
data.CNR.it