http://www.cnr.it/ontology/cnr/individuo/prodotto/ID206289
Direct local pattern sampling by efficient two-step random procedures (Contributo in atti di convegno)
- Type
- Label
- Direct local pattern sampling by efficient two-step random procedures (Contributo in atti di convegno) (literal)
- Anno
- 2011-01-01T00:00:00+01:00 (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
- 10.1145/2020408.2020500 (literal)
- Alternative label
Boley M., Lucchese C., Paurat D., Gartner, T. (2011)
Direct local pattern sampling by efficient two-step random procedures
in ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD'11, San Diego, USA, 21-24 August 2011
(literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
- Boley M., Lucchese C., Paurat D., Gartner, T. (literal)
- Pagina inizio
- Pagina fine
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#altreInformazioni
- Area di valutazione 01 - Scienze matematiche e informatiche.
Boley, Mario; Lucchese, Claudio; Paurat, Daniel; Gartner, Thomas (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#url
- http://dl.acm.org/citation.cfm?id=2020500&CFID=61806564&CFTOKEN=64940966 (literal)
- Note
- PuMa (literal)
- Scopu (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
- CNR-ISTI, Pisa, Italy; University of Bonn, Germany (literal)
- Titolo
- Direct local pattern sampling by efficient two-step random procedures (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#isbn
- 978-1-4503-0813-7 (literal)
- Abstract
- We present several exact and highly scalable local pattern sampling algorithms. They can be used as an alternative to exhaustive local pattern discovery methods (e.g, frequent set mining or optimistic-estimator-based subgroup discovery) and can substantially improve efficiency as well as con- trollability of pattern discovery processes. While previous sampling approaches mainly rely on the Markov chain Monte Carlo method, our procedures are direct, i.e., non process- simulating, sampling algorithms. The advantages of these direct methods are an almost optimal time complexity per pattern as well as an exactly controlled distribution of the produced patterns. Namely, the proposed algorithms can sample (item-)sets according to frequency, area, squared fre- quency, and a class discriminativity measure. Experiments demonstrate that these procedures can improve the accuracy of pattern-based models similar to frequent sets and often also lead to substantial gains in terms of scalability. (literal)
- Editore
- Prodotto di
- Autore CNR
- Insieme di parole chiave
Incoming links:
- Prodotto
- Autore CNR di
- Editore di
- Insieme di parole chiave di