Entry pairing in inverted file (Contributo in atti di convegno)

Type
Label
  • Entry pairing in inverted file (Contributo in atti di convegno) (literal)
Anno
  • 2009-01-01T00:00:00+01:00 (literal)
Alternative label
  • Lam H. T.; Perego R.; Silvestri F.; Quan N. T. M. (2009)
    Entry pairing in inverted file
    in WISE 2009 - Web Information Systems Engineering. 10th International Conference
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Lam H. T.; Perego R.; Silvestri F.; Quan N. T. M. (literal)
Pagina inizio
  • 511 (literal)
Pagina fine
  • 522 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#titoloVolume
  • WISE 2009 - Web Information Systems Engineering. 10th International Conference (Poznan, Polonia, October 5-7 2009). Proceedings, pp. 511 - 522. (Lecture Notes in Computer Science, vol. 5802). Springer US, (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
  • 5802 (literal)
Rivista
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#note
  • In: WISE 2009 - Web Information Systems Engineering. 10th International Conference (Poznan, Polonia, October 5-7 2009). Proceedings, pp. 511 - 522. (Lecture Notes in Computer Science, vol. 5802). Springer US, 2009. (literal)
Note
  • ISI Web of Science (WOS) (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • CNR-ISTI, Pisa, Lomonosov Moscow State University, Russia (literal)
Titolo
  • Entry pairing in inverted file (literal)
Abstract
  • This paper proposes to exploit content and usage informa- tion to rearrange an inverted index for a full-text IR system. The idea is to merge the entries of two frequently co-occurring terms, either in the collection or in the answered queries, to form a single, paired, entry. Since postings common to paired terms are not replicated, the resulting index is more compact. In addition, queries containing terms that have been paired are answered faster since we can exploit the pre-computed posting intersection. In order to choose which terms have to be paired, we formulate the term pairing problem as a Maximum-Weight Matching Graph problem, and we evaluate in our scenario efficiency and efficacy of both an exact and a heuristic solution. We apply our technique: (i) to compact a compressed inverted file built on an actual Web collection of documents, and (ii) to increase capacity of an in-memory posting list. Experiments showed that in the first case our approach can improve the compression ratio of up to 7.7%, while we measured a saving from 12% up to 18% in the size of the posting cache. (literal)
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Autore CNR di
Prodotto
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
Insieme di parole chiave di
data.CNR.it