Data handling strategies for high throughput pyrosequencers. (Articolo in rivista)

Type
Label
  • Data handling strategies for high throughput pyrosequencers. (Articolo in rivista) (literal)
Anno
  • 2007-01-01T00:00:00+01:00 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
  • 10.1186/1471-2105-8-S1-S22 (literal)
Alternative label
  • Trombetti GA, Bonnal RJ, Rizzi E, De Bellis G, Milanesi L. (2007)
    Data handling strategies for high throughput pyrosequencers.
    in BMC bioinformatics
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Trombetti GA, Bonnal RJ, Rizzi E, De Bellis G, Milanesi L. (literal)
Pagina inizio
  • s1 (literal)
Pagina fine
  • s22 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
  • 8 (literal)
Rivista
Note
  • ISI Web of Science (WOS) (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • CNR-ITB (literal)
Titolo
  • Data handling strategies for high throughput pyrosequencers. (literal)
Abstract
  • BACKGROUND: New high throughput pyrosequencers such as the 454 Life Sciences GS 20 are capable of massively parallelizing DNA sequencing providing an unprecedented rate of output data as well as potentially reducing costs. However, these new pyrosequencers bear a different error profile and provide shorter reads than those of a more traditional Sanger sequencer. These facts pose new challenges regarding how the data are handled and analyzed, in addition, the steep increase in the sequencers throughput calls for much computation power at a low cost. RESULTS: To address these challenges, we created an automated multi-step computation pipeline integrated with a database storage system. This allowed us to store, handle, index and search (1) the output data from the GS20 sequencer (2) analysis projects, possibly multiple on every dataset (3) final results of analysis computations (4) intermediate results of computations (these allow hand-made comparisons and hence further searches by the biologists). Repeatability of computations was also a requirement. In order to access the needed computation power, we ported the pipeline to the European Grid: a large community of clusters, load balan (literal)
Prodotto di
Autore CNR

Incoming links:


Autore CNR di
Prodotto
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
data.CNR.it