Compute intensive algorithm on heterogeneous system: A case study about fourier transform (Contributo in atti di convegno)

Type
Label
  • Compute intensive algorithm on heterogeneous system: A case study about fourier transform (Contributo in atti di convegno) (literal)
Anno
  • 2014-01-01T00:00:00+01:00 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
  • 10.1109/PDP.2014.55 (literal)
Alternative label
  • Galizia A.; Danovaro E.; Ripepi G.; Clematis A. (2014)
    Compute intensive algorithm on heterogeneous system: A case study about fourier transform
    in 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Torino, Italia, 12-14 Febbraio 2014
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Galizia A.; Danovaro E.; Ripepi G.; Clematis A. (literal)
Pagina inizio
  • 223 (literal)
Pagina fine
  • 227 (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#url
  • http://www.scopus.com/inward/record.url?eid=2-s2.0-84899445044&partnerID=q2rCbXpz (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#titoloVolume
  • 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (literal)
Note
  • Google Scholar (literal)
  • Scopu (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • Institute for Applied Mathematics and Information Technologies, National Research Council, Genoa, Italy (literal)
Titolo
  • Compute intensive algorithm on heterogeneous system: A case study about fourier transform (literal)
Abstract
  • Current workstations can offer really amazing raw computational power: up to 10 TFlops on a single machine equipped with multiple CPUs and accelerators as the Intel Xeon Phi or GPU devices. Such results can only be achieved with a massive parallelism of computational devices, thus the actual barrier posed by the exploitation of modern heterogeneous HPC resources is the difficulty in development and/or (performance) efficient porting of software on such architectures. In this paper, we present an experimental study about achievable performance of a widely used, computational intensive application the Fourier Transform, i.e. Discrete Fourier Transform (DFT) and Fast Fourier Transform. We propose an evaluation of the benefits obtained exploiting such resources in terms of performance and programming efforts in the development of the code with a emphasis on the programming approach adopted for code parallelization. With the exception of the interesting performance achieved exploiting GPU for the DFT algorithm, the use state-ofthe- art software libraries provide the best solution since they represent a good compromise to balance programming efforts and performance achievements. © 2014 IEEE. (literal)
Editore
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Autore CNR di
Prodotto
Editore di
Insieme di parole chiave di
data.CNR.it