TESLA GPUs versus MPI with OpenMP for the forward modeling of gravity and gravity gradient of large prisms ensemble

Carlos Couder-Castañeda; Carlos Ortiz-Alemán; Mauricio Gabriel Orozco-Del-Castillo; Mauricio Nava-Flores

doi:10.1155/2013/437357

TESLA GPUs versus MPI with OpenMP for the forward modeling of gravity and gravity gradient of large prisms ensemble

Carlos Couder-Castañeda, Carlos Ortiz-Alemán, Mauricio Gabriel Orozco-Del-Castillo, Mauricio Nava-Flores

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

9 Citas (Scopus)

Resumen

An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel computing, which has led to the development of several applications with various applications. Nevertheless, in some applications the decomposition of the tasks is not trivial, as can be appreciated in this paper. Unlike a trivial decomposition of the domain, we proposed to decompose the problem by sets of prisms and use different memory spaces per processing CUDA core, avoiding the performance decay as a result of the constant calls to kernels functions which would be needed in a parallelization by observations points. The design and implementation created are the main contributions of this work, because the parallelization scheme implemented is not trivial. The performance results obtained are comparable to those of a small processing cluster.

Idioma original	Inglés
Número de artículo	437357
Publicación	Journal of Applied Mathematics
Volumen	2013
DOI	https://doi.org/10.1155/2013/437357
Estado	Publicada - 2013
Publicado de forma externa	Sí

Acceder al documento

10.1155/2013/437357

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

@article{667a4aa7040442baa009d46ebd4207f6,

title = "TESLA GPUs versus MPI with OpenMP for the forward modeling of gravity and gravity gradient of large prisms ensemble",

abstract = "An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel computing, which has led to the development of several applications with various applications. Nevertheless, in some applications the decomposition of the tasks is not trivial, as can be appreciated in this paper. Unlike a trivial decomposition of the domain, we proposed to decompose the problem by sets of prisms and use different memory spaces per processing CUDA core, avoiding the performance decay as a result of the constant calls to kernels functions which would be needed in a parallelization by observations points. The design and implementation created are the main contributions of this work, because the parallelization scheme implemented is not trivial. The performance results obtained are comparable to those of a small processing cluster.",

author = "Carlos Couder-Casta{\~n}eda and Carlos Ortiz-Alem{\'a}n and Orozco-Del-Castillo, {Mauricio Gabriel} and Mauricio Nava-Flores",

year = "2013",

doi = "10.1155/2013/437357",

language = "Ingl{\'e}s",

volume = "2013",

journal = "Journal of Applied Mathematics",

issn = "1110-757X",

}

TY - JOUR

T1 - TESLA GPUs versus MPI with OpenMP for the forward modeling of gravity and gravity gradient of large prisms ensemble

AU - Couder-Castañeda, Carlos

AU - Ortiz-Alemán, Carlos

AU - Orozco-Del-Castillo, Mauricio Gabriel

AU - Nava-Flores, Mauricio

PY - 2013

Y1 - 2013

N2 - An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel computing, which has led to the development of several applications with various applications. Nevertheless, in some applications the decomposition of the tasks is not trivial, as can be appreciated in this paper. Unlike a trivial decomposition of the domain, we proposed to decompose the problem by sets of prisms and use different memory spaces per processing CUDA core, avoiding the performance decay as a result of the constant calls to kernels functions which would be needed in a parallelization by observations points. The design and implementation created are the main contributions of this work, because the parallelization scheme implemented is not trivial. The performance results obtained are comparable to those of a small processing cluster.

AB - An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel computing, which has led to the development of several applications with various applications. Nevertheless, in some applications the decomposition of the tasks is not trivial, as can be appreciated in this paper. Unlike a trivial decomposition of the domain, we proposed to decompose the problem by sets of prisms and use different memory spaces per processing CUDA core, avoiding the performance decay as a result of the constant calls to kernels functions which would be needed in a parallelization by observations points. The design and implementation created are the main contributions of this work, because the parallelization scheme implemented is not trivial. The performance results obtained are comparable to those of a small processing cluster.

UR - http://www.scopus.com/inward/record.url?scp=84890028131&partnerID=8YFLogxK

U2 - 10.1155/2013/437357

DO - 10.1155/2013/437357

M3 - Artículo

SN - 1110-757X

VL - 2013

JO - Journal of Applied Mathematics

JF - Journal of Applied Mathematics

M1 - 437357

ER -

TESLA GPUs versus MPI with OpenMP for the forward modeling of gravity and gravity gradient of large prisms ensemble

Resumen

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto