Performance of parallel FDTD method for shared- and distributed-memory architectures: Application tobioelectromagnetics
Metadatos
Afficher la notice complèteEditorial
Rashid Mehmood
Date
2020-09-11Referencia bibliográfica
Ruiz-Cabello N. M, Abal¸enkovs M, Diaz Angulo LM, Cobos Sanchez C, Moglie F, Garcia SG (2020) Performance of parallel FDTD method for shared- and distributed-memory architectures: Application tobioelectromagnetics. PLoS ONE 15 (9): e0238115. [https://doi.org/10.1371/journal. pone.0238115]
Résumé
This work provides an in-depth computational performance study of the parallel finite-difference
time-domain (FDTD) method. The parallelization is done at various levels including:
shared- (OpenMP) and distributed- (MPI) memory paradigms and vectorization on three different
architectures: Intel’s Knights Landing, Skylake and ARM’s Cavium ThunderX2. This
study contributes to prove, in a systematic manner, the well-established claim within the
Computational Electromagnetic community, that the main factor limiting FDTD performance,
in realistic problems, is the memory bandwidth. Consequently a memory bandwidth
threshold can be assessed depending on the problem size in order to attain optimal performance.
Finally, the results of this study have been used to optimize the workload balancing
of simulation of a bioelectromagnetic problem consisting in the exposure of a human model
to a reverberation chamber-like environment.