Distributed memory, GPU accelerated Fock construction for hybrid, Gaussian basis density functional theory.

J Chem Phys

Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA.

Published: June 2023

AI Article Synopsis

  • Modern supercomputers increasingly depend on GPUs, leading to a focus on optimizing electronic structure methods to leverage these parallel computing resources.
  • While previous work has mainly centered on shared memory systems, this study introduces distributed memory algorithms for calculating Coulomb and exact exchange matrices in hybrid Kohn-Sham density functional theory (DFT) with Gaussian basis sets.
  • The new algorithms, tested on systems with hundreds to over a thousand atoms on the Perlmutter supercomputer, show excellent performance and scalability using up to 128 NVIDIA A100 GPUs.

Article Abstract

With the growing reliance of modern supercomputers on accelerator-based architecture such a graphics processing units (GPUs), the development and optimization of electronic structure methods to exploit these massively parallel resources has become a recent priority. While significant strides have been made in the development GPU accelerated, distributed memory algorithms for many modern electronic structure methods, the primary focus of GPU development for Gaussian basis atomic orbital methods has been for shared memory systems with only a handful of examples pursing massive parallelism. In the present work, we present a set of distributed memory algorithms for the evaluation of the Coulomb and exact exchange matrices for hybrid Kohn-Sham DFT with Gaussian basis sets via direct density-fitted (DF-J-Engine) and seminumerical (sn-K) methods, respectively. The absolute performance and strong scalability of the developed methods are demonstrated on systems ranging from a few hundred to over one thousand atoms using up to 128 NVIDIA A100 GPUs on the Perlmutter supercomputer.

Download full-text PDF

Source
http://dx.doi.org/10.1063/5.0151070DOI Listing

Publication Analysis

Top Keywords

distributed memory
12
gaussian basis
12
gpu accelerated
8
electronic structure
8
structure methods
8
memory algorithms
8
methods
5
memory gpu
4
accelerated fock
4
fock construction
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!