A portable and performant graphics processing unit (GPU)-accelerated library for electron repulsion integral (ERI) evaluation, named LibERI, has been developed and implemented via directive-based (e.g., OpenMP and OpenACC) and standard language parallelism (e.
View Article and Find Full Text PDFThe effective fragment molecular orbital (EFMO) method has been developed to predict the total energy of a very large molecular system accurately (with respect to the underlying quantum mechanical method) and efficiently by taking advantage of the locality of strong chemical interactions and employing a two-level hierarchical parallelism. The accuracy of the EFMO method is partly attributed to the accurate and robust intermolecular interaction prediction between distant fragments, in particular, the many-body polarization and dispersion effects, which require the generation of static and dynamic polarizability tensors by solving the coupled perturbed Hartree-Fock (CPHF) and time-dependent HF (TDHF) equations, respectively. Solving the CPHF and TDHF equations is the main EFMO computational bottleneck due to the inefficient (serial) and I/O-intensive implementation of the CPHF and TDHF solvers.
View Article and Find Full Text PDFThe primary focus of GAMESS over the last 5 years has been the development of new high-performance codes that are able to take effective and efficient advantage of the most advanced computer architectures, both CPU and accelerators. These efforts include employing density fitting and fragmentation methods to reduce the high scaling of well-correlated (e.g.
View Article and Find Full Text PDFThe performance of Fortran 2008 DO CONCURRENT (DC) relative to OpenACC and OpenMP target offloading (OTO) with different compilers is studied for the GAMESS quantum chemistry application. Specifically, DC and OTO are used to offload the Fock build, which is a computational bottleneck in most quantum chemistry codes, to GPUs. The DC Fock build performance is studied on NVIDIA A100 and V100 accelerators and compared with the OTO versions compiled by the NVIDIA HPC, IBM XL, and Cray Fortran compilers.
View Article and Find Full Text PDFMultiple ERI (Electron Repulsion Integral) tensor contractions (METC) with several matrices are ubiquitous in quantum chemistry. In response theories, the contraction operation, rather than ERI computations, can be the major bottleneck, as its computational demands are proportional to the multiplicatively combined contributions of the number of excited states and the kernel pre-factors. This paper presents several high-performance strategies for METC.
View Article and Find Full Text PDFUsing an OpenMP Application Programming Interface, the resolution-of-the-identity second-order Møller-Plesset perturbation (RI-MP2) method has been off-loaded onto graphical processing units (GPUs), both as a standalone method in the GAMESS electronic structure program and as an electron correlation energy component in the effective fragment molecular orbital (EFMO) framework. First, a new scheme has been proposed to maximize data digestion on GPUs that subsequently linearizes data transfer from central processing units (CPUs) to GPUs. Second, the GAMESS Fortran code has been interfaced with GPU numerical libraries (e.
View Article and Find Full Text PDFStrategies for multiple-level parallelizations of quantum-mechanical calculations are discussed, with an emphasis on using groups of workers for performing parallel tasks. These parallel programming models can be used for a variety ab initio quantum chemistry approaches, including the fragment molecular orbital method and replica-exchange molecular dynamics. Strategies for efficient load balancing on problems of increasing granularity are introduced and discussed.
View Article and Find Full Text PDFA framework to offload four-index two-electron repulsion integrals to graphical processing units (GPUs) using OpenMP is discussed. The method has been applied to the Fock build for low angular momentum s and p functions in both the restricted Hartree-Fock (RHF) and in the effective fragment molecular orbital (EFMO) framework. Benchmark calculations for the GPU code for the pure RHF method show an increasing speedup relative to the existing OpenMP CPU code in GAMESS from 1.
View Article and Find Full Text PDFAn quantum chemical approach for the modeling of propellant degradation is presented. Using state-of-the-art bonding analysis techniques and composite methods, a series of potential degradation reactions are devised for a sample hydroxyl-terminated-polybutadiene (HTPB) type solid fuel. By applying thermochemical procedures and isodesmic reactions, accurate thermochemical quantities are obtained using a modified G3 composite method based on the resolution of the identity.
View Article and Find Full Text PDFThe Gaussian-3 (G3) composite approach for thermochemical properties is revisited in light of the enhanced computational efficiency and reduced memory costs by applying the resolution-of-the-identity (RI) approximation for two-electron repulsion integrals (ERIs) to the computationally demanding component methods in the G3 model: the energy and gradient computations via the second-order Møller-Plesset perturbation theory (MP2) and the energy computations using the coupled-cluster singles-doubles method augmented with noniterative triples corrections [CCSD(T)]. Efficient implementation of the RI-based methods is achieved by employing a hybrid distributed/shared memory model based on MPI and OpenMP. The new variant of the G3 composite approach based on the RI approximation is termed the RI-G3 scheme, or alternatively the PDG method.
View Article and Find Full Text PDFA discussion of many of the recently implemented features of GAMESS (General Atomic and Molecular Electronic Structure System) and LibCChem (the C++ CPU/GPU library associated with GAMESS) is presented. These features include fragmentation methods such as the fragment molecular orbital, effective fragment potential and effective fragment molecular orbital methods, hybrid MPI/OpenMP approaches to Hartree-Fock, and resolution of the identity second order perturbation theory. Many new coupled cluster theory methods have been implemented in GAMESS, as have multiple levels of density functional/tight binding theory.
View Article and Find Full Text PDFJ Chem Theory Comput
February 2020
The fully analytic gradient of the second-order Møller-Plesset perturbation theory (MP2) with the resolution-of-the-identity (RI) approximation in the fragment molecular orbital (FMO) framework is derived and implemented using a hybrid multilevel parallel programming model, a combination of the general distributed data interface (GDDI) and the OpenMP API. The FMO/MP2 analytic gradient contains three parts, i.e.
View Article and Find Full Text PDFJ Chem Theory Comput
October 2019
The general distributed data interface (GDDI) that was developed for the fragment molecular orbital (FMO) method is combined with the shared memory OpenMP parallel middleware to support a threading multilevel parallelism. First, GDDI partitions [logical] compute nodes into groups, which are statically or dynamically assigned to different fragments. A small number of processes are created on each compute node.
View Article and Find Full Text PDFThe four-index two-electron repulsion integral (4-2ERI) matrix is compressed using the resolution-of-the-identity (RI) approximation combined with the rank factorization approximation (RFA). The 4-2ERI is first approximated by the RI product. Then, the singular value decomposition (SVD) approximation is used to eliminate low-weighted singular vectors.
View Article and Find Full Text PDFPhys Chem Chem Phys
December 2016
The thermodynamic and kinetic controls of graphene chemistry are studied computationally using a graphene hydrogenation reaction and polyaromatic hydrocarbons to represent the graphene surface. Hydrogen atoms are concertedly chemisorped onto the surface of graphene models of different shapes (i.e.
View Article and Find Full Text PDFA comprehensive picture on the mechanism of the epoxy-phenol curing reactions is presented using the density functional theory B3LYP/ 6-31G(d,p) and simplified physical molecular models to examine all possible reaction pathways. Phenol can act as its own promoter by using an addition phenol molecule to stabilize the transition states, and thus lower the rate-limiting barriers by 27.0-48.
View Article and Find Full Text PDFWe present a detailed analysis of the factors influencing the formation of epoxide and ether groups in graphene nanoflakes using conventional density functional theory (DFT), the density-functional tight-binding (DFTB) method, π-Hückel theory, and graph theoretical invariants. The relative thermodynamic stability associated with the chemisorption of oxygen atoms at various positions on hexagonal graphene flakes (HGFs) of D(6h)-symmetry is determined by two factors - viz. the disruption of the π-conjugation of the HGF and the geometrical deformation of the HGF structure.
View Article and Find Full Text PDF