Integral-Direct and Parallel Implementation of the CCSD(T) Method: Algorithmic Developments and Large-Scale Applications.

J Chem Theory Comput

Department of Physical Chemistry and Materials Science , Budapest University of Technology and Economics, P.O. Box 91, H-1521 Budapest , Hungary.

Published: January 2020

A completely integral-direct, disk I/O, and network traffic economic coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] implementation has been developed relying on the density-fitting approximation. By fully exploiting the permutational symmetry, the presented algorithm is highly operation count and memory-efficient. Our measurements demonstrate excellent strong scaling achieved via hybrid MPI/OpenMP parallelization and a highly competitive, 60-70% utilization of the theoretical peak performance on up to hundreds of cores. The terms whose evaluation time becomes significant only for small- to medium-sized examples have also been extensively optimized. Consequently, high performance is also expected for systems appearing in extensive data sets used, e.g., for density functional or machine learning parametrizations, and in calculations required for certain reduced-cost or local approximations of CCSD(T), such as in our local natural orbital scheme [LNO-CCSD(T)]. The efficiency of this implementation allowed us to perform some of the largest CCSD(T) calculations ever presented for systems of 31-43 atoms and 1037-1569 orbitals using only four to eight many-core CPUs and 1-3 days of wall time. The resulting 13 correlation energies and the 12 corresponding reaction energies and barrier heights are added to our previous benchmark set collecting reference CCSD(T) results of molecules at the applicability limit of current implementations.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jctc.9b00957DOI Listing

Publication Analysis

Top Keywords

integral-direct parallel
4
parallel implementation
4
ccsdt
4
implementation ccsdt
4
ccsdt method
4
method algorithmic
4
algorithmic developments
4
developments large-scale
4
large-scale applications
4
applications completely
4

Similar Publications

Linear-Scaling Local Natural Orbital CCSD(T) Approach for Open-Shell Systems: Algorithms, Benchmarks, and Large-Scale Applications.

J Chem Theory Comput

November 2023

Department of Physical Chemistry and Materials Science, Faculty of Chemical Technology and Biotechnology, Budapest University of Technology and Economics, Műegyetem rkp. 3, H-1111 Budapest, Hungary.

The extension of the highly optimized local natural orbital (LNO) coupled cluster (CC) with single-, double-, and perturbative triple excitations [LNO-CCSD(T)] method is presented for high-spin open-shell molecules based on restricted open-shell references. The techniques enabling the outstanding efficiency of the closed-shell LNO-CCSD(T) variant are adopted, including the iteration- and redundancy-free second-order Møller-Plesset and (T) formulations as well as the integral-direct, memory- and disk use-economic, and OpenMP-parallel algorithms. For large molecules, the efficiency of our open-shell LNO-CCSD(T) method approaches that of its closed-shell parent method due to the application of restricted orbital sets for demanding integral transformations and a novel approximation for higher-order long-range spin-polarization effects.

View Article and Find Full Text PDF

Hyperfine Coupling Constants in Local Exact Two-Component Theory.

J Chem Theory Comput

January 2022

Department of Chemistry, University of California, Irvine, 1102 Natural Sciences II, Irvine, California 92697-2025, United States.

We present a highly efficient implementation of the electron-nucleus hyperfine coupling matrix within the one-electron exact two-component (X2C) theory. The complete derivative of the X2C Hamiltonian is formed, that is, the derivatives of the unitary decoupling transformation are considered. This requires the solution of the response and Sylvester equations, consequently increasing the computational costs.

View Article and Find Full Text PDF

A Massively Parallel Implementation of the CCSD(T) Method Using the Resolution-of-the-Identity Approximation and a Hybrid Distributed/Shared Memory Parallelization Model.

J Chem Theory Comput

August 2021

Department of Chemistry and Ames Laboratory, Iowa State University, 2416 Pammel Drive, Ames 50011-2416, Iowa United States of America.

A parallel algorithm is described for the coupled-cluster singles and doubles method augmented with a perturbative correction for triple excitations [CCSD(T)] using the resolution-of-the-identity (RI) approximation for two-electron repulsion integrals (ERIs). The algorithm bypasses the storage of four-center ERIs by adopting an integral-direct strategy. The CCSD amplitude equations are given in a compact quasi-linear form by factorizing them in terms of amplitude-dressed three-center intermediates.

View Article and Find Full Text PDF

A linear-scaling local second-order Møller-Plesset (MP2) method is presented for high-spin open-shell molecules based on restricted open-shell (RO) reference functions. The open-shell local MP2 (LMP2) approach inherits the iteration- and redundancy-free formulation and the completely integral-direct, OpenMP-parallel, and memory and disk use economic algorithms of our closed-shell LMP2 implementation. By utilizing restricted local molecular orbitals for the demanding integral transformation step and by introducing a novel long-range spin-polarization approximation, the computational cost of RO-LMP2 approaches that of closed-shell LMP2.

View Article and Find Full Text PDF

Accurate Reduced-Cost CCSD(T) Energies: Parallel Implementation, Benchmarks, and Large-Scale Applications.

J Chem Theory Comput

February 2021

Department of Physical Chemistry and Materials Science, Budapest University of Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary.

The accurate and systematically improvable frozen natural orbital (FNO) and natural auxiliary function (NAF) cost-reducing approaches are combined with our recent coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] implementations. Both of the closed- and open-shell FNO-CCSD(T) codes benefit from OpenMP parallelism, completely or partially integral-direct density-fitting algorithms, checkpointing, and hand-optimized, memory- and operation count effective implementations exploiting all permutational symmetries. The closed-shell CCSD(T) code requires negligible disk I/O and network bandwidth, is MPI/OpenMP parallel, and exhibits outstanding peak performance utilization of 50-70% up to hundreds of cores.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!