The purpose of this work is to evaluate the efficacy of oversubscription, at the 1n, 2n, and 3n levels for n physical cores, on semi-direct MP2 methods within NWChem when using two and three Intel nodes. Semi-direct MP2 energy and gradient calculations were performed on chemical systems ranging from 824 to 1626 basis functions using the cc-pVDZ basis set. Wall times for semi-direct MP2 energies were reduced by as much as 36% using two nodes and 44% using three nodes compared to no oversubscription. Total energy consumed by the CPU and DRAM was also reduced by as much as 12% using two nodes and as much as 20% using three nodes when oversubscribing. MP2 gradient wall times improved by as much as 16% using two nodes and 18% using three nodes compared to execution at the 1n level; however, energy savings were insignificant. Intel performance-counter data show a strong correlation between total wall time saved and less time spent in the idle state, indicating a more efficient use of the processors when oversubscribing. © 2019 Wiley Periodicals, Inc.

Download full-text PDF

Source
http://dx.doi.org/10.1002/jcc.25866DOI Listing

Publication Analysis

Top Keywords

semi-direct mp2
12
three nodes
12
nodes
8
wall times
8
nodes compared
8
improving efficiency
4
semi-direct
4
efficiency semi-direct
4
semi-direct møller-plesset
4
møller-plesset second-order
4

Similar Publications

The purpose of this work is to evaluate the efficacy of oversubscription, at the 1n, 2n, and 3n levels for n physical cores, on semi-direct MP2 methods within NWChem when using two and three Intel nodes. Semi-direct MP2 energy and gradient calculations were performed on chemical systems ranging from 824 to 1626 basis functions using the cc-pVDZ basis set. Wall times for semi-direct MP2 energies were reduced by as much as 36% using two nodes and 44% using three nodes compared to no oversubscription.

View Article and Find Full Text PDF

In this work, the effect of oversubscription is evaluated, via calling 2n, 3n, or 4n processes for n physical cores, on semi-direct MP2 energy and gradient calculations and RI-MP2 energy calculations with the cc-pVTZ basis using NWChem. Results indicate that on both Intel and AMD platforms, oversubscription reduces total time to solution on average for semi-direct MP2 energy calculations by 25-45% and reduces total energy consumed by the CPU and DRAM on average by 10-15% on the Intel platform. Semi-direct gradient time to solution is shortened on average by 8-15% and energy consumption is decreased by 5-10%.

View Article and Find Full Text PDF

We present a new algorithm for analytical gradient evaluation in resolution-of-the-identity second-order Møller-Plesset perturbation theory (RI-MP2) and thoroughly assess its computational performance and chemical accuracy. This algorithm addresses the potential I/O bottlenecks associated with disk-based storage and access of the RI-MP2 t-amplitudes by utilizing a semi-direct batching approach and yields computational speed-ups of approximately 2-3 over the best conventional MP2 analytical gradient algorithms. In addition, we attempt to provide a straightforward guide to performing reliable and cost-efficient geometry optimizations at the RI-MP2 level of theory.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!