Graph Convolutional Neural Network (GCNN) is a popular class of deep learning (DL) models in material science to predict material properties from the graph representation of molecular structures. Training an accurate and comprehensive GCNN surrogate for molecular design requires large-scale graph datasets and is usually a time-consuming process. Recent advances in GPUs and distributed computing open a path to reduce the computational cost for GCNN training effectively. However, efficient utilization of high performance computing (HPC) resources for training requires simultaneously optimizing large-scale data management and scalable stochastic batched optimization techniques. In this work, we focus on building GCNN models on HPC systems to predict material properties of millions of molecules. We use HydraGNN, our in-house library for large-scale GCNN training, leveraging distributed data parallelism in PyTorch. We use ADIOS, a high-performance data management framework for efficient storage and reading of large molecular graph data. We perform parallel training on two open-source large-scale graph datasets to build a GCNN predictor for an important quantum property known as the HOMO-LUMO gap. We measure the scalability, accuracy, and convergence of our approach on two DOE supercomputers: the Summit supercomputer at the Oak Ridge Leadership Computing Facility (OLCF) and the Perlmutter system at the National Energy Research Scientific Computing Center (NERSC). We present our experimental results with HydraGNN showing (i) reduction of data loading time up to 4.2 times compared with a conventional method and (ii) linear scaling performance for training up to 1024 GPUs on both Summit and Perlmutter.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575242 | PMC |
http://dx.doi.org/10.1186/s13321-022-00652-1 | DOI Listing |
Sci Rep
December 2024
Department of Civil Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
Deep learning models are widely used for traffic forecasting on freeways due to their ability to learn complex temporal and spatial relationships. In particular, graph neural networks, which integrate graph theory into deep learning, have become popular for modeling traffic sensor networks. However, traditional graph convolutional networks (GCNs) face limitations in capturing long-range spatial correlations, which can hinder accurate long-term predictions.
View Article and Find Full Text PDFSci Rep
December 2024
Department of Computing and Information Systems, Sunway University, 47500, Petaling Jaya, Selangor Darul Ehsan, Malaysia.
Urban mobility prediction is crucial for optimizing resource allocation, managing transportation systems, and planning urban development. We propose a novel framework, GeoTemporal LSTM (GT-LSTM), designed to address the intricate spatiotemporal dynamics of urban environments. GT-LSTM integrates temporal dependencies with geographic information through a multi-modal approach that combines attention mechanisms and Recurrent Neural Networks (RNNs).
View Article and Find Full Text PDFFront Microbiol
December 2024
College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, China.
In the contemporary field of life sciences, researchers have gradually recognized the critical role of microbes in maintaining human health. However, traditional biological experimental methods for validating the association between microbes and diseases are both time-consuming and costly. Therefore, developing effective computational methods to predict potential associations between microbes and diseases is an important and urgent task.
View Article and Find Full Text PDFSci Rep
December 2024
School of Marxism, China University of Political Science and Law (CUPL), Beijing, 100091, China.
To improve students' understanding of physical education teaching concepts and help teachers analyze students' cognitive patterns, the study proposes an association learning-based method for understanding physical education teaching concepts using deep learning algorithms, which extracts image features related to teaching concepts using convolutional neural networks. Moreover, a neurocognitive diagnostic model based on hypergraph convolution is constructed to mine the data of students' long-term learning sequences and identify students' cognitive outcomes. The findings revealed that the highest accuracy of the association graph convolutional neural network was 0.
View Article and Find Full Text PDFSci Rep
December 2024
Computer Science Department, Faculty of Computers and Information, South Valley University, Qena, 83523, Egypt.
Enhanced technologies of the future are gradually improving the digital landscape. Internet of Things (IoT) technology is an advanced technique that is quickly increasing owing to the development of a network of organized online devices. In today's digital era, the IoT is considered one of the most robust technologies.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!