Accelerating Machine Learning Inference with GPUs in ProtoDUNE Data Processing.

Comput Softw Big Sci

Fermi National Accelerator Laboratory, Kirk Road and Pine Streets, Batavia, 60510 IL USA.

Published: October 2023

We study the performance of a cloud-based GPU-accelerated inference server to speed up event reconstruction in neutrino data batch jobs. Using detector data from the ProtoDUNE experiment and employing the standard DUNE grid job submission tools, we attempt to reprocess the data by running several thousand concurrent grid jobs, a rate we expect to be typical of current and future neutrino physics experiments. We process most of the dataset with the GPU version of our processing algorithm and the remainder with the CPU version for timing comparisons. We find that a 100-GPU cloud-based server is able to easily meet the processing demand, and that using the GPU version of the event processing algorithm is two times faster than processing these data with the CPU version when comparing to the newest CPUs in our sample. The amount of data transferred to the inference server during the GPU runs can overwhelm even the highest-bandwidth network switches, however, unless care is taken to observe network facility limits or otherwise distribute the jobs to multiple sites. We discuss the lessons learned from this processing campaign and several avenues for future improvements.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611601PMC
http://dx.doi.org/10.1007/s41781-023-00101-0DOI Listing

Publication Analysis

Top Keywords

inference server
8
gpu version
8
processing algorithm
8
cpu version
8
data
6
processing
6
accelerating machine
4
machine learning
4
learning inference
4
inference gpus
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!