State-of-the-art distributed-memory computer clusters contain multicore CPUs with 16 and more cores. The second generation of the Intel Xeon Phi many-core processor has more than 60 cores with 16 GB of high-performance on-chip memory. We contrast the performance of the second-generation Intel Xeon Phi, code-named Knights Landing (KNL), with 68 computational cores to the latest multicore CPU Intel Skylake with 18 cores. A special-purpose code solving a system of nonlinear reaction-diffusion partial differential equations with several thousands of point sources modeled mathematically by Dirac delta distributions serves as realistic test bed. The system is discretized in space by the finite volume method and advanced by fully implicit time-stepping, with a matrix-free implementation that allows the complex model to have an extremely small memory footprint. The sample application is a seven variable model of calcium-induced calcium release (CICR) that models the interplay between electrical excitation, calcium signaling, and mechanical contraction in a heart cell. The results demonstrate that excellent parallel scalability is possible on both hardware platforms, but that modern multicore CPUs outperform the specialized many-core Intel Xeon Phi KNL architecture for a large class of problems such as systems of parabolic partial differential equations.

Download full-text PDF

Source
http://dx.doi.org/10.1002/cnm.3244DOI Listing

Publication Analysis

Top Keywords

intel xeon
12
xeon phi
12
multicore cpus
8
partial differential
8
differential equations
8
challenges opportunities
4
opportunities simulation
4
simulation calcium
4
calcium waves
4
waves modern
4

Similar Publications

Processing large collections of earth observation (EO) time-series, often petabyte-sized, such as NASA's Landsat and ESA's Sentinel missions, can be computationally prohibitive and costly. Despite their name, even the Analysis Ready Data (ARD) versions of such collections can rarely be used as direct input for modeling because of cloud presence and/or prohibitive storage size. Existing solutions for readily using these data are not openly available, are poor in performance, or lack flexibility.

View Article and Find Full Text PDF

Superconductivity can be considered among the most exciting discoveries in material science of the 20th century. However, the hard conditions for the synthesis and the difficult characterization, make the statement of new high critical temperature ( ) complex from the experimental viewpoint and have recently led to several hot controversies in the literature. In this panorama, theory has become a trustworthy diagnosis.

View Article and Find Full Text PDF

Crystal Polymorph Search in the Ensemble via a Deposition/Sublimation Alchemical Path.

Cryst Growth Des

April 2024

Department of Biomedical Engineering, University of Iowa, 103 South Capitol Street, 5601 Seamans Center for the Engineering Arts and Sciences, Iowa City, Iowa 52242, United States.

The formulation of active pharmaceutical ingredients involves discovering stable crystal packing arrangements or polymorphs, each of which has distinct pharmaceutically relevant properties. Traditional experimental screening techniques utilizing various conditions are commonly supplemented with in silico crystal structure prediction (CSP) to inform the crystallization process and mitigate risk. Predictions are often based on advanced classical force fields or quantum mechanical calculations that model the crystal potential energy landscape but do not fully incorporate temperature, pressure, or solution conditions during the search procedure.

View Article and Find Full Text PDF

The cooperative, connected, and automated mobility (CCAM) infrastructure plays a key role in understanding and enhancing the environmental perception of autonomous vehicles (AVs) driving in complex urban settings. However, the deployment of CCAM infrastructure necessitates the efficient selection of the computational processing layer and deployment of machine learning (ML) and deep learning (DL) models to achieve greater performance of AVs in complex urban environments. In this paper, we propose a computational framework and analyze the effectiveness of a custom-trained DL model (YOLOv8) when deployed in diverse devices and settings at the vehicle-edge-cloud-layered architecture.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!