AI Article Synopsis

  • This study introduces a compact deep learning architecture and specialized hardware to enhance the reconstruction of blood flow index (BFi) in diffuse correlation spectroscopy (DCS) using autocorrelation functions (ACFs) for training.
  • The proposed lightweight deep learning model displays significant improvements in mean squared error (MSE) compared to traditional convolutional neural networks (CNN), while also simplifying computations through feature extraction, which is optimized for hardware implementation.
  • The developed system allows for real-time, parallel processing of autocorrelation functions on FPGA, offering an integrated on-chip solution for converting photon data into BFi and coherence factor β, surpassing conventional post-processing methods.

Article Abstract

This study proposes a compact deep learning (DL) architecture and a highly parallelized computing hardware platform to reconstruct the blood flow index (BFi) in diffuse correlation spectroscopy (DCS). We leveraged a rigorous analytical model to generate autocorrelation functions (ACFs) to train the DL network. We assessed the accuracy of the proposed DL using simulated and milk phantom data. Compared to convolutional neural networks (CNN), our lightweight DL architecture achieves 66.7% and 18.5% improvement in MSE for BFi and the coherence factor β, using synthetic data evaluation. The accuracy of rBFi over different algorithms was also investigated. We further simplified the DL computing primitives using subtraction for feature extraction, considering further hardware implementation. We extensively explored computing parallelism and fixed-point quantization within the DL architecture. With the DL model's compact size, we employed unrolling and pipelining optimizations for computation-intensive for-loops in the DL model while storing all learned parameters in on-chip BRAMs. We also achieved pixel-wise parallelism, enabling simultaneous, real-time processing of 10 and 15 autocorrelation functions on Zynq-7000 and Zynq-UltraScale+ field programmable gate array (FPGA), respectively. Unlike existing FPGA accelerators that produce BFi and the β from autocorrelation functions on standalone hardware, our approach is an encapsulated, end-to-end on-chip conversion process from intensity photon data to the temporal intensity ACF and subsequently reconstructing BFi and β. This hardware platform achieves an on-chip solution to replace post-processing and miniaturize modern DCS systems that use single-photon cameras. We also comprehensively compared the computational efficiency of our FPGA accelerator to CPU and GPU solutions.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2024.108471DOI Listing

Publication Analysis

Top Keywords

autocorrelation functions
12
deep learning
8
learning architecture
8
diffuse correlation
8
correlation spectroscopy
8
hardware platform
8
hardware
5
high-performance deep
4
architecture
4
architecture hardware
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!