A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 1034
Function: getPubMedXML

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3152
Function: GetPubMedArticleOutput_2016

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

CHEF: A Framework for Deploying Heterogeneous Models on Clusters with Heterogeneous FPGAs. | LitMetric

CHEF: A Framework for Deploying Heterogeneous Models on Clusters with Heterogeneous FPGAs.

IEEE Trans Comput Aided Des Integr Circuits Syst

Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA 15261, USA.

Published: November 2024

AI Article Synopsis

  • - DNNs are transitioning from simple models (single-modality, single-task) to more complex ones (multi-modality, multi-task), which require advanced hardware solutions to handle their varying layers and complex dependencies.
  • - Heterogeneous systems are being developed, integrating different accelerators to reduce latency, with FPGAs being a key component due to their high density and configurability for machine-learning tasks.
  • - The authors introduce CHEF, a framework that efficiently implements these complex models on heterogeneous FPGA clusters, featuring two main approaches (CHEF-A2F and CHEF-M2A) that significantly reduce latency and search times compared to previous methods.

Article Abstract

DNNs are rapidly evolving from streamlined single-modality single-task (SMST) to multi-modality multi-task (MMMT) with large variations for different layers and complex data dependencies among layers. To support such models, hardware systems also evolved to be heterogeneous. The heterogeneous system comes from the prevailing trend to integrate diverse accelerators into the system for lower latency. FPGAs have high computation density and communication bandwidth and are configurable to be deployed with different designs of accelerators, which are widely used for various machine-learning applications. However, scaling from SMST to MMMT on heterogeneous FPGAs is challenging since MMMT has much larger layer variations, a massive number of layers, and complex data dependency among different backbones. Previous mapping algorithms are either inefficient or over-simplified which makes them impractical in general scenarios. In this work, we propose CHEF to enable efficient implementation of MMMT models in realistic heterogeneous FPGA clusters, i.e. deploying heterogeneous accelerators on heterogeneous FPGAs (A2F) and mapping the heterogeneous DNNs on the deployed heterogeneous accelerators (M2A). We propose CHEF-A2F, a two-stage accelerators-to-FPGAs deployment approach to co-optimize hardware deployment and accelerator mapping. In addition, we propose CHEF-M2A, which can support general and practical cases compared to previous mapping algorithms. To the best of our knowledge, this is the first attempt to implement MMMT models in real heterogeneous FPGA clusters. Experimental results show that the latency obtained with CHEF is near-optimal while the search time is 10000X less than exhaustively searching the optimal solution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654640PMC
http://dx.doi.org/10.1109/tcad.2024.3438994DOI Listing

Publication Analysis

Top Keywords

heterogeneous fpgas
12
heterogeneous
11
deploying heterogeneous
8
layers complex
8
complex data
8
previous mapping
8
mapping algorithms
8
mmmt models
8
heterogeneous fpga
8
fpga clusters
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!