Modern language models such as bidirectional encoder representations from transformers have revolutionized natural language processing (NLP) tasks but are computationally intensive, limiting their deployment on edge devices. This paper presents an energy-efficient accelerator design tailored for encoder-based language models, enabling their integration into mobile and edge computing environments. A data-flow-aware hardware accelerator design for language models inspired by Simba, makes use of approximate fixed-point POSIT-based multipliers and uses high bandwidth memory (HBM) in achieving significant improvements in computational efficiency, power consumption, area and latency compared to the hardware-realized scalable accelerator Simba. Compared to Simba, AxLaM achieves a ninefold energy reduction, 58% area reduction and 1.2 times improved latency, making it suitable for deployment in edge devices. The energy efficiency of AxLaN is 1.8 TOPS/W, 65% higher than FACT, which requires pre-processing of the language model before implementing it on the hardware.This article is part of the theme issue 'Emerging technologies for future secure computing platforms'.

Download full-text PDF

Source
http://dx.doi.org/10.1098/rsta.2023.0395DOI Listing

Publication Analysis

Top Keywords

language models
16
accelerator design
12
energy-efficient accelerator
8
design language
8
edge computing
8
deployment edge
8
edge devices
8
language
6
axlam energy-efficient
4
accelerator
4

Similar Publications

Cost Effectiveness of Colorectal Cancer Screening Strategies in Middle- and High-Income Countries: A Systematic Review.

J Gastroenterol Hepatol

January 2025

Department of Epidemiology and Biostatistics, School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi, China.

Background And Aim: Colorectal cancer (CRC) is a significant global health burden, and screening can greatly reduce CRC incidence and mortality. Previous studies investigated the economic effects of CRC screening. We performed a systematic review to provide the cost-effectiveness of CRC screening strategies across countries with different income levels.

View Article and Find Full Text PDF

The lexicon is an evolving symbolic system that expresses an unbounded set of emerging meanings with a limited vocabulary. As a result, words often extend to new meanings. Decades of research have suggested that word meaning extension is non-arbitrary, and recent work formalizes this process as cognitive models of semantic chaining whereby emerging meanings link to existing ones that are semantically close.

View Article and Find Full Text PDF

Advances in artificial intelligence (AI), machine learning, and publicly accessible language model tools such as ChatGPT-3.5 continue to shape the landscape of modern medicine and patient education. ChatGPT's open access (OA), instant, human-sounding interface capable of carrying discussion on myriad topics makes it a potentially useful resource for patients seeking medical advice.

View Article and Find Full Text PDF

This study aimed to develop an advanced ensemble approach for automated classification of mental health disorders in social media posts. The research question was: can an ensemble of fine-tuned transformer models (XLNet, RoBERTa, and ELECTRA) with Bayesian hyperparameter optimization improve the accuracy of mental health disorder classification in social media text. Three transformer models (XLNet, RoBERTa, and ELECTRA) were fine-tuned on a dataset of social media posts labelled with 15 distinct mental health disorders.

View Article and Find Full Text PDF

Background: Early interventions for young children with autism have been shown to enhance developmental outcomes. However, opportunities for targeted interventions in autism, both in care and preschool, are often lacking, particularly in immigrant communities. The early start denver model (ESDM) stands as one of the most well-established intervention models, including improvement in core developmental domains and reduction of maladaptive behaviours, also delivered in preschool settings.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!