TransCL: Transformer Makes Strong and Flexible Compressive Learning.

IEEE Trans Pattern Anal Mach Intell

Published: April 2023

Compressive learning (CL) is an emerging framework that integrates signal acquisition via compressed sensing (CS) and machine learning for inference tasks directly on a small number of measurements. It can be a promising alternative to classical image-domain methods and enjoys great advantages in memory saving and computational efficiency. However, previous attempts on CL are not only limited to a fixed CS ratio, which lacks flexibility, but also limited to MNIST/CIFAR-like datasets and do not scale to complex real-world high-resolution (HR) data or vision tasks. In this article, a novel transformer-based compressive learning framework on large-scale images with arbitrary CS ratios, dubbed TransCL, is proposed. Specifically, TransCL first utilizes the strategy of learnable block-based compressed sensing and proposes a flexible linear projection strategy to enable CL to be performed on large-scale images in an efficient block-by-block manner with arbitrary CS ratios. Then, regarding CS measurements from all blocks as a sequence, a pure transformer-based backbone is deployed to perform vision tasks with various task-oriented heads. Our sufficient analysis presents that TransCL exhibits strong resistance to interference and robust adaptability to arbitrary CS ratios. Extensive experiments for complex HR data demonstrate that the proposed TransCL can achieve state-of-the-art performance in image classification and semantic segmentation tasks. In particular, TransCL with a CS ratio of 10% can obtain almost the same performance as when operating directly on the original data and can still obtain satisfying performance even with an extremely low CS ratio of 1%. The source codes of our proposed TransCL is available at https://github.com/MC-E/TransCL/.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2022.3194001DOI Listing

Publication Analysis

Top Keywords

compressive learning
12
arbitrary ratios
12
proposed transcl
12
compressed sensing
8
vision tasks
8
large-scale images
8
transcl
7
transcl transformer
4
transformer strong
4
strong flexible
4

Similar Publications

Magnetic Resonance Imaging is a cornerstone of medical diagnostics, providing high-quality soft tissue contrast through non-invasive methods. However, MRI technology faces critical limitations in imaging speed and resolution. Prolonged scan times not only increase patient discomfort but also contribute to motion artifacts, further compromising image quality.

View Article and Find Full Text PDF

: This study aimed to establish a predictive model for critical quality attributes (CQAs) related to tablet integrity, including tablet breaking force (TBF), friability, and capping occurrence, using machine learning-based models and nondestructive experimental data. : The machine learning-based models were trained on data to predict the CQAs of metformin HCl (MF)-containing tablets using a commercial-scale wet granulation process, and five models were each compared for regression and classification. We identified eight input variables associated with the process and material parameters that control the tableting outcome using feature importance analysis.

View Article and Find Full Text PDF

Background: In this study, the unconfined compressive strength (q) of a mixture consisting of clay reinforced with 24 mm-long basalt fiber was estimated using extreme learning machine (ELM). The aim of this study is to estimate the results closest to the data obtained through experimental studies without the need for experimental studies. The literature review reveals that the ELM technique has not been applied to predict the compressive strength of basalt fiber-reinforced clay, and this study aims to provide a novel contribution in this area.

View Article and Find Full Text PDF

General matrix multiplication (GEMM) in machine learning involves massive computation and data movement, which restricts its deployment on resource-constrained devices. Although data reuse can reduce data movement during GEMM processing, current approaches fail to fully exploit its potential. This work introduces a sparse GEMM accelerator with a weight-and-output stationary (WOS) dataflow and a distributed buffer architecture.

View Article and Find Full Text PDF

In the domain of food science, apple grading holds significant research value and application potential. Currently, apple grading predominantly relies on manual methods, which present challenges such as low production efficiency and high subjectivity. This study marks the first integration of advanced computer vision, image processing, and machine learning technologies to design an innovative automated apple grading system.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!