Parameter efficient transfer learning (PETL) methods provide an efficient alternative for fine-tuning. However, typical PETL methods inject the same structures to all Pre-trained Language Model (PLM) layers and only use the final hidden states for downstream tasks, regardless of the knowledge diversity across PLM layers. Additionally, the backpropagation path of existing PETL methods still passes through the frozen PLM during training, which is computational and memory inefficient. In this paper, we propose FLAT, a generic PETL method that explicitly and individually combines knowledge across all PLM layers based on the tokens to perform a better transferring. FLAT considers the backbone PLM as a feature extractor and combines the features in a side-network, hence the backpropagation does not involve the PLM, which results in much less memory requirement than previous methods. The results on the GLUE benchmark show that FLAT outperforms other tuning techniques in the low-resource scenarios and achieves on-par performance in the high-resource scenarios with only 0.53% trainable parameters per task and 3.2× less GPU memory usagewith BERT. Besides, further ablation study is conducted to reveal that the proposed fusion layer effectively combines knowledge from PLM and helps the classifier to exploit the PLM knowledge to downstream tasks. We will release our code for better reproducibility.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.neunet.2024.106631 | DOI Listing |
Genes (Basel)
December 2024
College of Plant Protection, Henan Agricultural University, Zhengzhou 450002, China.
: Xi Junecry (), a perennial herb of the Araceae family, is indigenous to Xinxian County, Henan Province, China, and is regarded as a premium variety among similar medicinal materials. However, the lack of comprehensive genetic information on Xi Junecry germplasm resources has constrained the cultivation and identification of high-quality varieties. : In this study, six chloroplast genomes of Xi Junecry were assembled and annotated using high-throughput sequencing.
View Article and Find Full Text PDFSci Rep
November 2024
Jiangsu Academy of Forestry, Nanjing, China.
Int J Mol Sci
August 2024
Marine Biology Institute, Shantou University, Shantou 515063, China.
var. , a prevalent seaweed along the Chinese coast, has economic and ecological significance. However, systematic positions within and among the three orders of Phaeophyceae, Fucales, Ectocarpales, and Laminariales are in debate.
View Article and Find Full Text PDFNeural Netw
November 2024
School of Computer Science and Technology, Nanjing University of Aeronautic and Astronautics, Nanjing, 211106, China. Electronic address:
Parameter efficient transfer learning (PETL) methods provide an efficient alternative for fine-tuning. However, typical PETL methods inject the same structures to all Pre-trained Language Model (PLM) layers and only use the final hidden states for downstream tasks, regardless of the knowledge diversity across PLM layers. Additionally, the backpropagation path of existing PETL methods still passes through the frozen PLM during training, which is computational and memory inefficient.
View Article and Find Full Text PDFInt J Mol Sci
July 2024
College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou 310018, China.
(Vitaceae) is known for its ornamental, medicinal, and ecological significance. However, the structural and variational characteristics of the chloroplast genome and their impact on phylogenetic relationships remain underexplored. This study utilized bioinformatics methods to assemble and annotate the chloroplast genomes of 10 species and compare them with five previously sequenced species.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!