Existing cross-lingual summarization (CLS) datasets consist of inconsistent sample quality and low scale. To address these problems, we propose a method that jointly supervises quality and scale to build CLS datasets. In terms of quality supervision, the method adopts a multi-strategy filtering algorithm to remove low-quality samples of monolingual summarization (MS) from the perspectives of character and semantics, thereby improving the quality of the MS dataset. In terms of scale supervision, the method adopts a text augmentation algorithm based on the pretrained model to increase the size of CLS datasets with quality assurance. This method was used to build an English-Chinese CLS dataset and evaluate it with a reasonable data quality evaluation framework. The evaluation results show that the dataset is of good quality and large size. These outcomes show that the proposed method may comprehensively improve quality and scale, thereby resulting in a high-quality and large-scale CLS dataset at a lower cost.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280405 | PMC |
http://dx.doi.org/10.7717/peerj-cs.1299 | DOI Listing |
Materials (Basel)
November 2024
Department of Computer Science and Information Technologies, University of Kasdi Merbah, Ouargla 30000, Algeria.
The accurate and efficient classification of steel surface defects is critical for ensuring product quality and minimizing production costs. This paper proposes a novel method based on wavelet transform and texture descriptors for the robust and precise classification of steel surface defects. By leveraging the multiscale analysis capabilities of wavelet transforms, our method extracts both broad and fine-grained textural features.
View Article and Find Full Text PDFInt Breastfeed J
October 2024
Department of Infectious Diseases, Copenhagen University Hospital, Kettegaard Alle 30, Hvidovre, Denmark.
We propose wake-sleep consolidated learning (WSCL), a learning strategy leveraging complementary learning system (CLS) theory and the wake-sleep phases of the human brain to improve the performance of deep neural networks (DNNs) for visual classification tasks in continual learning (CL) settings. Our method learns continually via the synchronization between distinct wake and sleep phases. During the wake phase, the model is exposed to sensory input and adapts its representations, ensuring stability through a dynamic parameter freezing mechanism and storing episodic memories in a short-term temporary memory (similar to what happens in the hippocampus).
View Article and Find Full Text PDFSensors (Basel)
August 2024
Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China.
In recent years, significant progress has been made in facial expression recognition methods. However, tasks related to facial expression recognition in real environments still require further research. This paper proposes a tri-cross-attention transformer with a multi-feature fusion network (TriCAFFNet) to improve facial expression recognition performance under challenging conditions.
View Article and Find Full Text PDFJ Neural Eng
August 2024
College of electronic and optical engineering & college of flexible electronics (future technology), Nanjing University of Posts and Telecommunications, Jiangsu 210023, People's Republic of China.
Electroencephalogram (EEG) signals offer invaluable insights into the complexities of emotion generation within the brain. Yet, the variability in EEG signals across individuals presents a formidable obstacle for empirical implementations. Our research addresses these challenges innovatively, focusing on the commonalities within distinct subjects' EEG data.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!