A major bottleneck preventing the extension of deep learning systems to new domains is the prohibitive cost of acquiring sufficient training labels. Alternatives such as weak supervision, active learning, and fine-tuning of pretrained models reduce this burden but require substantial human input to select a highly informative subset of instances or to curate labeling functions. REGAL (Rule-Enhanced Generative Active Learning) is an improved framework for weakly supervised text classification that performs active learning over labeling functions rather than individual instances. REGAL interactively creates high-quality labeling patterns from raw text, enabling a single annotator to accurately label an entire dataset after initialization with three keywords for each class. Experiments demonstrate that REGAL extracts up to 3 times as many high-accuracy labeling functions from text as current state-of-the-art methods for interactive weak supervision, enabling REGAL to dramatically reduce the annotation burden of writing labeling functions for weak supervision. Statistical analysis reveals REGAL performs equal or significantly better than interactive weak supervision for five of six commonly used natural language processing (NLP) baseline datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281613PMC
http://dx.doi.org/10.3390/ai3010013DOI Listing

Publication Analysis

Top Keywords

weak supervision
20
active learning
16
labeling functions
16
interactive weak
8
learning
5
weak
5
supervision
5
labeling
5
regal
5
rule-enhanced active
4

Similar Publications

This study discusses disseminated intravascular coagulation (DIC) associated with solid cancers and various vascular abnormalities, both of which generally exhibit chronic DIC patterns. Solid cancers are among the most significant underlying diseases that induce DIC. However, the severity, bleeding tendency, and progression of DIC vary considerably depending on the type and stage of the cancer, making generalization difficult.

View Article and Find Full Text PDF

Background: Health system and environmental factors play a significant role in achieving the World Health Organization (WHO) End Tuberculosis (TB) targets. However, quantitative measures are scarce or non-existent at a global level. We aimed to measure the progress made towards meeting the global End TB milestones from 2015 to 2020 and identify health system and environmental factors contributing to the success.

View Article and Find Full Text PDF

Digital transformation has significantly impacted public procurement, improving operational efficiency, transparency, and competition. This transformation has allowed the automation of data analysis and oversight in public administration. Public procurement involves various stages and generates a multitude of documents.

View Article and Find Full Text PDF

Background: Transcatheter Aortic Valve Implantation (TAVI) procedures are rapidly expanding, necessitating a more extensive stratification of patients with aortic stenosis. Especially in the high-risk group, some patients fail to derive optimal or any benefits from TAVI, leading to the risk of futile interventions. Despite consensus among several experts regarding the importance of recognizing and anticipating such interventions, the definition, and predictive criteria for futility in TAVI remain ambiguous.

View Article and Find Full Text PDF

Cross-programmatic inefficiencies are duplications or misalignments that arise from undue fragmentation of health systems by vertical health programs. Identifying and addressing the root causes of cross-programmatic inefficiencies in a health system can ensure more efficient use of resources to make progress toward Universal Health Coverage. This paper examines the root causes of cross-programmatic inefficiencies related to governance and financing in the state health system of Anambra in southeast Nigeria.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!