Supervised classifiers are highly dependent on abundant labeled training data. Alternatives for addressing the lack of labeled data include: labeling data (but this is costly and time consuming); training classifiers with abundant data from another domain (however, the classification accuracy usually decreases as the distance between domains increases); or complementing the limited labeled data with abundant unlabeled data from the same domain and learning semi-supervised classifiers (but the unlabeled data can mislead the classifier). A better alternative is to use both the abundant labeled data from a source domain, the limited labeled data and optionally the unlabeled data from the target domain to train classifiers in a domain adaptation setting. We propose two such classifiers, based on logistic regression, and evaluate them for the task of splice site prediction-a difficult and essential step in gene prediction. Our classifiers achieved high accuracy, with highest areas under the precision-recall curve between 50.83% and 82.61%.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4894847PMC
http://dx.doi.org/10.1109/TNB.2016.2522400DOI Listing

Publication Analysis

Top Keywords

labeled data
16
unlabeled data
12
data
10
domain adaptation
8
logistic regression
8
task splice
8
splice site
8
abundant labeled
8
data domain
8
limited labeled
8

Similar Publications

A U-Net based partial convolutional time-domain separation model to identify motor units from surface electromyographic signals in real time.

J Electromyogr Kinesiol

December 2024

School of Information Science and Technology, Dalian Maritime University, Linghai Road 1, Dalian, Liaoning Province 116026, China. Electronic address:

This study proposed a U-Net based partial convolutional time-domain model for a real-time high-density surface electromyography (HD-sEMG) decomposition. The model combines U-Net and a separation block containing partial convolution, aiming to efficiently identify motor units (MUs) without preprocessing. The proposed U-Net based network was trained by the HD-sEMG signals with innervation pulse trains (IPTs) labels, and the results are compared between different step sizes, noises, and model structures under the sliding time window with 120 sampling points.

View Article and Find Full Text PDF

Background: Exposure of critically ill patients to antibiotics lead to intestinal dysbiosis, which often manifests as antibiotic-associated diarrhoea. Faecal microbiota transplantation restores gut microbiota and may lead to faster resolution of diarrhoea.

Methods: Into this prospective, multi-centre, randomized controlled trial we will enrol 36 critically ill patients with antibiotic-associated diarrhoea.

View Article and Find Full Text PDF

Echocardiographic findings of patients with transthyretin amyloid cardiomyopathy.

J Echocardiogr

December 2024

Department of Cardiovascular Medicine, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan.

Transthyretin amyloid cardiomyopathy (ATTR-CM) is becoming increasingly recognized with the aging population, advancements in understanding of disease pathobiology and the potential benefits of emerging therapies. Bone scintigraphy, including Tc-labeled pyrophosphate scintigraphy, is currently considered the first-line modality for identifying ATTR-CM. Therefore, it is important to increase the preset probability using inexpensive and simple tests including echocardiography.

View Article and Find Full Text PDF

Purpose: Nanoparticles are highly efficient vectors for ferrying contrast agents across cell membranes, enabling ultra-sensitive in vivo tracking of single cells with positron emission tomography (PET). However, this approach must be fully characterized and understood before it can be reliably implemented for routine applications.

Methods: We developed a Langmuir adsorption model that accurately describes the process of labeling mesoporous silica nanoparticles (MSNP) with Ga.

View Article and Find Full Text PDF

Evaluating Medical Image Segmentation Models Using Augmentation.

Tomography

December 2024

Clinic for Radiology and Nuclear Medicine, University Hospital, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany.

Background: Medical imagesegmentation is an essential step in both clinical and research applications, and automated segmentation models-such as TotalSegmentator-have become ubiquitous. However, robust methods for validating the accuracy of these models remain limited, and manual inspection is often necessary before the segmentation masks produced by these models can be used.

Methods: To address this gap, we have developed a novel validation framework for segmentation models, leveraging data augmentation to assess model consistency.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!