Data filtering based on removing non-informative features, with unchanged signal between compared experimental conditions, can significantly increase sensitivity of methods used to detect differentially expressed genes or other molecular components measured in high-throughput biological experiments. Criteria for data filtering can be stated on the basis of averages or variances of signal levels across samples. The crucial parts of feature filtering are selection of filter type and cut-off threshold, which are specific to the particular dataset. In this paper, we present an algorithm and a stand-alone application, GaMRed, for adaptive filtering insignificant features in high-throughput data, based on Gaussian mixture decomposition. We have tested the performance of our algorithm using datasets from three different high-throughput biological experiments. We estimated the number of differentially expressed features after applying multiple testing correction and performed functional analysis of obtained features using Gene Ontology terms. Also, we checked if the control of false discovery rate and family-wise error rate after applying feature filtering remains at appropriate level. GaMRed is fast, automatic, and does not require expert knowledge in parameter tuning. The algorithm increases sensitivity of methods used to find differentially expressed features and biological validity of the findings. The program can be downloaded from: http://zaed.aei.polsl.pl/index.php/pl/oprogramowanie-zaed.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCBB.2018.2858825 | DOI Listing |
Mol Diagn Ther
January 2025
Istituto Europeo di Oncologia, IRCCS, Via Adamello 16, 20139, Milan, Italy.
Background: Predicting response to targeted cancer therapies increasingly relies on both simple and complex genetic biomarkers. Comprehensive genomic profiling using high-throughput assays must be evaluated for reproducibility and accuracy compared with existing methods.
Methods: This study is a multicenter evaluation of the Oncomine™ Comprehensive Assay Plus (OCA Plus) Pan-Cancer Research Panel for comprehensive genomic profiling of solid tumors.
Theor Appl Genet
January 2025
Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France.
Phenomic selection based on parental spectra can be used to predict GCA and SCA in a sparse factorial design. Prediction approaches such as genomic selection can be game changers in hybrid breeding. They allow predicting the genetic values of hybrids without the need for their physical production.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, 10257, Lithuania.
The expansion of single-cell analytical techniques has empowered the exploration of diverse biological questions at the individual cells. Droplet-based single-cell RNA sequencing (scRNA-seq) methods have been particularly widely used due to their high-throughput capabilities and small reaction volumes. While commercial systems have contributed to the widespread adoption of droplet-based scRNA-seq, their relatively high cost limits the ability to profile large numbers of cells and samples.
View Article and Find Full Text PDFInt J Mol Sci
January 2025
Institute of Pathogenic Microorganism, Jiangxi Agricultural University, Nanchang 330000, China.
Monkeypox (MPOX) is a zoonotic viral disease caused by the Monkeypox virus (MPXV), which has become the most significant public health threat within the genus since the eradication of the Variola virus (VARV). Despite the extensive attention MPXV has garnered, little is known about its clinical manifestations in humans. In this study, a high-throughput RNA sequencing (RNA-seq) and liquid chromatography-tandem mass spectrometry (LC-MS/MS) approach was employed to investigate the transcriptional and metabolic responses of HEK293T cells to the MPXV A5L protein.
View Article and Find Full Text PDFInt J Mol Sci
January 2025
Yunnan Provincial Key Laboratory of Animal Nutrition and Feed, Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China.
With the rapid advancement of high-throughput sequencing technologies, whole genome sequencing (WGS) has emerged as a crucial tool for studying genetic variation and population structure. Utilizing population genomics tools to analyze resequencing data allows for the effective integration of selection signals with population history, precise estimation of effective population size, historical population trends, and structural insights, along with the identification of specific genetic loci and variations. This paper reviews current whole genome sequencing technologies, detailing primary research methods, relevant software, and their advantages and limitations within population genomics.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!