Gene Expression is the process of determining the physical characteristics of living beings by generating the necessary proteins. Gene Expression takes place in two steps, translation and transcription. It is the flow of information from DNA to RNA with enzymes' help, and the end product is proteins and other biochemical molecules. Many technologies can capture Gene Expression from the DNA or RNA. One such technique is Microarray DNA. Other than being expensive, the main issue with Microarray DNA is that it generates high-dimensional data with minimal sample size. The issue in handling such a heavyweight dataset is that the learning model will be over-fitted. This problem should be addressed by reducing the dimension of the data source to a considerable amount. In recent years, Machine Learning has gained popularity in the field of genomic studies. In the literature, many Machine Learning-based Gene Selection approaches have been discussed, which were proposed to improve dimensionality reduction precision. This paper does an extensive review of the various works done on Machine Learning-based gene selection in recent years, along with its performance analysis. The study categorizes various feature selection algorithms under Supervised, Unsupervised, and Semi-supervised learning. The works done in recent years to reduce the features for diagnosing tumors are discussed in detail. Furthermore, the performance of several discussed methods in the literature is analyzed. This study also lists out and briefly discusses the open issues in handling the high-dimension and less sample size data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7758324PMC
http://dx.doi.org/10.3389/fgene.2020.603808DOI Listing

Publication Analysis

Top Keywords

gene selection
12
gene expression
12
machine learning
8
open issues
8
dna rna
8
microarray dna
8
sample size
8
machine learning-based
8
learning-based gene
8
gene
6

Similar Publications

Photosynthetic microalgae are promising green cell factories for the sustainable production of high-value chemicals and biopharmaceuticals. The chloroplast organelle is being developed as a chassis for synthetic biology as it contains its own genome (the plastome) and some interesting advantages, such as high recombinant protein titers and a diverse and dynamic metabolism. However, chloroplast engineering is currently hampered by the lack of standardized cloning tools and Design-Build-Test-Learn workflows to ease genomic and metabolic engineering.

View Article and Find Full Text PDF

Upon exposure to salt stress, calcium signaling in plants activates various stress-responsive genes and proteins along with enhancement in antioxidant defense to eventually regulate the cellular homeostasis for reducing cytosolic sodium levels. The coordination among the calcium signaling molecules and transporters plays a crucial role in salinity tolerance. In the present study, twenty-one diverse indigenous rice genotypes were evaluated for salt tolerance during the early seedling stage, and out of that nine genotypes were further selected for physio-biochemical study.

View Article and Find Full Text PDF

We aimed to assess the impact of splicing variants reported in our laboratory to gain insight into their clinical relevance. A total of 108 consecutive individuals, for whom 113 splicing variants had been reported, were selected for RNA-sequencing (RNA-seq), considering the gene expression in blood. A protocol was developed to perform RNA extraction and sequencing using the same sample (dried blood spots, DBS) provided for the DNA analysis, including library preparation and bioinformatic pipeline analysis.

View Article and Find Full Text PDF

Crohn's disease (CD) is a chronic inflammatory bowel disease with an unknown etiology. Ubiquitination plays a significant role in the pathogenesis of CD. This study aimed to explore the functional roles of ubiquitination-related genes in CD.

View Article and Find Full Text PDF

Publicly available trial matching tools can improve the access to therapeutic innovations, but errors may expose to over-solicitation and disappointment. We performed a pragmatic non-interventional prospective evaluation on sequential patients at the Molecular Tumor Board of Centre Leon Berard. During 10 weeks in 2024, we analysed 157 patients with four clinical trial matching tools from the 19 screened: Klineo, ScreenAct, Trialing and DigitalECMT.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!