The discovery of microRNAs (miRNAs) remains an important problem, particularly given the growth of high-throughput sequencing, cell sorting and single cell biology. While a large number of miRNAs have already been annotated, there may well be large numbers of miRNAs that are expressed in very particular cell types and remain elusive. Sequencing allows us to quickly and accurately identify the expression of known miRNAs from small RNA-Seq data. The biogenesis of miRNAs leads to very specific characteristics observed in their sequences. In brief, miRNAs usually have a well-defined 5' end and a more flexible 3' end with the possibility of 3' tailing events, such as uridylation. Previous approaches to the prediction of novel miRNAs usually involve the analysis of structural features of miRNA precursor hairpin sequences obtained from genome sequence. We surmised that it may be possible to identify miRNAs by using these biogenesis features observed directly from sequenced reads, solely or in addition to structural analysis from genome data. To this end, we have developed mirnovo, a machine learning based algorithm, which is able to identify known and novel miRNAs in animals and plants directly from small RNA-Seq data, with or without a reference genome. This method performs comparably to existing tools, however is simpler to use with reduced run time. Its performance and accuracy has been tested on multiple datasets, including species with poorly assembled genomes, RNaseIII (Drosha and/or Dicer) deficient samples and single cells (at both embryonic and adult stage).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5716205PMC
http://dx.doi.org/10.1093/nar/gkx836DOI Listing

Publication Analysis

Top Keywords

mirnas
9
small rna-seq
8
rna-seq data
8
novel mirnas
8
mirnovo genome-free
4
genome-free prediction
4
prediction micrornas
4
micrornas small
4
small rna
4
rna sequencing
4

Similar Publications

Crosstalk between non-coding RNAs and programmed cell death in colorectal cancer: implications for targeted therapy.

Epigenetics Chromatin

January 2025

Research Center for Biochemistry and Nutrition in Metabolic Diseases, Institute for Basic Sciences, Kashan University of Medical Sciences, Kashan, Iran.

Background: Colorectal cancer (CRC) remains one of the most common causes of cancer-related mortality worldwide. Its progression is influenced by complex interactions involving genetic, epigenetic, and environmental factors. Non-coding RNAs (ncRNAs), including microRNAs (miRNAs), long non-coding RNAs (lncRNAs), and circular RNAs (circRNAs), have been identified as key regulators of gene expression, affecting diverse biological processes, notably programmed cell death (PCD).

View Article and Find Full Text PDF

Background: Head and neck squamous cell carcinoma (HNSCC), a highly invasive malignancy with a poor prognosis, is one of the most common cancers globally. Circular RNAs (circRNAs) have become key regulators of human malignancies, but further studies are necessary to fully understand their functions and possible causes in HNSCC.

Methods: CircCCT2 expression levels in HNSCC tissues and cells were measured via qPCR.

View Article and Find Full Text PDF

Regulatory role of lnc-MAP3K13-3:1 on miR-6894-3p and SHROOM2 in modulating cellular dynamics in hepatocellular carcinoma.

BMC Cancer

January 2025

Jiangxi Provincial Key Laboratory of Child Development and Genetics, Jiangxi Provincial Children's Hospital, No. 122 of YangMing Road, DongHu District, NanChang, 330006, China.

Background: Hepatocellular carcinoma (HCC) is a prevalent primary liver malignancy and a leading cause of cancer-related mortality worldwide. Despite advancements in therapeutic strategies, the 5-year survival rate for individuals undergoing curative resection remains between 10% and 15%. Consequently, identifying molecular targets that specifically inhibit the proliferation and metastasis of HCC cells is critical for improving treatment outcomes.

View Article and Find Full Text PDF

Background: miRNAs (microRNAs) are endogenous RNAs with lengths of 18 to 24 nucleotides and play critical roles in gene regulation and disease progression. Although traditional wet-lab experiments provide direct evidence for miRNA-disease associations, they are often time-consuming and complicated to analyze by current bioinformatics tools. In recent years, machine learning (ML) and deep learning (DL) techniques are powerful tools to analyze large-scale biological data.

View Article and Find Full Text PDF

Background: Muscle and adipose tissue are the most critical indicators of beef quality, and their development and function are regulated by noncoding RNAs (ncRNAs). However, the differential regulatory mechanisms of ncRNAs in muscle and adipose tissue remain unclear.

Results: In this study, 2,343 differentially expressed mRNAs (DEMs), 235 differentially expressed lncRNAs (DELs), 95 differentially expressed circRNAs (DECs) and 54 differentially expressed miRNAs (DEmiRs) were identified in longissimus dorsi muscle (LD), subcutaneous fat (SF) and perirenal fat (VF) in Qinchuan beef cattle.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!