Large intergenic noncoding RNAs (lincRNAs) are emerging as key regulators of diverse cellular processes. Determining the function of individual lincRNAs remains a challenge. Recent advances in RNA sequencing (RNA-seq) and computational methods allow for an unprecedented analysis of such transcripts. Here, we present an integrative approach to define a reference catalog of >8000 human lincRNAs. Our catalog unifies previously existing annotation sources with transcripts we assembled from RNA-seq data collected from ∼4 billion RNA-seq reads across 24 tissues and cell types. We characterize each lincRNA by a panorama of >30 properties, including sequence, structural, transcriptional, and orthology features. We found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that lincRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes. We distinguish an additional subset of transcripts that have high evolutionary conservation but may include short ORFs and may serve as either lincRNAs or small peptides. Our integrated, comprehensive, yet conservative reference catalog of human lincRNAs reveals the global properties of lincRNAs and will facilitate experimental studies and further functional classification of these genes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3185964PMC
http://dx.doi.org/10.1101/gad.17446611DOI Listing

Publication Analysis

Top Keywords

large intergenic
8
intergenic noncoding
8
noncoding rnas
8
reveals global
8
global properties
8
reference catalog
8
human lincrnas
8
lincrnas
7
integrative annotation
4
annotation human
4

Similar Publications

Poly(ethylene terephthalate) (PET) is one of the most ubiquitous plastics and can be depolymerized through biological and chemo-catalytic routes to its constituent monomers, terephthalic acid (TPA) and ethylene glycol (EG). TPA and EG can be re-synthesized into PET for closed-loop recycling or microbially converted into higher-value products for open-loop recycling. Here, we expand on our previous efforts engineering and applying Pseudomonas putida KT2440 for PET conversion by employing adaptive laboratory evolution (ALE) to improve TPA catabolism.

View Article and Find Full Text PDF

Novel replication-competent reporter-expressing Rift Valley fever viruses for molecular studies.

J Virol

December 2024

Centro de Investigación en Sanidad Animal (CISA), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria, Consejo Superior de Investigaciones Científicas (INIA-CSIC), Madrid, Spain.

Unlabelled: Rift Valley fever virus (RVFV) is a mosquito-borne zoonotic disease that causes severe disease in both domestic and wild ungulates and humans, making it a significant threat to livestock and public health. The RVFV genome consists of three single-stranded, negative-sense RNA segments differing in size: small (S), medium (M), and large (L). Segment S encodes the virus nucleoprotein N and the virulence-associated factor non-structural (NSs) protein in opposite orientations, separated by an intergenic region (IGR).

View Article and Find Full Text PDF

Background: Genetic colocalization analysis is a statistical method that evaluates whether two traits (e.g., osteoarthritis [OA] risk and microRNA [miRNA] expression levels) share the same or distinct genetic association signals in a locus typically identified in genome-wide association studies (GWAS).

View Article and Find Full Text PDF

DRGAT: Predicting Drug Responses Via Diffusion-Based Graph Attention Network.

J Comput Biol

December 2024

Artificial Intelligence and Data Engineering Department, Ozyegin University, Istanbul, Turkey.

Accurately predicting drug response depending on a patient's genomic profile is critical for advancing personalized medicine. Deep learning approaches rise and especially the rise of graph neural networks leveraging large-scale omics datasets have been a key driver of research in this area. However, these biological datasets, which are typically high dimensional but have small sample sizes, present challenges such as overfitting and poor generalization in predictive models.

View Article and Find Full Text PDF
Article Synopsis
  • * Various diagnostic methods were used, including phenotypic testing, MALDI-TOF MS, and molecular techniques that analyze specific genes, confirming the isolate as T. abortisuis.
  • * The results showed a high sequence identity with reference strains, demonstrating the importance of integrating different diagnostic techniques for accurate identification of bacterial pathogens in veterinary medicine.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!