The current deluge of newly identified RNA transcripts presents a singular opportunity for improved assessment of coding potential, a cornerstone of genome annotation, and for machine-driven discovery of biological knowledge. While traditional, feature-based methods for RNA classification are limited by current scientific knowledge, deep learning methods can independently discover complex biological rules in the data de novo. We trained a gated recurrent neural network (RNN) on human messenger RNA (mRNA) and long noncoding RNA (lncRNA) sequences.
View Article and Find Full Text PDFThe epidermal permeability barrier (EPB) prevents organisms from dehydration and infection. The transcriptional regulation of EPB development is poorly understood. We demonstrate here that transcription factor COUP-TF-interacting protein 1 (CTIP1/BCL11A; hereafter CTIP1) is highly expressed in the developing murine epidermis.
View Article and Find Full Text PDFOBJECTIVE The molecular mechanisms behind cerebral aneurysm formation and rupture remain poorly understood. In the past decade, microRNAs (miRNAs) have been shown to be key regulators in a host of biological processes. They are noncoding RNA molecules, approximately 21 nucleotides long, that posttranscriptionally inhibit mRNAs by attenuating protein translation and promoting mRNA degradation.
View Article and Find Full Text PDFThe fundamental question of how sequence defines conformation is explicitly answered if the structures of all possible sequences of a macromolecule are determined. We present here a crystallographic screen of all permutations of the inverted repeat DNA sequence d(CCnnnN6N7N8GG), where N6, N7, and N8 are any of the four naturally occurring nucleotides. At this point, 63 of the 64 possible permutations have been crystallized from a defined set of solutions.
View Article and Find Full Text PDF