We present a technically simple approach for gene expression cytometry combining next-generation sequencing with stochastic barcoding of single cells. A combinatorial library of beads bearing cell- and molecular-barcoding capture probes is used to uniquely label transcripts and reconstruct the digital gene expression profile of thousands of individual cells in a single experiment without the need for robotics or automation. We applied the technology to dissect the human hematopoietic system and to characterize heterogeneous response to in vitro stimulation.
View Article and Find Full Text PDFWe present a new approach for the sensitive detection and accurate quantitation of messenger ribonucleic acid (mRNA) gene transcripts in single cells. First, the entire population of mRNAs is encoded with molecular barcodes during reverse transcription. After amplification of the gene targets of interest, molecular barcodes are counted by sequencing or scored on a simple hybridization detector to reveal the number of molecules in the starting sample.
View Article and Find Full Text PDFWe present a simple molecular indexing method for quantitative targeted RNA sequencing, in which mRNAs of interest are selectively captured from complex cDNA libraries and sequenced to determine their absolute concentrations. cDNA fragments are individually labeled so that each molecule can be tracked from the original sample through the library preparation and sequencing process. Multiple copies of cDNA fragments of identical sequence become distinct through labeling, and replicate clones created during PCR amplification steps can be identified and assigned to their distinct parent molecules.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
May 2011
We implement a unique strategy for single molecule counting termed stochastic labeling, where random attachment of a diverse set of labels converts a population of identical DNA molecules into a population of distinct DNA molecules suitable for threshold detection. The conceptual framework for stochastic labeling is developed and experimentally demonstrated by determining the absolute and relative number of selected genes after stochastically labeling approximately 360,000 different fragments of the human genome. The approach does not require the physical separation of molecules and takes advantage of highly parallel methods such as microarray and sequencing technologies to simultaneously count absolute numbers of multiple targets.
View Article and Find Full Text PDFBackground: Copy number variants (CNVs) account for a large proportion of genetic variation in the genome. The initial discoveries of long (> 100 kb) CNVs in normal healthy individuals were made on BAC arrays and low resolution oligonucleotide arrays. Subsequent studies that used higher resolution microarrays and SNP genotyping arrays detected the presence of large numbers of CNVs that are < 100 kb, with median lengths of approximately 10 kb.
View Article and Find Full Text PDFThe rapid amplification of cDNA ends (RACE) procedure is a widely used PCR-based method to clone the cDNA ends of mRNA transcripts. Current RACE methods often produce a high background of nonspecific PCR products, which can exclude the identification of the target cDNA of interest. We describe here an improved RACE procedure using circular cDNA templates and demonstrate the successful extension cloning of 4406 cDNAs.
View Article and Find Full Text PDFAlternative splicing of pre-mRNAs is an important mechanism for the generation of vertebrate protein diversity. Unfortunately, the contribution of alternative splicing to protein diversity is currently not well understood because many full-length mRNA sequences have yet to be identified. In this report, we describe the use of RT-PCR to identify and clone 279 novel alternatively spliced mRNAs from 114 well-known drug target genes.
View Article and Find Full Text PDFSerine 171 in the GABA(A) receptor gamma2 subunit is highly conserved in the ligand-gated ion channel superfamily. In this paper, we report that mutating serine 171 within gamma2 to glycine or cysteine prevents the interaction of gamma2 with alpha2 and beta1 when these subunits are co-expressed in human embryo kidney 293 cells, resulting in intracellular retention of gamma2. Structure analysis based on a three-dimensional homology model of gamma2 (Ernst, M.
View Article and Find Full Text PDFWe have cloned a novel gamma-aminobutyric acid type A (GABAA) receptor gamma2 subunit variant named gamma2XL. gamma2XL contains an alternatively spliced exon, resulting in the addition of 40 amino acids to the N-terminal extracellular domain between Ser171 and Tyr172. We show that gamma2XL failed to localize to the cell surface when it was coexpressed with the alpha2 and beta1 subunits in human embryonic kidney 293 cells.
View Article and Find Full Text PDFAlthough several signal peptide-trapping methods have been devised and used to detect signal sequences, none have relied on using E.coli to identify eukaryotic proteins with signal peptides. Here, we describe a system for selecting human secreted and membrane proteins in E.
View Article and Find Full Text PDFReverse transcription of mRNA often leads to the synthesis of partial, non-full-length cDNAs. Methods to facilitate reverse transcription across RNA regions of secondary structure, as well as enzyme modifications to eliminate RNase H activities inherent to reverse transcriptase enzymes, have been previously reported. However, because all reverse transcriptases have high error rates of polymerization, the misincorporation of nucleotides can also cause the reverse transcriptase to stumble.
View Article and Find Full Text PDFThe majority of expressed sequence tag (EST) sequences available today have been derived from the 5' ends of cDNA clones. Obtaining high-quality DNA sequences from the 3' ends of oligo(dT)-primed cDNA on a large scale has been difficult because of slippage of the DNA polymerase enzyme used in direct PCR and cycle sequencing. With the completion of whole genome sequencing for more and more organisms, mRNA 3'-UTR sequences can be particularly useful for clustering large numbers of ESTs for the effective discrimination of individual genes and gene families.
View Article and Find Full Text PDF