Affordable sequencing and genotyping methods are essential for large scale genome-wide association studies. While genotyping microarrays and reference panels for imputation are available for human subjects, non-human model systems often lack such options. Our lab previously demonstrated an efficient and cost-effective method to genotype heterogeneous stock rats using double-digest genotyping-by-sequencing.
View Article and Find Full Text PDFGenome-wide association studies typically evaluate the autosomes and sometimes the X Chromosome, but seldom consider the Y or mitochondrial (MT) Chromosomes. We genotyped the Y and MT Chromosomes in heterogeneous stock (HS) rats (Rattus norvegicus), an outbred population created from 8 inbred strains. We identified 8 distinct Y and 4 distinct MT Chromosomes among the 8 founders.
View Article and Find Full Text PDFAffordable sequencing and genotyping methods are essential for large scale genome-wide association studies. While genotyping microarrays and reference panels for imputation are available for human subjects, non-human model systems often lack such options. Our lab previously demonstrated an efficient and cost-effective method to genotype heterogeneous stock rats using double-digest genotyping-by-sequencing.
View Article and Find Full Text PDFThe use of single cell/nucleus RNA sequencing (scRNA-seq) technologies that quantitively describe cell transcriptional phenotypes is revolutionizing our understanding of cell biology, leading to new insights in cell type identification, disease mechanisms, and drug development. The tremendous growth in scRNA-seq data has posed new challenges in efficiently characterizing data-driven cell types and identifying quantifiable marker genes for cell type classification. The use of machine learning and explainable artificial intelligence has emerged as an effective approach to study large-scale scRNA-seq data.
View Article and Find Full Text PDFWith the advent of single cell/nucleus RNA sequencing (sc/snRNA-seq), the field of cell phenotyping is now a data-driven exercise providing statistical evidence to support cell type/state categorization. However, the task of classifying cells into specific, well-defined categories with the empirical data provided by sc/snRNA-seq remains nontrivial due to the difficulty in determining specific differences between related cell types with close transcriptional similarities, resulting in challenges with matching cell types identified in separate experiments. To investigate possible approaches to overcome these obstacles, we explored the use of supervised machine learning methods-logistic regression, support vector machines, random forests, neural networks, and light gradient boosting machine (LightGBM)-as approaches to classify cell types using snRNA-seq datasets from human brain middle temporal gyrus (MTG) and human kidney.
View Article and Find Full Text PDF