63 results match your criteria: "Institute of Statistical Sciences[Affiliation]"
J Biopharm Stat
November 2024
School of Statistics, Dongbei University of Finance and Economics, Dalian, China.
Taking into account the local dependence structure in large-scale multiple testing is expected to improve both the efficiency of the testing procedure and the interpretability of scientific findings. The hidden Markov model (HMM), as an effective model to describe the sequential dependence, has been successfully applied to large-scale multiple testing with local correlations. However, in many applications, the first-order Markov chain is not flexible enough to capture the complexity of local correlations.
View Article and Find Full Text PDFForensic Sci Int Genet
January 2025
Institute of Statistical Sciences, School of Mathematics, Woodland Road, University of Bristol, Bristol BS8 1UG, UK; MRC Integrative Epidemiology Unit, School of Medicine, Oakfield Grove, University of Bristol, Bristol BS8 2BN, UK. Electronic address:
Microhaplotypes (MHs) describe physically close genetic markers that are inherited together and are gaining prominence due to their efficiency in forensic, clinical, and population studies. They excel in kinship analysis, DNA mixture detection, and ancestry inference, offering advantages in precision over individual SNPs and STRs. In this study, a pipeline was developed to efficiently select highly informative MHs from large-scale genomic datasets.
View Article and Find Full Text PDFAm J Hum Genet
August 2024
Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Big Data Bio-Intelligence Lab, The Hong Kong University of Science and Technology, Hong Kong SAR, China. Electronic address:
Mendelian randomization (MR), which utilizes genetic variants as instrumental variables (IVs), has gained popularity as a method for causal inference between phenotypes using genetic data. While efforts have been made to relax IV assumptions and develop new methods for causal inference in the presence of invalid IVs due to confounding, the reliability of MR methods in real-world applications remains uncertain. Instead of using simulated datasets, we conducted a benchmark study evaluating 16 two-sample summary-level MR methods using real-world genetic datasets to provide guidelines for the best practices.
View Article and Find Full Text PDFMol Ecol
April 2024
RZSS WildGenes Laboratory, Conservation Department, Royal Zoological Society of Scotland, Edinburgh, UK.
This paper asks the question: can genomic information be used to recover a species that is already on the pathway to extinction due to genetic swamping from a related and more numerous population? We show that a breeding strategy in a captive breeding program can use whole genome sequencing to identify and remove segments of DNA introgressed through hybridisation. The proposed policy uses a generalized measure of kinship or heterozygosity accounting for local ancestry, that is, whether a specific genetic location was inherited from the target of conservation. We then show that optimizing these measures would minimize undesired ancestry while also controlling kinship and/or heterozygosity, in a simulated breeding population.
View Article and Find Full Text PDFNature
February 2024
Lundbeck Foundation GeoGenetics Centre, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
Nature
January 2024
Lundbeck Foundation GeoGenetics Centre, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
Nature
January 2024
Lundbeck Foundation GeoGenetics Centre, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
Major migration events in Holocene Eurasia have been characterized genetically at broad regional scales. However, insights into the population dynamics in the contact zones are hampered by a lack of ancient genomic data sampled at high spatiotemporal resolution. Here, to address this, we analysed shotgun-sequenced genomes from 100 skeletons spanning 7,300 years of the Mesolithic period, Neolithic period and Early Bronze Age in Denmark and integrated these with proxies for diet (C and N content), mobility (Sr/Sr ratio) and vegetation cover (pollen).
View Article and Find Full Text PDFNature
January 2024
Lundbeck Foundation GeoGenetics Centre, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
The Holocene (beginning around 12,000 years ago) encompassed some of the most significant changes in human evolution, with far-reaching consequences for the dietary, physical and mental health of present-day populations. Using a dataset of more than 1,600 imputed ancient genomes, we modelled the selection landscape during the transition from hunting and gathering, to farming and pastoralism across West Eurasia. We identify key selection signals related to metabolism, including that selection at the FADS cluster began earlier than previously reported and that selection near the LCT locus predates the emergence of the lactase persistence allele by thousands of years.
View Article and Find Full Text PDFBioinformatics
January 2024
Department of Biostatistics, City University of Hong Kong, Tat Chee Avenue, Hong Kong, China.
Motivation: The utilization of single-cell bisulfite sequencing (scBS-seq) methods allows for precise analysis of DNA methylation patterns at the individual cell level, enabling the identification of rare populations, revealing cell-specific epigenetic changes, and improving differential methylation analysis. Nonetheless, the presence of sparse data and an overabundance of zeros and ones, attributed to limited sequencing depth and coverage, frequently results in reduced precision accuracy during the process of differential methylation detection using scBS-seq. Consequently, there is a pressing demand for an innovative differential methylation analysis approach that effectively tackles these data characteristics and enhances recognition accuracy.
View Article and Find Full Text PDFBMJ Open
December 2023
Research Institute of Statistical Sciences, National Bureau of Statistics of China, Beijing, China.
Front Public Health
March 2023
Department of Rehabilitation Medicine, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
Introduction: Sarcopenia and low hemoglobin level are common in older adults. Few studies have evaluated the association between hemoglobin level and sarcopenia and with inconsistent findings. The multifaceted effects of sarcopenia on the human body and the high prevalence of anemia in the Chinese population make it necessary to explore the association between the two.
View Article and Find Full Text PDFDiabetes Res Clin Pract
March 2023
Rehabilitation Medicine Center, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
Aims: To investigate the association between alanine transaminase (ALT) and in-hospital death in patients admitted to the intensive care unit for diabetic ketoacidosis (DKA).
Methods: A cohort of 2,684 patients was constructed from the eICU Collaborative Research Database. Baseline demographic and clinical characteristics were summarized.
BMC Genomics
July 2022
Yunnan Key Laboratory of Statistical Modeling and Data Analysis, Yunnan University, Kunming, China.
Background: Using single-cell RNA sequencing (scRNA-seq) data to diagnose disease is an effective technique in medical research. Several statistical methods have been developed for the classification of RNA sequencing (RNA-seq) data, including, for example, Poisson linear discriminant analysis (PLDA), negative binomial linear discriminant analysis (NBLDA), and zero-inflated Poisson logistic discriminant analysis (ZIPLDA). Nevertheless, few existing methods perform well for large sample scRNA-seq data, in particular when the distribution assumption is also violated.
View Article and Find Full Text PDFR Soc Open Sci
December 2021
Unité Eco-Anthropologie (EA), Muséum National d'Histoire Naturelle, 17 place du Trocadero, Paris 75016, France.
Integrating datasets from different disciplines is hard because the data are often qualitatively different in meaning, scale and reliability. When two datasets describe the same entities, many scientific questions can be phrased around whether the (dis)similarities between entities are conserved across such different data. Our method, CLARITY, quantifies consistency across datasets, identifies where inconsistencies arise and aids in their interpretation.
View Article and Find Full Text PDFFront Immunol
December 2021
Division of Pulmonology, Department of Internal Medicine, Far Eastern Memorial Hospital, New Taipei City, Taiwan.
Background: The incidence of nontuberculous mycobacterial lung disease (NTM-LD) is increasing worldwide. Immune exhaustion has been reported in NTM-LD, but T-cell immunoglobulin and mucin domain-containing protein 3 (TIM3), a co-inhibitory receptor on T cells, has been scarcely studied.
Methods: Patients with NTM-LD and healthy controls were prospectively recruited from July 2014 to August 2019 at three tertiary referral centers in Taiwan.
Biomedicines
October 2021
College of Medicine, National Taiwan University, Taipei 100, Taiwan.
Controlling latent tuberculosis infection (LTBI) is important for preventing tuberculosis (TB). However, the immune regulation of LTBI remains uncertain. Immune checkpoints and CD14+ monocytes are pivotal for immune defense but have been scarcely studied in LTBI.
View Article and Find Full Text PDFLifetime Data Anal
October 2021
School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia.
In medical studies, the collected covariates contain underlying outliers. For clustered/longitudinal data with censored observations, the traditional Gehan-type estimator is robust to outliers in response but sensitive to outliers in the covariate domain, and it also ignores the within-cluster correlations. To take account of within-cluster correlations, varying cluster sizes, and outliers in covariates, we propose weighted Gehan-type estimating functions for parameter estimation in the accelerated failure time model for clustered data.
View Article and Find Full Text PDFBMC Genomics
June 2021
Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen, China.
Background: Identifying differentially expressed genes between the same or different species is an urgent demand for biological and medical research. For RNA-seq data, systematic technical effects and different sequencing depths are usually encountered when conducting experiments. Normalization is regarded as an essential step in the discovery of biologically important changes in expression.
View Article and Find Full Text PDFJCO Precis Oncol
March 2022
Institute of Statistical Sciences, Academia Sinica, Taipei, Taiwan.
Stat Med
August 2021
Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, China.
Bulk and single-cell RNA-seq (scRNA-seq) data are being used as alternatives to traditional technology in biology and medicine research. These data are used, for example, for the detection of differentially expressed (DE) genes. Several statistical methods have been developed for the classification of bulk and single-cell RNA-seq data.
View Article and Find Full Text PDFJ Appl Stat
March 2021
College of Mathematics and Statistics, Institute of Statistical Sciences, Shenzhen Key Laboratory of Advanced Machine Learning and Applications, Shenzhen University, Shenzhen, People's Republic of China.
In this paper, we consider the estimation and model selection for longitudinal partial linear varying coefficient errors-in-variables (EV) models when the covariates are measured with some additive errors. Bias-corrected penalized quadratic inference functions method is proposed based on quadratic inference functions with two penalty function terms. The proposed method can not only handle the measurement errors of covariates and within-subject correlations but also estimate and select significant non-zero parametric and nonparametric components simultaneously.
View Article and Find Full Text PDFFront Genet
March 2021
Shenzhen Key Laboratory of Advanced Machine Learning and Applications, College of Mathematics and Statistics, Institute of Statistical Sciences, Shenzhen University, Shenzhen, China.
Next-generation sequencing has emerged as an essential technology for the quantitative analysis of gene expression. In medical research, RNA sequencing (RNA-seq) data are commonly used to identify which type of disease a patient has. Because of the discrete nature of RNA-seq data, the existing statistical methods that have been developed for microarray data cannot be directly applied to RNA-seq data.
View Article and Find Full Text PDFMol Cancer
February 2021
Institute of Biomedical Big Data, Wenzhou Medical University, Wenzhou, 325027, China.
Early detection is crucial to improve breast cancer (BC) patients' outcomes and survival. Mammogram and ultrasound adopting the Breast Imaging Reporting and Data System (BI-RADS) categorization are widely used for BC early detection, while suffering high false-positive rate leading to unnecessary biopsy, especially in BI-RADS category-4 patients. Plasma cell-free DNA (cfDNA) carrying on DNA methylation information has emerged as a non-invasive approach for cancer detection.
View Article and Find Full Text PDFRecent Results Cancer Res
January 2021
Genomics Research Center, Academia Sinica, 128 Academia Road, Sect. 2, Taipei, 115, Taiwan.
Cancer Immunol Immunother
May 2021
Graduate Institute of Toxicology, College of Medicine, National Taiwan University, No.1, Section 4, Ren-Ai Rd, Taipei, 100, Taiwan.