Genome-wide association studies present computational challenges for missing data imputation, while the advances of genotype technologies are generating datasets of large sample sizes with sample sets genotyped on multiple SNP chips. We present a new framework SparRec (Sparse Recovery) for imputation, with the following properties: (1) The optimization models of SparRec, based on low-rank and low number of co-clusters of matrices, are different from current statistics methods. While our low-rank matrix completion (LRMC) model is similar to Mendel-Impute, our matrix co-clustering factorization (MCCF) model is completely new. (2) SparRec, as other matrix completion methods, is flexible to be applied to missing data imputation for large meta-analysis with different cohorts genotyped on different sets of SNPs, even when there is no reference panel. This kind of meta-analysis is very challenging for current statistics based methods. (3) SparRec has consistent performance and achieves high recovery accuracy even when the missing data rate is as high as 90%. Compared with Mendel-Impute, our low-rank based method achieves similar accuracy and efficiency, while the co-clustering based method has advantages in running time. The testing results show that SparRec has significant advantages and competitive performance over other state-of-the-art existing statistics methods including Beagle and fastPhase.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5071878PMC
http://dx.doi.org/10.1038/srep35534DOI Listing

Publication Analysis

Top Keywords

missing data
16
matrix completion
12
data imputation
12
current statistics
8
statistics methods
8
based method
8
sparrec
6
sparrec effective
4
matrix
4
effective matrix
4

Similar Publications

Background: Hepatitis C virus (HCV) and hepatitis B virus (HBV) infections pose significant global health concerns, contributing to chronic liver diseases. Blood transfusion is identified as a potential route for the transmission of these viruses, necessitating effective screening strategies for blood donors. The aim of this study was to assess the significance of nucleic acid testing (NAT) in detecting HBV and HCV infections among blood donors who initially tested negative in serological tests.

View Article and Find Full Text PDF

Home Health Care Research for Children With Disability and Medical Complexity.

Pediatrics

January 2025

Complex Care, Division of General Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts.

Pediatric home health care represents a vital system of care for children with disability and medical complexity, encompassing services provided by family caregivers and nonfamily home health care providers and the use of durable medical equipment and supplies. Home health care is medically necessary for the physiologic health of children with disability and medical complexity and for their participation and function within home, school, and community settings. While the study of pediatric home health care in the United States has increased in the last decade, its research remains primarily methodologically limited to observational studies.

View Article and Find Full Text PDF

Background/purpose: Many designs of static computer-assisted implant surgery (sCAIS) are available for clinician to achieve proper implant position. However, there were not any studies that approached the design alone to evaluate whether sleeve-in-sleeve or sleeve-on-drill design provided most accuracy implant position. The purpose of this study was to investigate the precision of implant placement with sleeve-in-sleeve and sleeve-on-drill static computer assisted implant surgery (sCAIS) designs.

View Article and Find Full Text PDF

Background: Despite extensive research on OHCA in urban centres worldwide, there is a significant gap in knowledge regarding these events in less urbanized regions, especially in Low-Middle-Income Countries (LMICs).

Aim: To determine the characteristics and outcomes of adult out-of-hospital cardiac arrest (OHCA) in rural and suburban districts of Sindh, Pakistan.

Methods: Data of OHCA patients (>18 years) was collected retrospectively from January 2020 to December 2022, from the medical records of district and tehsil hospitals of the province of Sindh Data analysis was performed using the Statistical Package Software for the Social Sciences (SPSS) Statistics 29.

View Article and Find Full Text PDF

Objectives: The ideal model of care for individuals with Differences of Sex Development (DSD) continues to evolve, with multiple models proposed. This study aimed to explore current care models for individuals with DSD in Australia and New Zealand (NZ) and to identify clinician perceptions of gaps and barriers in current practice.

Methods: Cross-sectional anonymous online questionnaire, conducted via Research Electronic Data Capture (REDCap) software.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!