Estimating optimal window size for analysis of low-coverage next-generation sequence data.

Bioinformatics

Department of Statistics, University of Leeds, Leeds LS2 9JT, United Kingdom, Department of Statistics, Faculty of Science, King Saud University, Riyadh, Saudi Arabia, Leeds Institute Cancer and Pathology, University of Leeds, Leeds LS9 7TF, UK and Illumina UK Ltd., Chesterford Research Park, Saffron Walden, CB10 1XL, UK.

Published: July 2014

Motivation: Current high-throughput sequencing has greatly transformed genome sequence analysis. In the context of very low-coverage sequencing (<0.1×), performing 'binning' or 'windowing' on mapped short sequences ('reads') is critical to extract genomic information of interest for further evaluation, such as copy-number alteration analysis. If the window size is too small, many windows will exhibit zero counts and almost no pattern can be observed. In contrast, if the window size is too wide, the patterns or genomic features will be 'smoothed out'. Our objective is to identify an optimal window size in between the two extremes.

Results: We assume the reads density to be a step function. Given this model, we propose a data-based estimation of optimal window size based on Akaike's information criterion (AIC) and cross-validation (CV) log-likelihood. By plotting the AIC and CV log-likelihood curve as a function of window size, we are able to estimate the optimal window size that minimizes AIC or maximizes CV log-likelihood. The proposed methods are of general purpose and we illustrate their application using low-coverage next-generation sequence datasets from real tumour samples and simulated datasets.

Availability And Implementation: An R package to estimate optimal window size is available at http://www1.maths.leeds.ac.uk/∼arief/R/win/.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btu123DOI Listing

Publication Analysis

Top Keywords

estimating optimal
4
optimal window
4
window size
4
size analysis
4
analysis low-coverage
4
low-coverage next-generation
4
next-generation sequence
4
sequence data
4
data motivation
4
motivation current
4

Similar Publications

The objective of this retrospective observational study was to estimate the prevalence of actinic keratosis (AK) in individuals aged ≥ 40 years in France, to describe the characteristics of affected patients, and to describe treatments. A representative panel of 20,000 households with ≥ 1 member aged ≥ 40 years were invited to participate. Participants who reported AK lesions diagnosed by a physician were eligible.

View Article and Find Full Text PDF

What Does it Cost to Provide Free Maternal and Child Health Services in Primary Health Centres? A Case Study of Imo State, Southeast Nigeria.

West Afr J Med

September 2024

Health Policy Research Group, Department of Pharmacology and Therapeutics, College of Medicine, University of Nigeria Enugu-Campus, Enugu, Nigeria.

Background: This study estimated the cost of providing free maternal and child health (MCH) services at the primary health centre (PHC) level in southeast Nigeria. The costs of providing an essential benefit package of maternal and child health (MCH) services are unknown. Such information is required for optimal resource allocation decisions and for replicating similar programmes in different settings.

View Article and Find Full Text PDF

A vision model for automated frozen tuna processing.

Sci Rep

January 2025

School of Food and Pharmacy, Zhejiang Ocean University, Zhoushan, 316022, People's Republic of China.

Accurate and rapid segmentation of key parts of frozen tuna, along with precise pose estimation, is crucial for automated processing. However, challenges such as size differences and indistinct features of tuna parts, as well as the complexity of determining fish poses in multi-fish scenarios, hinder this process. To address these issues, this paper introduces TunaVision, a vision model based on YOLOv8 designed for automated tuna processing.

View Article and Find Full Text PDF

Association between intraoperative fluid management and postoperative outcomes in living kidney donors: a retrospective cohort study.

Sci Rep

January 2025

Department of Anesthesiology and Pain Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81, Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea.

Optimal fluid strategy for laparoscopic donor nephrectomy (LDN) remains unclear. LDN has been a domain for liberal fluid management to ensure graft perfusion, but this can result in adverse outcomes due to fluid overload. We compared postoperative outcome of living kidney donors according to the intraoperative fluid management.

View Article and Find Full Text PDF

Epidemiological status, development trends, and risk factors of disability-adjusted life years due to diabetic kidney disease: A systematic analysis of Global Burden of Disease Study 2021.

Chin Med J (Engl)

January 2025

Department of Metabolism and Endocrinology, National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, The Second Xiangya Hospital of Central South University, Changsha, Hunan 410011, China.

Background: Approximately 40% of individuals with diabetes worldwide are at risk of developing diabetic kidney disease (DKD), which is not only the leading cause of kidney failure, but also significantly increases the risk of cardiovascular disease, causing significant societal health and financial burdens. This study aimed to describe the burden of DKD and explore its cross-country epidemiological status, predict development trends, and assess its risk factors and sociodemographic transitions.

Methods: Based on the Global Burden of Diseases (GBD) Study 2021, data on DKD due to type 1 diabetes (DKD-T1DM) and type 2 diabetes (DKD-T2DM) were analyzed by sex, age, year, and location.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!