Systematic analyses of AISNPs screening and classification algorithms based on genome-wide data for forensic biogeographic ancestry inference.

Forensic Sci Int

Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China. Electronic address:

Published: April 2024

AI Article Synopsis

Article Abstract

Identifying the biogeographic ancestral origin of biological sample left at a crime scene can provide important evidence for judicial case, as well as clue for narrowing down suspect. Ancestry informative single nucleotide polymorphism (AISNP) has become one of the most important genetic markers in recent years for screening ancestry information loci and analyzing the population genetic background and structure due to their high number and wide distributions in the human genome. In this study, based on data from 26 populations in the 1000 Genomes Project Phase 3, a Random Forest classification model was constructed with one-vs-rest classification strategy for embedded feature selection in order to obtain a panel with a small number of efficient AISNPs. The research aim was to clarify differentiations of population genetic structures among continents and subregions of East Asia. ADMIXTURE results showed that based on the 58 AISNPs selected by the machine learning algorithm, the 26 populations involved in the study could be categorized into six intercontinental ancestry components: North East Asia, South East Asia, Africa, Europe, South Asia, and America. The 24 continental-specific AISNPs and 34 East Asian-specific AISNPs were finally obtained, and used to construct the ancestry prediction model using XGBoost algorithm, resulting in the Matthews correlation coefficients of 0.94 and 0.89, and accuracies of 0.94 and 0.92, respectively. The machine learning models that we constructed using population-specific AISNPs were able to accurately predict the ancestral origins of continental and intra-East Asian populations. To summarize, screening a set of high-perform AISNPs to infer biogeographical ancestral information using embedded feature selection has potential application in creating a layered inference system that accurately differentiates from intercontinental populations to local subpopulations.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.forsciint.2024.111975DOI Listing

Publication Analysis

Top Keywords

east asia
12
population genetic
8
embedded feature
8
feature selection
8
machine learning
8
aisnps
7
ancestry
5
systematic analyses
4
analyses aisnps
4
aisnps screening
4

Similar Publications

In the context of Chinese clinical texts, this paper aims to propose a deep learning algorithm based on Bidirectional Encoder Representation from Transformers (BERT) to identify privacy information and to verify the feasibility of our method for privacy protection in the Chinese clinical context. We collected and double-annotated 33,017 discharge summaries from 151 medical institutions on a municipal regional health information platform, developed a BERT-based Bidirectional Long Short-Term Memory Model (BiLSTM) and Conditional Random Field (CRF) model, and tested the performance of privacy identification on the dataset. To explore the performance of different substructures of the neural network, we created five additional baseline models and evaluated the impact of different models on performance.

View Article and Find Full Text PDF

Lobar pneumonia is an acute inflammation with increasing incidence globally. Delayed treatment can lead to severe complications, posing life-threatening risks. Thus, it is crucial to determine effective treatment methods to improve the prognosis of children with lobar pneumonia.

View Article and Find Full Text PDF

Porcine reproductive and respiratory syndrome virus (PRRSV), an important pathogen affecting the pig industry, is an RNA virus with high genetic diversity. In this study, 12,299 clinical samples were collected from northern China during 2021-2023 to investigate the molecular epidemiological characteristics and genetic evolution of PRRSV. All samples were screened using qRT-PCR and further analyzed through gene and whole-genome sequencing.

View Article and Find Full Text PDF

Current Situation of Goose Astrovirus in China: A Review.

Viruses

January 2025

Center of Disease Immunity and Intervention, College of Medicine, Lishui University, Lishui 323000, China.

Gosling gout disease is an infectious disease caused by goose astrovirus (GAstV), which can result in urate deposition in the internal organs and joints of goslings. Since 2015, outbreaks of gosling gout disease have occurred in several goose-producing areas in China. Subsequently, the disease spread to the vast majority of eastern China, becoming a major threat to goose farms and causing huge economic losses to the goose industry.

View Article and Find Full Text PDF

Serosurvey of Bovine Viral Diarrhea Virus in Cattle in Southern Japan and Estimation of Its Transmissibility by Transient Infection in Nonvaccinated Cattle.

Viruses

January 2025

Laboratory of Microbiology, Department of Disease Control, Faculty of Veterinary Medicine, Hokkaido University, Kita 18, Nishi 9, Kita-Ku, Sapporo 060-0818, Hokkaido, Japan.

Bovine viral diarrhea (BVD) is caused by the BVD virus (BVDV) and has been reported worldwide in cattle. To estimate BVDV circulation among cattle where few BVD cases were reported in southern Japan, 1910 serum samples collected from 35 cattle farms without a BVD outbreak were investigated to detect antibodies against BVDV-1 and BVDV-2 using an indicator virus with a cytopathogenic effect and the luciferase gene, respectively. Neutralizing antibodies against BVDV-1 and BVDV-2 were detected more frequently in 18 vaccinated farms than in 17 nonvaccinated farms.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!