Most organisms are more closely related to nearby than distant members of their species, creating spatial autocorrelations in genetic data. This allows us to predict the location of origin of a genetic sample by comparing it to a set of samples of known geographic origin. Here, we describe a deep learning method, which we call Locator, to accomplish this task faster and more accurately than existing approaches. In simulations, Locator infers sample location to within 4.1 generations of dispersal and runs at least an order of magnitude faster than a recent model-based approach. We leverage Locator's computational efficiency to predict locations separately in windows across the genome, which allows us to both quantify uncertainty and describe the mosaic ancestry and patterns of geographic mixing that characterize many populations. Applied to whole-genome sequence data from parasites, mosquitoes, and global human populations, this approach yields median test errors of 16.9km, 5.7km, and 85km, respectively.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324158PMC
http://dx.doi.org/10.7554/eLife.54507DOI Listing

Publication Analysis

Top Keywords

predicting geographic
4
geographic location
4
location genetic
4
genetic variation
4
variation deep
4
deep neural
4
neural networks
4
networks organisms
4
organisms closely
4
closely nearby
4

Similar Publications

The Circumsporozoite Protein (PfCSP) has been used in developing the RTS,S, and R21 malaria vaccines. However, genetic polymorphisms within compromise the effectiveness of the vaccine. Thus, it is essential to continuously assess the genetic diversity of , especially when deploying it across different geographical regions.

View Article and Find Full Text PDF

The species, valued for their pharmaceutical, ornamental, and economic importance, exhibit notable rarity and endemism in the Karst areas of the Yunnan-Kweichow Plateau in China. These species face significant threats from habitat loss and fragmentation, leading to a decline in biodiversity. To mitigate these threats, the Maxent algorithm was employed to analyze current and future distribution patterns, with a particular focus on the influence of climate variables in predicting potential distribution shifts and assessing extinction risks under the optimistic SSP1-2.

View Article and Find Full Text PDF

Pervasive glacier retreats across Svalbard from 1985 to 2023.

Nat Commun

January 2025

Chair of Data Science in Earth Observation, Department of Aerospace and Geodesy, Technical University of Munich, Munich, Germany.

A major uncertainty in predicting the behaviour of marine-terminating glaciers is ice dynamics driven by non-linear calving front retreat, which is poorly understood and modelled. Using 124919 calving front positions for 149 marine-terminating glaciers in Svalbard from 1985 to 2023, generated with deep learning, we identify pervasive calving front retreats for non-surging glaciers over the past 38 years. We observe widespread seasonal cycles in calving front position for over half of the glaciers.

View Article and Find Full Text PDF

High performance COVID-19 screening using machine learning.

Tunis Med

January 2025

Laboratory of viruses, vectors and hosts: LR20IPT10, Institut Pasteur de Tunis, University of Tunis El Manar, 13, Place Pasteur, 1002 Tunis Belvédère, Tunisia.

Since the World Health Organization declared the Coronavirus Disease 2019 (COVID-19) pandemic as an international concern of public health emergency in the early 2020, several strategies have been initiated in many countries to prevent healthcare services breakdown and collapse of healthcare structures. The most important strategy was the increased testing, diagnosis, isolation, contact tracing to identify, quarantine and test close contacts. In this context, finding a rapid, reliable and affordable tool for COVID-19 screening was the main challenge to address the pandemic.

View Article and Find Full Text PDF

Small mammals and associated infections in China: a systematic review and spatial modelling analysis.

Lancet Reg Health West Pac

January 2025

State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Science, Beijing, PR China.

Background: As natural reservoirs of diverse pathogens, small mammals are considered a key interface for guarding public health due to their wide geographic distribution, high density and frequent interaction with humans.

Methods: All formally recorded natural occurrences of small mammals (Order: Rodentia, Eulipotyphla, Lagomorpha, and Scandentia) and their associated microbial infections in China were searched in the English and Chinese literature spanning from 1950 to 2021 and geolocated. Machine learning models were applied to determine ecological drivers for the distributions of 45 major small mammal species and two common rodent-borne diseases (RBDs), and model-predicted potential risk locations were mapped.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!