A large-scale dataset of single and mixed-source short tandem repeat profiles to inform human identification strategies: PROVEDIt.

Forensic Sci Int Genet

Biomedical Forensic Sciences Program, Boston University School of Medicine, United States; Center for Computational and Integrative Biology, Rutgers University, United States; Department of Chemistry, Rutgers University, Camden, United States. Electronic address:

Published: January 2018

DNA-based human identity testing is conducted by comparison of PCR-amplified polymorphic Short Tandem Repeat (STR) motifs from a known source with the STR profiles obtained from uncertain sources. Samples such as those found at crime scenes often result in signal that is a composite of incomplete STR profiles from an unknown number of unknown contributors, making interpretation an arduous task. To facilitate advancement in STR interpretation challenges we provide over 25,000 multiplex STR profiles produced from one to five known individuals at target levels ranging from one to 160 copies of DNA. The data, generated under 144 laboratory conditions, are classified by total copy number and contributor proportions. For the 70% of samples that were synthetically compromised, we report the level of DNA damage using quantitative and end-point PCR. In addition, we characterize the complexity of the signal by exploring the number of detected alleles in each profile.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.fsigen.2017.10.006DOI Listing

Publication Analysis

Top Keywords

str profiles
12
short tandem
8
tandem repeat
8
str
5
large-scale dataset
4
dataset single
4
single mixed-source
4
mixed-source short
4
profiles
4
repeat profiles
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!