Increasing the discrimination power of ancestry- and identity-informative SNP loci within the ForenSeq™ DNA Signature Prep Kit

Jonathan L. King, Jennifer Churchill Cihlar, Nicole M.M. Novroski, Xiangpei Zeng, David H. Warshauer, Lay Hong Seah, Bruce Budowle

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

The use of single nucleotide polymorphisms (SNPs) in forensic genetics has been limited to challenged samples with low template and/or degraded DNA. The recent introduction of massively parallel sequencing (MPS) technologies has expanded the potential applications of these markers and increased the discrimination power of well-established loci by considering variation in the flanking regions of target loci. The ForenSeq Signature Preparation Kit contains 165 SNP amplicons for ancestry- (aiSNPs), identity- (iiSNPs), and phenotype-inference (piSNPs). In this study, 714 individuals from four major populations (African American, AFA; East Asian, ASN; US Caucasian, CAU; and Southwest US Hispanic, HIS) previously reported by Churchill et al. [Forensic Sci Int Genet. 30 (2017) 81–92; DOI: https://doi.org/10.1016/j.fsigen.2017.06.004] were assessed using STRait Razor v2s to determine the level of diversity in the flanking regions of these amplicons. The results show that nearly 70% of loci showed some level of flanking region variation with 22 iiSNPs and 8 aiSNPs categorized as microhaplotypes in this study. The heterozygosities of these microhaplotypes approached, and in one instance surpassed, those of some core STR loci. Also, the impact of the flanking region on other forensic parameters (e.g., power of exclusion and power of discrimination) was examined. Sixteen of the 94 iiSNPs had an effective allele number greater than 2.00 across the four populations. To assess what effect the flanking region information had on the ancestry inference, genotype probabilities and likelihood ratios were determined. Additionally, concordance with the ForenSeq UAS and Nextera Rapid Capture was evaluated, and patterns of heterozygote imbalance were identified. Pairwise comparison of the iiSNP diplotypes determined the probability of detecting a mixture (i.e., observing ≥ 3 haplotypes) using these loci alone was 0.9952. The improvement in random match probabilities for the full regions over the target iiSNPs was found to be significant. When combining the iiSNPs with the autosomal STRs, the combined match probabilities ranged from 6.40 × 10−73 (ASN) to 1.02 × 10-79 (AFA).

Original languageEnglish
Pages (from-to)60-76
Number of pages17
JournalForensic Science International: Genetics
Volume36
DOIs
StatePublished - 1 Sep 2018

Fingerprint

Single Nucleotide Polymorphism
DNA
Forensic Genetics
Southwestern United States
Viverridae
High-Throughput Nucleotide Sequencing
Heterozygote
Hispanic Americans
African Americans
Haplotypes
Population
Alleles
Genotype
Technology
Phenotype

Keywords

  • Bioinformatics
  • FGx
  • ForenSeq
  • Massively parallel sequencing
  • Microhaplotypes
  • SNPs

Cite this

@article{2b9904e01b1d4581b2379888e71481c9,
title = "Increasing the discrimination power of ancestry- and identity-informative SNP loci within the ForenSeq™ DNA Signature Prep Kit",
abstract = "The use of single nucleotide polymorphisms (SNPs) in forensic genetics has been limited to challenged samples with low template and/or degraded DNA. The recent introduction of massively parallel sequencing (MPS) technologies has expanded the potential applications of these markers and increased the discrimination power of well-established loci by considering variation in the flanking regions of target loci. The ForenSeq Signature Preparation Kit contains 165 SNP amplicons for ancestry- (aiSNPs), identity- (iiSNPs), and phenotype-inference (piSNPs). In this study, 714 individuals from four major populations (African American, AFA; East Asian, ASN; US Caucasian, CAU; and Southwest US Hispanic, HIS) previously reported by Churchill et al. [Forensic Sci Int Genet. 30 (2017) 81–92; DOI: https://doi.org/10.1016/j.fsigen.2017.06.004] were assessed using STRait Razor v2s to determine the level of diversity in the flanking regions of these amplicons. The results show that nearly 70{\%} of loci showed some level of flanking region variation with 22 iiSNPs and 8 aiSNPs categorized as microhaplotypes in this study. The heterozygosities of these microhaplotypes approached, and in one instance surpassed, those of some core STR loci. Also, the impact of the flanking region on other forensic parameters (e.g., power of exclusion and power of discrimination) was examined. Sixteen of the 94 iiSNPs had an effective allele number greater than 2.00 across the four populations. To assess what effect the flanking region information had on the ancestry inference, genotype probabilities and likelihood ratios were determined. Additionally, concordance with the ForenSeq UAS and Nextera Rapid Capture was evaluated, and patterns of heterozygote imbalance were identified. Pairwise comparison of the iiSNP diplotypes determined the probability of detecting a mixture (i.e., observing ≥ 3 haplotypes) using these loci alone was 0.9952. The improvement in random match probabilities for the full regions over the target iiSNPs was found to be significant. When combining the iiSNPs with the autosomal STRs, the combined match probabilities ranged from 6.40 × 10−73 (ASN) to 1.02 × 10-79 (AFA).",
keywords = "Bioinformatics, FGx, ForenSeq, Massively parallel sequencing, Microhaplotypes, SNPs",
author = "King, {Jonathan L.} and Cihlar, {Jennifer Churchill} and Novroski, {Nicole M.M.} and Xiangpei Zeng and Warshauer, {David H.} and Seah, {Lay Hong} and Bruce Budowle",
year = "2018",
month = "9",
day = "1",
doi = "10.1016/j.fsigen.2018.06.005",
language = "English",
volume = "36",
pages = "60--76",
journal = "Forensic Science International: Genetics",
issn = "1872-4973",
publisher = "Elsevier Ireland Ltd",

}

Increasing the discrimination power of ancestry- and identity-informative SNP loci within the ForenSeq™ DNA Signature Prep Kit. / King, Jonathan L.; Cihlar, Jennifer Churchill; Novroski, Nicole M.M.; Zeng, Xiangpei; Warshauer, David H.; Seah, Lay Hong; Budowle, Bruce.

In: Forensic Science International: Genetics, Vol. 36, 01.09.2018, p. 60-76.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Increasing the discrimination power of ancestry- and identity-informative SNP loci within the ForenSeq™ DNA Signature Prep Kit

AU - King, Jonathan L.

AU - Cihlar, Jennifer Churchill

AU - Novroski, Nicole M.M.

AU - Zeng, Xiangpei

AU - Warshauer, David H.

AU - Seah, Lay Hong

AU - Budowle, Bruce

PY - 2018/9/1

Y1 - 2018/9/1

N2 - The use of single nucleotide polymorphisms (SNPs) in forensic genetics has been limited to challenged samples with low template and/or degraded DNA. The recent introduction of massively parallel sequencing (MPS) technologies has expanded the potential applications of these markers and increased the discrimination power of well-established loci by considering variation in the flanking regions of target loci. The ForenSeq Signature Preparation Kit contains 165 SNP amplicons for ancestry- (aiSNPs), identity- (iiSNPs), and phenotype-inference (piSNPs). In this study, 714 individuals from four major populations (African American, AFA; East Asian, ASN; US Caucasian, CAU; and Southwest US Hispanic, HIS) previously reported by Churchill et al. [Forensic Sci Int Genet. 30 (2017) 81–92; DOI: https://doi.org/10.1016/j.fsigen.2017.06.004] were assessed using STRait Razor v2s to determine the level of diversity in the flanking regions of these amplicons. The results show that nearly 70% of loci showed some level of flanking region variation with 22 iiSNPs and 8 aiSNPs categorized as microhaplotypes in this study. The heterozygosities of these microhaplotypes approached, and in one instance surpassed, those of some core STR loci. Also, the impact of the flanking region on other forensic parameters (e.g., power of exclusion and power of discrimination) was examined. Sixteen of the 94 iiSNPs had an effective allele number greater than 2.00 across the four populations. To assess what effect the flanking region information had on the ancestry inference, genotype probabilities and likelihood ratios were determined. Additionally, concordance with the ForenSeq UAS and Nextera Rapid Capture was evaluated, and patterns of heterozygote imbalance were identified. Pairwise comparison of the iiSNP diplotypes determined the probability of detecting a mixture (i.e., observing ≥ 3 haplotypes) using these loci alone was 0.9952. The improvement in random match probabilities for the full regions over the target iiSNPs was found to be significant. When combining the iiSNPs with the autosomal STRs, the combined match probabilities ranged from 6.40 × 10−73 (ASN) to 1.02 × 10-79 (AFA).

AB - The use of single nucleotide polymorphisms (SNPs) in forensic genetics has been limited to challenged samples with low template and/or degraded DNA. The recent introduction of massively parallel sequencing (MPS) technologies has expanded the potential applications of these markers and increased the discrimination power of well-established loci by considering variation in the flanking regions of target loci. The ForenSeq Signature Preparation Kit contains 165 SNP amplicons for ancestry- (aiSNPs), identity- (iiSNPs), and phenotype-inference (piSNPs). In this study, 714 individuals from four major populations (African American, AFA; East Asian, ASN; US Caucasian, CAU; and Southwest US Hispanic, HIS) previously reported by Churchill et al. [Forensic Sci Int Genet. 30 (2017) 81–92; DOI: https://doi.org/10.1016/j.fsigen.2017.06.004] were assessed using STRait Razor v2s to determine the level of diversity in the flanking regions of these amplicons. The results show that nearly 70% of loci showed some level of flanking region variation with 22 iiSNPs and 8 aiSNPs categorized as microhaplotypes in this study. The heterozygosities of these microhaplotypes approached, and in one instance surpassed, those of some core STR loci. Also, the impact of the flanking region on other forensic parameters (e.g., power of exclusion and power of discrimination) was examined. Sixteen of the 94 iiSNPs had an effective allele number greater than 2.00 across the four populations. To assess what effect the flanking region information had on the ancestry inference, genotype probabilities and likelihood ratios were determined. Additionally, concordance with the ForenSeq UAS and Nextera Rapid Capture was evaluated, and patterns of heterozygote imbalance were identified. Pairwise comparison of the iiSNP diplotypes determined the probability of detecting a mixture (i.e., observing ≥ 3 haplotypes) using these loci alone was 0.9952. The improvement in random match probabilities for the full regions over the target iiSNPs was found to be significant. When combining the iiSNPs with the autosomal STRs, the combined match probabilities ranged from 6.40 × 10−73 (ASN) to 1.02 × 10-79 (AFA).

KW - Bioinformatics

KW - FGx

KW - ForenSeq

KW - Massively parallel sequencing

KW - Microhaplotypes

KW - SNPs

UR - http://www.scopus.com/inward/record.url?scp=85048802168&partnerID=8YFLogxK

U2 - 10.1016/j.fsigen.2018.06.005

DO - 10.1016/j.fsigen.2018.06.005

M3 - Article

C2 - 29935396

AN - SCOPUS:85048802168

VL - 36

SP - 60

EP - 76

JO - Forensic Science International: Genetics

JF - Forensic Science International: Genetics

SN - 1872-4973

ER -