Massively parallel sequencing (MPS) offers advantages over current capillary electrophoresis-based analysis of short tandem repeat (STR) loci for human identification testing. In particular STR repeat motif sequence information can be obtained, thereby increasing the discrimination power of some loci. While sequence variation within the repeat region is observed relatively frequently in some of the commonly used STRs, there is an additional degree of variation found in the flanking regions adjacent to the repeat motif. Repeat motif and flanking region sequence variation have been described for major population groups, however, not for more isolated populations. Flanking region sequence variation in STR and single nucleotide polymorphism (SNP) loci in the Yavapai population was analyzed using the ForenSeq™ DNA Signature Prep Kit and STRait Razor v2s. Seven and 14 autosomal STRs and identity-informative single nucleotide polymorphisms (iiSNPs), respectively, had some degree of flanking region variation. Three and four of these identity-informative loci, respectively, showed ≥5% increase in expected heterozygosity. The combined length- and sequence-based random match probabilities (RMPs) for 27 autosomal STRs were 6.11 × 10−26 and 2.79 × 10−29, respectively. When combined with 94 iiSNPs (a subset of which became microhaplotypes) the combined RMP was 5.49 × 10−63. Analysis of length-based and sequence-based autosomal STRs in STRUCTURE indicated that the Yavapai are most similar to the Hispanic population. While producing minimal increase in X- and Y-STR discrimination potential, access to flanking region data enabled identification of one novel X-STR and three Y-STR alleles relative to previous reports. Five ancestry-informative SNPs (aiSNPs) and two phenotype-informative SNPs (piSNPs) exhibited notable flanking region variation.
- Flanking region
- ForenSeq™ DNA Signature Prep Kit
- Massively parallel sequencing
- STRait Razor v2s
- Sequence variation