Compound stutter in D2S1338 and D12S391

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Compound repeats provide a testing ground for evaluating hypotheses on the causes and consequences of stutter. As compound repeats have two repeating motifs, each motif may produce stutter variants; thus as the repeat within the locus is compound, so is the resultant stutter. Further, the rates of stutter formation between these motifs may not be independent. This lack of independence may complicate modeling strategies, thus contributing to the challenges of mixture interpretation that rely on nucleotide sequence. This study evaluates compound stutter in two STR loci: D2S1338 and D12S391. The effects of flanking variation, as well as possible interactions between the two different uninterrupted stretches (US) and their respective stutter variants are assessed. Multivariate multiple linear regression (MMLR) was used to show that, as with simple repeats, the rate of stutter product formation of a particular repeating motif is not solely a function of the US of that repeat. The nucleotides adjacent to the repeating motif also appear to influence the rate of stutter formation of that motif, with those nucleotides sometimes including the other motif. MMLR was used to estimate the size of these effects and to construct an example of a two-dimensional (thus, a compound) stutter prediction. This example may merit further investigation in the application of massively parallel sequencing data to mixture interpretation and probabilistic genotyping.

Original languageEnglish
Pages (from-to)50-56
Number of pages7
JournalForensic Science International: Genetics
Volume39
DOIs
StatePublished - 1 Mar 2019

Fingerprint

Linear Models
Nucleotides
High-Throughput Nucleotide Sequencing

Keywords

  • Flanking variation
  • Massively parallel sequencing
  • Probabilistic genotyping
  • Stutter

Cite this

@article{d8e2f18a787e4178b199a9b0d89e86f5,
title = "Compound stutter in D2S1338 and D12S391",
abstract = "Compound repeats provide a testing ground for evaluating hypotheses on the causes and consequences of stutter. As compound repeats have two repeating motifs, each motif may produce stutter variants; thus as the repeat within the locus is compound, so is the resultant stutter. Further, the rates of stutter formation between these motifs may not be independent. This lack of independence may complicate modeling strategies, thus contributing to the challenges of mixture interpretation that rely on nucleotide sequence. This study evaluates compound stutter in two STR loci: D2S1338 and D12S391. The effects of flanking variation, as well as possible interactions between the two different uninterrupted stretches (US) and their respective stutter variants are assessed. Multivariate multiple linear regression (MMLR) was used to show that, as with simple repeats, the rate of stutter product formation of a particular repeating motif is not solely a function of the US of that repeat. The nucleotides adjacent to the repeating motif also appear to influence the rate of stutter formation of that motif, with those nucleotides sometimes including the other motif. MMLR was used to estimate the size of these effects and to construct an example of a two-dimensional (thus, a compound) stutter prediction. This example may merit further investigation in the application of massively parallel sequencing data to mixture interpretation and probabilistic genotyping.",
keywords = "Flanking variation, Massively parallel sequencing, Probabilistic genotyping, Stutter",
author = "Woerner, {August Eric} and King, {Jonathan L.} and Bruce Budowle",
year = "2019",
month = "3",
day = "1",
doi = "10.1016/j.fsigen.2018.12.001",
language = "English",
volume = "39",
pages = "50--56",
journal = "Forensic Science International: Genetics",
issn = "1872-4973",
publisher = "Elsevier Ireland Ltd",

}

Compound stutter in D2S1338 and D12S391. / Woerner, August Eric; King, Jonathan L.; Budowle, Bruce.

In: Forensic Science International: Genetics, Vol. 39, 01.03.2019, p. 50-56.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Compound stutter in D2S1338 and D12S391

AU - Woerner, August Eric

AU - King, Jonathan L.

AU - Budowle, Bruce

PY - 2019/3/1

Y1 - 2019/3/1

N2 - Compound repeats provide a testing ground for evaluating hypotheses on the causes and consequences of stutter. As compound repeats have two repeating motifs, each motif may produce stutter variants; thus as the repeat within the locus is compound, so is the resultant stutter. Further, the rates of stutter formation between these motifs may not be independent. This lack of independence may complicate modeling strategies, thus contributing to the challenges of mixture interpretation that rely on nucleotide sequence. This study evaluates compound stutter in two STR loci: D2S1338 and D12S391. The effects of flanking variation, as well as possible interactions between the two different uninterrupted stretches (US) and their respective stutter variants are assessed. Multivariate multiple linear regression (MMLR) was used to show that, as with simple repeats, the rate of stutter product formation of a particular repeating motif is not solely a function of the US of that repeat. The nucleotides adjacent to the repeating motif also appear to influence the rate of stutter formation of that motif, with those nucleotides sometimes including the other motif. MMLR was used to estimate the size of these effects and to construct an example of a two-dimensional (thus, a compound) stutter prediction. This example may merit further investigation in the application of massively parallel sequencing data to mixture interpretation and probabilistic genotyping.

AB - Compound repeats provide a testing ground for evaluating hypotheses on the causes and consequences of stutter. As compound repeats have two repeating motifs, each motif may produce stutter variants; thus as the repeat within the locus is compound, so is the resultant stutter. Further, the rates of stutter formation between these motifs may not be independent. This lack of independence may complicate modeling strategies, thus contributing to the challenges of mixture interpretation that rely on nucleotide sequence. This study evaluates compound stutter in two STR loci: D2S1338 and D12S391. The effects of flanking variation, as well as possible interactions between the two different uninterrupted stretches (US) and their respective stutter variants are assessed. Multivariate multiple linear regression (MMLR) was used to show that, as with simple repeats, the rate of stutter product formation of a particular repeating motif is not solely a function of the US of that repeat. The nucleotides adjacent to the repeating motif also appear to influence the rate of stutter formation of that motif, with those nucleotides sometimes including the other motif. MMLR was used to estimate the size of these effects and to construct an example of a two-dimensional (thus, a compound) stutter prediction. This example may merit further investigation in the application of massively parallel sequencing data to mixture interpretation and probabilistic genotyping.

KW - Flanking variation

KW - Massively parallel sequencing

KW - Probabilistic genotyping

KW - Stutter

UR - http://www.scopus.com/inward/record.url?scp=85058222913&partnerID=8YFLogxK

U2 - 10.1016/j.fsigen.2018.12.001

DO - 10.1016/j.fsigen.2018.12.001

M3 - Article

VL - 39

SP - 50

EP - 56

JO - Forensic Science International: Genetics

JF - Forensic Science International: Genetics

SN - 1872-4973

ER -