Rare-variant extensions of the transmission disequilibrium test: Application to autism exome sequence data

Zongxiao He, Brian O'Roak, Joshua D. Smith, Gao Wang, Stanley Hooker, Regie Lyn P Santos-Cortez, Biao Li, Mengyuan Kan, Nik Krumm, Deborah A. Nickerson, Jay Shendure, Evan E. Eichler, Suzanne M. Leal

Research output: Contribution to journalArticle

37 Citations (Scopus)

Abstract

Many population-based rare-variant (RV) association tests, which aggregate variants across a region, have been developed to analyze sequence data. A drawback of analyzing population-based data is that it is difficult to adequately control for population substructure and admixture, and spurious associations can occur. For RVs, this problem can be substantial, because the spectrum of rare variation can differ greatly between populations. A solution is to analyze parent-child trio data, by using the transmission disequilibrium test (TDT), which is robust to population substructure and admixture. We extended the TDT to test for RV associations using four commonly used methods. We demonstrate that for all RV-TDT methods, using proper analysis strategies, type I error is well-controlled even when there are high levels of population substructure or admixture. For trio data, unlike for population-based data, RV allele-counting association methods will lead to inflated type I errors. However type I errors can be properly controlled by obtaining p values empirically through haplotype permutation. The power of the RV-TDT methods was evaluated and compared to the analysis of case-control data with a number of genetic and disease models. The RV-TDT was also used to analyze exome data from 199 Simons Simplex Collection autism trios and an association was observed with variants in ABCA7. Given the problem of adequately controlling for population substructure and admixture in RV association studies and the growing number of sequence-based trio studies, the RV-TDT is extremely beneficial to elucidate the involvement of RVs in the etiology of complex traits.

Original languageEnglish (US)
Pages (from-to)33-46
Number of pages14
JournalAmerican Journal of Human Genetics
Volume94
Issue number1
DOIs
StatePublished - Jan 2 2014

Fingerprint

Exome
Autistic Disorder
Population
Inborn Genetic Diseases
Genetic Models
Haplotypes
Sequence Analysis
Alleles

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Rare-variant extensions of the transmission disequilibrium test : Application to autism exome sequence data. / He, Zongxiao; O'Roak, Brian; Smith, Joshua D.; Wang, Gao; Hooker, Stanley; Santos-Cortez, Regie Lyn P; Li, Biao; Kan, Mengyuan; Krumm, Nik; Nickerson, Deborah A.; Shendure, Jay; Eichler, Evan E.; Leal, Suzanne M.

In: American Journal of Human Genetics, Vol. 94, No. 1, 02.01.2014, p. 33-46.

Research output: Contribution to journalArticle

He, Z, O'Roak, B, Smith, JD, Wang, G, Hooker, S, Santos-Cortez, RLP, Li, B, Kan, M, Krumm, N, Nickerson, DA, Shendure, J, Eichler, EE & Leal, SM 2014, 'Rare-variant extensions of the transmission disequilibrium test: Application to autism exome sequence data', American Journal of Human Genetics, vol. 94, no. 1, pp. 33-46. https://doi.org/10.1016/j.ajhg.2013.11.021
He, Zongxiao ; O'Roak, Brian ; Smith, Joshua D. ; Wang, Gao ; Hooker, Stanley ; Santos-Cortez, Regie Lyn P ; Li, Biao ; Kan, Mengyuan ; Krumm, Nik ; Nickerson, Deborah A. ; Shendure, Jay ; Eichler, Evan E. ; Leal, Suzanne M. / Rare-variant extensions of the transmission disequilibrium test : Application to autism exome sequence data. In: American Journal of Human Genetics. 2014 ; Vol. 94, No. 1. pp. 33-46.
@article{072b9d6bfbd64b5b9162428e0b3c1596,
title = "Rare-variant extensions of the transmission disequilibrium test: Application to autism exome sequence data",
abstract = "Many population-based rare-variant (RV) association tests, which aggregate variants across a region, have been developed to analyze sequence data. A drawback of analyzing population-based data is that it is difficult to adequately control for population substructure and admixture, and spurious associations can occur. For RVs, this problem can be substantial, because the spectrum of rare variation can differ greatly between populations. A solution is to analyze parent-child trio data, by using the transmission disequilibrium test (TDT), which is robust to population substructure and admixture. We extended the TDT to test for RV associations using four commonly used methods. We demonstrate that for all RV-TDT methods, using proper analysis strategies, type I error is well-controlled even when there are high levels of population substructure or admixture. For trio data, unlike for population-based data, RV allele-counting association methods will lead to inflated type I errors. However type I errors can be properly controlled by obtaining p values empirically through haplotype permutation. The power of the RV-TDT methods was evaluated and compared to the analysis of case-control data with a number of genetic and disease models. The RV-TDT was also used to analyze exome data from 199 Simons Simplex Collection autism trios and an association was observed with variants in ABCA7. Given the problem of adequately controlling for population substructure and admixture in RV association studies and the growing number of sequence-based trio studies, the RV-TDT is extremely beneficial to elucidate the involvement of RVs in the etiology of complex traits.",
author = "Zongxiao He and Brian O'Roak and Smith, {Joshua D.} and Gao Wang and Stanley Hooker and Santos-Cortez, {Regie Lyn P} and Biao Li and Mengyuan Kan and Nik Krumm and Nickerson, {Deborah A.} and Jay Shendure and Eichler, {Evan E.} and Leal, {Suzanne M.}",
year = "2014",
month = "1",
day = "2",
doi = "10.1016/j.ajhg.2013.11.021",
language = "English (US)",
volume = "94",
pages = "33--46",
journal = "American Journal of Human Genetics",
issn = "0002-9297",
publisher = "Cell Press",
number = "1",

}

TY - JOUR

T1 - Rare-variant extensions of the transmission disequilibrium test

T2 - Application to autism exome sequence data

AU - He, Zongxiao

AU - O'Roak, Brian

AU - Smith, Joshua D.

AU - Wang, Gao

AU - Hooker, Stanley

AU - Santos-Cortez, Regie Lyn P

AU - Li, Biao

AU - Kan, Mengyuan

AU - Krumm, Nik

AU - Nickerson, Deborah A.

AU - Shendure, Jay

AU - Eichler, Evan E.

AU - Leal, Suzanne M.

PY - 2014/1/2

Y1 - 2014/1/2

N2 - Many population-based rare-variant (RV) association tests, which aggregate variants across a region, have been developed to analyze sequence data. A drawback of analyzing population-based data is that it is difficult to adequately control for population substructure and admixture, and spurious associations can occur. For RVs, this problem can be substantial, because the spectrum of rare variation can differ greatly between populations. A solution is to analyze parent-child trio data, by using the transmission disequilibrium test (TDT), which is robust to population substructure and admixture. We extended the TDT to test for RV associations using four commonly used methods. We demonstrate that for all RV-TDT methods, using proper analysis strategies, type I error is well-controlled even when there are high levels of population substructure or admixture. For trio data, unlike for population-based data, RV allele-counting association methods will lead to inflated type I errors. However type I errors can be properly controlled by obtaining p values empirically through haplotype permutation. The power of the RV-TDT methods was evaluated and compared to the analysis of case-control data with a number of genetic and disease models. The RV-TDT was also used to analyze exome data from 199 Simons Simplex Collection autism trios and an association was observed with variants in ABCA7. Given the problem of adequately controlling for population substructure and admixture in RV association studies and the growing number of sequence-based trio studies, the RV-TDT is extremely beneficial to elucidate the involvement of RVs in the etiology of complex traits.

AB - Many population-based rare-variant (RV) association tests, which aggregate variants across a region, have been developed to analyze sequence data. A drawback of analyzing population-based data is that it is difficult to adequately control for population substructure and admixture, and spurious associations can occur. For RVs, this problem can be substantial, because the spectrum of rare variation can differ greatly between populations. A solution is to analyze parent-child trio data, by using the transmission disequilibrium test (TDT), which is robust to population substructure and admixture. We extended the TDT to test for RV associations using four commonly used methods. We demonstrate that for all RV-TDT methods, using proper analysis strategies, type I error is well-controlled even when there are high levels of population substructure or admixture. For trio data, unlike for population-based data, RV allele-counting association methods will lead to inflated type I errors. However type I errors can be properly controlled by obtaining p values empirically through haplotype permutation. The power of the RV-TDT methods was evaluated and compared to the analysis of case-control data with a number of genetic and disease models. The RV-TDT was also used to analyze exome data from 199 Simons Simplex Collection autism trios and an association was observed with variants in ABCA7. Given the problem of adequately controlling for population substructure and admixture in RV association studies and the growing number of sequence-based trio studies, the RV-TDT is extremely beneficial to elucidate the involvement of RVs in the etiology of complex traits.

UR - http://www.scopus.com/inward/record.url?scp=84891831893&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84891831893&partnerID=8YFLogxK

U2 - 10.1016/j.ajhg.2013.11.021

DO - 10.1016/j.ajhg.2013.11.021

M3 - Article

C2 - 24360806

AN - SCOPUS:84891831893

VL - 94

SP - 33

EP - 46

JO - American Journal of Human Genetics

JF - American Journal of Human Genetics

SN - 0002-9297

IS - 1

ER -