The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human

Qiaoping Yuan, Zhifeng Zhou, Stephen G. Lindell, J. D. Higley, Betsy Ferguson, Robert C. Thompson, Juan F. Lopez, Stephen J. Suomi, Basel Baghal, Maggie Baker, Deborah C. Mash, Christina S. Barr, David Goldman

    Research output: Contribution to journalArticle

    26 Scopus citations

    Abstract

    Background: As a model organism in biomedicine, the rhesus macaque (Macaca mulatta) is the most widely used nonhuman primate. Although a draft genome sequence was completed in 2007, there has been no systematic genome-wide comparison of genetic variation of this species to humans. Comparative analysis of functional and nonfunctional diversity in this highly abundant and adaptable non-human primate could inform its use as a model for human biology, and could reveal how variation in population history and size alters patterns and levels of sequence variation in primates.Results: We sequenced the mRNA transcriptome and H3K4me3-marked DNA regions in hippocampus from 14 humans and 14 rhesus macaques. Using equivalent methodology and sampling spaces, we identified 462,802 macaque SNPs, most of which were novel and disproportionately located in the functionally important genomic regions we had targeted in the sequencing. At least one SNP was identified in each of 16,797 annotated macaque genes. Accuracy of macaque SNP identification was conservatively estimated to be >90%. Comparative analyses using SNPs equivalently identified in the two species revealed that rhesus macaque has approximately three times higher SNP density and average nucleotide diversity as compared to the human. Based on this level of diversity, the effective population size of the rhesus macaque is approximately 80,000 which contrasts with an effective population size of less than 10,000 for humans. Across five categories of genomic regions, intergenic regions had the highest SNP density and average nucleotide diversity and CDS (coding sequences) the lowest, in both humans and macaques. Although there are more coding SNPs (cSNPs) per individual in macaques than in humans, the ratio of dN/dS is significantly lower in the macaque. Furthermore, the number of damaging nonsynonymous cSNPs (have damaging effects on protein functions from PolyPhen-2 prediction) in the macaque is more closely equivalent to that of the human.Conclusions: This large panel of newly identified macaque SNPs enriched for functionally significant regions considerably expands our knowledge of genetic variation in the rhesus macaque. Comparative analysis reveals that this widespread, highly adaptable species is approximately three times as diverse as the human but more closely equivalent in damaging variation.

    Original languageEnglish (US)
    Article number52
    JournalBMC Genetics
    Volume13
    DOIs
    StatePublished - Jun 29 2012

    Keywords

    • Comparative genomics
    • Diversity
    • Human
    • Rhesus macaque
    • Single nucleotide polymorphism

    ASJC Scopus subject areas

    • Genetics
    • Genetics(clinical)

    Fingerprint Dive into the research topics of 'The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human'. Together they form a unique fingerprint.

  • Cite this

    Yuan, Q., Zhou, Z., Lindell, S. G., Higley, J. D., Ferguson, B., Thompson, R. C., Lopez, J. F., Suomi, S. J., Baghal, B., Baker, M., Mash, D. C., Barr, C. S., & Goldman, D. (2012). The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human. BMC Genetics, 13, [52]. https://doi.org/10.1186/1471-2156-13-52