Cloud computing for genomic data analysis and collaboration

Ben Langmead, Abhinav Nellore

Research output: Contribution to journalReview article

35 Scopus citations

Abstract

Next-generation sequencing has made major strides in the past decade. Studies based on large sequencing data sets are growing in number, and public archives for raw sequencing data have been doubling in size every 18 months. Leveraging these data requires researchers to use large-scale computational resources. Cloud computing, a model whereby users rent computers and storage from large data centres, is a solution that is gaining traction in genomics research. Here, we describe how cloud computing is used in genomics for research and large-scale collaborations, and argue that its elasticity, reproducibility and privacy features make it ideally suited for the large-scale reanalysis of publicly available archived data, including privacy-protected data.

Original languageEnglish (US)
Pages (from-to)208-219
Number of pages12
JournalNature Reviews Genetics
Volume19
Issue number4
DOIs
StatePublished - Apr 1 2018

    Fingerprint

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics
  • Genetics(clinical)

Cite this