A black-box re-weighting analysis can correct flawed simulation data

F. Marty Ytreberg, Daniel Zuckerman

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

There is a great need for improved statistical sampling in a range of physical, chemical, and biological systems. Even simulations based on correct algorithms suffer from statistical error, which can be substantial or even dominant when slow processes are involved. Further, in key biomolecular applications, such as the determination of protein structures from NMR data, non-Boltzmann-distributed ensembles are generated. We therefore have developed the "black-box" strategy for reweighting a set of configurations generated by arbitrary means to produce an ensemble distributed according to any target distribution. In contrast to previous algorithmic efforts, the black-box approach exploits the configuration-space density observed in a simulation, rather than assuming a desired distribution has been generated. Successful implementations of the strategy, which reduce both statistical error and bias, are developed for a one-dimensional system, and a 50-atom peptide, for which the correct 250-to-1 population ratio is recovered from a heavily biased ensemble.

Original languageEnglish (US)
Pages (from-to)7982-7987
Number of pages6
JournalProceedings of the National Academy of Sciences of the United States of America
Volume105
Issue number23
DOIs
StatePublished - Jun 10 2008
Externally publishedYes

Fingerprint

Peptides
Population
Proteins

Keywords

  • Canonical sampling
  • Free energy
  • Molecular simulation
  • Non-Boltzmann

ASJC Scopus subject areas

  • General

Cite this

A black-box re-weighting analysis can correct flawed simulation data. / Ytreberg, F. Marty; Zuckerman, Daniel.

In: Proceedings of the National Academy of Sciences of the United States of America, Vol. 105, No. 23, 10.06.2008, p. 7982-7987.

Research output: Contribution to journalArticle

@article{f01b320cbc4347c8a127f0ba28f66cfd,
title = "A black-box re-weighting analysis can correct flawed simulation data",
abstract = "There is a great need for improved statistical sampling in a range of physical, chemical, and biological systems. Even simulations based on correct algorithms suffer from statistical error, which can be substantial or even dominant when slow processes are involved. Further, in key biomolecular applications, such as the determination of protein structures from NMR data, non-Boltzmann-distributed ensembles are generated. We therefore have developed the {"}black-box{"} strategy for reweighting a set of configurations generated by arbitrary means to produce an ensemble distributed according to any target distribution. In contrast to previous algorithmic efforts, the black-box approach exploits the configuration-space density observed in a simulation, rather than assuming a desired distribution has been generated. Successful implementations of the strategy, which reduce both statistical error and bias, are developed for a one-dimensional system, and a 50-atom peptide, for which the correct 250-to-1 population ratio is recovered from a heavily biased ensemble.",
keywords = "Canonical sampling, Free energy, Molecular simulation, Non-Boltzmann",
author = "Ytreberg, {F. Marty} and Daniel Zuckerman",
year = "2008",
month = "6",
day = "10",
doi = "10.1073/pnas.0706063105",
language = "English (US)",
volume = "105",
pages = "7982--7987",
journal = "Proceedings of the National Academy of Sciences of the United States of America",
issn = "0027-8424",
number = "23",

}

TY - JOUR

T1 - A black-box re-weighting analysis can correct flawed simulation data

AU - Ytreberg, F. Marty

AU - Zuckerman, Daniel

PY - 2008/6/10

Y1 - 2008/6/10

N2 - There is a great need for improved statistical sampling in a range of physical, chemical, and biological systems. Even simulations based on correct algorithms suffer from statistical error, which can be substantial or even dominant when slow processes are involved. Further, in key biomolecular applications, such as the determination of protein structures from NMR data, non-Boltzmann-distributed ensembles are generated. We therefore have developed the "black-box" strategy for reweighting a set of configurations generated by arbitrary means to produce an ensemble distributed according to any target distribution. In contrast to previous algorithmic efforts, the black-box approach exploits the configuration-space density observed in a simulation, rather than assuming a desired distribution has been generated. Successful implementations of the strategy, which reduce both statistical error and bias, are developed for a one-dimensional system, and a 50-atom peptide, for which the correct 250-to-1 population ratio is recovered from a heavily biased ensemble.

AB - There is a great need for improved statistical sampling in a range of physical, chemical, and biological systems. Even simulations based on correct algorithms suffer from statistical error, which can be substantial or even dominant when slow processes are involved. Further, in key biomolecular applications, such as the determination of protein structures from NMR data, non-Boltzmann-distributed ensembles are generated. We therefore have developed the "black-box" strategy for reweighting a set of configurations generated by arbitrary means to produce an ensemble distributed according to any target distribution. In contrast to previous algorithmic efforts, the black-box approach exploits the configuration-space density observed in a simulation, rather than assuming a desired distribution has been generated. Successful implementations of the strategy, which reduce both statistical error and bias, are developed for a one-dimensional system, and a 50-atom peptide, for which the correct 250-to-1 population ratio is recovered from a heavily biased ensemble.

KW - Canonical sampling

KW - Free energy

KW - Molecular simulation

KW - Non-Boltzmann

UR - http://www.scopus.com/inward/record.url?scp=45849133760&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=45849133760&partnerID=8YFLogxK

U2 - 10.1073/pnas.0706063105

DO - 10.1073/pnas.0706063105

M3 - Article

C2 - 18544653

AN - SCOPUS:45849133760

VL - 105

SP - 7982

EP - 7987

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

SN - 0027-8424

IS - 23

ER -