PROSPECT-PSPP: An automatic computational pipeline for protein structure prediction

Jun Tao Guo; Kyle Ellrott; Won Jae Chung; Dong Xu; Serguei Passovets; Ying Xu

doi:10.1093/nar/gkh414

PROSPECT-PSPP: An automatic computational pipeline for protein structure prediction

Jun Tao Guo, Kyle Ellrott, Won Jae Chung, Dong Xu, Serguei Passovets, Ying Xu

Research output: Contribution to journal › Article › peer-review

15 Scopus citations

Abstract

Knowledge of the detailed structure of a protein is crucial to our understanding of the biological functions of that protein. The gap between the number of solved protein structures and the number of protein sequences continues to widen rapidly in the postgenomics era due to long and expensive processes for solving structures experimentally. Computational prediction of structures from amino acid sequence has come to play a key role in narrowing the gap and has been successful in providing useful information for the biological research community. We have developed a prediction pipeline, PROSPECT-PSPP, an integration of multiple computational tools, for fully automated protein structure prediction. The pipeline consists of tools for (i) preprocessing of protein sequences, which includes signal peptide prediction, protein type prediction (membrane or soluble) and protein domain partition, (ii) secondary structure prediction, (iii) fold recognition and (iv) atomic structural model generation. The centerpiece of the pipeline is our threading-based program PROSPECT. The pipeline is implemented using SOAP (Simple Object Access Protocol), which makes it easier to share our tools and resources. The pipeline has an easy-to-use user interface and is implemented on a 64-node dual processor Linux cluster. It can be used for genome-scale protein structure prediction. The pipeline is accessible at http://csbl.bmb.uga.edu/protein_pipeline.

Original language	English (US)
Pages (from-to)	W522-W525
Journal	Nucleic acids research
Volume	32
Issue number	WEB SERVER ISS.
DOIs	https://doi.org/10.1093/nar/gkh414
State	Published - Jul 1 2004
Externally published	Yes

ASJC Scopus subject areas

Genetics

Access to Document

10.1093/nar/gkh414

Cite this

@article{ac8cca727d5c4554bef10eedb426b32d,

title = "PROSPECT-PSPP: An automatic computational pipeline for protein structure prediction",

abstract = "Knowledge of the detailed structure of a protein is crucial to our understanding of the biological functions of that protein. The gap between the number of solved protein structures and the number of protein sequences continues to widen rapidly in the postgenomics era due to long and expensive processes for solving structures experimentally. Computational prediction of structures from amino acid sequence has come to play a key role in narrowing the gap and has been successful in providing useful information for the biological research community. We have developed a prediction pipeline, PROSPECT-PSPP, an integration of multiple computational tools, for fully automated protein structure prediction. The pipeline consists of tools for (i) preprocessing of protein sequences, which includes signal peptide prediction, protein type prediction (membrane or soluble) and protein domain partition, (ii) secondary structure prediction, (iii) fold recognition and (iv) atomic structural model generation. The centerpiece of the pipeline is our threading-based program PROSPECT. The pipeline is implemented using SOAP (Simple Object Access Protocol), which makes it easier to share our tools and resources. The pipeline has an easy-to-use user interface and is implemented on a 64-node dual processor Linux cluster. It can be used for genome-scale protein structure prediction. The pipeline is accessible at http://csbl.bmb.uga.edu/protein_pipeline.",

author = "Guo, {Jun Tao} and Kyle Ellrott and Chung, {Won Jae} and Dong Xu and Serguei Passovets and Ying Xu",

note = "Funding Information: The authors would like to thank Ms Shiming Dong and Mr Abhishek Chugh for their technical help. This work is in part supported by the Office of Biological and Environmental Research, US Department of Energy, under Contract DE/FG-2-04ER63714. It is also funded in part by the US Department of Energy{\textquoteright}s Genomes to Life program (www.doegenomestolife. org) under the project {\textquoteleft}Carbon Sequestration in Synecococcus sp.: from Molecular Machines to Hierarchical Modeling{\textquoteright} (www.genomes-to-life.org).",

year = "2004",

month = jul,

day = "1",

doi = "10.1093/nar/gkh414",

language = "English (US)",

volume = "32",

pages = "W522--W525",

journal = "Nucleic acids research",

issn = "0305-1048",

publisher = "Oxford University Press",

number = "WEB SERVER ISS.",

}

TY - JOUR

T1 - PROSPECT-PSPP

T2 - An automatic computational pipeline for protein structure prediction

AU - Guo, Jun Tao

AU - Ellrott, Kyle

AU - Chung, Won Jae

AU - Xu, Dong

AU - Passovets, Serguei

AU - Xu, Ying

N1 - Funding Information: The authors would like to thank Ms Shiming Dong and Mr Abhishek Chugh for their technical help. This work is in part supported by the Office of Biological and Environmental Research, US Department of Energy, under Contract DE/FG-2-04ER63714. It is also funded in part by the US Department of Energy’s Genomes to Life program (www.doegenomestolife. org) under the project ‘Carbon Sequestration in Synecococcus sp.: from Molecular Machines to Hierarchical Modeling’ (www.genomes-to-life.org).

PY - 2004/7/1

Y1 - 2004/7/1

N2 - Knowledge of the detailed structure of a protein is crucial to our understanding of the biological functions of that protein. The gap between the number of solved protein structures and the number of protein sequences continues to widen rapidly in the postgenomics era due to long and expensive processes for solving structures experimentally. Computational prediction of structures from amino acid sequence has come to play a key role in narrowing the gap and has been successful in providing useful information for the biological research community. We have developed a prediction pipeline, PROSPECT-PSPP, an integration of multiple computational tools, for fully automated protein structure prediction. The pipeline consists of tools for (i) preprocessing of protein sequences, which includes signal peptide prediction, protein type prediction (membrane or soluble) and protein domain partition, (ii) secondary structure prediction, (iii) fold recognition and (iv) atomic structural model generation. The centerpiece of the pipeline is our threading-based program PROSPECT. The pipeline is implemented using SOAP (Simple Object Access Protocol), which makes it easier to share our tools and resources. The pipeline has an easy-to-use user interface and is implemented on a 64-node dual processor Linux cluster. It can be used for genome-scale protein structure prediction. The pipeline is accessible at http://csbl.bmb.uga.edu/protein_pipeline.

AB - Knowledge of the detailed structure of a protein is crucial to our understanding of the biological functions of that protein. The gap between the number of solved protein structures and the number of protein sequences continues to widen rapidly in the postgenomics era due to long and expensive processes for solving structures experimentally. Computational prediction of structures from amino acid sequence has come to play a key role in narrowing the gap and has been successful in providing useful information for the biological research community. We have developed a prediction pipeline, PROSPECT-PSPP, an integration of multiple computational tools, for fully automated protein structure prediction. The pipeline consists of tools for (i) preprocessing of protein sequences, which includes signal peptide prediction, protein type prediction (membrane or soluble) and protein domain partition, (ii) secondary structure prediction, (iii) fold recognition and (iv) atomic structural model generation. The centerpiece of the pipeline is our threading-based program PROSPECT. The pipeline is implemented using SOAP (Simple Object Access Protocol), which makes it easier to share our tools and resources. The pipeline has an easy-to-use user interface and is implemented on a 64-node dual processor Linux cluster. It can be used for genome-scale protein structure prediction. The pipeline is accessible at http://csbl.bmb.uga.edu/protein_pipeline.

UR - http://www.scopus.com/inward/record.url?scp=3242887524&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=3242887524&partnerID=8YFLogxK

U2 - 10.1093/nar/gkh414

DO - 10.1093/nar/gkh414

M3 - Article

C2 - 15215441

AN - SCOPUS:3242887524

SN - 0305-1048

VL - 32

SP - W522-W525

JO - Nucleic acids research

JF - Nucleic acids research

IS - WEB SERVER ISS.

ER -

PROSPECT-PSPP: An automatic computational pipeline for protein structure prediction

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this