A computational pipeline for protein structure prediction and analysis at genome scale

Manesh Shah, Sergei Passovets, Dongsup Kim, Kyle Ellrott, Li Wang, Inna Vokler, Philip LoCascio, Dong Xu, Ying Xu

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

Motivation: Experimental techniques alone cannot keep up with the production rate of protein sequences, while computational techniques for protein structure predictions have matured to such a level to provide reliable structural characterization of proteins at large scale. Integration of multiple computational tools for protein structure prediction can complement experimental techniques. Results: We present an automated pipeline for protein structure prediction. The centerpiece of the pipeline is our threading-based protein structure prediction system PROSPECT. The pipeline consists of a dozen tools for identification of protein domains and signal peptide, protein triage to determine the protein type (membrane or globular), protein fold recognition, generation of atomic structural models, prediction result validation, etc. Different processing and prediction branches are determined automatically by a prediction pipeline manager based on identified characteristics of the protein. The pipeline has been implemented to run in a heterogeneous computational environment as a client/server system with a web interface. Genome-scale applications on Caenorhabditis elegans, Pyrococcus furiosus and three cyanobacterial genomes are presented.

Original languageEnglish (US)
Pages (from-to)1985-1996
Number of pages12
JournalBioinformatics
Volume19
Issue number15
DOIs
StatePublished - Oct 12 2003
Externally publishedYes

Fingerprint

Protein Structure Prediction
Genome
Pipelines
Genes
Proteins
Protein
Branch Prediction
Prediction
Computational Techniques
Client/server
Structural Model
Protein Sequence
Pyrococcus furiosus
Peptides
Fold
Membrane
Complement
Triage
Structural Models
Caenorhabditis elegans

ASJC Scopus subject areas

  • Clinical Biochemistry
  • Computer Science Applications
  • Computational Theory and Mathematics

Cite this

A computational pipeline for protein structure prediction and analysis at genome scale. / Shah, Manesh; Passovets, Sergei; Kim, Dongsup; Ellrott, Kyle; Wang, Li; Vokler, Inna; LoCascio, Philip; Xu, Dong; Xu, Ying.

In: Bioinformatics, Vol. 19, No. 15, 12.10.2003, p. 1985-1996.

Research output: Contribution to journalArticle

Shah, M, Passovets, S, Kim, D, Ellrott, K, Wang, L, Vokler, I, LoCascio, P, Xu, D & Xu, Y 2003, 'A computational pipeline for protein structure prediction and analysis at genome scale', Bioinformatics, vol. 19, no. 15, pp. 1985-1996. https://doi.org/10.1093/bioinformatics/btg262
Shah, Manesh ; Passovets, Sergei ; Kim, Dongsup ; Ellrott, Kyle ; Wang, Li ; Vokler, Inna ; LoCascio, Philip ; Xu, Dong ; Xu, Ying. / A computational pipeline for protein structure prediction and analysis at genome scale. In: Bioinformatics. 2003 ; Vol. 19, No. 15. pp. 1985-1996.
@article{8439ac8aab5740ec8cb48b8f2379f554,
title = "A computational pipeline for protein structure prediction and analysis at genome scale",
abstract = "Motivation: Experimental techniques alone cannot keep up with the production rate of protein sequences, while computational techniques for protein structure predictions have matured to such a level to provide reliable structural characterization of proteins at large scale. Integration of multiple computational tools for protein structure prediction can complement experimental techniques. Results: We present an automated pipeline for protein structure prediction. The centerpiece of the pipeline is our threading-based protein structure prediction system PROSPECT. The pipeline consists of a dozen tools for identification of protein domains and signal peptide, protein triage to determine the protein type (membrane or globular), protein fold recognition, generation of atomic structural models, prediction result validation, etc. Different processing and prediction branches are determined automatically by a prediction pipeline manager based on identified characteristics of the protein. The pipeline has been implemented to run in a heterogeneous computational environment as a client/server system with a web interface. Genome-scale applications on Caenorhabditis elegans, Pyrococcus furiosus and three cyanobacterial genomes are presented.",
author = "Manesh Shah and Sergei Passovets and Dongsup Kim and Kyle Ellrott and Li Wang and Inna Vokler and Philip LoCascio and Dong Xu and Ying Xu",
year = "2003",
month = "10",
day = "12",
doi = "10.1093/bioinformatics/btg262",
language = "English (US)",
volume = "19",
pages = "1985--1996",
journal = "Bioinformatics",
issn = "1367-4803",
publisher = "Oxford University Press",
number = "15",

}

TY - JOUR

T1 - A computational pipeline for protein structure prediction and analysis at genome scale

AU - Shah, Manesh

AU - Passovets, Sergei

AU - Kim, Dongsup

AU - Ellrott, Kyle

AU - Wang, Li

AU - Vokler, Inna

AU - LoCascio, Philip

AU - Xu, Dong

AU - Xu, Ying

PY - 2003/10/12

Y1 - 2003/10/12

N2 - Motivation: Experimental techniques alone cannot keep up with the production rate of protein sequences, while computational techniques for protein structure predictions have matured to such a level to provide reliable structural characterization of proteins at large scale. Integration of multiple computational tools for protein structure prediction can complement experimental techniques. Results: We present an automated pipeline for protein structure prediction. The centerpiece of the pipeline is our threading-based protein structure prediction system PROSPECT. The pipeline consists of a dozen tools for identification of protein domains and signal peptide, protein triage to determine the protein type (membrane or globular), protein fold recognition, generation of atomic structural models, prediction result validation, etc. Different processing and prediction branches are determined automatically by a prediction pipeline manager based on identified characteristics of the protein. The pipeline has been implemented to run in a heterogeneous computational environment as a client/server system with a web interface. Genome-scale applications on Caenorhabditis elegans, Pyrococcus furiosus and three cyanobacterial genomes are presented.

AB - Motivation: Experimental techniques alone cannot keep up with the production rate of protein sequences, while computational techniques for protein structure predictions have matured to such a level to provide reliable structural characterization of proteins at large scale. Integration of multiple computational tools for protein structure prediction can complement experimental techniques. Results: We present an automated pipeline for protein structure prediction. The centerpiece of the pipeline is our threading-based protein structure prediction system PROSPECT. The pipeline consists of a dozen tools for identification of protein domains and signal peptide, protein triage to determine the protein type (membrane or globular), protein fold recognition, generation of atomic structural models, prediction result validation, etc. Different processing and prediction branches are determined automatically by a prediction pipeline manager based on identified characteristics of the protein. The pipeline has been implemented to run in a heterogeneous computational environment as a client/server system with a web interface. Genome-scale applications on Caenorhabditis elegans, Pyrococcus furiosus and three cyanobacterial genomes are presented.

UR - http://www.scopus.com/inward/record.url?scp=0142210248&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0142210248&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btg262

DO - 10.1093/bioinformatics/btg262

M3 - Article

VL - 19

SP - 1985

EP - 1996

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 15

ER -