Exploring protein fold space by secondary structure prediction using data distribution method on Grid platform

Soojin Lee, Min Kyu Cho, Jin Won Jung, Jai Hoon Kim, Weontae Lee

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

Motivation: Since the newly developed Grid platform has been considered as a powerful tool to share resources in the Internet environment, it is of interest to demonstrate an efficient methodology to process massive biological data on the Grid environments at a low cost. This paper presents an efficient and economical method based on a Grid platform to predict secondary structures of all proteins in a given organism, which normally requires a long computation time through sequential execution, by means of processing a large amount of protein sequence data simultaneously. From the prediction results, a genome scale protein fold space can be pursued. Results: Using the improved Grid platform, the secondary structure prediction on genomic scale and protein topology derived from the new scoring scheme for four different model proteomes was presented. This protein fold space was compared with structures from the Protein Data Bank, database and it showed similarly aligned distribution. Therefore, the fold space approach based on this new scoring scheme could be a guideline for predicting a folding family in a given organism.

Original languageEnglish
Pages (from-to)3500-3507
Number of pages8
JournalBioinformatics
Volume20
Issue number18
DOIs
Publication statusPublished - 2004 Dec 12

Fingerprint

Structure Prediction
Data Distribution
Secondary Structure
Fold
Grid
Proteins
Protein
Scoring
Databases
Secondary Protein Structure
Biological Phenomena
Approach Space
Proteome
Protein Sequence
Folding
Internet
Genomics
Genome
Guidelines
Topology

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

Lee, Soojin ; Cho, Min Kyu ; Jung, Jin Won ; Kim, Jai Hoon ; Lee, Weontae. / Exploring protein fold space by secondary structure prediction using data distribution method on Grid platform. In: Bioinformatics. 2004 ; Vol. 20, No. 18. pp. 3500-3507.
@article{45513b33e58841a08806b633478b8d42,
title = "Exploring protein fold space by secondary structure prediction using data distribution method on Grid platform",
abstract = "Motivation: Since the newly developed Grid platform has been considered as a powerful tool to share resources in the Internet environment, it is of interest to demonstrate an efficient methodology to process massive biological data on the Grid environments at a low cost. This paper presents an efficient and economical method based on a Grid platform to predict secondary structures of all proteins in a given organism, which normally requires a long computation time through sequential execution, by means of processing a large amount of protein sequence data simultaneously. From the prediction results, a genome scale protein fold space can be pursued. Results: Using the improved Grid platform, the secondary structure prediction on genomic scale and protein topology derived from the new scoring scheme for four different model proteomes was presented. This protein fold space was compared with structures from the Protein Data Bank, database and it showed similarly aligned distribution. Therefore, the fold space approach based on this new scoring scheme could be a guideline for predicting a folding family in a given organism.",
author = "Soojin Lee and Cho, {Min Kyu} and Jung, {Jin Won} and Kim, {Jai Hoon} and Weontae Lee",
year = "2004",
month = "12",
day = "12",
doi = "10.1093/bioinformatics/bth435",
language = "English",
volume = "20",
pages = "3500--3507",
journal = "Bioinformatics",
issn = "1367-4803",
publisher = "Oxford University Press",
number = "18",

}

Exploring protein fold space by secondary structure prediction using data distribution method on Grid platform. / Lee, Soojin; Cho, Min Kyu; Jung, Jin Won; Kim, Jai Hoon; Lee, Weontae.

In: Bioinformatics, Vol. 20, No. 18, 12.12.2004, p. 3500-3507.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Exploring protein fold space by secondary structure prediction using data distribution method on Grid platform

AU - Lee, Soojin

AU - Cho, Min Kyu

AU - Jung, Jin Won

AU - Kim, Jai Hoon

AU - Lee, Weontae

PY - 2004/12/12

Y1 - 2004/12/12

N2 - Motivation: Since the newly developed Grid platform has been considered as a powerful tool to share resources in the Internet environment, it is of interest to demonstrate an efficient methodology to process massive biological data on the Grid environments at a low cost. This paper presents an efficient and economical method based on a Grid platform to predict secondary structures of all proteins in a given organism, which normally requires a long computation time through sequential execution, by means of processing a large amount of protein sequence data simultaneously. From the prediction results, a genome scale protein fold space can be pursued. Results: Using the improved Grid platform, the secondary structure prediction on genomic scale and protein topology derived from the new scoring scheme for four different model proteomes was presented. This protein fold space was compared with structures from the Protein Data Bank, database and it showed similarly aligned distribution. Therefore, the fold space approach based on this new scoring scheme could be a guideline for predicting a folding family in a given organism.

AB - Motivation: Since the newly developed Grid platform has been considered as a powerful tool to share resources in the Internet environment, it is of interest to demonstrate an efficient methodology to process massive biological data on the Grid environments at a low cost. This paper presents an efficient and economical method based on a Grid platform to predict secondary structures of all proteins in a given organism, which normally requires a long computation time through sequential execution, by means of processing a large amount of protein sequence data simultaneously. From the prediction results, a genome scale protein fold space can be pursued. Results: Using the improved Grid platform, the secondary structure prediction on genomic scale and protein topology derived from the new scoring scheme for four different model proteomes was presented. This protein fold space was compared with structures from the Protein Data Bank, database and it showed similarly aligned distribution. Therefore, the fold space approach based on this new scoring scheme could be a guideline for predicting a folding family in a given organism.

UR - http://www.scopus.com/inward/record.url?scp=12344272773&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=12344272773&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/bth435

DO - 10.1093/bioinformatics/bth435

M3 - Article

C2 - 15284104

AN - SCOPUS:12344272773

VL - 20

SP - 3500

EP - 3507

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 18

ER -