A protein map and its application

Stephen S.T. Yau, Chenglong Yu, Rong He

Research output: Contribution to journalArticlepeer-review

63 Citations (Scopus)


Graphical representation of gene sequences provides a simple way of viewing, sorting, and comparing various gene structures. Here we first report a two-dimensional graphical representation for protein sequences. With this method, we constructed the moment vectors for protein sequences, and mathematically proved that the correspondence between moment vectors and protein sequences is one-to-one. Therefore, each protein sequence can be represented as a point in a map, which we call protein map, and cluster analysis can be used for comparison between the points. Sixty-six proteins from five protein families were analyzed using this method. Our data showed that for proteins in the same family, their corresponding points in the map are close to each other. We also illustrate the efficiency of this approach by performing an extensive cluster analysis of the protein kinase C family. These results indicate that this protein map could be used to mathematically specify the similarity of two proteins and predict properties of an unknown protein based on its amino acid sequence.

Original languageEnglish
Pages (from-to)241-250
Number of pages10
JournalDNA and Cell Biology
Issue number5
Publication statusPublished or Issued - 1 May 2008
Externally publishedYes

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics
  • Cell Biology

Cite this