Advances in Computing
p-ISSN: 2163-2944 e-ISSN: 2163-2979
2012; 2(2): 23-28
doi: 10.5923/j.ac.20120202.04
A. Gandhimathi , Anu G. Nair , R. Sowdhamini
National Centre for Biological Sciences (TIFR), UAS-GKVK Campus, Bellary Road, Bangalore 560065, India
Correspondence to: R. Sowdhamini , National Centre for Biological Sciences (TIFR), UAS-GKVK Campus, Bellary Road, Bangalore 560065, India.
Email: |
Copyright © 2012 Scientific & Academic Publishing. All Rights Reserved.
Structure comparison is used to reveal the similarity between protein structures. Every method has its own strength and weakness and the assessment parameters need to be appropriate to the original question on performance of the method. Here, we have assessed three multiple structure-based sequence alignment programs and compared their results. The results suggest that superfamily members which have low sequence identity (<40%) can be aligned using flexible structure alignment methods followed by methods which consider multiple structural features like COMPARER. This kind of structural analysis protocol appears to produce more relevant results, due to consideration of large number of structural features, rather than pure geometric features.
Keywords: Structure Alignment, Outliers, Domain swapping, protein evolution, distant relationships
Figure 1. MUSTANG and MATT comparison. Protein domains that belong to a superfamily are shown in the best fit form after superposition.(a) Superfamily Phou-like (SCOP code: 109755) (b) Superfamily Heat shock protein 70kD (HSP70), C-terminal subdomain (SCOP code: 100934). Result of superposition obtained by MUSTANG is shown to the left and that obtained by MATT is shown to the right. In both these examples, results obtained by MATT are better |
Figure 2. shows the structure alignment of L27 domain (SCOP ID 101288). Fig 2(a) shows MATT derived alignment and the structural superposition (left) and COMPARER refined alignment and the structural superposition (right). More number of gaps are introduced in the MATT alignment shown in highlighted box. Fig 2(b) shows the same as Fig. 2(a) but for MUSTANG-derived alignment. Alignment is improved, as shown in highlighted circle, with less number of gaps |
|
Figure 3. Identifying structurally deviant members in Glutathione synthetase ATP-binding domain-like superfamily. (a) & (b) shows the representative structure of this superfamily and alignment of 20 protein domains. (c) &(d) shows the two outliers belonging to Succinyl-CoA synthetase family |
Figure 4. (a) shows the alignment of TorD superfamily by MUSTANG. MUSTANG is able to align the linker conserved motif (E(Q)PxDH) of swapped and non-swapped domains. Figure 4(b) shows COMPARER- refined alignment where initial equivalences are from MATT. MATT fails to align the conserved motif although MATT followed by COMPARER gives rise to an alignment where the conserved motif is equivalent |
[1] | Sierk ML, Kleywegt GJ, “Deja vu all over again: finding and analyzing protein structure similarities”, Structure, vol.12, no.12, pp.2103-2111, 2004. |
[2] | Carugo O, “Recent progress in measuring structural similarity between proteins”, Current Protein & Peptide Science, vol.8, no.3, pp.219–241, 2007. |
[3] | Godzik A, “The structural alignment between two proteins: is there a unique answer?” Protein Science, vol.5, no.7, pp.1325 –1338, 1996. |
[4] | Christoph B, Christine S. S, Peter L, “Accuracy analysis of multiple structure alignments”, Protein Science, Vol.18, PP.2027-2035, 2009. |
[5] | Liu W, Srivastava A, Zhang J, “A Mathematical Framework for Protein Structure Comparison.” PLoS Computational Biology, vol. 7, no.2, e1001075, 2011. |
[6] | Aysam G, Knapp E, “GIS: a comprehensive source for protein structure similarities”, Nucleic Acids Research, Vol. 38, pp. W46–W52, 2010. |
[7] | Joseph A.P, Srinivasan N, Alexandre G, “Improvement of protein structure comparison using a structural alphabet” Biochimie vol.93,pp. 1434-1445, 2011. |
[8] | Kleywegt G, “Use of non-crystallographic symmetry in protein structure refinement”, Acta Crystallographica D, Biological Crystallography, vol.52, no.4, pp.842–57, 1996. |
[9] | Ye Y and Godzik A, “Flexible structure alignment by chaining aligned fragment pairs allowing twists”, Bioinformatics, vol.19, no.2, pp.ii246-ii255, 2003. |
[10] | Jewett AI, Huang CC, Ferrin TE, “MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance”, Bioinformatics, vol.19, no.5, pp.625-634, 2003. |
[11] | Holm L, Sander C, “Protein structure comparison by alignment of distance matrices”, Journal of Molecular Biology, vol.233, no.1, pp.123-138, 1993. |
[12] | Dror O, Benyamini H, Nussinov R, Wolfson H, “MASS: multiple structural alignment by secondary structures”, Bioinformatics vol.19, no.1, pp.i95–i104, 2003. |
[13] | Menke M, Berger B, Cowen L, “Matt: local flexibility aids protein multiple structure alignment”, PLoS Computational Biology, vol.4, no.1, pp.88-99, 2008. |
[14] | Shatsky M, Nussinov R, Wolfson HJ, “A method for simultaneous alignment of multiple protein structures”, Proteins, vol.56, no.1, pp.143–156, 2004. |
[15] | Konagurthu AS, Whisstock JC, Stuckey PJ, Lesk AM, “MUSTANG: a multiple structural alignment algorithm”, Proteins, vol.64, no.3, pp.559–574, 2006. |
[16] | Ye Y, Godzik A, “Multiple flexible structure alignment using partial order graphs”, Bioinformatics, vol.21, no.10, pp.2362– 2369, 2005. |
[17] | Madhusudhan M,S. et al. Alignment of multiple protein structures based on sequence and structure features. Protein Engineering Design &. Selection, vol. 22, pp. 569–574, 2009. |
[18] | Sheng W, Jian P,Jinbo X, “Alignment of distantly related protein structures: algorithm,bound and implications to homology modeling”, vol. 27, no.18,pp. 2537–2545, 2011. |
[19] | Sali A, Blundell TL, “Definition of general topological equivalence in protein structures: A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming”, Journal of Molecular Biology, vol.212, no.2, pp.403-428, 1990. |
[20] | S. Kalaimathy, R. Sowdhamini and K. Kanagarajadurai, “Critical Assessment of Structure-based Sequence Alignment Methods at Distant Relationships”, Briefings in Bioinformatics, vol.12, no.2, pp.163-175, 2010. |
[21] | Gandhimathi A, Nair A and Sowdhamini R, “PASS2.4: An update of database of structure-based sequence alignments of structural domain superfamilies”, Nucleic Acids Research, vol.40, no.D1, pp.D531-D534, 2012. |
[22] | Mizuguchi K, Deane C. M, Blundell T.L, Johnson M. S and Overington J. P, “JOY: protein sequence-structure representation and analysis”, Bioinformatics, vol.14, no.7, pp.617–623, 1998. |
[23] | Sutcliffe M. J, Haneef I, Carney D and Blundell T. L, “Knowledge based modeling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures”, Protein Engineering, vol.1, no.5, pp.377–384, 1987. |
[24] | Dinescu A, Cundari TR, Bhansali VS, Luo JL and Anderson ME, “Function of conserved residues of human glutathione synthetase: implications for the ATP-grasp enzymes”, The Journal of Biological Chemistry, vol.279, no.21, pp.22412-22421, 2004. |
[25] | Bennett M. J, Schlunegger MP and Eisenberg D, “3D domain swapping: a mechanism for oligomer assembly”, Protein Science, vol.4, no.12, pp.2455–2468, 1995. |
[26] | Chia-Han C,et.al, “Detection and Alignment of 3D Domain Swapping Proteins Using Angle-Distance Image-Based Secondary Structural Matching Techniques”, PLoS ONE, vol.5, no.10, e13361,2010. |
[27] | Samuel T, Chantal Nivol,Catherine B, Marianne I,Isabelle M, Vincent M and Jean-Pierre S, “A Novel Protein Fold and Extreme Domain Swapping in the Dimeric TorD Chaperone from Shewanella massilia”, Structure, vol.11, no.2, pp.165–174, 2003. |
[28] | Olivier G, Vincent M, Chantal I, “Multiple roles of TorD-like chaperones in the biogenesis of molybdoenzymes”, FEMS Microbiology Letters, vol.297, no.1, pp.1–9, 2009. |