Newsletter contents... UP


Sc: measuring shape complementarity at protein-protein interfaces

Michael C. Lawrence

Biomolecular Research Institute, 343 Royal Parade, Parkville, Victoria 3052,
AUSTRALIA

Current address: CSIRO Health Sciences and Nutrition, 343 Royal Parade, Parkville, Victoria 3052, Australia

email: mike.lawrence@hsn.csiro.au


Background

The statistic Sc (Lawrence & Colman, 1993) measures the geometric surface complementarity of protein-protein interfaces. Sc depends both on the relative shape of the surfaces with respect to each other and on the extent to which the interaction brings individual elements of the opposing surfaces into proximity. The first dependence is via the use of normal products, and the second dependence is via an inverse exponential dependence on the distance of separation. The original SC software was distributed as a developmental version. The author's current version has now been released in CCP4 and is completely revised. It is considerably faster and easier to use than the developmental version.

In the original article the authors computed Sc for a number of different types of protein / protein interfaces and concluded that protein antibody / protein antigen interfaces appeared on average less complementary than both protein subunit / protein subunit interfaces and protein / protein inhibitor interfaces. This result was rationalized on the basis that the evolution of an antibody-antigen interface occurs in a different fashion to that of a protein subunit / protein subunit interface or a protein / protein inhibitor interface. It also appeared consistent with the observation that antigen binding sites on antibodies contain on average a higher percentage of aromatic residues (Padlan, 1990) than other interfaces - these residues having fewer conformational degrees of freedom than smaller hydrophobic or polar residues and their prevalence may well be expected to lead to less intimate packing. Sc has also been used to examine the shape complementarity of a T cell receptor in complex with a self peptide bound to a class I MHC molecule (Garcia et al., 1998; Ysern et al., 1998). The shape complementarity Sc was yet lower here than the average for protein antibody / protein antigen interfaces, presumably consistent with the need for a relatively low affinity interaction between the T-cell receptor and the peptide-MHC complex. More recently, higher Sc values have been reported for a T-cell receptor in complex with a foreign peptide bound to a Class II MHC molecule (Reinherz et al., 1999).

Definition of Sc

Consider two interacting molecules A and B and their molecular surfaces (Figure 1).

Figure 1: molecular surfaces of two interacting molecules

For each molecule we compute that portion of the molecular surface that is buried from the solvent via the interaction with the other molecule - these portions are termed PA and PB respectively. A peripheral band is then removed from each of these buried surfaces PA and PB by excluding that area of each that is within a distance d of the solvent-exposed portion of the respective molecular surface. The resultant subset of each buried surface is termed PiA and PiB respectively, where the subscript i denotes "interior". In the above Figure, accessible surface is shown dotted, peripheral surface is shown dashed and interior surface as unbroken line.

For each point xA in PA we find its nearest neighbour x'A on PiB. Let nA be the outwardly-oriented surface normal at xA and n'A be the inwardly-oriented surface normal at x'A. Define the scalar function

SA->B(xA) = (nAn'A) exp[-w (|xA-x'A|)2]

on the surface PiA, where w is a scalar weight.

Likewise by considering all points xB on surface PiB we may define

SB->A(xB) = (nBn'B) exp[-w (|xB-x'B|)2]

where x'B is the nearest point to xB on PA, nB the outwardly-oriented normal at xB and n'B the inwardly-oriented normal at x'B.

Sc is then defined as

Sc = ({SA->B}+{SB->A}) / 2

where braces denote the median of the distribution of SA->B(xA) and SB->A(xB) values over PiA and PiB respectively. Use of the median reduces the dependence of Sc on points that are outliers in the respective distributions.

Numeric calculation of Sc can be achieved via approximating the buried surface surfaces as uniformly distributed sets of points ("dots") sampled in the fashion outlined by (Connolly, 1983). In the standard calculation of Sc (Lawrence & Colman, 1993), d = 1.5 Å, w = 0.5 Å-2 and the surface sampling density is 15 dots / Å-2.

Cross comparison of Sc values

As is clear from the definition above, Sc depends not only upon the atomic coordinates, but also upon a set of parameters, and hence any published value for Sc should state the values used for these parameters. The impact of these various values upon Sc are as follows:

  1. Atomic coordinates

    The atomic coordinates underlie the definition of the protein interface. These coordinates have an error associated with them and the reliability of Sc will consequently be lower for less well-determined structures. However, (Lawrence & Colman, 1993) argue that coordinate error, given its relatively random nature, may not impact greatly on the value of Sc.

    The inclusion of solvent molecules within the interface needs special consideration. (Lawrence & Colman, 1993) suggest performing two calculations of Sc – first with the solvent associated with one molecule and then with the other, and simply taking the average. In other circumstances it may be more appropriate to omit the solvent altogether.

  2. Probe and atomic radii

    These radii define the molecular surface and altering them will alter Sc. However, a change in atomic radius at a given site on the interface would not be anticipated to alter substantially the normal product of juxtaposed surface elements, its effect would be through the distance exponential.

  3. Width d of excluded interface periphery

    The periphery of the buried interface is excluded from consideration in Sc for the reasons outlined above. Decreasing the width of the excluded band will decrease Sc via the inclusion of intrinsically non-complementary surface.

  4. Distance weighting factor w

    The exponential distance weighting factor w acts as a scale factor for the "fit" of the surfaces. Altering this parameter should not affect Sc for highly complementarity surfaces, but will have a significant effect on Sc for more poorly fitting surfaces.

  5. Surface point density

    Under-sampling the molecular surfaces should be avoided as this will have a marked impact on Sc. Test calculations show that the computation is stable at a dot density of around 15 / Å2.

The CCP4 version SC is distributed with the same set of radii and default parameters that accompanied the original software obtained from the author and which were used in the calculations cited in (Lawrence & Colman, 1993). These values should be used for all calculations if cross-comparison is to be made with already-published literature values of Sc.

Major improvements contained in the CCP4 version of SC

A number of major improvements have been made to the SC software:-

  1. Surface calculation is now carried out within the program, rather than piping intermediate results to Connolly's MS program in a stand-alone fashion. The program also now incorporates Connolly's new mds subroutine (obtainable from http://www.biohedron.com), which has a faster surface generation algorithm than the original MS.

  2. Selection of interface atoms is now done via a distance metric, avoiding the need for the initial low density surface computation.

  3. Molecule definition is now handled via a chain/residue/atom name parser.

  4. The interface to GRASP (Nicholls, 1993) is now incorporated directly in the program.

  5. Graphical output of SA->B, SB->A and distance histograms is now provided in CCP4 xloggraph-compatible form.

Conclusions

Sc is established as a useful tool quantifying the geometrical packing of protein interfaces and the new CCP4 version SC should greatly facilitate its use.

Given the large increase in the number of structures available since the original work of (Lawrence & Colman, 1993), it would be appropriate to re-assess the original conclusions regarding the complementarity of antibody-antigen interfaces compared to other forms of protein-protein interfaces, and also to consider extending these calculations to assess the broader range protein protein-receptor complexes that are now available. Such work is in progress in our laboratory (see for example (Epa and Colman, 2001)).

Acknowledgments

I thank my colleague Brian Smith for assistance with coding the new version of SC, Michael Connolly for making available the mds subroutine and Peter Briggs for incorporating CCP4 compatibility into SC.


References

Connolly, M. L. (1983). Analytical molecular surface calculation. J. Appl. Crystallogr. 16, 548-558.

Epa, V.C. & Colman, P.M. (2001). Shape and Electrostatic Complementarity at Viral Antigen-Antibody Complexes. Curr. Topics. Microbiol. Immunol. (in press).

Garcia, K. C., Degano, M., Pease, L. R., Huang, M., Peterson, P. A., Teyton, L. & Wilson, I. A. (1998). Structural basis of plasticity in T cell receptor recognition of a self peptide-MHC antigen. Science 279, 1166-1172.

Lawrence, M. C. & Colman, P. M. (1993). Shape complementarity at protein/protein interfaces. J. Mol. Biol. 234, 946-950.

Nicholls, A. J. (1993). GRASP: graphical representation and analysis of surface properties. Biophys. J. 64, A116.

Padlan, E. A. (1990). On the nature of antibody combining sites: unusual structural features that may confer on these sites an enhanced capacity for binding ligands. Proteins 7, 112-124.

Reinherz, E. L., Tan, K., Tang, L., Kern, P., Liu, J., Xiong, Y., Hussey, R. E., Smolyar, A., Hare, B., Zhang, R., Joachimiak, A., Chang, H. C., Wagner, G. & Wang, J. (1999). The crystal structure of a T cell receptor in complex with peptide and MHC class II. Science 286, 1913-21.

Ysern, X., Li, H. & Mariuzza, R. A. (1998). Imperfect interfaces. Nature Struct. Biol. 5, 412-414.


Newsletter contents... UP