Newsletter contents... UP


CCP4 Logo

ANISOANL - analysing anisotropic displacement parameters

Martyn Winn

Daresbury Laboratory,
Daresbury,
Warrington
WA4 4AD, U.K.
m.d.winn@dl.ac.uk


Introduction

With improving data collection techniques, there are an increasing number of structures being solved to atomic resolution (1.2A or better). At such resolutions, anisotropic displacement parameters (ADPs) can be determined, providing detailed information on the averaged atomic displacements (see e.g. Merritt (1999) for a recent review). However, with six parameters per atom from the symmetric U tensors which represent the ADPs, there is perhaps an overabundance of information, and there is a need for simple methods with which to interpret the data.

ANISOANL is a new CCP4 program (available with version 4.1) which provides some simple tools for analysing ADPs. In this article, I will give an overview of these tools. Examples are taken from a 1.5A structure of barnase (PDB id 1a2p) and a 1.15A structure of myoglobin (PDB id 1a6g). For information on running the program, please see the program documentation. For information on obtaining the program, please see the CCP4 web pages.

Plots derived from ADPs

The PLOT option provides a series of plots in a similar style to the program BAVERAGE for the isotropic case. Several plots are given, reflecting the more complex information given by ADPs. The equivalent isotropic displacement parameter (Uiso = trace(U)/3), gives a measure of the size of the overall displacement. The anisotropy A, defined as the ratio of the smallest to the largest eigenvalue of U, gives a measure of how non-spherical the thermal ellipsoid is (1 implies complete isotropy, while 0 is extreme anisotropy). An anisotropic ADP can be either prolate (cigar-like) or oblate (disc-like). These two cases are discriminated by the value of the ratio of the middle to the largest eigenvalue of U, which is 1 for oblate and equal to A for prolate.

barnase: 1a2p

This graph shows some plots for chain A of barnase. ANISOMEAN1 is the anisotropy A, averaged over main chain atoms for each residue, while PROLMEAN1 is the prolate/oblate discriminator, similarly averaged. Most residues have an average anisotropy in the range 0.2 to 0.6 which is fairly typical. The value of PROLMEAN generally lies closer to A than to unity, implying a tendency towards prolate ellipsoids. R2FROMORIG shows the square of the distance from the centre of mass of the molecule.

Rosenfield's rigid-body postulate

ADPs are useful for discerning detailed motions at an atomic level, for example at catalytic sites (with the usual provisos that ADPs bundle all kinds of displacements and errors, and there is no information on interatomic correlations). But it is also often useful to identify displacements operating at a larger scale, for example displacements of secondary structure elements. The simplest example of this is the case of a group of atoms moving as a quasi-rigid body.

Rosenfield et al. (1978) proposed a `rigid-body postulate' based on refined ADPs. Since interatomic distances within a rigid body are fixed, the difference in the projections of the ADPs of 2 atoms in a rigid body (the `Delta' value) must be zero. Note that this applies to all pairs of atoms in the rigid body, and not just bonded pairs. This is a necessary but not sufficient condition for rigid-body displacements. In any case, since proteins are never completely rigid, we can only identify possible quasi-rigid groups from low values of Delta for a set of atoms. Schneider (1996) has applied this approach to the protein SP445.

ANISOANL will produce a postscript figure giving Delta values between pairs of atoms, averaged over a number of bins. For example, using the main chain atoms of barnase, one gets the following figure:

barnase: 1a2p

Light shading corresponds to low values of Delta, while dark shading corresponds to high values of Delta. Possible rigid-body behaviour is indicated by blocks of light shading. The blocks may or may not be contiguous along the protein chain. These plots are usually very noisy, but it is usually possible to discern some structure (perhaps with the help of the `Geophys' team!). In this example, it is possible to identify the three molecules of barnase that occur in the asymmetric unit (labelled as residues 1-108, 109-216 and 217-324). Thus it appears that the 3 molecules each move as a quasi-rigid body.

To analyse these results further, it is necessary to explicitly identify the three molecules as three rigid bodies. This is done via the TLSIN file, which in this case is:


TLS                                                                       
RANGE  'A   3.' 'A 110.' MNCH

TLS                                                                       
RANGE  'B   3.' 'B 110.' MNCH

TLS                                                                       
RANGE  'C   3.' 'C 110.' MNCH

Each "TLS" record begins a new rigid group, consisting of the atoms specified in one or more "RANGE" records.

ANISOANL plots the distribution of Delta values for each rigid group, and for all pairs of atoms belonging to different rigid groups (the OFFDIAG plot):

barnase: 1a2p

The distribution is similar for each putative rigid group, while the OFFDIAG plot is skewed towards higher values of Delta. This implies that pairs of atoms within a molecule satisfy the rigid body postulate better than pairs of atoms from different molecules.

A second example, which looks at the internal structure of a single molecule rather than several whole molcules, is given by the 1.15A structure of a myoglobin-CO complex (PDB id 1a6g). The plot of the Delta matrix looks like:

myoglobin-CO: 1a6g

Again there is a lot of noise, but it is possible to pick out possible pseudo-rigid regions at residues 1-21, 21-41, 51-81, 81-101 and 101-151. In fact, these regions correspond closely to helices A (residues 3-18), B and C (20-35,36-42), D and E (51-57,58-77), F (86-95) and G and H (100-118,124-149) respectively. Looking at the inter-helix Delta values, helix F stands out in particular as having large Delta values, and thus not being part of any larger pseudo-rigid group. It appears that helices, or pairs of helices, form the relevant units for describing large scale displacements in myoglobin.

Fitting TLS parameters

Given refined ADPs for a structure, one can try to fit TLS parameters describing rigid body motion to these ADPs (see Schomaker and Trueblood (1968)). Note that this is distinct from refining TLS parameters directly against Xray data. Also, the fitting will maximise the contribution of TLS, and therefore overestimate rigid-body motion. ANISOANL will fit TLS parameters with the FITTLS option, using TLS groups specified in the TLSIN file.

I have done this for the barnase structure treating each molecule as a separate TLS group (i.e. using the TLSIN file given in the previous section). The success of the fitting can be assessed by comparing the equivalent isotropic displacement parameter, the anisotropy and the prolate/oblate discriminator derived from the TLS parameters (labelled UISOMEAN2, ANISOMEAN2 and PROLMEAN2) to those derived from the refined ADPs. This is shown for chain C of barnase (this shows the best fit, though chains A and B are similar):

barnase: 1a2p

barnase: 1a2p

barnase: 1a2p

The fit is fairly good, and thus TLS provides a good first-order description of the refined ADPs. The discrepancies show, however, that there is some detail unaccounted for by the pseudo-rigid body description.

I have also fitted TLS parameters to the myoglobin structure. Following the results of the Delta-matrix analysis given above, I have used 6 TLS groups based on the alpha helices, with helices D and E being treated as a single group, and likewise helices G and H:


TLS                                                                                 
RANGE  'A   3.' 'A  18.'  FIT  MNCH                                    

TLS                                                                                 
RANGE  'A  20.' 'A  35.'  FIT  MNCH                                    

TLS                                                                                 
RANGE  'A  36.' 'A  42.'  FIT  MNCH                                    

TLS                                                                                 
RANGE  'A  51.' 'A  57.'  FIT  MNCH                                    
RANGE  'A  58.' 'A  77.'  FIT  MNCH                                    

TLS                                                                                 
RANGE  'A  86.' 'A  95.'  FIT  MNCH                                    

TLS                                                                                 
RANGE  'A 100.' 'A 118.'  FIT  MNCH                                    
RANGE  'A 124.' 'A 149.'  FIT  MNCH                                    

The resultant TLS parameters are used to calculate ADPs which are compared to the refined ADPs as follows:

myoglobin-CO: 1a6g

myoglobin-CO: 1a6g

myoglobin-CO: 1a6g

The fit is clearly very good for both Uiso and the anisotropic components (residues with UISOMEAN2 = 0 and ANISOMEAN2 = PROLMEAN2 = 1 are those not included in any TLS group).

A more detailed rigid-body analysis of 1a6g was given by Vojtechovsky et al. (1999) who fitted TLS parameters to helices E and F, as well as to the heme group. They concluded on the basis of this fit that helix F and the heme group had ADPs consistent with rigid-body displacements, while helix E is less well described by a rigid body model.

These authors gave TLS parameters for helix F (and the heme group) which can be compared with the present results (analysis of the TLS tensors was done using the CCP4 program TLSANL):


                  T axes and eigenvalues (A**2)    L axes and eigenvalues (deg**2)

                     [-0.20, -0.25,  0.95] 0.18      [ 0.07, -0.16,  0.98]  27.2
Vojtechovsky et al.  [ 0.06, -0.97, -0.24] 0.09      [ 0.19, -0.97, -0.17]   4.3
                     [ 0.98,  0.01,  0.21] 0.08      [ 0.98,  0.20, -0.04]   0.6

                     [-0.26, -0.30,  0.92] 0.21      [ 0.15, -0.11,  0.98]  34.1
This study:          [ 0.00, -0.95, -0.31] 0.09      [ 0.35, -0.92, -0.15]   6.7
                     [ 0.97, -0.08,  0.25] 0.15      [ 0.93,  0.37, -0.10]  -2.0

There are some discrepancies due to differences in the atom selection used for the TLS groups, but the overall picture is the same. (The negative value for the 3rd eigenvalue of L is unphysical, but allowed by the fitting procedure, and illustrates that there is some over-fitting.) In particular, there is a dominating libration parallel to the Z axis, and approximately parallel to the helix axis. These axes may be displayed using the AXES option of TLSANL.

Program usage

The program takes two input files with logical names XYZIN and TLSIN. XYZIN is a standard PDB file, with ADPs recorded on ANISOU lines. The TLSIN file is used for atom selection: often it is used to define TLS groups, but it can be used more generally. If the RIGIDBODY option is used, then a postscript file PSOUT is written containing the Delta-matrix plot. If the FITTLS option is used for fitting TLS groups to the input ADPs, then two more files are written out: TLSOUT contains the fitted TLS parameters, XYZOUT contains residual ADPs (the difference in the input ADPs and those derived from the TLS parameters).

In the version released with 4.1, ANISOANL can only be run from the command line. However, a task interface for ccp4i is in preparation, and will be made available soon. For full details of the program, see the distributed documentation.

References

  1. E.A.Merritt Acta Cryst., D55, 1109 (1999)
  2. R.E.Rosenfield, K.N.Trueblood and J.D.Dunitz, Acta Cryst, A34, 828 (1978)
  3. T.R.Schneider, Proc. CCP4 Study Weekend, 133 (1996).
  4. V.Schomaker and K.N.Trueblood, Acta Cryst., B24, 63 (1968)
  5. J.Vojtechovsky, K.Chu, J.Berendzen, R.M.Sweet and I.Schlichting, Biophysical Journal, 77, 2153 (1999).


Newsletter contents... UP