Posts Tagged ‘high energy species’

Why are α-helices in proteins mostly right handed?

Saturday, April 9th, 2011

Understanding why and how proteins fold continues to be a grand challenge in science. I have described how Wrinch in 1936 made a bold proposal for the mechanism, which however flew in the face of much of then known chemistry. Linus Pauling took most of the credit (and a Nobel prize) when in a famous paper in 1951 he suggested a mechanism that involved (inter alia) the formation of what he termed α-helices. Jack Dunitz in 2001 wrote a must-read article on the topic of “Pauling’s Left-handed α-helix” (it is now known to be right handed).  I thought I would revisit this famous example with a calculation of my own and here I have used the ωB97XD/6-311G(d,p) DFT procedure to calculate some of the energy components of a small helix comprising (ala)6 in both left and right handed form.

Firstly, it is important to note that Pauling was apparently not aware of the absolute handedness of amino acids (which are (S) in CIP terminology). This had in fact only been established a few months before Pauling’s publication by Bijvoet, and news of this might not have reached Pauling. So Pauling guessed (or perhaps, he had already built his models, and did not have time to reconstruct them) and his famous α-helix diagram turned out to be the enantiomer of the real McCoy. As with DNA itself, the helix bears a diastereomeric relationship to the chirality of the amino acids; both have to be inverted to get the proper enantiomer (which is what Pauling did). The secret that Pauling had discovered was hydrogen bonding, and particular, weak N-H…O=C interactions (Wrinch had thought it was strong covalent N-C-OH bonding instead). Of course, there are other effects at work, which include van der Waals or dispersion interactions, electrostatic effects resulting from the large dipoles in peptides (not least due to the zwitterionic character), the planarity of the peptide bond itself, the potential for other types of hydrogen bond (e.g. C-H…O) and entropic effects. I have split some of these down for left and right handed forms of DNA in another post.

It turns out calculating most of these effects on an even-handed basis is not that easy. Only the recent advent of dispersion-corrected DFT procedures, together with solvation algorithms that allow for accurate geometry optimisation and subsequent evaluation of free energies allows such a calculation to be performed. Hitherto, it has been mostly molecular mechanics that has been used (which itself relies on many parameters from quantum mechanics, such as atom charges, and explicitly identifying interactions for hydrogen bonding). By returning to a quantum-mechanical model, some of these assumptions inherent in the mechanics method need not be made.

We showed in 1991 that an effective solvation treatment required for the zwitterionic form of amino acids in aqueous solutions would ideally comprise not only a self-consistent-reaction-field, but also explicit water molecules as solvent. Here only the former solvation term is included, but expanding the model to include water is certainly possible. Both the zwitterionic and the neutral forms of (ala)6 are included below, so that the effect of a large dipole on the structure and relative helical stability can be estimated. One notes that (even in a dielectric cavity corresponding to water), the extended zwitterions are high energy species.  In a protein, they of course would be stabilized by the immediate environment of the ions. The right-handed helix clearly comes out as more stable (by about 1 kcal/mol per residue, see also DOI: 10.1021/ja960665u),  but this is not really due to either dispersion effects or entropy and must therefore arise largely from the hydrogen-bond like interactions. Ionizing the termini to form a zwitterion increases the propensity for a right handed helix slightly.

Relative thermodynamic energies (kcal mol-1) of (ala)6 α-helices
System Total energy Dispersion ΔΔH298 Δ(T.ΔS298) ΔΔG298
Left, neutral 0.0
0.0 0.0 0.0 0.0
Right, neutral -4.0
+0.2 -4.0 0.9 -4.9
Left, zwitterion 0.0 0.0 0.0 0.0 0.0
Right, zwitterion -7.1 0.1 -6.3 1.7 -8.0

Shown below are the calculated structures. The chains have (inter alia) unusual bifurcated hydrogen-bonding interactions, between one carbonyl group and two N-H groups (show as atom with halo). These are not quite the models that Linus Pauling built!

Left handed. Click for 3D

Right handed. Click for 3D

Left handed zwitterion. Click for 3D

Right handed zwitterion. Click for 3D

For a more objective analysis of the interactions within the system, a QTAIM analysis is shown below.

Left helix. Bond critical points in green. Click for 3D.

Right helix. Click for 3D

Whilst the overall conclusion is that theory agrees well with the experimental observation that peptide sequences tend to coil into right rather than left handed helices,  the reasons they do so is a little more subtle than simple model building alone can reveal.  As the AIM shows, a plethora of unusual and weaker interactions occur within these helices, a full analysis of which must await presentation elsewhere.

An NCI analysis reveals strong hydrogen bonds as blue-shaded surfaces.

NCI surface. Click for 3D.

The Cyclol Hypothesis for protein structure: castles in the air.

Monday, April 4th, 2011

Most scientific theories emerge slowly, over decades, but others emerge fully formed virtually overnight as it were (think  Einstein in 1905). A third category is the supernova type, burning brightly for a short while, but then vanishing (almost) without trace shortly thereafter. The structure of DNA (of which I have blogged elsewhere) belongs to the second class, whilst one the brightest (and now entirely forgotten) examples of the supernova type concerns the structure of proteins. In 1936, it must have seemed a sure bet that the first person to come up with a successful theory of the origins of the (non-random) relatively rigid structure of proteins would inevitably win a Nobel prize (and of course this did happen for that other biologically important system, DNA, some 17 years later). Compelling structures for larger molecules providing reliable atom-atom distances based on crystallography were still in the future in 1936, and so structural theories contained a fair element of speculation and hopefully inspired guesswork (much as cosmological theories appear to have nowadays!).

Dorothy Wrinch was a mathematician who came up with just such a hypothesis for rigid protein structure, based in effect on elegance and symmetry, coupled with some knowledge of chemistry and crystallography[1]. She had noticed that the repeating polypeptide motif might be folded such that a cyclisation could occur to give what she termed a cyclol (an organic chemist would call this an aminol, and we would also now recognize it as a three-fold tetrahedral intermediate of the type involved in the hydrolysis of peptides). Wrinch proposed that this cyclisation could be repeated on a large scale to produce rigid scaffolds for proteins. The three-fold symmetric elegance of such motifs clearly appealed to this mathematician (the interesting symmetrical and conformational properties of the central cyclohexane-like ring were still to be fully recognised by anyone. Since Wrinch built many 3D models of her cyclols, one can but wonder how that central ring was represented, and whether its chair conformation was at all recognised. Another Nobel prize awaited the discoverer of this, Derek Barton).

The Cyclol structure. Click for 3D.

An immense controversy immediately broke out (not least because little direct spectroscopic evidence for the OH groups could be found). The story is rivetingly told by Patrick Coffey in his book Cathedrals of Science (ISBN 978-0-19-532134-0). Linus Pauling entered the fray in 1939[2], and one of the arguments he deployed was not so much symmetric elegance but thermodynamics (he also suggested hydrogen bonding and  S-S linkages for rigidifying proteins). The proposed cyclisation, he suggested, led to a very high energy species. Whilst Wrinch attempted to refute this[3], Pauling’s arguments won almost everyone over. Although Wrinch forlornly continued to promote her idea, last reviewing the topic as late as in 1963[4], crystallography was now producing cast iron data for protein structures. None have ever emerged with a cyclol motif, and this hypothesis is now firmly consigned to untaught history[5]. To this day, no examples of the tris(aminol) cyclol ring are to be found in the Cambridge small molecule crystal structure database either (although some related tetrahedral intermediates are known as crystalline species, see for example here, and they can be quite easily characterised in solution, see for example[6].

When  I read the story, it struck me that modern theory could easily verify how valid Pauling’s thermodynamic argument was. I have picked (ala)6 as my model, and have calculated the relative free energy (ΔG298) of the following three isomers.

  1. An acyclic zwitterionic form of this hexapeptide, calculated with a SCRF reaction field for water to allow for the ionic nature (ωB97XD/6-31G(d,p)), reveals a proton transfer to a neutral system, with an energy of +7.3 kcal/mol

    Acyclic (ala)6, in zwitterionic form

  2. A cyclic neutral peptide, which results from elimination of water from 1, again calculated with a water reaction field (DOI: 10042/to-8219), revealing a relative free energy of +0.0 kcal/mol

    Cyclic (ala)6

  3. The cyclic isomer 3 resulting from further cyclisation of 2 (DOI: 10042/to-8222) with a relative free energy of +69.0 kcal/mol

    Cyclol model for (ala)6.

From this, it appears that model 3 is ~69 kcal/mol less stable than the cyclic peptide 2, or 11.6 kcal/mol per amino acid residue. Pauling’s thermodynamic arguments suggested a value of ~28 kcal/mol (a value which Wrinch disputed as unreliable). So, in one sense, the above calculation is closer to Wrinch than to Pauling! In another, it still means Wrinch was wrong!! It is worth speculating why Pauling’s estimate is out. The cyclol 3 exhibits anomeric stabilizations, which of course were unknown in Pauling’s time. Both 2 and 3 exhibit attractive, but different, van der Waals attractions which contribute to their stabilities. And Pauling took no account of any entropy differences between 2 and 3. In retrospect,  3 was simply too rigid to allow most enzyme catalysis models to function, as we recognise them nowadays.

You might ask why I have revived a long forgotten theory as the topic of this post. Well, I think it is always worth revisiting the past, and re-examining old assumptions. When we do so, we find that Wrinch did not miss by as much as her detractors perhaps implied. With a little more luck, she might have gotten it right. Science is a bit like that, you need a dose of luck sometimes!

References

  1. "The cyclol hypothesis and the “globular” proteins", Proceedings of the Royal Society of London. Series A - Mathematical and Physical Sciences, vol. 161, pp. 505-524, 1937. https://doi.org/10.1098/rspa.1937.0159
  2. L. Pauling, and C. Niemann, "The Structure of Proteins", Journal of the American Chemical Society, vol. 61, pp. 1860-1867, 1939. https://doi.org/10.1021/ja01876a065
  3. D.M. Wrinch, "The Geometrical Attack on Protein Structure", Journal of the American Chemical Society, vol. 63, pp. 330-333, 1941. https://doi.org/10.1021/ja01847a004
  4. D. WRINCH, "Recent Advances in Cyclol Chemistry", Nature, vol. 199, pp. 564-566, 1963. https://doi.org/10.1038/199564a0
  5. C. Tanford, "How protein chemists learned about the hydrophobic factor", Protein Science, vol. 6, pp. 1358-1366, 1997. https://doi.org/10.1002/pro.5560060627
  6. H.S. Rzepa, A.M. Lobo, M.M. Marques, and S. Prabhakar, "Characterizing a tetrahedral intermediate in an acyl transfer reaction: An undergraduate 1H NMR demonstration", Journal of Chemical Education, vol. 64, pp. 725, 1987. https://doi.org/10.1021/ed064p725