Posts Tagged ‘London’

A nice example of open data (in London).

Sunday, March 5th, 2017

Living in London, travelling using public transport is often the best way to get around. Before setting out on a journey one checks the status of the network. Doing so today I came across this page: our open data from Transport for London. 

  1. I learnt that by making TFL travel data openly available, some 11,000 developers (sic!) have registered for access, out of which some 600 travel apps have emerged.
  2. The data is in XML, which makes it readily inter-operable.[1]
  3. This encourages crowd-sourced innovation.
  4. They have taken the trouble to produce an API (application programmable interface) which allows rich access to the data and information about e.g. AccidentStats, AirQuality, BikePoint, Journey, Line, Mode, Occupancy, Place, Road, Search, StopPointVehicle.

Chemists could learn some lessons here! Of course, there are quite a few chemical databases with APIs that are examples of open data, but the “ESI” (electronic supporting information) sources which almost all published articles rely upon to disseminate data are clearly struggling to cope. Take for example this recent article[2], where much of the data has been dropped into the inevitable PDF “coffin” and which is a breathtaking 907 pages long. To give the authors their due, they also provide 20 CIF files which ARE good sources of data. Rarely commented on, but clearly missing from the information associated with this (indeed most) articles is the metadata about the data. Thus the metadata for these CIF files amounts to just e.g. 229. To find out the context, one has to scour the article (or the 907 pages of the ESI) to identify compound 229 (I strongly suspect it’s a molecule because of the implied semantics of the term, not because its been explicitly declared). You will not find the metadata at e.g. data.datacite.org which is one open aggregator and global search engine based on deposited metadata.

I have commented elsewhere on this blog that other types of data could also be enhanced in the manner that CIF crystallographic files represent. For example the Mpublish NMR project, examples of which are shown here, and for which typical data AND its metadata can be seen at DOI: 10.14469/hpc/1053. I fancy that if this method had been adopted,[2] those 907 pages might have shrunk somewhat, although of course not entirely. But my hope is that gradually the innovative chemistry community will find ways of exhuming more and more data from the PDF coffin and in the process reducing the paginated lengths of the PDF-based ESI further, perchance eventually even to zero?

If you are yourself preparing an article and sweating over the ESI at this very moment, do please take a look at the Mpublish method and how perhaps it can help make your NMR data at least more useful to others.


I understand an article describing this project is in preparation. If you cannot wait, this recent application of the Mpublish project has some details.[3]

References

  1. P. Murray-Rust, and H.S. Rzepa, "Chemical Markup, XML, and the Worldwide Web. 1. Basic Principles", Journal of Chemical Information and Computer Sciences, vol. 39, pp. 928-942, 1999. https://doi.org/10.1021/ci990052b
  2. J.M. Lopchuk, K. Fjelbye, Y. Kawamata, L.R. Malins, C. Pan, R. Gianatassio, J. Wang, L. Prieto, J. Bradow, T.A. Brandt, M.R. Collins, J. Elleraas, J. Ewanicki, W. Farrell, O.O. Fadeyi, G.M. Gallego, J.J. Mousseau, R. Oliver, N.W. Sach, J.K. Smith, J.E. Spangler, H. Zhu, J. Zhu, and P.S. Baran, "Strain-Release Heteroatom Functionalization: Development, Scope, and Stereospecificity", Journal of the American Chemical Society, vol. 139, pp. 3209-3226, 2017. https://doi.org/10.1021/jacs.6b13229
  3. M.J. Harvey, A. McLean, and H.S. Rzepa, "A metadata-driven approach to data repository design", Journal of Cheminformatics, vol. 9, 2017. https://doi.org/10.1186/s13321-017-0190-6

The atom and the molecule: A one-day symposium on 23 March, 2016 celebrating Gilbert N. Lewis.

Friday, December 11th, 2015

You might have noticed the occasional reference here to the upcoming centenary of the publication of Gilbert N. Lewis’ famous article entitled “The atom and the molecule“.[1] A symposium exploring his scientific impact and legacy will be held in London on March 23, 2016, exactly 70 years to the day since his death. A list of the speakers and their titles is shown below; there is no attendance fee, but you must register as per the instructions below.


Royal Society of Chemistry Historical Group Meeting on 23th March 2016, Burlington House, Piccadilly, London: The atom and the molecule: A symposium celebrating Gilbert N. Lewis.

  • Dr Patrick Coffey (Berkeley, USA): Does Personality Influence Scientific Credit? Simultaneous Priority Disputes: Lewis vs. Langmuir and Langmuir vs. Harkins
  • Professor Robin Hendry (Durham, UK): Lewis on Structure and the Chemical Bond
  • Professor Alan Dronsfield (UK): An organic chemist reflects on the Lewis two-electron bond
  • Dr Julia Contreras-García (UPMC, France): Do bonds need a name?
  • Professor Nick Greeves (Liverpool, UK): The influence of Lewis on organic chemistry teaching, textbooks and beyond
  • Professor Clark Landis (UWM, USA): Lewis and Lewis-like Structures in the Quantum Era
  • Professor Michael Mingos (Oxford, UK): The Inorganic dimension to Lewis and Kossel’s landmark contributions
  • Dr Patrick Coffey (Berkeley, USA): Lewis’ Life, Death, and Missing Nobel Prize

Prior registration is essential. Please email your name and address to Professor John Nicholson,  jwnicholson01 @ gmail.com


References

  1. G.N. Lewis, "THE ATOM AND THE MOLECULE.", Journal of the American Chemical Society, vol. 38, pp. 762-785, 1916. https://doi.org/10.1021/ja02261a002

Chemistry in the early 1960s: a reminiscence.

Monday, December 22nd, 2014

I started chemistry with a boxed set in 1962. In those days they contained serious amounts of chemicals, but I very soon ran out of most of them. Two discoveries turned what might have been a typical discarded christmas present into a lifelong career and hobby.

The first was 60 Stoke Newington High Street in north London, the home of Albert N. Beck, Chemist (or his son; my information comes from a historical listing of the shops present on the high street in 1921). I would set out from our home in London SW6 on the #73 bus route (top deck) and it would take about an hour to arrive. On entering the shop, I ventured down a set of stairs into the basement to replenish the chemicals with sensible stocks, and purchase the odd glassware, filter paper, etc. And then venture back across London carrying the proceeds of many weeks, possibly months worth of hoarded pocket-money (apart that is from 1 shilling every two weeks which I reserved for football at Craven Cottage). At some stage, health and safety legislated against 12-year-old boys (and certainly also girls) purchasing chemicals in this manner! However, I can assure you all that I never came to any harm with anything I purchased at A. N. Beck and Sons. Apart that is from giving my parents a good fright.

The second was coming across this book by A. J. Mee. I had thought it was well and truly lost; imagine my delight when I recently found it at home, complete with chemical stains, and dated as from a reprint in 1959.

IFOn the inside cover, I found one shopping list from my expeditions to A. N. Beck and Sons. The price 1/6 is the representation of one shilling and six pence (more than the price of a football match, or perhaps £50 in today’s money? I think football was much cheaper then! Oh, 1/6 is 7.5p in the decimal currency of today, or £0.075). Note that iodine was one of the items purchased. And note the wish list at the bottom! I was clearly starting to do organic chemistry.

shopping-list

The pages of this book list 289 experiments, and I assiduously recorded a tick against all the ones I actually did. This is a typical page (click to expand).

IFThus expt 205 is the preparation of 1,3,5-tribromobenzene from 1,3,5-tribromoaniline (ticked), followed by that of o-cresol from o-toluidine (ticked). You can see how all the aromatic rings are still represented by what now looks like cyclohexane. This book gave me many hours of delightful recreation (I have not counted the ticks, but I think I attempted around half the experiments). Note in particular the huge scale these experiments were done at; 18g of product (I suspect I must have scaled them down a fair bit in order to preserve pocket money). Expt 198 was that of benzidine, of which I do recollect preparing  ~2g. No warnings then about the extremely carcinogenic nature of this substance! Chemistry has certainly changed since then.

Lost unfortunately is the laboratory book where I recorded my results, but one or two samples still exist!

 

 

A computed mechanistic pathway for the formation of an amide from an acid and an amine in non-polar solution.

Wednesday, November 12th, 2014

In London, one has the pleasures of attending occasional one day meetings at the Burlington House, home of the Royal Society of Chemistry. On November 5th this year, there was an excellent meeting on the topic of Challenges in Catalysisand you can see the speakers and (some of) their slides here. One talk on the topic of Direct amide formation – the issues, the art, the industrial application by Dave Jackson caught my interest. He asked whether an amide could be formed directly from a carboxylic acid and an amine without the intervention of an explicit catalyst. The answer involved noting that the carboxylic acid was itself a catalyst in the process, and a full mechanistic exploration of this aspect can be found in an article published in collaboration with Andy Whiting’s group at Durham.[1] My after-thoughts in the pub centered around the recollection that I had written some blog posts about the reaction between hydroxylamine and propanone. Might there be any similarity between the two mechanisms?

amide

That mechanism can be represented as above, which (as per the hydroxylamine mechanism) comprises three transition states and two intermediates. The original study[1] reported just the one TS1. Editing out the starting coordinates from the PDF-based supporting information (the process is not always easy) enabled an IRC (intrinsic reaction coordinate) for TS1 to be easily computed.[2]

origa

origa
This reveals that TS1 is not the complete story, there is still much of the reaction left to complete. The energy profile is charted (using the ωB97XD/6-311G(d,p/SCRF=p-xylene method) according to the scheme above as reactants TS1Intermediate 1TS2Tetrahedral intermediateTS3products. Computed properties for this more detailed pathway are transcluded here from the digital repository[3] and appear at the end of this post.

  1. TS1 yields what might be called a zwitterionic intermediate. However, this has a relatively small dipole moment (5.7D). Thus, against accepted wisdom, such apparently ionic intermediates CAN be involved in reactions occurring in non-polar solvents!
  2. TS2 is rather unexpected, involving synchronous proton transfer coupled to anomerically related C-OH bond rotation. This rotation changes the anomeric interactions with the adjacent substituents; in my experience I have never before seen a reaction mode quite like this one!
  3. TS3 collapses the tetrahedral intermediate by synchronous proton transfer and C-O bond cleavage, and is (in this model) the rate determining step.  The free energy barrier corresponds to a half-life at 298K of about half an hour.
  4. The product is calculated as exoenergic with respect to reactants,; the reaction does drive to form an amide (and any catalysis of course will not influence that final outcome, only its kinetics).

If you read the original article[1] you will realise the above only scratches the surface of the many fascinating properties of this apparently very simple reaction. Thus, not addressed above is why amides are only formed in certain solvents (xylene for example) but not others. The solvent may have a specific role to play which is not modelled simply by its continuum dielectric or its boiling point. There is much else that could be said.


References

  1. H. Charville, D.A. Jackson, G. Hodges, A. Whiting, and M.R. Wilson, "The Uncatalyzed Direct Amide Formation Reaction – Mechanism Studies and the Key Role of Carboxylic Acid H‐Bonding", European Journal of Organic Chemistry, vol. 2011, pp. 5981-5990, 2011. https://doi.org/10.1002/ejoc.201100714
  2. H.S. Rzepa, "C21H21NO4", 2014. https://doi.org/10.14469/ch/74636
  3. H.S. Rzepa, "A computed mechanistic pathway for the formation of an amide from an acid and an amine in non-polar solution.", 2014. https://doi.org/10.6084/m9.figshare.1235300

Computers 1967-2013: a personal perspective. Part 5. Network bandwidth.

Wednesday, June 5th, 2013

In a time of change, we often do not notice that Δ = ∫δ. Here I am thinking of network bandwidth, and my personal experience of it over a 46 year period.

I first encountered bandwidth in 1967 (although it was not called that then). I was writing Algol code to compute the value of π, using paper tape to send the code to the computer. Unfortunately, the paper tape punch was about 10 km from that computer. The round trip (by van) took about a week, the outcome being often merely to discover that the first line of the code contained a compilation error. I think I got to computing π after about six weeks. That is a bandwidth of about 18 characters (108 bits) in 3628800 seconds, or 0.00003 bits per second.

I did my undergraduate work in 1969, when the distance between the card punch and the computer had reduced to about 50m, and instant turnaround involved circulating in a loop between the punch and the line printer, hoping that neither suffered a paper-wreck. The bandwidth had certainly gone up. On a good day, you could make 20 or so circuits, which did leave one feeling faintly dizzy. 

The next improvement came in 1972, when I was solving non-linear equations for kinetic rate constants, using a 110 bits per second (baud) or ~ 18 characters per second using the 6-bit computers of that era) teletypewriter. This was about 50m from the lab where the kinetic measurements were made (using, if you are interested a scintillation counter. Yes, I was mildly radioactive for most of my PhD, but I do not believe I glowed in the dark). This bandwidth was in fact fine for uploading kinetic data, and receiving the computed rate constant and its standard error. You might note however that this teletypewriter was the only one in the building I occupied, and yet demand for it was small (I was pretty much its only user). 

The next increment occurred in Texas 1974-1977, where I was now doing quantum chemical calculations. Back in time to the card punch and the lineprinter (Texas is big, and so now the distance between them was a 10 minute walk). But in my last year there, a state-of-the-art 300 baud teletypewriter was installed! This was now fast enough to play a computer game (something to do with Dragons and Dungeons I think), and so now there was competition to use it. Particularly from one of my friends, who shall be called George, and who on one occasion spent about 48 virtually contiguous hours trying to get to the last level. The rest of us returned to the card punch to submit the calculations. It was also during this period that the first emails started to be exchanged, but only really as a curiosity: “it would never catch on” was the opinion of most.

Back in the UK by 1977, I was overwhelmed by the speed of the 9.6 kbaud graphics terminal I now had access to, 32 times faster. And the rate continued to multiply, by a further 1000 to attain 10 Mbaud in 1987. But another change occurred during this period. The previous eras had involved transmitting the data no more than ~200m, from one point in the campus to another. But by 1986, if one tried hard enough, one could reach ARPANET. And that was 5000 km away! My first use of such distances was to reach California and download Apple’s system 5.0 for the Macs in the department (I have described elsewhere the role the Mac’s printer port played in this). From then on, we always did have the latest operating system installed on most of the machines (although not always did this subterfuge address the intended issue, which was to stop the computer crashing as often).

These speeds however did not reach beyond the university. Back home, around 1983, I was back to using a 300 baud modem, with an acoustic coupler to the land line. Our young daughter, aged 3 at the time, joined in the data transmission with gusto. Her joyful shrieks were invariably picked up by the acoustic coupler, and translated into a jumble of characters, which were then interleaved into the numbers coming back from quantum calculations. It was sometimes difficult to tell them apart! These domestic modems gradually got faster, probably attaining 9.6 kbaud by about 1993 (during the course of which the acoustic component was replaced by electronics, and oddly, our daughter stopped shrieking in quite the same way). 

Back in the university in 1993, the first 100 megabits per second (100Mbps ≅100 Mbaud) ethernet lines and switches were being installed, but the national and international backbones were still a lot slower. It was in this year that I was approached to be part of a SuperJanet project. We were going to do a molecular videoconference from London to Cambridge and Leeds; a three-way connection, and this needed ~ 20Mbps to transmit the signal from the video camera as well as the 3D images of molecules in real-time (compression techniques were not so advanced in those days). Because BT was sponsoring the project, they naturally wanted some publicity, and so we even got to appear on the national television news that night. But we came within about 1 minute of a disaster. Our 20Mbps connection went through the SuperJanet national backbone, the capacity of which was, you guessed, ~ 20 Mbps. The network operators (located at the Rutherford-Appleton laboratories), who we had not had the foresight to pre-warn, came within 1 minute of isolating Imperial College from the national network because of our bandwidth hogging. I met them a month or so later, and they told me this. I feel I was lucky to escape with my life and body intact from that meeting (or to put it another way, they were not happy bunnies). 

By about 2000, I had achieved 1 Gbps to my desktop computer (and there it has stayed for the past 13 years). What about home? Well, to cut the story short, I recently benchmarked the domestic WiFi connection between a laptop and “the world” at about 65 Mbps (download) and 18 Mbps (upload), a little less than 1 million times greater than 30 years earlier and a 12 orders of magnitude greater than in 1967. I gather however that some lucky inhabitants of Austin Texas (the scene of my 1974-1977 experiments), courtesy of Google, can get 1 Gbps!

I will end by quoting Samuel Butler, writing in 1863I venture to suggest that … the general development of the human race to be well and effectually completed when all men, in all places, without any loss of time, at a low rate of charge, are cognizant through their senses, of all that they desire to be cognizant of in all other places. … This is the grand annihilation of time and place which we are all striving for, and which in one small part we have been permitted to see actually realised” (Quoted in George Dyson, “Darwin amongst the Machines, The Evolution of Global Intelligence”, Addison-Wesley, N.Y., 1997. ISBN 0-201-400649-7).


I just benchmarked my office computer (using only solid-state memory and that 1Gbps connection) and got 58Mbps (download)/75Mbps (upload).

The standard program was NCSA Telnet if  I remember. You made a connection from the computer (using its printer port) to the ARPANET node at University College London (not a widely advertised service), and thence to an Apple FTP site where one could initiate an anonymous file transfer back to one’s computer.  System 5 was about half a Mbyte then, and this took about 1-2 hours to retrieve (unless the connection went down, in which case one started again).

More joining up of pieces. Stereocontrol in the ring opening of cyclopropenes.

Thursday, July 12th, 2012

Years ago, I was travelling from Cambridge to London on a train. I found myself sitting next to a chemist, and (as chemists do), he scribbled the following on a piece of paper. When I got to work the next day Vera (my student) was unleashed on the problem, and our thoughts were published[1]. That was then.

This is now. I have just finished a post on ring-opening reactions of oxirene, a 4n electron anti-aromatic ring. I was casting around for an example of a calculation done just before the modern Internet era, and happened upon the above. Although this was a mere 20 years ago, much of the detail had faded; I had not thought much about it in the intervening years, but I did recollect that although we had attributed the high stereoselectivity shown above to a stereoelectronic orbital alignment, I was not entirely happy with the interpretation. Put simply, we had relied on a molecular orbital diagram, and this diagram (in resplendent colour in the journal, one of the few being so published at that time, and for no colour charge to boot) was just too complicated. Arguably it was the fixated complexity (I remember at the time that it looked complicated no matter what the viewing angle was) that set me on the road to the use of the Web, and ultimately here to this blog. So I thought a re-analysis using modern algorithms and presentation might help simplify. The newly recalculated transition state (ωB97XD/6-311G(d,p) looks like:

Transition state for ring opening of a cyclopropene. Click for 3D.

  1. The reaction is a 4n (n=1) electron electrocyclic ring opening, and so according to the rules, should proceed with the formation/cleavage of an antarafacial bond. You might think that there are not quite enough substituents to reveal this stereochemistry, but there are if the carbene lone pair is included. So how to add the lone pair?
  2. Well, its coordinates can be computed using the ELF (electron localisation function). The relevant lone pair is ringed in red below. Using (old technology, i.e. a static figure) you may choose to believe me when I argue that this lone pair is above the plane of the forming ring from the perspective shown, whilst the terminus of the bond it forms is to the bottom. This defines an antarafacial component. Well, I might have carefully manipulated the viewing angle to show this. Now, in 2012 rather than 1992, you can load the 3D coordinates by clicking below, and check for yourself!

    Lone pair centroid for the transition state. Click for 3D

  3. What about the stereo-control? Take a look at the angle between the axis of the C-Cl bond (atoms ringed in blue) and the centroid of the carbene lone pair (red). It is about 162°, or almost anti-periplanar. A magic orientation in organic chemistry. Time to attack the orbitals again. Our published diagram looked as below. It shows the HOMO aligning with the LUMO+2 (if your eyes are not distracted by all the other detail).
    But we can now simplify such a complex molecular orbital by using instead a localized version, an NBO. A little explanation is needed. The NBO orbital shown with red/blue phases is antibonding for the C-Cl bond. That with orange/purple is the carbene lone pair. Where orange overlaps with red, we have a positive overlap that stabilises the system. The NBO E2 perturbation energy is around 4.6 kcal/mol. Although this may seem small, it is actually quite large for a through-space interaction of this type. It is this stabilisation (amounting to ~ 1.6 kcal/mol in free energy) that accounts for the high selectivity for the stereoisomer shown above.

    NBO for transition state. Click for 3D.

Well, I think that the passage of 20 years has enabled us to tidy up the origins of the stereoelectronic effect responsible for controlling this reaction, and to produce clearer diagrams which the reader can interactively explore for themselves. It did take 20 years to join things up though!

References

  1. M.S. Baird, J.R. Al Dulayymi, H.S. Rzepa, and V. Thoss, "An unusual example of stereoelectronic control in the ring opening of 3,3-disubstituted 1,2-dichlorocyclopropenes", Journal of the Chemical Society, Chemical Communications, pp. 1323, 1992. https://doi.org/10.1039/c39920001323