Posts Tagged ‘opendata’

(re)Use of data from chemical journals.

Wednesday, December 22nd, 2010

If you visit this blog you will see a scientific discourse in action. One of the commentators there notes how they would like to access some data made available in a journal article via the (still quite rare) format of an interactive table, but they are not familiar with how to handle that kind of data (file). The topic in question deals with various kinds of (chemical) data, including crystallographic information, computational modelling, and spectroscopic parameters. It could potentially deal with much more. It is indeed difficult for any one chemist to be familiar with how data is handled in such diverse areas. So I thought I would put up a short tutorial/illustration in this post of how one might go about extracting and re-using data from this one particular source.

Interactive Journal table

The above is a snapshot of part of the table in question, with a box in the middle set aside for a Jmol applet to appear. What might be both less obvious, and less familiar to many who might have seen such a display is the very rich environment available for manipulating the data. To expose some of this, proceed as follows:

  1. Firstly, load a molecule into the Jmol window by clicking on e.g. the hyperlink shown below.

    Loading a molecule

  2. The display shown below will appear, in this case a set of coordinates used to present a 3D model of a molecule, which can be rotated, zoomed, etc. It also has been labelled with various selected bond lengths etc.

    Interactive table with molecule loaded

  3. To extract data, right-click anywhere in the molecule area. Navigate through the menus which appear as shown below. In this case, the data is present in the form of a Gaussian log file. This can contain the history of the particular calculation performed (e.g. a geometry optimisation) or as in this case, all 3N-6 calculated normal vibrational modes. The one of interest here is number 318, being an O=C=O stretching mode.

    An Interactive table in a chemistry journal.

  4. This mode can now be manipulated visually by selecting various parameters:

    Manipulating a vibrational mode

  5. Jmol has a scintillating display of other options, and more are being added all the time, so the above display is by no means the limit of what one can do.
  6. Now to the most important bit. Invoke the menu as shown below, whereupon a copy of the relevant file (gzipped in this case to reduce its size) will be downloaded to your local system. You will now need to use a program on your own computer capable of reading and processing such a file (after unzipping).

    Downloading a data file.

  7. There may be a bewildering variety of programs and toolkits which may perform the operation you wish on such a file. Some are commercial, some are open source. To help people get going, I link to one of the latter type here, You might also want to visit the Quixote project for ideas.
  8. We are not quite finished yet. Perhaps a Gaussian log file does not suite your purpose. Well, now try clicking on this link

    Link to a digital repository

  9. This produces a page such as below, which contains more files. In this example, several molecular identifiers are present (InChI and InChI key) to help identify the uniqueness of the system, the molecular coordinates are available as a .cml file which itself can be processed by a variety of software tools, the original file used to run the calculation can be inspected (if you want to eg repeat it) as input.gjf, the logfile we have seen above, and a checkpoint file, which is most useful when using either the Gaussian program system or a visualiser (Gaussview, ChemBio3D etc, both commercial programs). A SMILES string is also offered, and sometimes (not in this example) a so-called wavefunction file which can be used by some programs to analyse the wavefunction, and perform e.g. QTAIM, ELF, NCI analyses.

    A digital repository page.

    It is now up to the user to identify suitable processing programs on their computer which fit their purpose.

  10. There is one other file present which I have not yet explained, the mets.xml manifest. This is a metadata file, containing (along with much else) an RDF declaration of (some) of the properties of the molecule. In theory at least, this file could be automatically harvested for the RDF, which could be injected into a triple store, and queried semantically using eg SPARQL. That is part of the semantic web.

I hope some of the screenshots here make the process of extracting data from an interactive table article a little more obvious. I must declare that this way of doing it is just one of the ways being explored and also (much to my regret) is not yet particularly common. But hopefully you might capture a little of what some of us believe to be the future of scientific journals.

Data-round-tripping: wherein the future?

Tuesday, December 7th, 2010

Moving (chemical) data around in a manner which allows its (automated) use in whichever context it finds itself must be a holy grail for all scientists and chemists. I posted earlier on the fragile nature of molecular diagrams making the journey between the editing program used to create them (say ChemDraw) and the Word processor used to place them into a context (say Microsoft office), via an intermediate storage area known as the clipboard. The round trip between the Macintosh (OS X) versions of these programs had been broken a little while, but it is now fixed! A small victory. This blog reports what happened when such a Mac-created Word document is sent to someone using Microsoft Windows as an OS (or vice versa).

As you might have guessed, the molecular diagram arrives largely dead, and not re-usable. Opening the .docx archive (it is nothing more than a zip file) reveals only a JPEG file residing inside. Nothing that can be chemically repurposed. If the reverse process is undertaken, of creating a chemdraw diagram, and pasting it into Word on Windows, one finds in the .docx two components; a bit-mapped image linked to an active object containing the data. Only the first of these is recognised if the file makes its way to a Macintosh; i.e. the same story, the data is again lost. So the bottom line is that Mac users and Windows users cannot, after all, exchange repurposable molecular diagrams using Word documents using this combination of programs. This is not good.

But let me remind what happened around 1993. The word processor was joined by a program called the Web browser. In 1996, the underlying content carrier, HTML, became XHTML (an instance of XML). Right from day 1 almost, such XHTML could, and frequently was repurposed. A memorable example is that search engines could use it to index the Web. The XHTML easily survived trips to and from clipboards. In 1996, CML joined HTML as a way of carrying chemical information capable of round-tripping without loss (if need be). There are other chemical XML languages in use nowadays, including CDXML used by the ChemDraw program. Word itself now uses XML (the x in .docx). So, after 14 years, why am I still describing the difficulties above? I am frankly at a loss to explain why there is still a need to write this post.

All is not entirely lost. The CML4Word approach is designed to enable (chemical) data round tripping from the outset. Although I do not yet know if the CML created and stored in the Word document using this mechanism is recognised anywhere outside of Word 2007 on Windows?  If anyone can let me know of examples where such a CML-enabled Word document can be used in other environments, I would be very grateful (but not on  OS X, as I know already).

And as I might have mentioned in the previous post on this topic, things may not however be getting better in that other carrier of information and data, the mobile phone/iPad, as exemplified by operating systems such as iOS or Android. Watch this space, as they say.

Data-round-tripping: moving chemical data around.

Saturday, November 20th, 2010

For those of us who were around in 1985, an important chemical IT innovation occurred. We could acquire a computer which could be used to draw chemical structures in one application, and via a mysterious and mostly invisible entity called the clipboard, paste it into a word processor (it was called a Macintosh). Perchance even print the result on a laserprinter. Most students of the present age have no idea what we used to do before this innovation! Perhaps not in 1985, but at some stage shortly thereafter, and in effect without most people noticing, the return journey also started working, the so-called round trip. It seemed natural that a chemical structure diagram subjected to this treatment could still be chemically edited, and that it could make the round trip repeatedly. Little did we realise how fragile this round trip might be. Years later, the computer and its clipboard, the chemistry software, and the word processor had all moved on many generations (it is important to flag that three different vendors were involved, all using proprietary formats to weave their magic). And (on a Mac at least) the round-tripping no longer worked. Upon its return to (Chemdraw in this instance), it had been rendered inert, un-editable, and devoid of semantic meaning unless a human intervened. By the way, this process of data-loss is easily demonstrated even on this blog. The chemical diagrams you see here are similarly devoid of data, being merely bit-mapped JPG images. Which is why, on many of these posts, I put in the caption Click for 3D, which gives you access to the chemical data proper (in CML or other formats). And I throw in a digital repository identifier for good measure should you want a full dataset.

It is only now that we (more specifically, this user) understand what had happened under-the-hood to break this round-tripping. In 1984, when Apple produced the Mac, they also produced a most interesting data format called PICT. A human saw the PICT as a PICTure, but the computer saw more. It (could) see additional data embedded in the PICT. The clipboard supported the PICT format, which meant that both picture and data could be transferred between programs. And ChemDraw and Word also understood this. Hence the ability to round-trip noted above (it has to be said between specifically these programs).

Times moved on and the limitations of PICT set in. Apple refocussed on the PDF format. Related, notice, to the Postscript format that Adobe had introduced in order to allow high quality laserprinting. PICT support was abandoned, and the various components no longer carried recognisable data (specifically the clipboard or the ability of Word to recognise the data). Round-tripping broke. Does this matter? Well, one colleague where I work had accumulated more than 1000 chemical diagrams, which he decided to store in Powerpoint (and yes, he threw the original Chemdraw files away). The day came when he wanted to round trip one of them. And of course he could not. He was rather upset I have to say!

PDF was not really a format designed to carry data (see DOI: 10.1021/ci9003688). But, bless their hearts, the three vendors involved in this story all agreed to support data embedded in the PDF hamburger (and Abobe to tolerate it) and now once again, a structure diagram can move into an Office program (on Mac) and out again and retain its chemical integrity. What lessons can be learnt?

  1. Firstly, out of side, out of mind. The clipboard is truly mostly out of sight, and it was not really designed from the outset to preserve data properly. Nowadays I wonder whether clipboards in general recognise XML (and hence CML) and preserve it. I truly do not know. But they should.
  2. Secondly, any system which relies on three or four commercial vendors, who at least in the past, devised proprietary formats which they could change without warning, is bound to be fragile.
  3. We have learnt that data is valuable. More so than the representation of it (i.e. a 2D or 3D structure diagram). But when its lost, the users should care! And tell the vendors.
  4. Peter Murray-Rust and his team have produced CML4Word (or as Microsoft call it, Chemistry add-in for Word). At its heart is data integrity. Fantastic! But I wonder if it survives on Microsoft’s clipboard (I know it does not on Apple’s, since CML4Word is not available on that OS. And is unlikely to ever become so).
  5. And I can see history about to repeat itself. The same seems about to happen on new devices such as the Apple iPad. It too has copy/paste via a clipboard. I bet this will not round trip chemistry (or much other) data! Want to bet that the lessons of this story have not yet been learnt?

Oh, for those who wish to round-trip chemistry on a Mac, you will have to acquire ChemDraw 12.0.2 and Word 2011 (version 14.01), as well as OS X 10.6 for it to work.

A Digital chemical repository – is it being used?

Tuesday, May 4th, 2010

In this previous blog post I wrote about one way in which we have enhanced the journal article. Associated with that enhancement, and also sprinkled liberally throughout this blog, are links to a Digital Repository (if you want to read all about it, see DOI: 10.1021/ci7004737). It is a fairly specific repository for chemistry, with about 5000 entries. These are mostly the results of quantum mechanical calculations on molecules (together with a much smaller number of spectra, crystal structure and general document depositions). Today, with some help (thanks Matt!), I decided to take a look at how much use the repository was receiving.

  1. The first entry in the log dates from 2008-02-05.
  2. The repository is now receiving about 1200 accesses via handle resolutions each day, which comprises
  3. ~150 unique client IPs, and
  4. ~900 unique handles accessed daily

Whilst most of the hits are coming from web spiders by auto-discovery, a fair number (perhaps ~300) of the 5000 entries have also been linked to via journal articles, and of course this blog, and some hits may be presumed to be the result of non-random ping-backs. A breakdown of a typical day (2010-02-10) when 839 unique handles were accessed shows access by, amongst others, five universities, Google/Yahoo, several other information corporations and Microsoft. I had no idea Microsoft was interested in calculations on molecules! You saw that here first!!

Other anecdotal feedback regarding the repository: I often use it to exchange calculations with collaborators, sending them the handle instead of a vast checkpoint or log file. Some collaborators, it has to be said are baffled by the interface presented to them (which was designed in large measure by DSpace, not by us).

It is early days in many ways, and being pretty much the only standards-compliant digital repository operating in chemistry in this manner means that awareness is still low. If anyone reading this blog knows of significant others, please comment.

(Hyper)activating the chemistry journal.

Monday, September 7th, 2009

The science journal is generally acknowledged as first appearing around 1665 with the Philosophical Transactions of the Royal Society in London and (simultaneously) the French Academy of Sciences in Paris. By the turn of the millennium, around 10,000 science and medical journals were estimated to exist. By then, the Web had been around for a decade, and most journals had responded to this new medium by re-inventing themselves for it. For most part, they adopted a format which emulated paper (Acrobat), with a few embellishments (such as making the text fully searchable) and then used the Web to deliver this new reformulation of the journal. Otherwise, Robert Hooke would have easily recognized the medium he helped found in the 17th century.

In 1994, a small group of us thought that one could, and indeed should go further than emulated paper. We argued [1] that journals should be activated by delivering not merely the logic of a scientific argument, but also the data on which it might have been based. Of course, we encountered the usual problem; doing this might cost publishers more in production resources, and in the absence of a market prepared to pay the extra, the business model did not make sense (to the publishers). Well, 15 years later, and most publishers are indeed now thinking about how their journals can be enhanced. A number of interesting projects (the RSC’s Project Prospect is one which strives to bring science alive) have emerged. Another is the topic of this blog; the activation of the journal with molecular coordinates and data using the Jmol applet.

Initially (~2005), this project met with resistance from publishers, and the issue really amounted to what the definitive version of a scientific article should be. Should that definitive version be printable? That model, after all had served the community well for more than 300 years! And journals from the very beginning are still as readable now as when first published. In other words, print lasts! But print is pretty limiting after all. For a start, it is limited to 2D static representations. Molecules, by and large, do their magic in a dynamic three dimensions (4D in an Einsteinian sense). But print is also expensive; not merely to produce, but to transport paper around the world.

From the turn of the millennium, a number of publishers, amongst them the American Chemical Society, started to evolve the scientific article such that the pre-eminent version would now be considered to be the HTML form (perhaps as a prelude to phasing out print entirely? See an interesting commentary by a journal editor) and perhaps a digital Acrobat form which would be deemed to loose some of its functionality once printed (again see here for how Acrobat can be used to enhance things). Again however, a chicken-and-egg scenario resulted. To enhance the articles with extra functionality (such as data), they would need to find authors prepared to put the extra work into preparing the material. In fact, most authors already do that, but they call it supporting information. This is often highly data rich, covering materials such as spectra, coordinates and other information nowadays provided to researchers for analysis. Unfortunately, what has been missing is the education of authors to provide this information in a proper digital form which can be easily re-used by others, and on a Web page, converted automatically to nice interactive models. Most spectra which form part of the supporting information are in fact still scanned versions of printed spectra!

Enter computational chemists. Nowadays, they live in a world that truly does not need printing! Almost all of their data is already suitably digital. So perhaps it is no surprise to find that when enhanced journal articles started appearing around 2005, many were produced by this group of chemists. By now perhaps you are wondering what such an article might look like. Well, the remainder of this blog will be devoted to listing some examples. You will also notice that they come exclusively from our own publications. Perhaps someone will find the time to collect a far more representative set to better illustrate the diversity of this form, and how it is evolving. Meanwhile, you might wish to take a look at the following.

Part 1: The early days: 1994 onwards

These examples all relied on a browser plugin called Chime, which is no longer with us! Hence the pages designed to invoke it no longer display properly. But the data associated with the articles is still there!

  1. An early 1994 example of (hyper)activating a journal article can be seen here as the preliminary communication and
  2. in 1995 here as the final full article. I am told that this was the article that actually inspired the developers of Chime to enhance (Netscape) with a chemical plugin.
  3. This one from 1998 illustrates how articles can decay in functionality when Chime is no longer available.
  4. An ab initio and MNDO-d SCF-MO Computational Study of Stereoelectronic Control in Extrusion Reactions of R2I-F Iodine (III) Intermediates, M. A. Carroll, S. Martin-Santamaria, V. W. Pike, H. S. Rzepa and D. A. Widdowson, Perkin Trans. 2, 1999, 2707-2714 with the supporting information here.
  5. Huckel and Mobius Aromaticity and Trimerous transition state behaviour in the Pericyclic Reactions of [10], [14], [16] and [18] Annulenes. Sonsoles Martên-Santamarêa, Balasundaram Lavan and H. S. Rzepa, J. Chem. Soc., Perkin Trans 2, 2000, 1415. with the supporting information here.
  6. Peter Murray-Rust, H. S. Rzepa and Michael Wright, “Development of Chemical Markup Language (CML) as a System for Handling Complex Chemical Content”, New J. Chem., 2001, 618-634. DOI: 10.1039/b008780g. This article broke new ground in that the supporting information was something of a misnomer. It was expressed entirely in XML, including all the chemistry data, and used XSLT transforms on the fly to regenerate the article. In that sense, it was actually a superset of the published article. It would be fair to say that this article was rather ahead of its time (although it does seem appropriate to publish it in a new journal!).
  7. M. Jakt, L. Johannissen, H. S. Rzepa, D. A. Widdowson and R. Wilhelm, “A Computational Study of the Mechanism of Palladium Insertion into Alkynyl and Aryl Carbon-Fluorine bonds”, Perkin Trans. 2, 2002, 576-581 and supporting information.
  8. P. Murray-Rust and H. S. Rzepa, chapter in “Handbook of Chemoinformatics. Part 2. Advanced Topics.”, ed. J. Gasteiger and T. Engel, 2003, Vol 1, was not enhanced per se, but did lay out the principles of how it might/should be done.
  9. K. P. Tellmann, M. J. Humphries, H. S. Rzepa and V. C. Gibson, “An experimental and computational study of β-H transfer between organocobalt complexes and 1-alkenes”, Organometallics, 2004, 23, 5503-5513. DOI: 10.1021/om049581h and supporting information.

Part 2: 2005.

These four examples all now invoke Jmol, which downloads upon request and hence does not rely on the presence of any browser plugin. The four articles were submited with supporting information in the form of HTML. These were associated with the main article, but were not formal part of that article. In that sense, they represent an incarnation of the traditional model, with all the data firmly resident in the supporting information.

  1. Gibson, Vernon C.; Marshall, Edward L.; Rzepa, H. S. ” A computational study on the ring-opening polymerization of lactide initiated by β-diketiminate metal alkoxides: The origin of heterotactic stereocontrol”, J. Am. Chem. Soc., 2005, 127, 6048-6051. DOI: 10.1021/ja043819b and supporting information.
  2. H. S. Rzepa, Mobius aromaticity and delocalization”, Chem. Rev., 2005, 105, 3697 – 3715. DOI: 10.1021/cr030092l and supporting information.
  3. H. S. Rzepa, “Double-twist Mšbius Aromaticity in a 4n+2 Electron Electrocyclic Reaction”, 2005, Chem Comm, 5220-5222. DOI: 10.1039/b510508k The supporting information is also available directly.
  4. H. S. Rzepa, “A Double-twist Mobius-aromatic conformation of [14]annulene”, Org. Lett., 2005, 7, 637 – 4639. DOI: 10.1021/ol0518333 and supporting information.

Part 3: 2006 onwards

The supporting information has now been assimilated into the main body of the article proper, and within these confines contribute components such as enhanced figures or tables (i.e. enhanced with data)

  1. A. P. Dove, V. C. Gibson, E. L. Marshall, H. S. Rzepa, A. J. P. White and D. J. Williams, “Synthetic, Structural, Mechanistic and Computational Studies on Single-Site β-Diketiminate Tin(II) Initiators for the Polymerization of rac-Lactide”, J. Am. Chem. Soc., 2006,128, 9834-9843. DOI: 10.1021/ja061400a The enhancement can be seen in Figure 11.
  2. O. Casher and H. S. Rzepa, “SemanticEye: A Semantic Web Application to Rationalise and Enhance Chemical Electronic Publishing”, J. Chem. Inf. Mod., 2006, 46, 2396-2411. DOI: 10.1021/ci060139e
  3. H S. Rzepa and M. E. Cass, “A Computational Study of the Nondissociative Mechanisms that Interchange Apical and Equatorial Atoms in Square Pyramidal Molecules”, Inorg. Chem., 200645, 3958–3963. DOI 10.1021/ic0519988. Interactive table at 10.1021/ic0519988/ic0519988.html
  4. M. E. Cass and H. S. Rzepa, “In Search of The Bailar Twist and Ray-Dutt mechanisms that racemize chiral tris-chelates: A computational study of Sc(III), V(III), Co(III), Zn(II) and Ga(III) complexes of a ligand analog of acetylacetonate”, Inorg. Chem., 2007, 49, 8024-8031. DOI: 10.1021/ic062473y The enhancement can be seen in Figure 2
  5. H. S. Rzepa, “Lemniscular Hexaphyrins as examples of aromatic and antiaromatic Double-Twist Möbius Molecules”, Org. Lett., 2008, 10, 949-952.DOI:10.1021/ol703129z The enhancement can be seen in Web Table 1.
  6. D. C. Braddock and H. S. Rzepa, “Structural Reassignment of Obtusallenes V, VI and VII by GIAO-based Density functional prediction”, J. Nat. Prod., 2008, DOI: 10.1021/np0705918 and WEO1.
  7. S. M. Rappaport and H S. Rzepa, “Intrinsically Chiral Aromaticity. Rules Incorporating Linking Number, Twist, and Writhe for Higher-Twist Möbius Annulenes”, J. Am. Chem. Soc., 2008, 130,, 7613-7619. DOI: 10.1021/ja710438j and WEO1 to 4
  8. C. S. M. Allan and H. S. Rzepa, “AIM and ELF Critical point and NICS Magnetic analyses of Möbius-type Aromaticity and Homoaromaticity in Lemniscular Annulenes and Hexaphyrins”, J. Org. Chem., 2008, 73, 6615-6622. DOI: 10.1021/jo801022b and WEO1
  9. C. S. M. Allan and H. S. Rzepa, “Chiral aromaticities. Möbius Homoaromaticity”, J. Chem. Theory. Comp., 2008, 4, 1841-1848. DOI: 10.1021/ct8001915 and WEO1
  10. C. S. M Allan and H. S. Rzepa, “The structure of Polythiocyanogen: A Computational investigation”, Dalton Trans., 2008, 6925 – 6932. DOI: 10.1039/b810147g and enhanced Table
  11. H. S. Rzepa, “Wormholes in Chemical Space connecting Torus Knot and Torus Link π-electron density topologies”, Phys. Chem. Chem. Phys., 2009, 1340-1345. DOI: 10.1039/b810301a and enhanced Table.
  12. H. S. Rzepa, “The Chiro-optical properties of a Lemniscular Octaphyrin”, Org. Lett., 2009, 11, 3088-3091. DOI: 10.1021/ol901172g
  13. C. S. Wannere, H. S. Rzepa, B. C. Rinderspacher, A. Paul, H. F. Schaefer III, P. v. R. Schleyer and C. S. M. Allan, “The geometry and electronic topology of higher-order Möbius charged Annulenes”, J. Phys. Chem., 2009, DOI: 10.1021/jp902176a and enhanced table
  14. H. S. Rzepa, “The distortivity of π-electrons in conjugated Boron rings.”, Phys. Chem. Chem. Phys., 2009, DOI: 10.1039/B911817A and enhanced table.
  15. H. S. Rzepa, “The importance of being bonded”, Nature Chem., 2009, DOI: 10.1038/nchem.373 and the exploratorium.
  16. King Kuok Hii, J.L.Arbour, H.S.Rzepa, A.J.P.White, “Unusual Regiodivergence in Metal-Catalysed Intramolecular Cyclisation of γ-Allenols”, Chem. Commun, 2009, DOI: 10.1039/b913295c and enhanced table.
  17. L. F. V. Pinto, P. M. C. Glória, M. J. S. Gomes, H. S. Rzepa, S. Prabhakar, A. M. Lobo. “A Dramatic Effect of Double Bond Configuration in N-Oxy-3-aza Cope Rearrangements – A simple synthesis of functionalised allenes”, Tet. Lett., 2009, 50, 3446-3449. DOI: 10.1016/j.tetlet.2009.02.228 and interactive table.
  18. H. S. Rzepa and C. S. M. Allan, “Racemization of isobornyl chloride via carbocations: a non-classical look at a classic mechanism”, J. Chem. Educ., 2010, DOI: 10.1021/ed800058c and interactive table.
  19. K. Abersfelder, A. J. P. White, H. S. Rzepa, and D. Scheschkewitz “A Tricyclic Aromatic Isomer of Hexasilabenzene”, Science, 2010, DOI: 10.1126/science.1181771 and interactive table.
  20. A. C. Spivey, L. Laraia, A. R. Bayly, H. S. Rzepa and A. J. P. White “Stereoselective Synthesis of cis- and trans-2,3-Disubstituted Tetrahydrofurans via Oxonium−Prins Cyclization: Access to the Cordigol Ring System”, Org. Lett., 2010, DOI 10.1021/ol9024259 and interactive table.
  21. J. Kong, P. v. R. Schleyer and H. S. Rzepa, “Successful Computational Modeling of Iso-bornyl Chloride Ion-Pair Mechanisms”, J. Org. Chem., 2010, DOI: 10.1021/jo100920e and interactive table.
  22. A. Smith, H. S. Rzepa, A. White, D. Billen, K. K. Hii, “Delineating Origins of Stereocontrol in Asymmetric Pd-Catalyzed α-Hydroxylation of 1,3-Ketoesters”, J. Org. Chem., 2010, 75, 3085-3096. DOI: 10.1021/jo1002906 and interactive table.
  23. H. S. Rzepa “The rational design of helium bonds”, Nature Chem.20102, 390-393. DOI: 10.1038/NCHEM.596 and web enhanced table.
  24. P. Rivera-Fuentes, J. Lorenzo Alonso-Gómez, A. G. Petrovic, P. Seiler, F. Santoro, N. Harada, N. Berova, H. S. Rzepa, and F. Diederich, “Enantiomerically Pure Alleno–Acetylenic Macrocycles: Synthesis, Solid-State Structures, Chiroptical Properties, and Electron Localization Function Analysis”, Chem. Eur. J., 2010, DOI: 10.1002/chem.201001087 and interactive figure
  25. H. S. Rzepa, “The Nature of the Carbon-Sulfur bond in the species H-CS-OH”, J. Chem. Theory. Comput., 2010, 49, DOI: 10.1021/ct100470g and interactive table.
  26. H. S. Rzepa, “Can 1,3-dimethylcyclobutadiene and carbon dioxide co-exist inside a supramolecular cavity?”, Chem. Commun., 2010, DOI: 10.1039/C0CC04023A and interactive table
  27. M. R. Crittall, H. S. Rzepa, and D. R. Carbery, “Design, Synthesis, and Evaluation of a Helicenoidal DMAP Lewis Base Catalyst”, Org. Lett., 2011, DOI: 10.1021/ol2001705 and interactive table
  28. H. S. Rzepa, “The past, present and future of Scientific discourse”, J. Cheminformatics, 2011, 3, 46. DOI: 10.1186/1758-2946-3-46 and interactive figure 3, figure 4 and figure 5.
  29. H. S. Rzepa, “A computational evaluation of the evidence for the synthesis of 1,3-dimethylcyclobutadiene in the solid state and aqueous solution”, Chem. Euro. J.2012, in press.
  30. J. L. Arbour, H. S. Rzepa, L. A. Adrio, E. M. Barreiro, P. G. Pringle and K. K. (Mimi) Hii, “Silver-catalysed enantioselective additions of O-H and N-H to C=C bonds: Non-covalent interactions in stereoselective processes”, Chem. Euro. J.2012, in press, Web table 1 and Web table 2.
  31. H. S. Rzepa, “Chemical datuments as scientific enablers”, J. Chemoinformatics, submitted.
  32. A. P. Buchard, F. Jutz, F. M. R. Kember, H. S. Rzepa, C. K. Williams, C.K., “Experimental and Computational Investigation of the Mechanism of Carbon Dioxide/Cyclohexene Oxide Copolymerization Using A Dizinc Catalyst”, in press. Interactivity box
  33. D. C. Braddock, D. Roy, D. Lenoir, E. Moore, H. S. Rzepa, J. I-Chia Wu and P. von R. Schleyer, “Verification of Stereospecific Dyotropic Racemisation of Enantiopure d and l-1,2-Dibromo-1,2-diphenylethane in Non-polar Media”, Chem. Comm., 2012, just published. DOI: 10.1039/C2CC33676F and interactivity box.
  34. K. Leszczyńska, K. Abersfelder, M. Majumdar, B. Neumann, H.-G. Stammler, H. S. Rzepa, P. Jutzi and D. Scheschkewitz, “The Cp*Si+ Cation as a Stoichiometric Source of Silicon, Chem. Comm., 2012, 48, 7820-7822. DOI: 10.1039/c2cc33911k. Cites links to 10042/to-13974, 10042/to-13982, 10042/to-13969, 10042/20028, 10042/to-13973, 10042/to-13985
  35. H. S. Rzepa, “A computational evaluation of the evidence for the synthesis of 1,3-dimethylcyclobutadiene in the solid state and aqueous solution”, Chem. Euro. J., 2013, 4932-4937. DOI: 10.1002/chem.201102942 and WebTable
  36. H. S. Rzepa, “Chemical datuments as scientific enablers”, J. Chemoinformatics, 2013, 4, DOI: 10.1186/1758-2946-5-6. The interactivity box is integrated into the body of the article.
  37. M. J. Cowley, V. Huch, H. S. Rzepa, D. Scheschkewitz, “A Silicon Version of the Vinylcarbene – Cyclopropene Equilibrium: Isolation of a Base-Stabilized Disilenyl Silylene”, 2013, Nature Chem., in press and Webtable.
  38. M. J. S. Gomes, L. F. V. Pinto, H. S. Rzepa, S. Prabhakar, A. M. Lobo, “N-Heteroatom Substitution Effects in 3-Aza-Cope Rearrangements”, Chemistry Central, 2013, 7:94. doi:10.1186/1752-153X-7-94 and Table.
  39. H. S. Rzepa and C. Wentrup, “Mechanistic Diversity in Thermal Fragmentation Reactions: a Computational Exploration of CO and CO2 Extrusions from Five-Membered Rings”, J. Org. Chem., DOI: 10.1021/jo401146k and Table.
  40. D. C. Braddock, J. Clarke and H. S. Rzepa “Epoxidation of Bromoallenes Connects Red Algae Metabolites by an Intersecting Bromoallene Oxide – Favorskii Manifold”, Chem. Comm., 2013, DOI: 10.1039/C3CC46720A and Table.
  41. M. J. Fuchter, Ya-Pei Lo and H. S. Rzepa, “Mechanistic and chiroptical studies on the desulfurization of epidithiodioxopiperazines reveal universal retention of configuration at the bridgehead carbon atoms”, J. Org. Chem., 2013, in press. doi: 10.1021/jo401316a and table.

References

  1. H.S. Rzepa, B.J. Whitaker, and M.J. Winter, "Chemical applications of the World-Wide-Web system", Journal of the Chemical Society, Chemical Communications, pp. 1907, 1994. https://doi.org/10.1039/c39940001907

On the importance of Digital repositories in Chemistry

Friday, April 3rd, 2009

The preceeding blog entries contain stories about chemical behaviour. If you have clicked on the diagrams, you may even have gotten a Jmol view of the relevant molecules popping up. But if you are truly curious, you may even have the urge to acquire the relevant 3D information about the molecule, and play with it yourself. Even after 15 years of the  (chemical) Web, this can be distressingly difficult to achieve (or can it be that it is only myself who wishes to view molecules in their  native mode?).  Thus the standard mechanism is to seek out on journal pages that disarming little entry entitled  supporting information and to hope that you might find something useful embedded there.  Embedded is the correct description, since the information is often found within the confines of an Acrobat file, and has to be extracted from there.  Indeed, that is what  I had to resort to in order to write one of the blog entries below. I ground my teeth whilst doing so. 
blog11
So is there a better way? We think so! The  digital repository. If you click on this you should see the entry directly. What can you do there? Well, if you have suitable programs, you can download eg a Checkpoint file of the calculation that created the molecule model and re-activate it there. Or you can download just the CML file for viewing in any CML-compliant program (such as e.g. Jmol). Or you can check up on the InCHi string or the InChI Key of the molecule.

What about the specific entry above? Well, it corresponds to the calculation for the π4 + π2 cycloaddition described in the blog entry below. You can now verify for yourself the assertions made in that entry, ie that the rotation mode is disrotatory, or that the bond is forming antarafacially. You do not need to take my word for it! If the Digital repository is too much trouble for you, click on the graphic instead to get a similar result.

We now regularly put such links into journal articles, in the form of Web-enhanced tables and figures, so that it is literally just one click away from such an article to having a vibrant molecule dancing in front of you. All (chemistry) journals should do this. If they do not yet, then contact their editor in chief when you next submit an article and ask them why not!

See also the blog by Peter Murray-Rust.