Posts Tagged ‘Chemical substance’

Global initiatives in research data management and discovery: searching metadata.

Monday, March 7th, 2016

The upcoming ACS national meeting in San Diego has a CINF (chemical information division) session entitled "Global initiatives in research data management and discovery". I have highlighted here just one slide from my contribution to this session, which addresses the discovery aspect of the session.

Data, if you think about it, is rarely discoverable other than by intimate association with a narrative or journal article. Even then, the standard procedure is to identify the article itself as being of interest, and then digging out the "supporting information", which normally takes the form of a single paginated PDF document. If you are truly lucky, you might also get a CIF file (for crystal structures). But such data has little life of its own outside of its parent, the article. Put another way, it has no metadata it can call its own (metadata is data about an object, in this case research data). An alternative is to try to find the data by searching conventional databases such as CAS,  Beilstein/Reaxys or CSD, and there of course the searches can be very precise. But (someone) has to pay the bills for such accessibility.

We are now starting to see quite different solutions to finding data (the F in FAIR data, the other letters representing accessibility, interoperability and re-usability). These solutions depend on metadata being a part of the solution from the outset, rather than any afterthought produced as a commercial solution. The collection of metadata is part of the overall process called RDM, or research data management, perhaps even the most important part of it. In exchange for identifying metadata about one's data, one gets back a "receipt" in the form of a persistent identifier for the data, more commonly known as a DOI. The agency that issues the DOI also undertakes to look after the donated metadata, and to make it searchable. The table below shows eight searches of such metadata, one example of how to acquire statistics relating to the usage of the data and one search of how to find repositories containing the data.

Search queries enabled by the use of metadata in data publication
# Search query* Instances retrieved:
1 http://search.datacite.org/ui?q=alternateIdentifier:InChIKey:*  InChI identifier
2 http://search.datacite.org/ui?q=alternateIdentifier:InChI:*  InChI key 
3 http://search.datacite.org/ui?q=alternateIdentifier:InChIKey:CULPUXIDFLIQBT-UHFFFAOYSA-N InChI key CULPUXIDFLIQBT-UHFFFAOYSA-N 
4 http://search.datacite.org/ui?q=ORCID:0000-0002-8635-8390+alternateIdentifier:InChIKey:* ORCID 0000-0002-8635-8390 AND (boolean) InChI key.
5 http://search.datacite.org/ui?q=ORCID:0000-0002-8635-8390+alternateIdentifier:InChI:InChI=1S/C9H11N5O3* ORCID 0000-0002-8635-8390 AND (boolean) + InChI string 1S/C9H11N5O3 with the * wild.
6 http://search.datacite.org/ui?q=has_media:true&fq=prefix:10.14469 Has content media for Publisher 10.14469 (Imperial College)
7 http://search.datacite.org/ui?q=format:chemical/x-* Data format type chemical/x-* 
8 http://search.datacite.org/api?&q=prefix:10.14469& fq=alternateIdentifier:InChIKey:*& fl=doi,title,alternateIdentifier& wt=json&rows=15
http://api.labs.datacite.org/works?q=prefix:10.14469+AND+alternateIdentifier:InChIKey:*
First 15 hits in JSON format, batch query mode
9 http://stats.datacite.org/?fq=datacentre_facet:"BL.IMPERIAL – Imperial College London" resolution statistics for publisher 10.14469 (Imperial College) per month
10 http://service.re3data.org/search?query=&subjects[]=31 Chemistry Research data repository search for Chemistry (135 hits)

In this instance the three MIME media types are chemical/x-wavefunction, chemical/x-gaussian-checkpoint and chemical/x-gaussian-log. See[1] for chemical MIME (multipurpose internet media extensions).


Anyone familiar with the standard ways of finding data (CAS, CSD, Reaxys) will appreciate that the above does not yet have the finesse to find eg sub-structures of chemical structures, synthetic procedures or molecular properties. My including it here is primarily to show some of the potential such systems have, and to remark particularly that the batch query capability of this infrastructure could indeed be used in the future to construct much more sophisticated systems.  Oh, and to the end-user at least, the searches shown above do not require institutional licenses to use. Both the data and its metadata is free, mostly with a CC0 or CC BY 3.0 license for re-use (the R of FAIR).

If more of interest related to this topic emerges at the ACS session,  I will report back here.

References

  1. H.S. Rzepa, P. Murray-Rust, and B.J. Whitaker, "The Application of Chemical Multipurpose Internet Mail Extensions (Chemical MIME) Internet Standards to Electronic Mail and World Wide Web Information Exchange", Journal of Chemical Information and Computer Sciences, vol. 38, pp. 976-982, 1998. https://doi.org/10.1021/ci9803233

Bond stretch isomerism. Did this idea first surface 100 years ago?

Tuesday, February 9th, 2016

The phenomenon of bond stretch isomerism, two isomers of a compound differing predominantly in just one bond length, is one of those chemical concepts that wax and occasionally wane.[1] Here I explore such isomerism for the elements Ge, Sn and Pb.

In one earlier post, I noted a form of bond stretch isomerism that can arise from a Jahn-Teller distortion ending in two different geometries in which one or more pairs of bonds swap short/long lengths. Examples include substituted cyclo-octatetraenes[2] and octahedral d9-Cu(II) complexes.[3] A more interesting seminal possibility was implied by G. N. Lewis a century ago when discussing the arrangement of electrons in a (carbon-carbon) triple bond.[4]

lewis1
*It took ~50 years to prove this assertion wrong.[5]

In a commentary, I reported the results of a search of the crystal structure database for the geometries associated with RX≡XR systems (X= C, Si, Ge, Sn, Pb). Here I focus the search[6] specifically for X=Sn,Ge; this version of bond stretch isomerism also allows angles to change (= rehybridisation at atoms) in order to provide a mechanism for a barrier separating the two forms.

For X=Sn, note the presence of up to three clusters, although the relatively low number of hits makes the statistics less certain.

  1. The hotspot cluster centered around angles of 125° and a Sn-Sn distance of ~2.6Å.
  2. Another with angles of <100° and Sn-Sn distances of ~3.3Å.
  3. A third with angles of <100° and Sn-Sn distances of 2.8Å, which may or may not be a genuine unique form of bonding.

This pattern was commented on in 2010 by Power[7], whose group synthesized most of the examples in the hits above. A plot of compounds with Ge-Ge bonds reveals both similarity with (two, possibly three clusters) and difference from (the clusters are closely spaced in terms of the Ge-Ge bond length, but separated in terms of angle) Sn.

GeGe

Time for some computations (which at least will remove random errors in the geometry). I selected the only known example of an RPb-PbR compound[8] as a seed and put it through a B3LYP+D3/Def2-TZVPP calculation (with 172 atoms and 2920 basis functions, this is a relatively large calculation!), which reproduces the known structure pretty well (table).

QIMQUY

So what about another bond stretch isomers? The Pb=Pb variation is indeed a stable minimum around 28.0 kcal/mol above the known structure, which seems to put this form out of experimental reach (with this ligand/aryl group at least). With Sn for the same aryl ligand, the energy difference is smaller (~15.8 kcal/mol for this ligand; Powers reports other systems where the energy difference may be only ~5 kcal/mol). Judging by the distribution of the 13 hits recovered from the CSD search, both bond stretch isomers may be accessible experimentally. The calculations show that the GeGe bond isomers are much closer in energy than SnSn (for this ligand). For all three metals however, the calculated difference in the metal-metal length for the two isomers is ~0.45 – 0.52Å. This strongly suggests that whereas the SnSn plot above is demonstrating bond length isomerism, the GeGe plot may not be; at least not of the same type that the calculations here are revealing (via the Wiberg bond orders).

System Relative energy XX distance RXX angle Wiberg bond order DataDOI
Pb=Pb +28.0 2.767 118.7 1.666 [9]
Pb-Pb 0.0 3.215 (3.188)[8] 93.7 (94.3)[8] 0.889 [10]
Sn=Sn +15.8 2.640 123.1 1.911 [11]
Sn-Sn 0.0 3.126 95.5 0.892 [12]
Ge=Ge +0.5 2.263 125.2 2.138 [13]
Ge-Ge 0.0   2.777 99.7 0.866 [14]

No doubt the particular bond length form is being facilitated by the nature of the ligand and the steric interactions therein imparted, both repulsive AND attractive. These interactions can be visualised via NCI (non-covalent-interaction) plots (click on the image to obtain a rotatable 3D model). First Pb-Pb followed by Pb=Pb. Note how in both cases, the PbPb region is enclosed in regions of weak attractive dispersion interactions, which however avoid the "hemidirected" inert Pb lone pairs.[15]

Pb-Pb Pb=Pb

So in the end we have something of a mystery. There is evidence from crystal structures that at least two bond-stretch isomers of RSnSnR compounds can form, but the calculations indicate that the Sn=Sn form is significantly higher in energy (although not impossibly so for thermal accessibility). Conversely, the Ge=Ge equivalent is very similar in energy to a Ge-Ge form with a significantly longer bond length, but there seems no crystallographic evidence for such a big difference in bond lengths. Perhaps the answer lies with the ligands?

It seems particularly appropriate on the centenary of G. N. Lewis' famous paper in which he clearly notes the possibility of three isomeric forms for the triple bond, to pay tribute to the impact his suggestions continue to make to chemistry.


The individual entries can be inspected via the following dois: [16],[17],[18],[19],[20],[21],[22],[23],[24],[25]

You can view individual entries via the following DOIs: [26],[27],[28],[29],[30],[31],[32],[33],[34],[35]

References

  1. J.A. Labinger, "Bond-stretch isomerism: a case study of a quiet controversy", Comptes Rendus. Chimie, vol. 5, pp. 235-244, 2002. https://doi.org/10.1016/s1631-0748(02)01380-2
  2. J.E. Anderson, and P.A. Kirsch, "Structural equilibria determined by attractive steric interactions. 1,6-Dialkylcyclooctatetraenes and their bond-shift and ring inversion investigated by dynamic NMR spectroscopy and molecular mechanics calculations", Journal of the Chemical Society, Perkin Transactions 2, pp. 1951, 1992. https://doi.org/10.1039/p29920001951
  3. W. Zhang, L. Chen, R. Xiong, T. Nakamura, and S.D. Huang, "New Ferroelectrics Based on Divalent Metal Ion Alum", Journal of the American Chemical Society, vol. 131, pp. 12544-12545, 2009. https://doi.org/10.1021/ja905399x
  4. G.N. Lewis, "THE ATOM AND THE MOLECULE.", Journal of the American Chemical Society, vol. 38, pp. 762-785, 1916. https://doi.org/10.1021/ja02261a002
  5. F.A. Cotton, "Metal-Metal Bonding in [Re<sub>2</sub>X<sub>8</sub>]<sup>2-</sup> Ions and Other Metal Atom Clusters", Inorganic Chemistry, vol. 4, pp. 334-336, 1965. https://doi.org/10.1021/ic50025a016
  6. H. Rzepa, "Crystal structures containing Sn...Sn bonds", 2016. https://doi.org/10.14469/hpc/249
  7. Y. Peng, R.C. Fischer, W.A. Merrill, J. Fischer, L. Pu, B.D. Ellis, J.C. Fettinger, R.H. Herber, and P.P. Power, "Substituent effects in ditetrel alkyne analogues: multiple vs. single bonded isomers", Chemical Science, vol. 1, pp. 461, 2010. https://doi.org/10.1039/c0sc00240b
  8. L. Pu, B. Twamley, and P.P. Power, "Synthesis and Characterization of 2,6-Trip<sub>2</sub>H<sub>3</sub>C<sub>6</sub>PbPbC<sub>6</sub>H<sub>3</sub>-2,6-Trip<sub>2</sub> (Trip = C<sub>6</sub>H<sub>2</sub>-2,4,6-<i>i</i>-Pr<sub>3</sub>):  A Stable Heavier Group 14 Element Analogue of an Alkyne", Journal of the American Chemical Society, vol. 122, pp. 3524-3525, 2000. https://doi.org/10.1021/ja993346m
  9. H.S. Rzepa, "C 72 H 98 Pb 2", 2016. https://doi.org/10.14469/ch/191856
  10. H.S. Rzepa, "C 72 H 98 Pb 2", 2016. https://doi.org/10.14469/ch/191873
  11. https://doi.org/
  12. H.S. Rzepa, "C 72 H 98 Sn 2", 2016. https://doi.org/10.14469/ch/191881
  13. H.S. Rzepa, "C 72 H 98 Ge 2", 2016. https://doi.org/10.14469/ch/191882
  14. H.S. Rzepa, "C 72 H 98 Ge 2", 2016. https://doi.org/10.14469/ch/191883
  15. M. Imran, A. Mix, B. Neumann, H. Stammler, U. Monkowius, P. Gründlinger, and N.W. Mitzel, "Hemi- and holo-directed lead(<scp>ii</scp>) complexes in a soft ligand environment", Dalton Transactions, vol. 44, pp. 924-937, 2015. https://doi.org/10.1039/c4dt01406e
  16. Jones, C.., Sidiropoulos, A.., Holzmann, N.., Frenking, G.., and Stasch, A.., "CCDC 892557: Experimental Crystal Structure Determination", 2012. https://doi.org/10.5517/ccyys5t
  17. Phillips, A.D.., Wright, R.J.., Olmstead, M.M.., and Power, P.P.., "CCDC 187521: Experimental Crystal Structure Determination", 2002. https://doi.org/10.5517/cc6942p
  18. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771267: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwklt
  19. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771268: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwkmv
  20. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771270: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwkpx
  21. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771271: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwkqy
  22. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771272: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwkrz
  23. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771274: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwkt1
  24. Fischer, R.C.., Pu, Lihung., Fettinger, J.C.., Brynda, M.A.., and Power, P.P.., "CCDC 624216: Experimental Crystal Structure Determination", 2007. https://doi.org/10.5517/ccnyk04
  25. Pu, Lihung., Phillips, A.D.., Richards, A.F.., Stender, M.., Simons, R.S.., Olmstead, M.M.., and Power, P.P.., "CCDC 221953: Experimental Crystal Structure Determination", 2004. https://doi.org/10.5517/cc7fysc
  26. Sasamori, Takahiro., Sugahara, Tomohiro., Agou, Tomohiro., Guo, Jing-Dong., Nagase, Shigeru., Streubel, Rainer., and Tokitoh, Norihiro., "CCDC 1035078: Experimental Crystal Structure Determination", 2014. https://doi.org/10.5517/cc13r2mk
  27. Sidiropoulos, A.., Jones, C.., Stasch, A.., Klein, S.., and Frenking, G.., "CCDC 749451: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cct4vvm
  28. Shan, Yu-Liang., Yim, Wai-Leung., and So, Cheuk-Wai., "CCDC 1019495: Experimental Crystal Structure Determination", 2015. https://doi.org/10.5517/cc136vy3
  29. Sugiyama, Y.., Sasamori, T.., Hosoi, Y.., Furukawa, Y.., Takagi, N.., Nagase, S.., and Tokitoh, N.., "CCDC 297827: Experimental Crystal Structure Determination", 2006. https://doi.org/10.5517/cc9zxbh
  30. Stender, M.., Phillips, A.D.., Wright, R.J.., and Power, P.P.., "CCDC 180660: Experimental Crystal Structure Determination", 2002. https://doi.org/10.5517/cc61zry
  31. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771273: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwks0
  32. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771269: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwknw
  33. Peng, Yang., Fischer, R.C.., Merrill, W.A.., Fischer, J.., Pu, Lihung., Ellis, B.D.., Fettinger, J.C.., Herber, R.H.., and Power, P.P.., "CCDC 771266: Experimental Crystal Structure Determination", 2010. https://doi.org/10.5517/cctwkks
  34. Jones, C.., Sidiropoulos, A.., Holzmann, N.., Frenking, G.., and Stasch, A.., "CCDC 892556: Experimental Crystal Structure Determination", 2012. https://doi.org/10.5517/ccyys4s
  35. Jones, C.., Sidiropoulos, A.., Holzmann, N.., Frenking, G.., and Stasch, A.., "CCDC 892555: Experimental Crystal Structure Determination", 2012. https://doi.org/10.5517/ccyys3r