Posts Tagged ‘Search queries’

How does carbon dioxide coordinate to a metal?

Saturday, May 6th, 2017

Mention carbon dioxide (CO2) to most chemists and its properties as a metal ligand are not the first aspect that springs to mind. Here thought I might take a look at how it might act as such.

There are up to five binding modes with one metal that one might envisage:

  1. Bonded interaction with the metal via just one oxygen atom,
  2. Bonded interaction via just the central carbon atom,
  3. Bonded interaction via the π-face of one C=O double bond,
  4. A weaker non-bonded interaction via carbon, or
  5. via oxygen.

Search queries of the Cambridge structure database (CSD) for these five modes are illustrated below (dataDOI: 10.14469/hpc/2524), with the constraints being applied to how many bonds (of unspecified type) each atom carries, along with no disorder and no errors. Thus query 1 is constrained by 1-coordination on one oxygen, and two on the carbon and other oxygen. 

  1. This query yields four hits: 10.5517/ccvcdq9, 10.5517/cc12nq6n10.5517/cc12nq5m10.5517/cc12nq4l. The angle subtended at the central carbon of the CO2 ranges from 172-176°, a very modest bending of the linear CO2. There are no examples where the metal is bonded to both oxygens.
  2. The next category involves the metal binding just to the central carbon. Two examples are known, differentiated from O-coordination by a more acute angle at the central carbon of 121-132°.
  3. The π-coordinated type requires a slightly more complex search query, shown below. The π-complex is defined as adding one coordination to each of one oxygen and the carbon. 

    This reveals 16 examples:

    The sine of the angle subtended at the centroid of one C-O bond shows that for most of the examples, the metal is close to perpendicular to this bond. The angle subtended at the central carbon ranges from 128-138, rather larger than the examples where the metal is bound just to the carbon. I have picked these two for illustration. The first (dataDOI: 10.5517/cc86r17) contains both CO2 and CO coordinated to the metal.This one (dataDOI: 10.1021/ic101652e) contains a short metal-centroid distance of 1.78Å (as also does 10.5517/ccz34kr). 

    There are two examples where BOTH π-CO bonds are coordinated to a metal; 10.5517/ccqlv7c and 10.5517/ccqlv8d (Ni-centroid distance 1.9Å) but these are intriguing because the two π-complexes are co-planar and not orthogonal.

  4. The final two cases are defined in the CSD database by having not so much bonds between metal and either C or O, as close intermolecular contacts typical of e.g. hydrogen bonds. This one (dataDOI: 10.5517/cc12nq9r) is to Fe, with a metal-C distance of 2.87Å which is significantly shorter than the anticipated sum of the van der Waals radii of the two atoms. The next (dataDOI: 10.5517/cc12npn2) has a close approach of Co to O of 2.23Å. The angles subtended at the carbon range from 174-180°. There are no convincing examples of close non-bonded approaches of the metal to both oxygen atoms simultaneously.

It is striking that the searches (as defined above) reveal relatively few examples. This might simply be a result of how the compounds are indexed in the CSD, reflected in the coordination constraints applied in the searches. Nevertheless, we see three quite different types of ligand-metal coordination in which bonds can be said to form and a more diffuse spectrum of weaker interactions to carbon dioxide. As a metal ligand, it is certainly interesting! Several deserve their wavefunctions looked at and I might report back on this aspect.

Global initiatives in research data management and discovery: searching metadata.

Monday, March 7th, 2016

The upcoming ACS national meeting in San Diego has a CINF (chemical information division) session entitled "Global initiatives in research data management and discovery". I have highlighted here just one slide from my contribution to this session, which addresses the discovery aspect of the session.

Data, if you think about it, is rarely discoverable other than by intimate association with a narrative or journal article. Even then, the standard procedure is to identify the article itself as being of interest, and then digging out the "supporting information", which normally takes the form of a single paginated PDF document. If you are truly lucky, you might also get a CIF file (for crystal structures). But such data has little life of its own outside of its parent, the article. Put another way, it has no metadata it can call its own (metadata is data about an object, in this case research data). An alternative is to try to find the data by searching conventional databases such as CAS,  Beilstein/Reaxys or CSD, and there of course the searches can be very precise. But (someone) has to pay the bills for such accessibility.

We are now starting to see quite different solutions to finding data (the F in FAIR data, the other letters representing accessibility, interoperability and re-usability). These solutions depend on metadata being a part of the solution from the outset, rather than any afterthought produced as a commercial solution. The collection of metadata is part of the overall process called RDM, or research data management, perhaps even the most important part of it. In exchange for identifying metadata about one's data, one gets back a "receipt" in the form of a persistent identifier for the data, more commonly known as a DOI. The agency that issues the DOI also undertakes to look after the donated metadata, and to make it searchable. The table below shows eight searches of such metadata, one example of how to acquire statistics relating to the usage of the data and one search of how to find repositories containing the data.

Search queries enabled by the use of metadata in data publication
# Search query* Instances retrieved:
1 http://search.datacite.org/ui?q=alternateIdentifier:InChIKey:*  InChI identifier
2 http://search.datacite.org/ui?q=alternateIdentifier:InChI:*  InChI key 
3 http://search.datacite.org/ui?q=alternateIdentifier:InChIKey:CULPUXIDFLIQBT-UHFFFAOYSA-N InChI key CULPUXIDFLIQBT-UHFFFAOYSA-N 
4 http://search.datacite.org/ui?q=ORCID:0000-0002-8635-8390+alternateIdentifier:InChIKey:* ORCID 0000-0002-8635-8390 AND (boolean) InChI key.
5 http://search.datacite.org/ui?q=ORCID:0000-0002-8635-8390+alternateIdentifier:InChI:InChI=1S/C9H11N5O3* ORCID 0000-0002-8635-8390 AND (boolean) + InChI string 1S/C9H11N5O3 with the * wild.
6 http://search.datacite.org/ui?q=has_media:true&fq=prefix:10.14469 Has content media for Publisher 10.14469 (Imperial College)
7 http://search.datacite.org/ui?q=format:chemical/x-* Data format type chemical/x-* 
8 http://search.datacite.org/api?&q=prefix:10.14469& fq=alternateIdentifier:InChIKey:*& fl=doi,title,alternateIdentifier& wt=json&rows=15
http://api.labs.datacite.org/works?q=prefix:10.14469+AND+alternateIdentifier:InChIKey:*
First 15 hits in JSON format, batch query mode
9 http://stats.datacite.org/?fq=datacentre_facet:"BL.IMPERIAL – Imperial College London" resolution statistics for publisher 10.14469 (Imperial College) per month
10 http://service.re3data.org/search?query=&subjects[]=31 Chemistry Research data repository search for Chemistry (135 hits)

In this instance the three MIME media types are chemical/x-wavefunction, chemical/x-gaussian-checkpoint and chemical/x-gaussian-log. See[1] for chemical MIME (multipurpose internet media extensions).


Anyone familiar with the standard ways of finding data (CAS, CSD, Reaxys) will appreciate that the above does not yet have the finesse to find eg sub-structures of chemical structures, synthetic procedures or molecular properties. My including it here is primarily to show some of the potential such systems have, and to remark particularly that the batch query capability of this infrastructure could indeed be used in the future to construct much more sophisticated systems.  Oh, and to the end-user at least, the searches shown above do not require institutional licenses to use. Both the data and its metadata is free, mostly with a CC0 or CC BY 3.0 license for re-use (the R of FAIR).

If more of interest related to this topic emerges at the ACS session,  I will report back here.

References

  1. H.S. Rzepa, P. Murray-Rust, and B.J. Whitaker, "The Application of Chemical Multipurpose Internet Mail Extensions (Chemical MIME) Internet Standards to Electronic Mail and World Wide Web Information Exchange", Journal of Chemical Information and Computer Sciences, vol. 38, pp. 976-982, 1998. https://doi.org/10.1021/ci9803233