{"id":22636,"date":"2020-08-29T09:26:11","date_gmt":"2020-08-29T08:26:11","guid":{"rendered":"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=22636"},"modified":"2020-08-29T09:26:11","modified_gmt":"2020-08-29T08:26:11","slug":"exploiting-the-power-of-persistent-identifiers-pids-for-locating-all-kinds-of-research-object","status":"publish","type":"post","link":"https:\/\/www.rzepa.net\/blog\/?p=22636","title":{"rendered":"Exploiting the power of persistent identifiers (PIDs) for locating all kinds of research object."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"22636\">\n<p>The folks at DataCite have <a href=\"https:\/\/blog.datacite.org\/power-of-pids\/\">announced<\/a> a new research object discovery service which aims to give users a &#8220;<i>comprehensive overview of connections between entities in the research landscape&#8221;<\/i>.\u00a0The portal <a href=\"https:\/\/commons.datacite.org\">https:\/\/commons.datacite.org<\/a>\u00a0acts as the entry point for three basic types of persistent identifiers (PIDs);<\/p>\n<ol>\n<li>Research works, using the DOI (digital object identifier) as a PID. This includes both research articles and research data as &#8220;works&#8221; or research objects and can be invoked using the prefix <strong>https:\/\/commons.datacite.org\/doi.org?query=<\/strong> to the search query.<\/li>\n<li>People, using the ORCID as a PID <em>via<\/em> the prefix <strong>https:\/\/commons.datacite.org\/orcid.org?query=<\/strong><\/li>\n<li>Organisations, using ROR as a PID using the prefix <strong>https:\/\/commons.datacite.org\/ror.org?query=<\/strong><\/li>\n<li>If one wants to construct a search which combines any two, or all three of the above categories, then the search prefix is simply <strong>https:\/\/commons.datacite.org\/?query=<\/strong><\/li>\n<\/ol>\n<p>To use this very modern type of discovery portal, one currently has to be familiar with how to construct a valid search query to be appended to any of the above prefixes. This is now well documented at <a href=\"https:\/\/support.datacite.org\/docs\/datacite-commons\">https:\/\/support.datacite.org\/docs\/datacite-commons<\/a>, although it still requires some work and patience to construct a precise search query. This in turn requires knowledge of the so-called &#8220;<a href=\"https:\/\/schema.datacite.org\">metadata schema<\/a>&#8220;, on which the indexing is based.<\/p>\n<p>This sort of activity is best illustrated using examples. As it happens I have already collected a decent set at <a href=\"https:\/\/doi.org\/drrm\">https:\/\/doi.org\/drrm<\/a>,\u00a0nicely illustrating that a search query, or a collection of search queries, can themselves be considered as a valid research object! That collection used the prefix <span style=\"color: #ff0000;\"><strong>https:\/\/search.datacite.org\/works?query=<\/strong><\/span> which might usefully be considered as now obsoleted by <span style=\"color: #008000;\"><strong>https:\/\/commons.datacite.org\/?query=<\/strong><\/span>. You can take any of the original queries and try them out here. I will show just two:<\/p>\n<ol>\n<li><a href=\"https:\/\/commons.datacite.org\/?query=titles.title:*amidation*\">https:\/\/commons.datacite.org\/?query=titles.title:*amidation*<\/a> The orignal search gives 170 hits, since it is based largely on DOIs for datasets only. The new version of the search yields 1016 hits, since it includes authors and organisations as well. The results look like this, indicating 846 hits come from the CrossRef registration agency (mostly journals) and the rest from DataCite (mostly data).\n<\/li>\n<\/ol>\n<p><a href=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/commons-results.jpg?ssl=1\"><img data-recalc-dims=\"1\" decoding=\"async\" class=\"aligncenter size-large wp-image-22643\" src=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/commons-results.jpg?w=450&#038;ssl=1\" alt=\"\"  \/><\/a><\/p>\n<ol start=\"2\">\n<li><small><a href=\"https:\/\/commons.datacite.org\/?query=media.media_type:chemical\/x-mnpub*+AND+(subjects.subjectScheme:inchikey+AND+subjects.subject:*BHYQUOWHUMNGMD-UHFFFAOYSA-N*)+AND+(subjects.subjectScheme:NMR_Nucleus+AND+subjects.subject:11B)+AND+(subjects.subjectScheme:NMR_Solvent+AND+subjects.subject:CDCl3)\">https:\/\/commons.datacite.org\/?query=media.media_type:chemical\/x-mnpub*+AND+(subjects.subjectScheme:inchikey+AND+subjects.subject:*BHYQUOWHUMNGMD-UHFFFAOYSA-N*)+AND+(subjects.subjectScheme:NMR_Nucleus+AND+subjects.subject:11B)+AND+(subjects.subjectScheme:NMR_Solvent+AND+subjects.subject:CDCl3)<\/a><\/small> is a the other end of the spectrum for specificity, constraining the search to some very specific chemical properties, the nature of which should be reasonably obvious from the syntax of the query. This specificity is why it continues to give just one hit.<\/li>\n<\/ol>\n<p> <a href=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?ssl=1\"><img data-recalc-dims=\"1\" decoding=\"async\" data-attachment-id=\"22646\" data-permalink=\"https:\/\/www.rzepa.net\/blog\/?attachment_id=22646\" data-orig-file=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?fit=2480%2C984&amp;ssl=1\" data-orig-size=\"2480,984\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Screenshot 209\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?fit=300%2C119&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?fit=450%2C178&amp;ssl=1\" class=\"aligncenter size-large wp-image-22646\" src=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?w=450&#038;ssl=1\" alt=\"\"  srcset=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?resize=1024%2C406&amp;ssl=1 1024w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?resize=300%2C119&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?resize=768%2C305&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?resize=1536%2C609&amp;ssl=1 1536w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?resize=2048%2C813&amp;ssl=1 2048w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?w=900&amp;ssl=1 900w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/Screenshot-209.jpg?w=1350&amp;ssl=1 1350w\" sizes=\"(max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p>The evolution of these search facilities gives an interesting pointer to what the future might hold. New registration agencies can be easily added to the above lists for including other kinds of research object. For example, instruments and their properties. One can combine these diverse properties into a single search, thus revealing scientific information or connections that may not be apparent from historical (chemical) abstracting agencies such as <em>e.g.<\/em> CAS or Reaxys. Importantly, all the metadata on which the indexing is based is fully open and not proprietary and currently at least searches such as the above are free at point of use (unlike the chemical registration agencies noted for which commercial licenses have to be purchased by organisations). The concept of searching for relationships across different types of PID is summarised by the term &#8220;<strong>PID Graph<\/strong>&#8220;. This in turn can reveal other properties of the objects, such as <em>e.g.<\/em> usage statistics and citations;<a href=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?ssl=1\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"22647\" data-permalink=\"https:\/\/www.rzepa.net\/blog\/?attachment_id=22647\" data-orig-file=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?fit=1458%2C1190&amp;ssl=1\" data-orig-size=\"1458,1190\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"bildschirmfoto-2020-08-25-um-06.33.07\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?fit=300%2C245&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?fit=450%2C367&amp;ssl=1\" class=\"aligncenter size-large wp-image-22647\" src=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?resize=450%2C367&#038;ssl=1\" alt=\"\" width=\"450\" height=\"367\" srcset=\"https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?resize=1024%2C836&amp;ssl=1 1024w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?resize=300%2C245&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?resize=768%2C627&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?w=1458&amp;ssl=1 1458w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?w=900&amp;ssl=1 900w, https:\/\/i0.wp.com\/www.rzepa.net\/blog\/wp-content\/uploads\/2020\/08\/bildschirmfoto-2020-08-25-um-06.33.07.png?w=1350&amp;ssl=1 1350w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p>It is good to see this evolution of new ways of finding scientific information and I rather think that we have only just began to see the potential of this approach; there is much more to come. Exciting times ahead I fancy!<\/p>\n<hr \/>\n<p>This post has a PID: <a href=\"https:\/\/doi.org\/10.14469\/hpc\/7366\">10.14469\/hpc\/7366<\/a>.<\/p>\n<hr \/>\n<!-- kcite active, but no citations found -->\n<\/div> <!-- kcite-section 22636 -->","protected":false},"excerpt":{"rendered":"<p>The folks at DataCite have announced a new research object discovery service which aims to give users a &#8220;comprehensive overview of connections between entities in the research landscape&#8221;.\u00a0The portal https:\/\/commons.datacite.org\u00a0acts as the entry point for three basic types of persistent identifiers (PIDs); Research works, using the DOI (digital object identifier) as a PID. This includes [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[3],"tags":[1523],"class_list":["post-22636","post","type-post","status-publish","format-standard","hentry","category-chemical-it","tag-chemical-it"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p1gPyz-5T6","jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/posts\/22636","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=22636"}],"version-history":[{"count":0,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/posts\/22636\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=22636"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=22636"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=22636"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}