{"id":22363,"date":"2019-04-18T09:50:22","date_gmt":"2019-04-18T08:50:22","guid":{"rendered":"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=20675"},"modified":"2019-04-18T09:50:22","modified_gmt":"2019-04-18T08:50:22","slug":"the-acessible-in-fair-data-2","status":"publish","type":"post","link":"https:\/\/www.rzepa.net\/blog\/?p=22363","title":{"rendered":"The &quot;Accessible&quot; in FAIR (data)."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"22363\">\n<p>In a <a href=\"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=20669\">previous post<\/a>, I looked at the <span style=\"color: #ff0000;\">F<\/span>indability of FAIR data in common chemistry journals. Here I move on to the next letter, the <span style=\"color: #ff0000;\">A<\/span> = <span style=\"color: #ff0000;\">A<\/span>ccessible.<\/p>\n<p>The attributes of <span style=\"color: #ff0000;\">A<\/span><span id=\"cite_ITEM-22363-0\" name=\"citation\"><a href=\"#ITEM-22363-0\">[1]<\/a><\/span> include:<\/p>\n<ol>\n<li>(meta)data are retrievable by their identifier using a standardized communication protocol.<\/li>\n<li>the protocol is open, free and universally implementable.\u00a0<\/li>\n<li>the protocol allows for an authentication and authorization procedure.<\/li>\n<li>metadata are accessible, even when the data are no longer available.\u00a0<\/li>\n<li>The metadata should include access information that enables automatic processing by a machine as well as a person.<\/li>\n<\/ol>\n<p>Items 1-2 are covered by associating a DOI (digital object identifier) with the metadata. Item 3 relates to data which is not necessarily also OPEN (FAIR and OPEN are complementary, but do not mean the same).<\/p>\n<p>Item 4 mandates that a copy of the metadata be held separately from the data itself; currently the favoured repository is DataCite (and this metadata way well be duplicated at CrossRef, thus providing a measure of redundancy). It also addresses an interesting debate on whether the container for data such as a ZIP or other compressed archive should also contain the full metadata descriptors internally, which would not directly address item 4, but could do so by also registering a copy of the metadata externally with eg DataCite.<\/p>\n<p>Item 4 also implies some measure of separation between the data and its metadata, which now raises an interesting and separate issue (introduced <a href=\"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=20634\">with this post<\/a>) that the metadata can be considered a living object, with some attributes being updated post deposition of the data itself. Thus such metadata could include an identifier to the journal article relating to the data, information that only appears after the FAIR data itself is published. Or pointers to other datasets published at a later date. Such updating of metadata contained in an archive along with the data itself would be problematic, since the data itself should not be a living object.<\/p>\n<p>Item 5 is the need for Accessibility to relate both to a human acquiring FAIR data and to a machine. The latter needs direct information on exactly how to access the data. To illustrate this, I will use data deposited in support of the <a href=\"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=20679\" target=\"_blank\" rel=\"noopener noreferrer\">previous post<\/a> and for which a representative example of metadata can be found at (item 4) a separate location at:<br \/>\n<small><tt><a href=\"https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.14469\/hpc\/5496\" target=\"_blank\" rel=\"noopener noreferrer\">data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.14469\/hpc\/5496<\/a><\/tt><\/small><\/p>\n<p>This contains the components:<\/p>\n<ol start=\"6\">\n<li><small><tt>&lt;relatedIdentifier relatedIdentifierType=\"URL\" relationType=\"HasMetadata\" relatedMetadataScheme=\"ORE\"schemeURI=\"http:\/\/www.openarchives.org\/ore\/<br \/>\n\"&gt;https:\/\/data.hpc.imperial.ac.uk\/resolve\/?ore=5496&lt;\/relatedIdentifier&gt;<\/tt><\/small><\/li>\n<li><small><tt>&lt;relatedIdentifier relatedIdentifierType=\"URL\" relationType=\"HasPart\" relatedMetadataScheme=\"Filename\" schemeURI=\"filename:\/\/aW5wdXQuZ2pm\"&gt;https:\/\/data.hpc.imperial.ac.uk\/resolve\/?doi=5496&amp;file=1&lt;\/relatedIdentifier&gt;<\/tt><\/small><\/li>\n<\/ol>\n<p>Item 6 is an machine-suitable RDF declaration of the <a href=\"https:\/\/data.hpc.imperial.ac.uk\/resolve\/?ore=5496\" target=\"metadata\" rel=\"noopener noreferrer\">full metadata record<\/a>. Item 7 allows direct access to the datafile. This in turn allows programmed interfaces to the data to be constructed, which include <em>e.g.<\/em> components for immediate visualisation and\/or analysis. It also allows access on a large-scale (mining), something a human is unlikely to try.\u00a0<\/p>\n<p>It would be fair to say that the A of FAIR is still evolving. Moreover, searches of the DataCite metadata database are not yet at the point where one can automatically identify metadata records that have these attributes. When they do become available, I will show some examples here.<\/p>\n<hr \/>\n<p><b>Added:<\/b> This search: <a href=\"https:\/\/search.test.datacite.org\/works?query=relatedIdentifiers.relatedMetadataScheme:ORE\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/search.test.datacite.org\/works?<br \/>\nquery=relatedIdentifiers.relatedMetadataScheme:ORE<\/a> shows how it might operate.<\/p>\n<h2>References<\/h2>\n    <ol class=\"kcite-bibliography csl-bib-body\"><li id=\"ITEM-22363-0\">M.D. Wilkinson, M. Dumontier, I.J. Aalbersberg, G. Appleton, M. Axton, A. Baak, N. Blomberg, J. Boiten, L.B. da Silva Santos, P.E. Bourne, J. Bouwman, A.J. Brookes, T. Clark, M. Crosas, I. Dillo, O. Dumon, S. Edmunds, C.T. Evelo, R. Finkers, A. Gonzalez-Beltran, A.J. Gray, P. Groth, C. Goble, J.S. Grethe, J. Heringa, P.A. \u2019t Hoen, R. Hooft, T. Kuhn, R. Kok, J. Kok, S.J. Lusher, M.E. Martone, A. Mons, A.L. Packer, B. Persson, P. Rocca-Serra, M. Roos, R. van Schaik, S. Sansone, E. Schultes, T. Sengstag, T. Slater, G. Strawn, M.A. Swertz, M. Thompson, J. van der Lei, E. van Mulligen, J. Velterop, A. Waagmeester, P. Wittenburg, K. Wolstencroft, J. Zhao, and B. Mons, \"The FAIR Guiding Principles for scientific data management and stewardship\", <i>Scientific Data<\/i>, vol. 3, 2016. <a href=\"https:\/\/doi.org\/10.1038\/sdata.2016.18\">https:\/\/doi.org\/10.1038\/sdata.2016.18<\/a>\n\n<\/li>\n<\/ol>\n\n<\/div> <!-- kcite-section 22363 -->","protected":false},"excerpt":{"rendered":"<p>In a previous post, I looked at the Findability of FAIR data in common chemistry journals. Here I move on to the next letter, the A = Accessible. The attributes of A include: (meta)data are retrievable by their identifier using a standardized communication protocol. the protocol is open, free and universally implementable.\u00a0 the protocol allows [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[3],"tags":[1397,2525,1428,811,2562,2570,2440,1721,2318,2444,2319,1473,1474,1074,2590,2599,314,2616,317,2630,1745,1510,2470,2652,390],"class_list":["post-22363","post","type-post","status-publish","format-standard","hentry","category-chemical-it","tag-academic-publishing","tag-automatic-processing","tag-data-management","tag-digital-object-identifier","tag-eidr","tag-fair-data","tag-findability","tag-identifiers","tag-information","tag-information-architecture","tag-information-science","tag-knowledge","tag-knowledge-representation","tag-metadata","tag-mining","tag-open-archives-initiative","tag-rdf","tag-records-management","tag-representative","tag-standardized-communication-protocol","tag-technical-communication","tag-technologyinternet","tag-web-design","tag-written-communication","tag-xml"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p1gPyz-5OH","jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/posts\/22363","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=22363"}],"version-history":[{"count":0,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=\/wp\/v2\/posts\/22363\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=22363"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=22363"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.rzepa.net\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=22363"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}