References

Bioinformatics

DATS: the data tag suite to enable discoverability of datasets

Today's science increasingly requires effective ways to find and access existing datasets that are distributed across a range of repositories. For researchers in the life sciences, discoverability of datasets may soon become as essential as identifying the latest publications via PubMed. Through an international collaborative effort funded by the National Institutes of Health (NIH)'s Big Data to Knowledge (BD2K) initiative, we have designed and implemented the DAta Tag Suite (DATS) model to support the DataMed data discovery index. DataMed's goal is to be for data what PubMed has been for the scientific literature. Akin to the Journal Article Tag Suite (JATS) used in PubMed, the DATS model enables submission of metadata on datasets to DataMed. DATS has a core set of elements, which are generic and applicable to any type of datasets, and an extended set that can accommodate more specialized data types. DATS is a platform-independent model also available as a Schema.org annotated serialization to be used beyond DataMed, for example, in projects like DataCite.

DataCite Metadata Schema Documentation for the Publication and Citation of Research Data v4.0

Published
Authors DataCite Metadata Working Group, Joan Starr, Madeleine de Smaele, Jan Ashton, Amy Barton, Tina Bradford, Anne Ciolek-Figiel, Stefanie Dietiker, Jannean Elliot, Berrit Genat, Karoline Harzenetter, Barbara Hirschmann, Stefan Jakobsson, Jean-Yves Mailloux, Elizabeth Newbold, Lars Holm Nielsen, Mohamed Yahia, Frauke Ziedorn

1 Introduction 1.1 The DataCite Consortium 1.2 DataCite Community Participation 1.3 The Metadata Schema 1.4 Version 4.0 Update 2 DataCite Metadata Properties 2.1 Overview 2.2 Citation 2.3 DataCite Properties 3 XML Example 4 XML Schema 5 Other DataCite Services Appendices Appendix 1: Controlled List Definitions Appendix 2: Earlier Version Update Notes

Technical and Human Infrastructure for Open Research (THOR)

Five years ago, a global infrastructure to uniquely attribute to researchers their scientific artefacts (articles, data, software…) appeared technically and socially infeasible. Since then, DataCite has minted over 3.5m unique identifiers for data. ORCID has deployed an open solution for identification of contributors with over 850,000 registrants in less than 2 years. THOR will leverage these emerging global infrastructures to support the H2020 goal to make every researcher digital and increase creativity and efficiency of research, while bridging the R&D divide between developed and less-developed regions. We will establish interoperability between existing resources, linking digital identifiers across platforms and propagating attribution information. We will integrate PID services across the research lifecycle and data publishing workflows in four advanced research communities, and then roll-out core services and service building blocks for the wider community. These open resources will foster an open and sustainable e-infrastructure across stakeholders to avoid duplications, give economies of scale, richness of services and the ability to respond rapidly to opportunities for innovation. THOR is not just relevant to the EINFRA-7-1024 Call, but will become a pervasive element of the EINFRA family of e-Infrastructure resources over the next 3 years. It will allow data-management and curation services to exploit knowledge of data location and attribution; provide robust and persistent mechanism for linking literature and data; enable search and resolving services and generate incentives for Open Science; deliver provenance and attribution mechanisms to underpin data exchange; and provide minting and resolving services for data citation workflows. Its impact will enable third-party services, no-profit and commercial, to leverage the scholarly record.

General Computer Science

Achieving human and machine accessibility of cited data in scholarly publications

Published in PeerJ Computer Science
Authors Joan Starr, Eleni Castro, Mercè Crosas, Michel Dumontier, Robert R. Downs, Ruth Duerr, Laurel L. Haak, Melissa Haendel, Ivan Herman, Simon Hodson, Joe Hourclé, John Ernest Kratz, Jennifer Lin, Lars Holm Nielsen, Amy Nurnberger, Stefan Proell, Andreas Rauber, Simone Sacchi, Arthur Smith, Mike Taylor, Tim Clark