Achieving human and machine accessibility of cited data in scholarly publications

Joan Starr, Eleni Castro, Mercè Crosas, Michel Dumontier, Robert R. Downs, Ruth Duerr, Laurel L. Haak, Melissa Haendel, Ivan Herman, Simon Hodson, Joe Hourclé, John Ernest Kratz, Jennifer Lin, Lars Holm Nielsen, Amy Nurnberger, Stefan Proell, Andreas Rauber, Simone Sacchi, Arthur Smith, Mike TaylorTim Clark

Research output: Contribution to journalArticle

44 Citations (Scopus)

Abstract

Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of research data. However, many common approaches for primary data publication in use today do not achieve sufficient long-term robustness, openness, accessibility or uniformity. Nor do they permit comprehensive exploitation by modern Web technologies. This has led to several authoritative studies recommending uniform direct citation of data archived in persistent repositories. Data are to be considered as first-class scholarly objects, and treated similarly in many ways to cited and archived scientific and scholarly literature. Here we briefly review the most current and widely agreed set of principle-based recommendations for scholarly data citation, the Joint Declaration of Data Citation Principles (JDDCP). We then present a framework for operationalizing the JDDCP; and a set of initial recommendations on identifier schemes, identifier resolution behavior, required metadata elements, and best practices for realizing programmatic machine actionability of cited data. The main target audience for the common implementation guidelines in this article consists of publishers, scholarly organizations, and persistent data repositories, including technical staff members in these organizations. But ordinary researchers can also benefit fromthese recommendations. The guidance provided here is intended to help achieve widespread, uniform human and machine accessibility of deposited data, in support of significantly improved verification, validation, reproducibility and re-use of scholarly/scientific data.

Original languageEnglish (US)
Article numbere1
JournalPeerJ
Volume2015
Issue number1
DOIs
StatePublished - 2015

Fingerprint

Reusability
reproducibility
Publications
Joints
Organizations
Metadata
Practice Guidelines
Research
world wide web
Communication
Research Personnel
Guidelines
communication (human)
Technology
researchers

Keywords

  • Data accessibility
  • Data archiving
  • Data citation
  • Machine accessibility

ASJC Scopus subject areas

  • Neuroscience(all)
  • Medicine(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

Starr, J., Castro, E., Crosas, M., Dumontier, M., Downs, R. R., Duerr, R., ... Clark, T. (2015). Achieving human and machine accessibility of cited data in scholarly publications. PeerJ, 2015(1), [e1]. https://doi.org/10.7717/peerj-cs.1

Achieving human and machine accessibility of cited data in scholarly publications. / Starr, Joan; Castro, Eleni; Crosas, Mercè; Dumontier, Michel; Downs, Robert R.; Duerr, Ruth; Haak, Laurel L.; Haendel, Melissa; Herman, Ivan; Hodson, Simon; Hourclé, Joe; Kratz, John Ernest; Lin, Jennifer; Nielsen, Lars Holm; Nurnberger, Amy; Proell, Stefan; Rauber, Andreas; Sacchi, Simone; Smith, Arthur; Taylor, Mike; Clark, Tim.

In: PeerJ, Vol. 2015, No. 1, e1, 2015.

Research output: Contribution to journalArticle

Starr, J, Castro, E, Crosas, M, Dumontier, M, Downs, RR, Duerr, R, Haak, LL, Haendel, M, Herman, I, Hodson, S, Hourclé, J, Kratz, JE, Lin, J, Nielsen, LH, Nurnberger, A, Proell, S, Rauber, A, Sacchi, S, Smith, A, Taylor, M & Clark, T 2015, 'Achieving human and machine accessibility of cited data in scholarly publications', PeerJ, vol. 2015, no. 1, e1. https://doi.org/10.7717/peerj-cs.1
Starr J, Castro E, Crosas M, Dumontier M, Downs RR, Duerr R et al. Achieving human and machine accessibility of cited data in scholarly publications. PeerJ. 2015;2015(1). e1. https://doi.org/10.7717/peerj-cs.1
Starr, Joan ; Castro, Eleni ; Crosas, Mercè ; Dumontier, Michel ; Downs, Robert R. ; Duerr, Ruth ; Haak, Laurel L. ; Haendel, Melissa ; Herman, Ivan ; Hodson, Simon ; Hourclé, Joe ; Kratz, John Ernest ; Lin, Jennifer ; Nielsen, Lars Holm ; Nurnberger, Amy ; Proell, Stefan ; Rauber, Andreas ; Sacchi, Simone ; Smith, Arthur ; Taylor, Mike ; Clark, Tim. / Achieving human and machine accessibility of cited data in scholarly publications. In: PeerJ. 2015 ; Vol. 2015, No. 1.
@article{c5d985c26464464b852bf59e7c503a79,
title = "Achieving human and machine accessibility of cited data in scholarly publications",
abstract = "Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of research data. However, many common approaches for primary data publication in use today do not achieve sufficient long-term robustness, openness, accessibility or uniformity. Nor do they permit comprehensive exploitation by modern Web technologies. This has led to several authoritative studies recommending uniform direct citation of data archived in persistent repositories. Data are to be considered as first-class scholarly objects, and treated similarly in many ways to cited and archived scientific and scholarly literature. Here we briefly review the most current and widely agreed set of principle-based recommendations for scholarly data citation, the Joint Declaration of Data Citation Principles (JDDCP). We then present a framework for operationalizing the JDDCP; and a set of initial recommendations on identifier schemes, identifier resolution behavior, required metadata elements, and best practices for realizing programmatic machine actionability of cited data. The main target audience for the common implementation guidelines in this article consists of publishers, scholarly organizations, and persistent data repositories, including technical staff members in these organizations. But ordinary researchers can also benefit fromthese recommendations. The guidance provided here is intended to help achieve widespread, uniform human and machine accessibility of deposited data, in support of significantly improved verification, validation, reproducibility and re-use of scholarly/scientific data.",
keywords = "Data accessibility, Data archiving, Data citation, Machine accessibility",
author = "Joan Starr and Eleni Castro and Merc{\`e} Crosas and Michel Dumontier and Downs, {Robert R.} and Ruth Duerr and Haak, {Laurel L.} and Melissa Haendel and Ivan Herman and Simon Hodson and Joe Hourcl{\'e} and Kratz, {John Ernest} and Jennifer Lin and Nielsen, {Lars Holm} and Amy Nurnberger and Stefan Proell and Andreas Rauber and Simone Sacchi and Arthur Smith and Mike Taylor and Tim Clark",
year = "2015",
doi = "10.7717/peerj-cs.1",
language = "English (US)",
volume = "2015",
journal = "PeerJ",
issn = "2167-8359",
publisher = "PeerJ",
number = "1",

}

TY - JOUR

T1 - Achieving human and machine accessibility of cited data in scholarly publications

AU - Starr, Joan

AU - Castro, Eleni

AU - Crosas, Mercè

AU - Dumontier, Michel

AU - Downs, Robert R.

AU - Duerr, Ruth

AU - Haak, Laurel L.

AU - Haendel, Melissa

AU - Herman, Ivan

AU - Hodson, Simon

AU - Hourclé, Joe

AU - Kratz, John Ernest

AU - Lin, Jennifer

AU - Nielsen, Lars Holm

AU - Nurnberger, Amy

AU - Proell, Stefan

AU - Rauber, Andreas

AU - Sacchi, Simone

AU - Smith, Arthur

AU - Taylor, Mike

AU - Clark, Tim

PY - 2015

Y1 - 2015

N2 - Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of research data. However, many common approaches for primary data publication in use today do not achieve sufficient long-term robustness, openness, accessibility or uniformity. Nor do they permit comprehensive exploitation by modern Web technologies. This has led to several authoritative studies recommending uniform direct citation of data archived in persistent repositories. Data are to be considered as first-class scholarly objects, and treated similarly in many ways to cited and archived scientific and scholarly literature. Here we briefly review the most current and widely agreed set of principle-based recommendations for scholarly data citation, the Joint Declaration of Data Citation Principles (JDDCP). We then present a framework for operationalizing the JDDCP; and a set of initial recommendations on identifier schemes, identifier resolution behavior, required metadata elements, and best practices for realizing programmatic machine actionability of cited data. The main target audience for the common implementation guidelines in this article consists of publishers, scholarly organizations, and persistent data repositories, including technical staff members in these organizations. But ordinary researchers can also benefit fromthese recommendations. The guidance provided here is intended to help achieve widespread, uniform human and machine accessibility of deposited data, in support of significantly improved verification, validation, reproducibility and re-use of scholarly/scientific data.

AB - Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of research data. However, many common approaches for primary data publication in use today do not achieve sufficient long-term robustness, openness, accessibility or uniformity. Nor do they permit comprehensive exploitation by modern Web technologies. This has led to several authoritative studies recommending uniform direct citation of data archived in persistent repositories. Data are to be considered as first-class scholarly objects, and treated similarly in many ways to cited and archived scientific and scholarly literature. Here we briefly review the most current and widely agreed set of principle-based recommendations for scholarly data citation, the Joint Declaration of Data Citation Principles (JDDCP). We then present a framework for operationalizing the JDDCP; and a set of initial recommendations on identifier schemes, identifier resolution behavior, required metadata elements, and best practices for realizing programmatic machine actionability of cited data. The main target audience for the common implementation guidelines in this article consists of publishers, scholarly organizations, and persistent data repositories, including technical staff members in these organizations. But ordinary researchers can also benefit fromthese recommendations. The guidance provided here is intended to help achieve widespread, uniform human and machine accessibility of deposited data, in support of significantly improved verification, validation, reproducibility and re-use of scholarly/scientific data.

KW - Data accessibility

KW - Data archiving

KW - Data citation

KW - Machine accessibility

UR - http://www.scopus.com/inward/record.url?scp=85019316984&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85019316984&partnerID=8YFLogxK

U2 - 10.7717/peerj-cs.1

DO - 10.7717/peerj-cs.1

M3 - Article

AN - SCOPUS:85019316984

VL - 2015

JO - PeerJ

JF - PeerJ

SN - 2167-8359

IS - 1

M1 - e1

ER -