Persistent Identifiers (PIDs)
Persistent identifiers (PIDs) are unique, long-lasting references to identified sources, such as publications, datasets, institutions, persons or analytical instrumentation, and allow to reliably locate, identify, and verify the references.
The two main PIDs employed within the chemistry community are Digital Object Identifiers (DOIs) as PIDs for objects such as publications, datasets, or software as well as Open Research and Contributor iDs (ORCID iDs) for people. Additionally, the Research Organization Registry (ROR) is used to identify research institutions. Current development also includes Persistent Identification of Instruments (PIDINST) as PIDs for the instrumentation used to obtain research data. The Persistent Identifiers for eResearch (ePIC) is specifically designated for digital research data storage.
The main benefits of PIDs for chemistry researchers are increased findability, visibility, and ease of access of their scientific work. In the context of research data, PIDs are especially useful to interconnect publications and datasets in repositories.
Digital Object Identifiers are PIDs for objects such as publications and datasets, but also physical objects. DOIs are resolved based on the Handle Systemto lead to the corresponding landing page and are assigned by members of the International DOI Foundation. Most well-recognised DOI registration agencies include CrossRef for publications and DataCite focusing on datasets.
A DOI consists of a character string divided into two parts by a slash. The prefix always starts with a
10. and identifies the registrant, who retrieves a DOI from a DOI registration agency. Every registrant has its own unique prefix. The suffix identifies a specific object registered by a registrant via a registration agency.
DOIs are most frequently displayed as an URL such as
https://doi.org/10.1000/182 and include a resolver as HTTP proxy server to redirect to the linked object. DOIs are standardised by the International Organisation for Standardisation (ISO) in ISO 26324.
Most importantly, domain-independent metadata also including provenance information is attached to the DOI. This metadata follows the metadata schemes provided by registration agencies such as the CrossRef Metadata Schema and the DataCite Metadata Schema.
Registrants are liable to update the URL connected to the DOI if the object is moved.
ORCID iDs are open and non-proprietary PIDs for authors also described as unique author identifiers and can be retrieved and used by any author free of charge. They are provided by the non-profit organization ORCID Inc., formerly called Open Research and Contributors ID organisation, and sustained by fees of member organisations, providing extended application program interfaces (API) for their members to integrate ORCID services.
An ORCID iD is a 16-digit alphanumerical code to unambiguously identify authors, whille author names might not be unique, may change, or may have different ordering conventions depending on cultural differences. ORCID iDs can be depicted as an URL, e.g.
https://orcid.org/0000-0000-0000-0000 but are also typically depicted without the resolver as an URN, e.g.
ORCID:0000-0000-0000-0000. ORCID iDs are a subset of International Standards Name Identifiers (ISNI) provided by the International Organisation for Standardisation (ISO) as ISO 27729.
The ORCID record connected to ORCID iDs includes the bibliographic output and may also provide information on employment, education and qualification, invited positions and distinctions, memberships as well as service and funding, depending on the privacy setting made by the author.
In addition to ORCID iDs as non-proprietary PIDs for authors, there are also proprietary author identifiers such as Thomson Reuters’ Web of Science ResearcherID and Elsevier’s Scopus Author ID.
Resarch Organisation Registry
The Research Organisation Registry (ROR) provides persistent identifiers for research organisations, while names might not be unique, may change and organisations might be merged, split, shut down or re-emerge. ROR enables to connect research organisations to research outputs and their researchers by identifying their affiliations.
ROR should be provided in affiliations of researchers in scientific publications as well as for data publications.
ROR does support parent-child hierarchies as well as lateral relationships between organisations. An example might include a university having a child organisation such as a research institute [German: Fakultät] and related organisations such as a university hospital. However, ROR focuses on fundamental affiliations and is not focused on all subdivisions of an organisations such as departments [German: Institut] as high-level institutional information is more stable.
The Persistent Identifier for eResearch (ePICs) are based on the Handle.Net Registry (HNR) and are intended to be used for unpublished digital research objects. To this end, ePICs can be assigned at early stages of research to locate research data and may be used by various research data management or archiving systems. Upon publication of a dataset, the full research data can be linked via the ePIC, for example, in cases that include proprietary or sensitive information.
This type of PID will commonly lead to a landing page, the content of which is maintained by the PID owner and includes the dataset's metadata or a method of contacting those responsible for the dataset.
Sources and further information
- English: DOI Basics - DataCite
- German: Persistent Identifiers - Technische Informationsbibliothek (TIB), Hannover
- German: Digital Object Identifiers - Technische Informationsbibliothek (TIB), Hannover
- German: Open Research and Contributor ID (ORICD iD) - Technische Informationsbibliothek (TIB), Hannover
- English: ePIC FAQs - PID Consortium
Main author: ORCID:0000-0003-4480-8661