Paradata in domain-agnostic research data related standards
| Standard | Processes covered | Process scope | Elements and representations (sample) | Granularity | Focus | Reference(s) |
|---|---|---|---|---|---|---|
| Data Package | Creation; Revisions | Data management | Licenses; Version (number); Sources (e.g. people, literature, identifiable using URIs and email addresses); Contributors (with role: author, publisher, maintainer, wrangler and contributor; organisation[al affiliation]); created (datetime) | Resource (Dataset) | Object-Attribute | Open Knowledge Foundation (2023) |
| DataCite Metadata Schema | Creation; Revisions | Identification; Retrieval | Creator (“main researchers involved in producing the data, or the authors of the publication, in priority order” with optional given name, family name, name identifier and affiliation sub-properties); Publisher (“name of the entity that holds, archives, publishes prints, distributes, releases, issues or produces the resource”; PublicationYear (date); Contributor (with optional given name, family name, name identifier and affiliation sub-properties) ContributorType (ContactPerson, DataCollector, DataCurator, DataManager, Distributor, Editor, HostingInstitution, Producer, ProjectLeader, ProjectManager, ProjectMember, RegistrationAgency, RegistrationAuthority, RelatedPerson, Researcher, ResearchGroup, RightsHolder, Sponsor, Supervisor, WorkPackageLeader, Other); Date (with type sub-property); RelatedItem (relation to another resource) Version (number); Rights (free text, URI, identifier); GeoLocation (with point, box, place and polygon sub-properties); FundingReference (with name, identifier and award related sub-properties) | Resource (Dataset) | Object-Attribute | DataCite Metadata Working Group (2021) |
| Dublin Core | Accrual; Creation; Replacement; Issuance; Modification; Replacement | Search; Retrieval | accrualMethod (value from Collection Description Accrual Method Vocabulary); accrualPeriodicity (value from Collection Description Frequency Vocabulary; accrualPolicy (value from the Collection Description Accrual Policy Vocabulary); audience (non-literal values from a vocabulary of audience types); available (date); conformsTo (standard); contributor (property: agent); created (date); creator (property: agent); date (related to the lifecycle of the resource – date); dateAccepted (date); dateCopyrighted (date); dateSubmitted (date); educationLevel (of the intended audience – property: agent); extent (size/duration – property); hasVersion (property); isPartOf (property); isReferencedBy (property: resource); isReplacedBy (property: resource); isRequiredBy (property: resource); issued (date); isVersionOf (property: resource); license (property: document); mediator (access to resource – property: agent); modified (date); provenance (free text); publisher (property: agent); references (property: resource); replaces (property: resource); requires (property: resource); rights (free text); rightsHolder (URI); source (URI); temporal (period of time); valid (date range) | Resource (Resource) | Object-Attribute | DCMI Usage Board (2020) |
| OAI-ORE (Open Archives Initiative Object Reuse and Exchange) | (Data) lineage | Identify and describe the constituents of aggregations of digital resources | Lineage (relationships between two objects); proxyFor (aggregated resource); In addition uses Dublin Core Elements and Terms, Friend of a Friend Terms, RDF Terms and RDF Schema Terms | Aggregations of resources (Compound objects consisting of multiple distributed objects) | Object-Object | Open Archives Initiative Object Reuse and Exchange (2014) |
| PREMIS | Digital provenance (history of an object); Digital object lifecycle | Digital preservation | Comprehensive metadata model documenting objects, events, agents and rights, with an event entity that aggregates information related to one or more objects (description of events, dates, outcomes, agents involved), and metadata relating to objects documenting, e.g. preservationLevelRole (controlled vocabulary); preservationLevelValue (controlled vocabulary); preservationLevelRationale; preservationLevelRationale (free text); dates and agents for descriptions; versions; creatingApplication (name, version, date, extensions); inhibitors to access, use, migration; originalName; storage (where object is stored); environment used to render or execute an object; relationship (between objects, e.g. derivation; object sequences; to events; event sequences) | Resource (Digital object) | Entity-Relationship | PREMIS Editorial Committee (2015) |
| CERIF | Scientific research | Constituents of (incl. activities relating to) scientific research | Activity Types (incl. Project, Conference, Fellowship, Networking, Infrastructure, Studentship) and subtypes; Activity Structure (relationships); Activity Statuses; Events (conference, workshop); PersonEventInvolvement; Research Infrastructure Usage; Research data sets and databases (output) | Domain (Scientific research and its constituents) | Entity-Relationship | CERIF and CRIS Architectures Task Group (2012) |
| DDI | Research lifecycle in quantitative and qualitative social, behavioral, economic and health sciences research | Research lifecycle | Methodological objects (incl. sample selection, data capture, weighting, quality control, process management); Processing (incl. data capture, data processing, analysis, data management); Data management (incl. ownership, access, rights management, restrictions, quality standards, organization, agent management, relationship between products, versioning, provenance); Conceptual objects (incl. representation, universe); Quantitative and qualitative data objects (incl. universe, representation, usage, record relationships, storage, access) | Resource (Dataset) | Object-Attribute | DDI Alliance (2020) |
| PROV | Provenance | Exchange of provenance information in “heterogeneous environments such as the Web” (Groth and Moreau, 2013) | Entities; Activities; Agents | Resource (Provenance information relating to a resource) | Entity-Relationship | Groth and Moreau (2013) |
| NetCDF | History; Provenance | Modifications | History; Provenance attributes | Resource (Array-oriented scientific data) | Object-Attribute | Rew et al. (1989), Unidata (2019) |
| Standard | Processes covered | Process scope | Elements and representations (sample) | Granularity | Focus | Reference(s) |
|---|---|---|---|---|---|---|
| Data Package | Creation; Revisions | Data management | Licenses; Version (number); Sources (e.g. people, literature, identifiable using URIs and email addresses); Contributors (with role: author, publisher, maintainer, wrangler and contributor; organisation[al affiliation]); created (datetime) | Resource (Dataset) | Object-Attribute | |
| DataCite Metadata Schema | Creation; Revisions | Identification; Retrieval | Creator (“main researchers involved in producing the data, or the authors of the publication, in priority order” with optional given name, family name, name identifier and affiliation sub-properties); Publisher (“name of the entity that holds, archives, publishes prints, distributes, releases, issues or produces the resource”; PublicationYear (date); Contributor (with optional given name, family name, name identifier and affiliation sub-properties) ContributorType (ContactPerson, DataCollector, DataCurator, DataManager, Distributor, Editor, HostingInstitution, Producer, ProjectLeader, ProjectManager, ProjectMember, RegistrationAgency, RegistrationAuthority, RelatedPerson, Researcher, ResearchGroup, RightsHolder, Sponsor, Supervisor, WorkPackageLeader, Other); Date (with type sub-property); RelatedItem (relation to another resource) Version (number); Rights (free text, URI, identifier); GeoLocation (with point, box, place and polygon sub-properties); FundingReference (with name, identifier and award related sub-properties) | Resource (Dataset) | Object-Attribute | |
| Dublin Core | Accrual; Creation; Replacement; Issuance; Modification; Replacement | Search; Retrieval | accrualMethod (value from Collection Description Accrual Method Vocabulary); accrualPeriodicity (value from Collection Description Frequency Vocabulary; accrualPolicy (value from the Collection Description Accrual Policy Vocabulary); audience (non-literal values from a vocabulary of audience types); available (date); conformsTo (standard); contributor (property: agent); created (date); creator (property: agent); date (related to the lifecycle of the resource – date); dateAccepted (date); dateCopyrighted (date); dateSubmitted (date); educationLevel (of the intended audience – property: agent); extent (size/duration – property); hasVersion (property); isPartOf (property); isReferencedBy (property: resource); isReplacedBy (property: resource); isRequiredBy (property: resource); issued (date); isVersionOf (property: resource); license (property: document); mediator (access to resource – property: agent); modified (date); provenance (free text); publisher (property: agent); references (property: resource); replaces (property: resource); requires (property: resource); rights (free text); rightsHolder (URI); source (URI); temporal (period of time); valid (date range) | Resource (Resource) | Object-Attribute | |
| OAI-ORE (Open Archives Initiative Object Reuse and Exchange) | (Data) lineage | Identify and describe the constituents of aggregations of digital resources | Lineage (relationships between two objects); proxyFor (aggregated resource); In addition uses Dublin Core Elements and Terms, Friend of a Friend Terms, RDF Terms and RDF Schema Terms | Aggregations of resources (Compound objects consisting of multiple distributed objects) | Object-Object | |
| PREMIS | Digital provenance (history of an object); Digital object lifecycle | Digital preservation | Comprehensive metadata model documenting objects, events, agents and rights, with an event entity that aggregates information related to one or more objects (description of events, dates, outcomes, agents involved), and metadata relating to objects documenting, e.g. preservationLevelRole (controlled vocabulary); preservationLevelValue (controlled vocabulary); preservationLevelRationale; preservationLevelRationale (free text); dates and agents for descriptions; versions; creatingApplication (name, version, date, extensions); inhibitors to access, use, migration; originalName; storage (where object is stored); environment used to render or execute an object; relationship (between objects, e.g. derivation; object sequences; to events; event sequences) | Resource (Digital object) | Entity-Relationship | |
| CERIF | Scientific research | Constituents of (incl. activities relating to) scientific research | Activity Types (incl. Project, Conference, Fellowship, Networking, Infrastructure, Studentship) and subtypes; Activity Structure (relationships); Activity Statuses; Events (conference, workshop); PersonEventInvolvement; Research Infrastructure Usage; Research data sets and databases (output) | Domain (Scientific research and its constituents) | Entity-Relationship | |
| DDI | Research lifecycle in quantitative and qualitative social, behavioral, economic and health sciences research | Research lifecycle | Methodological objects (incl. sample selection, data capture, weighting, quality control, process management); Processing (incl. data capture, data processing, analysis, data management); Data management (incl. ownership, access, rights management, restrictions, quality standards, organization, agent management, relationship between products, versioning, provenance); Conceptual objects (incl. representation, universe); Quantitative and qualitative data objects (incl. universe, representation, usage, record relationships, storage, access) | Resource (Dataset) | Object-Attribute | |
| PROV | Provenance | Exchange of provenance information in “heterogeneous environments such as the Web” ( | Entities; Activities; Agents | Resource (Provenance information relating to a resource) | Entity-Relationship | |
| NetCDF | History; Provenance | Modifications | History; Provenance attributes | Resource (Array-oriented scientific data) | Object-Attribute |