The Yoda metadata form explained

In this table, some fields in the Yoda metadata form are explained. The values in the ‘Mandatory’ column state whether the field is mandatory (Y: mandatory; N: not mandatory) when archiving a data package.

Field Mandatory Description Format 
Title Title of your data package. When you publish your data package, the title will be harvested by other catalogues.  Maximum length: 255 characters. 
Description A concise description (abstract) of your data package, containing for example subject, sample size, methodology, etc. Maximum length: 2700 characters. 
Discipline Choose the (sub)discipline of the study from the list. The list contains a combination of research disciplines and subdisciplines. The list uses the OECD FOS 2007 standard. 
Version Version number or date of the dataset. Yoda does not automatically assign version numbers to data packages. If you create multiple versions, you can register the version number yourself. Free format. For example: v1.0 (semantic version), v2024.09.20 (date version). 
Language of the data Choose the main language of the data in the dataset. The list uses the ISO 639/1 standard. 
Collection process: start and end date Start and end date of data collection. YYYY-MM-DD 
Location(s) covered Indication of the geographical entities (countries, regions, cities) covered within this data package. English naming convention preferred. We recommend using the preferred spelling from the Getty Thesaurus of Geographic Names. Add only one location per line: use the plus sign to add more values. Maximum length: 255 characters. 
Period covered: start and end date Indication of the start and end dates of the period covered by your dataset. This is not necessarily the same as the collection dates (for example: historical data may be collected in 2024 but cover 1900). YYYY-MM-DD 
Keywords Keywords/tags that describe your data package and may allow others to more easily find your data package. Free format. Add only one keyword per line: use the plus sign to add more values. Maximum length: 255 characters 
Related resource Any resources (articles, data packages, software, etc.) related to your data package, their identifier/link and how they are related to your data package. It is possible to add multiple related resources: use the plus sign to add more values. 
Related resource – Relation type Choose how the related resource is related to the current data package. The list uses the DataCite relationType vocabulary
Related resource - Title Title of the related resource. There is no automatic check whether the title matches with the persistent identifier. Maximum length: 255 characters. 
Related resource – Persistent identifier The Identifier and Identifier type for the related resource. For example: type: DOI, identifier: http://doi.org/10.24416/UU01-729A2Y   
Retention period (years) The minimum number of years that the data package should be preserved in the Vault. 

Number (integer) 

Default: 10 years 

Retention information Text field for remarks about the retention period. Use this field if you deviate from the default retention period. Free format 
Embargo end date It is possible to set an embargo on the data package. The metadata will be published already, but the data will only become available after this period. Specify here the date on which the embargo should end. YYYY-MM-DD 
Data type Choose what type of package it concerns. Datapackage (default), Software, Method, Other document 
Data classification Choose how sensitive the data is in terms of confidentiality, integrity and availability.  
Name of collection If the data package is part of a larger (conceptual) collection, you can enter the collection name here. The research group should ensure that all other data packages in the collection are archived with the same collection name. Maximum length: 255 characters. 
Funding reference The funding source(s) of your data package. This field can have multiple values: use the plus sign to add more values. 
Funding reference - Funder Name of the organization funding the research. For example: Dutch Research Council. Use names as specified in the Research Organization Registry (ROR). Maximum length: 255 characters. 
Funding reference – Award number The grant number issued by the funding organization. Free format. 
RemarksNAny remark from the datamanager. For example: for feedback if a data package is rejected. As a researcher, leave this empty.Free format.
Creator The author(s)/creator(s) of the data package This field can have multiple values: use the plus sign to add more creators. 
Creator – Name The personal/first name (Given Name) and surname/last name (Family Name) of the creator. Maximum length: 255 characters. 
Creator – Affiliation Select the organizational or institutional affiliation of the creator. The Affiliation identifier (ROR) will automatically appear. This field can have multiple values: use the plus sign to add more affiliations for the creator. 
Creator – Person identifier The Identifier and Identifier type for the creator, such as AuthorID, ORCID, or ResearcherIDEach creator can have multiple persistent identifiers: use the plus sign to add more person identifiers. Maximum length: 255 characters. 
Contributor The person(s) who contributed to this data package. This field can have multiple values: use the plus sign to add more contributors. 
Contributor - Name The personal/first name (Given Name) and surname/last name (Family Name) of the contributor. Maximum length: 255 characters. 
Contributor – Contributor type Choose how the contributor primarily contributed to the data package. The list uses the DataCite contributorType vocabulary
Contributor - Affiliation Select the organizational or institutional affiliation of the contributor. The Affiliation identifier (ROR) will automatically appear. This field can have multiple values: use the plus sign to add more affiliations for the creator. 
Contributor – Person identifier The Identifier and Identifier type for the contributor, such as AuthorID, ORCID, or ResearcherIDEach contributor can have multiple persistent identifiers: use the plus sign to add more person identifiers. Maximum length: 255 characters. 
Data package access Choose the access level under which the data package should be made available once published. Open – freely retrievable (publicly available), Restricted – available upon request (only available after specified conditions), Closed (not shared). 
License The terms that specify what others are allowed to do with the contents of the data package. If the Data package access is set to ‘Open – freely retrievable’, you can choose from a number of often-used licenses (recommended: Creative Commons Attribution 4.0). If the data package is restricted or closed, you select Custom and will have to add a License.txt file to your data package. Contact your data manager for help with this.