The Yoda metadata form explained
In this table, some fields in the Yoda metadata form are explained. The values in the ‘Mandatory’ column state whether the field is mandatory (Y: mandatory; N: not mandatory) when archiving a data package.
| Field | Mandatory | Description | Format |
| Title | Y | Title of your data package. When you publish your data package, the title will be harvested by other catalogues. | Maximum length: 255 characters. |
| Description | Y | A concise description (abstract) of your data package, containing for example subject, sample size, methodology, etc. | Maximum length: 2700 characters. |
| Discipline | Y | Choose the (sub)discipline of the study from the list. The list contains a combination of research disciplines and subdisciplines. | The list uses the OECD FOS 2007 standard. |
| Version | N | Version number or date of the dataset. Yoda does not automatically assign version numbers to data packages. If you create multiple versions, you can register the version number yourself. | Free format. For example: v1.0 (semantic version), v2024.09.20 (date version). |
| Language of the data | Y | Choose the main language of the data in the dataset. | The list uses the ISO 639/1 standard. |
| Collection process: start and end date | N | Start and end date of data collection. | YYYY-MM-DD |
| Location(s) covered | N | Indication of the geographical entities (countries, regions, cities) covered within this data package. | English naming convention preferred. We recommend using the preferred spelling from the Getty Thesaurus of Geographic Names. Add only one location per line: use the plus sign to add more values. Maximum length: 255 characters. |
| Period covered: start and end date | N | Indication of the start and end dates of the period covered by your dataset. This is not necessarily the same as the collection dates (for example: historical data may be collected in 2024 but cover 1900). | YYYY-MM-DD |
| Keywords | Y | Keywords/tags that describe your data package and may allow others to more easily find your data package. | Free format. Add only one keyword per line: use the plus sign to add more values. Maximum length: 255 characters |
| Related resource | N | Any resources (articles, data packages, software, etc.) related to your data package, their identifier/link and how they are related to your data package. | It is possible to add multiple related resources: use the plus sign to add more values. |
| Related resource – Relation type | Y | Choose how the related resource is related to the current data package. | The list uses the DataCite relationType vocabulary. |
| Related resource - Title | Y | Title of the related resource. | There is no automatic check whether the title matches with the persistent identifier. Maximum length: 255 characters. |
| Related resource – Persistent identifier | Y | The Identifier and Identifier type for the related resource. For example: type: DOI, identifier: http://doi.org/10.24416/UU01-729A2Y | |
| Retention period (years) | Y | The minimum number of years that the data package should be preserved in the Vault. | Number (integer) Default: 10 years |
| Retention information | N | Text field for remarks about the retention period. Use this field if you deviate from the default retention period. | Free format |
| Embargo end date | N | It is possible to set an embargo on the data package. The metadata will be published already, but the data will only become available after this period. Specify here the date on which the embargo should end. | YYYY-MM-DD |
| Data type | Y | Choose what type of package it concerns. | Datapackage (default), Software, Method, Other document |
| Data classification | Y | Choose how sensitive the data is in terms of confidentiality, integrity and availability. | |
| Name of collection | N | If the data package is part of a larger (conceptual) collection, you can enter the collection name here. | The research group should ensure that all other data packages in the collection are archived with the same collection name. Maximum length: 255 characters. |
| Funding reference | N | The funding source(s) of your data package. | This field can have multiple values: use the plus sign to add more values. |
| Funding reference - Funder | N | Name of the organization funding the research. For example: Dutch Research Council. | Use names as specified in the Research Organization Registry (ROR). Maximum length: 255 characters. |
| Funding reference – Award number | N | The grant number issued by the funding organization. | Free format. |
| Remarks | N | Any remark from the datamanager. For example: for feedback if a data package is rejected. As a researcher, leave this empty. | Free format. |
| Creator | Y | The author(s)/creator(s) of the data package | This field can have multiple values: use the plus sign to add more creators. |
| Creator – Name | Y | The personal/first name (Given Name) and surname/last name (Family Name) of the creator. | Maximum length: 255 characters. |
| Creator – Affiliation | Y | Select the organizational or institutional affiliation of the creator. The Affiliation identifier (ROR) will automatically appear. | This field can have multiple values: use the plus sign to add more affiliations for the creator. |
| Creator – Person identifier | N | The Identifier and Identifier type for the creator, such as AuthorID, ORCID, or ResearcherID. | Each creator can have multiple persistent identifiers: use the plus sign to add more person identifiers. Maximum length: 255 characters. |
| Contributor | N | The person(s) who contributed to this data package. | This field can have multiple values: use the plus sign to add more contributors. |
| Contributor - Name | Y | The personal/first name (Given Name) and surname/last name (Family Name) of the contributor. | Maximum length: 255 characters. |
| Contributor – Contributor type | Y | Choose how the contributor primarily contributed to the data package. | The list uses the DataCite contributorType vocabulary. |
| Contributor - Affiliation | Y | Select the organizational or institutional affiliation of the contributor. The Affiliation identifier (ROR) will automatically appear. | This field can have multiple values: use the plus sign to add more affiliations for the creator. |
| Contributor – Person identifier | N | The Identifier and Identifier type for the contributor, such as AuthorID, ORCID, or ResearcherID. | Each contributor can have multiple persistent identifiers: use the plus sign to add more person identifiers. Maximum length: 255 characters. |
| Data package access | Y | Choose the access level under which the data package should be made available once published. | Open – freely retrievable (publicly available), Restricted – available upon request (only available after specified conditions), Closed (not shared). |
| License | Y | The terms that specify what others are allowed to do with the contents of the data package. | If the Data package access is set to ‘Open – freely retrievable’, you can choose from a number of often-used licenses (recommended: Creative Commons Attribution 4.0). If the data package is restricted or closed, you select Custom and will have to add a License.txt file to your data package. Contact your data manager for help with this. |