RGS

A guide to Open Data

What data can be shared?

Most research data involving human participants can be shared openly so long as appropriate consent, anonymisation, rights management, and access control has been considered. Data protection laws (including GDPR) govern the processing of personal data, but do not apply to anonymised data. However, be aware that some data seemingly anonymised can still be used to identify participants. Data sharing should be considered from the outset of your research project, and appropriate ethical and legal issues given full consideration.

What follows is some tips centring on data involving human participants, but some of this section might be relevant to commercial, security, or legal implications exist.

Making Data Open

Open data should abide by the FAIR principles: It should be Findable (i.e., easily discoverable for both humans and automated computer searches), Accessible (clear instructions provided on access and authentication), Interoperable (compatible with other data types), and Reusable (full descriptions provided of the data, as well as clear usage licences). 

Keep the following recommendations in mind: 

  • Use clear and detailed data descriptions via use of a "data dictionary". This will help others understand your data allowing better reuse of it. 
  • Try to make the data accessible on their own terms (i.e., independent of the paper reporting the results). This can be achieved by posting the data in a dedicated public repository. 
  • Ensure missing data or data exclusions are fully annotated. 
  • Provide as much unprocessed data as possible so users can "rebuild" information. 
  • Include analysis code and processing scripts where feasible. 
  • Bonus points for not using proprietary formats / file types as the only way to access data. Where possible, convert data in proprietary format to open or standard formats before sharing the data. 
  • Consider asking colleagues to review the data submission before sharing to ensure quality control and accessibility. 

Steps to Open Data

Follow these steps to share your FAIR data with others: 

  1. Find a suitable repository. Make sure the repository is suitable for your needs. Is a general or specific repository more suitable? What are the file size limits? How long will the data be available? Do you need to manage sharing permissions, or set up an embargo period? Will you get a DOI or other persistent identifier? You can find a data repository at re3data.org Remember that Keele also has a data repository.
  2. Provide reuse guidance by deciding on an appropriate licence for your data. Be clear about how you would like the data cited. 
  3. Share the persistent URL of your data in your publications / conference talks using the data.