Why is this important?
With more and more digital data being generated in research, there is a growing need to ensure that data storage spaces are used efficiently. A regular clean up of the project storage will not only facilitate findability, reproducibility and sustainability of the data but will also help to keep data preservation costs more manageable.
The goal of a data storage cleanup is to free up storage space and enhance data organization by removing redundant or unnecessary files, archiving important data and ensuring compliant and well documented data are kept in storage. Indications below offer a general guideline to start organizing your project storage space. Review and appraise the data in your project directory. Identify data files in active use, candidates for long-term retention or deletion. Redundant or temporary files Format or usability issues If yes: consider improving the file quality or adding documentation where possible. (The support team can help you recover data from outdated file formats or help you set-up research documentation.) Where not possible you may want to consider removing these files from your storage. Legal aspects Copyright and usage rights Data protection Note that, even without obligations, data might still be relevant to preserve. Always check the scientific value or reuse potential of the data before considering deletion! Always ensure you are storing personal (or otherwise sensitive data) data in a secure environment and in a secure way. Scientific Value Reuse potential Other considerations Decision-making regarding which data is valuable to keep for the long-term can be challenging. Each discipline will have tailored criteria regarding the value of data files. You might need to develop specific criteria for data selection within your group/division. Archive finalized, published or reproducible datasets. Remove unnecessary , irrelevant or non-compliant data. Perform data triage and cleanup periodically, or (a) at the end of a project phase, (a) after publication submission, (c) before project closure or researcher departure.Quick Guide
Quick Guide Explained
Check the data retention period. Verify also whether your funder or journal have specific requirements for data preservation.
Check agreements with third parties (e.g. data sharing agreements, consortium or commercial collaborations, patent applications) for obligations regarding the data, as these might restrict the (long-term) storage of data.
(data obtained from or with 3rd parties )
Rights to store the data might be limited when you are an user of data mainly managed by others (e.g. data from public archives or agencies ).
Check always what are your rights to the data and whether (long-term) storage is allowed.
Check whether you have a valid legal basis to store the data. Consider data minimization criteria and keep an eye on how long data should be stored.
If storage/archiving of data is not allowed, you should consider removing these files from your storage or discussing further with the copyright holder/interested parties.
VAULT IT!
If you are using Yoda, make sure you transfer the data from the Research area to the Yoda Vaultt once the project is finished.
Destroy/delete safely