Skip to Main Content
Making sure your data remain accessible for the long term is a big challenge, especially since technology changes so quickly. Open file formats have a history of wide adoption and backward compatibility and are less likely to become obsolete. Proprietary software, such as Microsoft Office Suite, may become obsolete.
Choose open formats:
README.txt files are a recommended method to document your project and add context when clarity is needed.
You should use a README.text file at the top level of your research project folder to explain the purpose of the research, the relevant summary, name and contact information for project researchers, general organization of your files, and copyright and licensing information.
Credits: University of Cambridge Data Management Guide
Data documentation explains how data were created or digitised, what data mean, what their content and structure are and any data manipulations that may have taken place.
Document throughout your research process.
File Naming - Best practice is that the names are descriptive – they reflect the content of the file. Be consistent – use the same format for all of the files in a project, including data set files and zip or tar files. Some suggested attributes to include:
Use meaningful names that are consistent, descriptive, and short.
A great way to meaningfully name files is to include the project, instrument, and the year, month, and date in the file name.
Example:
Don't: File12935.xls
Do: Project_Instrument_location_YYYYMMDD.csv
You can improve your documentation by improving your note taking. Good notes are:
Storage ≠ Backup
Storage and backup are separate elements of data management that complement each other.
Storage is for your working files that you access regularly. If you lose storage, you'll lose the current versions of your data.
Backup is the regular process of copyright data. You don't need a backup until you lose your data, but it can save your research.
A good combination of storage and backup supports strong data management.
A good rule to follow when managing your data is the rule of 3. Keep three copies of your data:
Example
1. Laptop
2. External hard drive
3. Cloud storage