Skip to main content

Data Documentation

summary quote:

Simply making your data „open“ does not make your data usable. Usability is the result of other people being able to make sense of your data: the quality of your filenames, your folder structure and your documentation greatly contribute to the level of reusability of your data. It is neither difficult nor time-consuming to maintain your data in a way that invites other researchers to investigate and explore.

Andreas von der Dunk, Technische Universität Dresden, Service Center Research Data

General basics

data for future generations

A clean and comprehensible organisation of data and documents are an important part of good research practice and an important step to realise research data management according to the FAIR data principles.

An essential task is defining in advance the organisation and storage of data, documents, and their metadata, and documenting the relevant measures.

Central requirements are the definition of formal responsibilities, organisational conventions and technical implementations to organise the data and meta information produced. The information collected for this purpose is recorded in the Data Management Plan. Note that a good data organisation also estimates the costs (see costing tool and checklist for example) in the early application phase.

Formal responsibilities and organisational conventions

An essential part of the data documentation is the definition of the actors working with the data and the procedures. These are possible first steps:

  • Documenting the responsibilities of primary researchers and project staff.
  • Creating user roles: Defining detailed rights for users / groups / roles to access data and sensitive information.
  • Describing processes of quality assurance including protected storage, sharing, and accessibility in the short term and on the long run.
  • Data processing: How, where, how fast. Description of input and output data; decisions on naming and structuring conventions for files and folders.

The result should be a set of descriptive documents associated with the files used during the daily work routines, which unambiguously determine:

  • the status (for example original file, temporary work file; Draft, intermediate version, final version),
  • the location (workstation PC, central file server, database),
  • the availability time frame (short, project length, long-term),
  • the format in which they are saved.

small data handout

Technical implementations

Get an overview of the occurring data and document flows. A short description, easy to understand for every user, should be accessible on a low level and explain the main concepts. The data itself needs to have a bulletproof Data Organisation, including Metadata and suitable Data Formats. Find out more about the usual Best Practice at your institute or within your discipline. Plan how documents can be shared between project staff and what needs to be accessed for Data Publication with PIDs.

Data security affects all technical and organisational issues to protect the data from alteration, loss, and destruction. In this context, storage methods, backup procedures, necessary physical resources as well as automated and administrative routines must be planned and put in place. Ask local contacts or external experts about already established technologies for Data Storage and Archiving as well as suitable Repositories.

Synopsis

en_data_orgsteps

Good data documentation does not happen over night - take small steps first. The documentation of research data is primarily an organisational problem that is accompanied and supported by technological measures:

  • Record of status quo:
    • Which organisational processes have been used so far and which technologies support them?
    • What are the regulatory boundaries and technical limits?
    • Which personal roles are involved?
    • Which devices or file formats are or have been used?
    • Are there any special features?
  • Awareness: Who produces (meta)data, and who continues to use data and how?
  • Define internal rules and processes: What are the targets of RDM, and how can they be achieved?
  • Apply and evaluate rules iteratively: Learn, set, follow, repeat. Keep it simple and smart (KISS).
  • Develop suitable technology: Determine specific requirements in the first project phase and adapt them continuously to changing conditions.
  • Establish supporting technology: Evaluate and test software like ELN and Repositories; train your staff.
  • Obtain legal advice, considering local and higher-level policies and procedures: Contact the legal department at your institution or NFDI Querschnittssektion "Ethik und Recht"
  • Make rules and decisions accessible to everyone at an early stage, for example in the form of a short handout.
  • Check the concept regularly and update it if necessary.

Sources and further information