When we think of research data, we often only consider numerical data; however, this does not truly reflect the diversity of research data. Research data includes all information (physical and digital) collected, observed, generated, or created for the purpose of analysis to produce and validate original research results. Administrative documentation such as a key file, informed consent and interview guides should also be recognized as important elements of research data.
The term 'Data Asset' refers to an individual piece of data or documentation within a project. This can be a CSV file containing your raw data, the transcripts of an interview, or the results of an EEG scan. These data assets may transform throughout your project. For example, after you have cleaned a data set, you will then have a new data asset. This is considered a new data asset and should be treated as one. As you progress through your project, you will create and collect various data assets.
All of these assets together form a 'Data Package'. This is the final product of your research data. This package should include the data you collected, analysed, and processed, along with the contextual information that describes it (metadata). It should also contain the guidelines and protocols you followed, details about how the data was collected, its potential future used, and any other information needed to understand, verify, or reuse the dataset. Preparing a well-documented data package is especially important when it comes to archiving.


Throughout the lifetime of a project, many data assets will be collected and created. The organisation of these data assets is the key to good data management. You can categorize your data assets into four groups: