Data management plan is compulsory for almost all research projects either funded by the Research Council of Norway or European Commission. It is said that the effective data management plan could save large sum of repeat expenses and opens up possibility of new research collaboration based on secondary processing of data. Overall scope of the present document is data generated through research projects in the waterpower laboratory. The document describes data management plan and permanent repository of data generated in the waterpower laboratory through associated research projects and education. However, main purpose is data repository that allows building of next generation of data as well as upgradation. Main source of the data is research conducted by master students, PhDs and Postdocs in various research projects. Data management plan and the repository are extremely important to continue research in proper direction as well as building of blocks towards final technological output. This system also helps to avoid scratch start with each new project. The new students and researchers will have possibility to look into previous data sets and analyze the data and then continue new research without repeating, which is already carried out previously.


This document provides abstract information of research data management plan, storage type and scope. The majority data corresponds to the category of Engineering and physical science that means no personal/health data are included. During a project period, the data of multiple layers can be obtained, i.e., row data, main (processed) data and published data (research paper, dissertation, thesis, book, etc.). The published data are deposited at institutional repository of NTNU including Cristin, other data can be stored locally which allows continuous development of tools and techniques in the laboratory. The data are sorted into different categories such as, structured data, descriptive data, primary data (main data), secondary data, etc. Research publications and the data presented in the publications provide complete open access and the metadata search options to the international research community. However, permanent repository in the waterpower laboratory aims for storage of large data from the experiments, simulations, programs and numerical codes which are continuously developed. All these data shall be structured systematically and managed locally in the waterpower laboratory. The data are intended to store continuously as project work is in progress, both row and main data, and the lifetime of data is dependent on the project need as well as possibility of future development.



The data management plan

Background

A DMP is a document describing how data in a research project will be managed, from project start up, throughout the research process and in the time after completion of the project. A DMP describes what type of data will be collected. The plan states how the data will be stored, described with metadata, analyzed and, if possible, shared. The plan also addresses issues related to rights, privacy and costs. A DMP is a tool for planning and raising awareness and should be a "living document" which is updated during the research project depending on need. A good plan for how to organize and describe the data can make the project work more effective by making it easier to understand and work with the data, especially in larger research projects, such as FME centres. Good documentation and data management contributes to increased data quality, as well as verifiability and reuse.

The Research Council of Norway has a policy for Open Access to Research Data, where the standard is that the project they fund should have a DMP. Similarly, European Commission has certain requirements related to data management and open access for various project schemes. It is extremely important to have DMP in place. The waterpower laboratory aims to manage research data according to international standards, such as the FAIR principles, CARE principles as well as NTNU Open Data concept and thereby support the development of a global research community in which research data is widely shared. The outstanding example of open data is the series of Francis-99 open data, which was initiated in 2010, in fact much before the existing open data policies. Under the Francis-99, precious experimental data collected in the waterpower laboratory are openly available to the research community.

A data management plan:

Abstract detail from the national strategy on access to and sharing of research data

Research data should be shared and reused more widely

Better access to research data can boost innovation and value creation by enabling actors outside the research community to find new areas of application. Another benefit that is important in its own right is that greater transparency and insight into research can help to increase confidence in researchers and research findings. In order to make research data more available and increase reuse, researchers need the competence and tools to manage data in a sound, secure manner throughout all steps of the research process. They must have the infrastructure needed for collecting, analyzing, archiving and sharing data, as well as access to clear information about this infrastructure. The infrastructure in place must lay a foundation for cooperation and knowledge-sharing that extends across countries and sectors. It should be easy for international researchers to find Norwegian data sets.

This strategy does not cover research data from privately funded research and development activity. According to the basic principles, in cases where private actors are granted public funding for research or cooperate with public research institutes, universities, university colleges or hospitals on research and innovation projects that are publicly funded, it is possible to restrict access to data to protect trade secrets or when this is necessary in connection with commercialization of results. It will be up to private actors to assess this from case to case.

What does “publicly funded research data” mean?


label Research data types and collection.

Basic principles

  1. Research data must be as open as possible, as closed as necessary.
  2. Research data should be managed and curated to take full advantage of their potential.
  3. Decisions concerning archiving and management of ­research data must be taken within the research community.

Government expectations and measures

Abstract detail from the research council of Norway

Policy for research data management is driven from the national strategy. Projects that receive funding from the research council are to assess whether the need to draw up a data management plan. As a general rule, R&D-performing institutions themselves are responsible for determining which archiving solution to use. If the project owner decides that a data management plan is necessary, it should develop such a plan in line with the institution's own guidelines. This plan should be submitted in connection with the revision of the application. Whenever possible, data management plans should be available to the public and be openly published by the research institution so that the academic environment may be able to follow the practice of its colleagues. Under certain circumstances, the research council is entitled to stipulate storage of data and/or metadata in specific national or international archives. For example, in connection with certain relevant projects in the fields of social science, humanities, medicine and health, and environmental and development research, the research council asks to archive data at the Norwegian Centre for Research Data (NSD).

Recommendation and guideline

The data management plan is a living document that follows the research project and specifies the following: (1) the kind of data that will be generated (2) how the data will be described (3) where the data will be stored (4) whether and how the data can be shared. The purpose is to plan how to safeguard the research data, not just during the project period, but also for future reuse of the data. A data management plan is an effective means of identifying costs associated with data management and storage and can also help you to plan how to cover these costs. Data management plans are to be made public and openly accessible. This will promote greater openness and enable scientific groups to follow peer practice. The research data that are stored must be of quality that makes them possible to find and reuse. The Research Council recommends that you follow the international FAIR principles. In keeping with the FAIR principles, research data must be accessible, findable and reusable. The concept interoperable entails that both data and metadata must be machine-readable and that a consistent terminology is used.

Current open access requirements

At present, the Research Council stipulates the following requirements:

The Research Council Stimulation Scheme for Open Access Publication (STIM-OA) provides support to institutions for activities to make publications openly accessible.

How the STIM-OA scheme works:

Funding under the scheme will be available through 2022.

Abstract detail from the EU, H2020

Projects funded by H2020 are required to develop a data management plan within 6 months of received funding. In the plan you will be asked to specify: (1) what data will be open (2) what data the project will generate/use (3) how the data will be utilised or made available for verification and reuse (4) how it is organised and stored.

Core requirement for data management plans (borrowed from Science Europe):

  1. Data description and collection or re-use of existing data
    1. How will new data be collected or produced and/or how will existing data be re-used?
    2. What data (for example the kinds, formats, and volumes) will be collected or produced?
  2. Documentation and data quality
    1. What metadata and documentation (for example the methodology of data collection and way of organising data) will accompany data?
  3. Storage and backup during the research process
    1. How will data and metadata be stored and backed up during the research process?
    2. How will data security and protection of sensitive data be taken care of during the research?
  4. Legal and ethical requirements, codes of conduct
    1. If personal data are processed, how will compliance with legislation on personal data and on data security be ensured?
    2. How will other legal issues, such as intellectual property rights and ownership, be managed? What legislation is applicable?
    3. How will possible ethical issues be taken into account, and codes of conduct followed?
  5. Data sharing and long-term preservation
    1. How and when will data be shared? Are there possible restrictions to data sharing or embargo reasons?
    2. How will data for preservation be selected, and where will data be preserved long-term (for example a data repository or archive)?
    3. What methods or software tools will be needed to access and use the data?
    4. How will the application of a unique and persistent identifier (such as a Digital Object Identifier (DOI)) to each data set be ensured?
  6. Data management responsibilities and resources
    1. Who (for example role, position, and institution) will be responsible for data management (i.e. the data steward)?

What resources (for example financial and time) will be dedicated to data management and ensuring that data will be FAIR (Findable, Accessible, Interoperable, Re-usable)?

NTNU Open Data: NTNU’s policy for open research data 2018-2025

Making research data accessible and reusable contributes to increased reproducibility and transparency in science and may prevent the same data from being collected several times. Open data also create the basis for new and innovative digital services that have the potential to be of great societal value. It is reasonable to expect publicly funded research to be useful for society. Thus, there is considerable national and international awareness regarding open research data.

Expenses related to basic management, storage and publishing of research data should typically be covered by the individual research projects and will usually be considered a legitimate cost in applications for funding. Open access to research data should normally entail free use externally. Covering actual costs related to special preparation of data sets and similar should still be possible.

NTNU’s policy for open research data is based on the following principles:

NTNU, in addition to the institutional archive, recommends national resources such as easy.DMP (www.easydmp.sigma2.no), which is also part of UNINETT Sigma2 and allows storage of other data and are available openly. Currently a Digital Object Identifier (DOI) is not issued. EasyDMP follows (1) Science Europe (2) Horizon 2020 and (3) Institutional Templates. EasyDMP is a web form consisting of a series of questions grouped into a number of sections.  The questionnaire is dynamic, meaning that the type and the amount of questions you will be presented at every stage depends on the answer you have given at the previous stage.

Ownership of research data

As for scientific publications, the main principle for research data is that the institution retains the intellectual property rights and the author/researcher the copyright. Normally NTNU does not own research data produced by students or guest researchers, unless this has been agreed on, for example through externally funded projects.

As a rule, NTNU owns all research data collected and processed by personnel employed or contracted at the institution. This gives NTNU the right to openly publish the material, but does not preclude

Issues regarding rights and ownership must be clarified and secured in an agreement in cases where data are used for commercial and patenting purposes. Agreements must also be made in situations where NTNU’s own researchers utilize other’s data.

Data management plan for the waterpower laboratory




Reference