Identifiant du topic: HORIZON-HLTH-2023-TOOL-05-09

Developing a Data Quality and Utility Label for the European Health Data Space

Type d'action : HORIZON Coordination and Support Actions
Nombre d'étapes : Single stage
Date d'ouverture : 12 janvier 2023
Date de clôture : 13 avril 2023 17:00
Budget : €4 000 000
Call : Tools and technologies for a healthy society (Single stage - 2023)
Call Identifier : HORIZON-HLTH-2023-TOOL-05
Description :

ExpectedOutcome:

This topic aims at supporting activities that are enabling or contributing to one or several expected impacts of destination 5 “Unlocking the full potential of new tools, technologies and digital solutions for a healthy society”. To that end, proposals under this topic should aim to deliver results that are directed towards and contributing to all of the following expected outcomes:

  • Data Users (researchers, innovators, regulators, policymakers, clinicians) are able to identify the most relevant datasets that meet their specific needs through a label describing accurately and in a standard way the quality and utility dimensions of the datasets, as proposed in the legal provisions of the European Health Data Space (EHDS).
  • Data holders have clear specifications for dataset quality and utility labelling to comply with the requirements proposed in the EHDS legal provisions. In addition to that, data holders have access to a maturity model with the requirements a dataset needs to fulfil to achieve higher levels of data quality and utility.
  • European and National public funders ensure that the datasets, for which they provided funding for the creation and curation of, are more widely available, furthering their reuse for secondary uses as proposed in the EHDS legal provisions (research, innovation, regulatory work, policymaking, personalised medicine).
  • The European Commission has access to a set of specifications for the data quality and utility label supporting the implementation of the EHDS legal provisions.

Scope:

A vast quantity of health datasets exist across Europe, from multiple sources (individual care, medical registries, social, environmental behavioural, wellbeing, clinical trials, research, administrative, etc.), and of varying quality. This represents a tremendous opportunity for the reuse of this data for purposes other than for the one for which they were originally collected and spur the development of better prevention strategies, diagnoses, treatments and care plans.

The European Health Data Space (EHDS) will provide a common EU framework for secondary use of health data such as research, innovation, regulatory purposes, policymaking and personalised medicine. It will enable data users to have access to large amounts of health data through health data access bodies empowered with the EHDS legal provisions to overcome existing limitations regarding the processing of health data for secondary uses.

To support data users in the discovery and selection of datasets for their purposes, there is a growing need to develop a data quality and utility framework to articulate the characteristics and the potential usefulness of datasets. This framework will also support data holders in identifying and addressing areas of improvement which can, in turn, allow for wider and better use of these datasets.

Several initiatives have developed or are developing guidelines and recommendations for health data quality, however, these typically focus on specific data types (i.e. 1+ Million Genome Initiative[1]) or areas of applications (i.e. European Medicines Agency – EMA and Heads of Medicines Agencies’ Big Data Steering Group activities to support medicines regulation[2]). Similarly, previous studies and initiatives have addressed specific dimensions of ‘data quality’ for health data but none are offering a framework suitable for the breadth of data types and encompassing the quality and utility elements proposed in the EHDS legal provisions. The proposed framework should take into account the various needs of data users whilst at the same time avoid becoming an excessive burden on data holders which will need to produce the data quality and utility label.

Proposals should address all of the following activities:

  • Perform a mapping of existing data quality and utility principles/initiatives/frameworks (i.e. EMA/HMA Big Data Stakeholders Group Data quality efforts, TEHDAS Data Quality Working Group[3], EOSC-LIFE[4] Health Data Research UK’s data quality and utility framework[5], and relevant data principles, resources and tools (FAIR, FAIR Cookbook, etc.)[6];
  • Conduct various stakeholder consultations, integrating all relevant data users and data holders of health data, EHDS Health Data Access Bodies (HDABs) and other relevant actors to validate data user needs and adequately take into account relevant initiatives when developing the proposed framework;
  • Develop a framework (set of technical specifications) for the data quality and utility label that supports the implementation of the EHDS legal provisions and the roll out of the label by the data holders and EHDS Health Data Access Bodies;
  • Pilot and evaluate the use of the proposed framework (as a label and as a maturity model) on a datasets sample representing the wide-ranging data types (such as electronic health records, genomics datasets, medical registries, administrative data, etc.) and taking into account the needs of all data users identified.
  • Develop recommendations for the successful implementation and adoption of the data quality and utility label and maturity model across European Member States considering the maturity levels regarding secondary of health data.

The consortium should be composed of representatives from data users, data holders, health data access bodies, and other relevant stakeholders to the scope of secondary use of health data, adequately covering the diversity of heath data types and users’ needs across European Member States.

[1]https://b1mg-project.eu/work-packages/wp3

[2]https://www.ema.europa.eu/en/about-us/how-we-work/big-data

[3]https://tehdas.eu/packages/

[4]https://www.eosc-life.eu/

[5]Development of a data utility framework to support effective health data curation: https://informatics.bmj.com/content/28/1/e100303?utm_source=twitter&utm_medium=social&utm_term=hootsuite&utm_content=sme&utm_campaign=usage

[6]See definition of FAIR data in the introduction to this work programme part.