ciTIzen-centric DAta pLatform (TIDAL): Sharing Distributed Personal Data in a Privacy-Preserving Manner for Health Research

Tracking #: 3121-4335

This paper is currently under review
Authors: 
Chang Sun
Marc Gallofré Ocaña
Johan van Soest
Michel Dumontier1

Responsible editor: 
Guest Editors SW Meets Health Data Management 2022

Submission type: 
Full Paper
Abstract: 
Developing personal data sharing tools and standards in conformity with data protection regulations is essential to empower citizens to control and share their health data with authorized parties for any purpose they approve. This can be, among others, for primary use in healthcare, or secondary use for research to improve human health and well-being. Ensuring that citizens are able to make fine-grained decisions about how their personal health data can be used and shared will significantly encourage citizens to participate in more health-related research. In this paper, we propose a ciTIzen-centric DatA pLatform (TIDAL) to give individuals ownership of their own data and connect them with researchers to donate their personal data for research while being in control of the whole data life cycle including data access, storage, and analysis. We recognize that most existing technologies focus on one particular aspect such as personal data storage, suffer from executing data analysis over a large number of participants, and face challenges of low data quality and insufficient data interoperability. To address these challenges, the TIDAL platform integrates a set of components for requesting subsets of RDF (Resource Description Framework) data stored in personal data vaults based on SOcial LInked Data (SOLID) technology and analyzing them in a privacy-preserving manner. We demonstrate the feasibility and efficiency of the TIDAL platform by conducting a set of simulation experiments using three different pod providers (Inrupt.net, Solidcommunity.net, Self-hosted Server). On each pod provider, we evaluated the performance of TIDAL by querying and analyzing personal health data from an increasing number of participants and variables. The performance evaluation of TIDAL shows the execution time has a linear correlation between the number of pods on all pod providers. Platforms such as TIDAL can play an important role to connect citizens, researchers, and data organizations to increase the trust placed by citizens in the processing of their personal data.
Full PDF Version: 
Tags: 
Under Review