Anonymisation techniques – either encrypting or removing personally identifiable information - enable the wider use of personal data. However, using anonymised data comes with the risks of re-identifying patients, and thus involves a strict ethical approval process.
Using synthetic health data helps to overcome these privacy and confidentiality issues.
Synthetic health data is generated to represent real patient data, using publicly available open data sources, like NHS England and Public Health England statistics. Using synthetic health data also means:
Synthetic data can also be used in testing and creating different scenarios within a system, for example, illustrating the impact of a policy change, both at present-time and in the future.
Synthetic electronic health care records
A group of researchers in Massachusetts have developed Synthea: an approach, method, and software mechanism for generating synthetic patients and the synthetic electronic health care record. Based on publicly available information, the records can be simulated with pathways of disease progression, care plans, personal workflows (of the citizen and the healthcare professional), and the lifecycle of research projects.
Image: 'PADARSER as the conceptual framework for Synthea' <https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocx079/4098271>
As the above image shows, the result is a source of synthetic electronic health records that are readily available; suited to industrial, innovation, research, and educational uses; and free of legal, privacy, security, and intellectual property restrictions.
By Allie Short
We just sent you an email. Please click the link in the email to confirm your subscription!