Using synthetic or artificially generated data in training Artificial Intelligence (AI) algorithms is a burgeoning practice with significant potential to affect society directly. It can address data scarcity, privacy and bias issues but does raise concerns about data quality, security and ethical implications. While some systems use only synthetic data, most times synthetic data is used together with real-world data to train AI models.

Recommendations in this document are for any system where some synthetic data are used. The use of synthetic data has the potential to enhance existing data to allow for more efficient and inclusive practices and policies. However, we cannot assume synthetic data to be automatically better or even equivalent to data from the physical world. There are many risks to using synthetic data, including cybersecurity risks, bias propagation and increasing model error. This document sets out recommendations for the responsible use of synthetic data in AI training.

StandardPattern-INWEH

Download the policy guideline

Recommendations on the Use of Synthetic Data to Train AI Models
Download 

Suggested citation: Philippe de Wilde, Payal Arora, Fernando Buarque, Yik Chan Chin, Mamello Thinyane, Stinckwich Serge, Fournier-Tombs Eleonore and Marwala Tshilidzi. Recommendations on the Use of Synthetic Data to Train AI Models : UNU Centre, UNU-CPR, UNU Macau, 2024.

Related content

Seminar

Citizen Participation in Digital Health Initiatives in the Context of Public Sector Innovation Labs in Brazil

This research aims to enhance citizen engagement in Digital Health initiatives in Brazil, making it more inclusive.

-

Project

Strengthening Public and Private Actors’ Capacities to Operationalise a Laboratory on Innovation in E-Governance

Enhance Cape Verde's digital governance through an e-governance lab, fostering public and private sector collaboration for sustainable public services modernisation.

15 Feb 2024

Project

Portugal EGOV Index | Assessment of Portugal’s performance in the main international benchmarks on digital governance: analysis, recommendations, monitoring, and capacity development

Enhance Portugal's digital governance via strategic analysis and action plans, focusing on international benchmarks, continual improvements, and stakeholder engagement.

22 Feb 2024