Brief

The Use of Synthetic Data to Train AI Models: Opportunities and Risks for Sustainable Development

This technology brief explores the potential of synthetic data to accelerate the attainment of the SDGs through AI in the Global South.

Using synthetic or artificially generated data in training AI algorithms is a burgeoning practice with significant potential. It can address data scarcity, privacy, and bias issues and raise concerns about data quality, security, and ethical implications. This issue is heightened in the Global South, where data scarcity is much more severe than in the Global North. Synthetic data, therefore, addresses the problem of missing data, leading, in the best case, to better representation of populations in datasets and more equitable outcomes. However, we cannot consider synthetic data to be better or even equivalent to actual data from the physical world. In fact, there are many risks to using synthetic data, including cybersecurity risks, bias propagation, and simply an increase in model error. This technology brief proposes recommendations for the responsible use of synthetic data in AI training and the associated guidelines to regulate the use of synthetic data.

Yellow Pattern BG

Download the technology brief

The Use of Synthetic Data to Train AI Models: Opportunities and Risks for Sustainable Development (available in Chinese, English and Japanese)
Download 

Suggested citation: Marwala Tshilidzi, Fournier-Tombs Eleonore and Stinckwich Serge. The Use of Synthetic Data to Train AI Models: Opportunities and Risks for Sustainable Development : UNU Centre, UNU-CPR, UNU Macau, 2023.

Related content

Seminar

Citizen Participation in Digital Health Initiatives in the Context of Public Sector Innovation Labs in Brazil

This research aims to enhance citizen engagement in Digital Health initiatives in Brazil, making it more inclusive.

-

Project

Strengthening Public and Private Actors’ Capacities to Operationalise a Laboratory on Innovation in E-Governance

Enhance Cape Verde's digital governance through an e-governance lab, fostering public and private sector collaboration for sustainable public services modernisation.

15 Feb 2024

Project

Portugal EGOV Index | Assessment of Portugal’s performance in the main international benchmarks on digital governance: analysis, recommendations, monitoring, and capacity development

Enhance Portugal's digital governance via strategic analysis and action plans, focusing on international benchmarks, continual improvements, and stakeholder engagement.

22 Feb 2024