Policy Brief

Recommendations on the Use of Synthetic Data to Train AI Models

Publication Date
29 Feb 2024
Authors
Philippe de Wilde Payal Arora Fernando Buarque Yik Chan Chin Mamello Thinyane Serge Stinckwich Eleonore Fournier-Tombs Tshilidzi Marwala

Using synthetic or artificially generated data in training Artificial Intelligence (AI) algorithms is a burgeoning practice with significant potential to affect society directly. It can address data scarcity, privacy and bias issues but does raise concerns about data quality, security and ethical implications. While some systems use only synthetic data, most times synthetic data is used together with real-world data to train AI models.

Recommendations in this document are for any system where some synthetic data are used. The use of synthetic data has the potential to enhance existing data to allow for more efficient and inclusive practices and policies. However, we cannot assume synthetic data to be automatically better or even equivalent to data from the physical world. There are many risks to using synthetic data, including cybersecurity risks, bias propagation and increasing model error. This document sets out recommendations for the responsible use of synthetic data in AI training.

StandardPattern-INWEH

Download the policy guideline

Recommendations on the Use of Synthetic Data to Train AI Models
Download

Related content

Dr. Shashi Tharoor, Member of Parliament of India and former Under-Secretary-General of the United Nations; Mr. R. Venkatramani, the Attorney-General for India, and Prof. C. Raj Kumar, Founding Vice-Chancellor of O.P. Jindal Global University.

Conversation Series

International Order Under Challenge

-

News

Roundtable Discussion: How to Bridge the Gap between Policymakers and Academics in Africa and the Global South

Emmanuel Balogun and Thomas Tieku are holding a virtual roundtable hosted by the International Studies Association.

12 Jun 2026

Degree Defense

Public PhD Defense of Gaia Romeo, UNU-CRIS PhD Fellow

The PhD defense by Gaia Romeo takes place in Brussels on 10 June 2026.

-