Researchers to build first medical AI model with globally representative data

By News DeskPublished On: September 8, 2025Last Updated: September 22, 2025

A research consortium of over 100 study groups in more than 65 countries has launched a collaborative effort to develop the first globally representative AI foundation model in medicine, using 100 million eye images.

The Global RETFound initiative is one of the largest medical AI collaborations ever undertaken, producing one of the most geographically and ethnically diverse medical datasets assembled for AI training purposes.

The data will span Africa, the Middle East, North and South America, the breadth of Asia, Oceania, Europe and the Caucasus region.

The consortium will develop an AI model using an unprecedented dataset of over 100 million colour fundus photographs of the retina at the back of the eye, sourced from more than 65 countries.

Dr Yih Chung Tham is assistant professor at the National University of Singapore (NUS) Medicine and one of the project leads

Tham said: “Current foundational models are trained on data that is geographically and demographically ‘narrow’, which limits their effectiveness and can perpetuate existing health inequalities.

“The Global RETFound Consortium addresses this challenge through innovative approaches that enable broad international participation while maintaining strict privacy protections.”

The initiative builds on the success of RETFound, the first foundation model for retinal and systemic disease detection.

Published in Nature in 2023, RETFound was developed by researchers at Moorfields Eye Hospital and UCL Institute of Ophthalmology in London, using 1.6 million retinal images curated by the INSIGHT Health Data Research Hub at Moorfields.

While RETFound has already demonstrated significant potential for medical AI applications, the global model will expand the training data to encompass every continent except Antarctica.

A key innovation of the project is its flexible, two-pronged data sharing framework, designed to accommodate varying technical capacities and regulatory requirements across participating institutions.

The first approach involves local fine-tuning of generative AI models at individual institutions, with only model weights shared centrally — ensuring no patient data leaves the originating site.

The second pathway enables direct sharing of de-identified data through secure infrastructure for institutions that do not have local GPU resources or technical expertise.

Pearse Keane is professor of artificial medical intelligence at UCL and consultant ophthalmologist at Moorfields eye hospital.

Keane said: “This dual approach allows participation from research groups regardless of their resource levels.

By combining real and synthetic data generation techniques, we can build a diverse, globally representative dataset without compromising security.

The Global RETFound model will undergo comprehensive evaluation across multiple ophthalmic and systemic diseases, including diabetic retinopathy, glaucoma, age-related macular degeneration and cardiovascular disease.

The model will be released under a Creative Commons license, making it freely available for non-commercial research worldwide.

Entrepreneurs selected to tackle Liverpool health challenges

Chatbot-linked deaths highlight existential AI risks, says expert

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
__hssrc	session	This cookie is set by Hubspot. According to their documentation, whenever HubSpot changes the session cookie, this cookie is also set to determine if the visitor has restarted their browser. If this cookie does not exist when HubSpot manages cookies, it is considered a new session.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	1 year	This cookies is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".

Cookie	Duration	Description
__hssc	30 minutes	This cookie is set by HubSpot. The purpose of the cookie is to keep track of sessions. This is used to determine if HubSpot should increment the session number and timestamps in the __hstc cookie. It contains the domain, viewCount (increments each pageView in a session), and session start timestamp.
tve_leads_unique	1 month	This cookie is set by the provider Thrive Themes. This cookie is used to know which optin form the visitor has filled out when subscribing a newsletter.

Cookie	Duration	Description
__hstc	1 year 24 days	This cookie is set by Hubspot and is used for tracking visitors. It contains the domain, utk, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
hubspotutk	1 year 24 days	This cookie is used by HubSpot to keep track of the visitors to the website. This cookie is passed to Hubspot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-others	1 year	No description
lfuuid	9 years 11 months	Third party (Lead Forensics) cookie which enables us to track visitor behaviour on our site. Tracking is performed anonymously until a user identifies themselves by submitting a form.
tl_554_555_1	1 month	No description
tl_554_605_2	1 month	No description
tlf_1	5 days	No description