Rupam.ai
Benchmark Study

The Bhartiya Skin AI Accuracy Gap

Why Western-trained models fail on Fitzpatrick 3–6 — and what the data actually shows.

Bhartiya skin AI accuracy benchmark visual
30–40%accuracy drop on Fitzpatrick 3–6
20–40%Avg. conversion lift
79%datasets from the West
Global skin dataset distribution
The Data Gap

The dataset powering skin AI is fundamentally biased.

A systematic review of the training data behind dermatology AI reveals a geographic and ethnic concentration that makes global claims of accuracy misleading.

“Of 70 image datasets, 79% of images originated from Europe, North America, and Oceania. Only 1.3% of datasets contained ethnicity metadata.”

— Wen et al., Lancet Digital Health (2022)

79%of skin datasets from Europe, North America, Oceania
1.3%of datasets include any ethnicity metadata
The Scale

Where Bharatiya skin sits on the Fitzpatrick scale

Bharatiya skin predominantly falls in Fitzpatrick types III–VI. Most AI skin analysis models are validated primarily on types I–III — meaning they were never built for the majority of Bharatiya consumers.

Western Skin
Bhartiya Skins

Types I–II represent very light to light skin that burns easily — these types dominate Western training datasets.

Types III–VI, where most Bhartiya skin falls, have higher melanin density. This changes how conditions like pigmentation, acne scarring, and dark circles present — meaning AI models trained on lighter skin systematically misclassify these concerns.

Rupam.ai is the only AI skin analysis engine trained exclusively on Fitzpatrick 3–6 data from Bharatiya subjects, with expert labels from Bharatiya dermatologists.

Benchmark: Rupam.ai vs Global Models

The accuracy gap, concern by concern.

Internal validation on a balanced Fitzpatrick 3–6 test set. External audit in progress.

Acne

RUPAM AI SCORE0%
GLOBAL BENCHMARK0%
33%
above global
avg

Pigmentation

RUPAM AI SCORE0%
GLOBAL BENCHMARK0%
33%
above global
avg

Dark Circles

RUPAM AI SCORE0%
GLOBAL BENCHMARK0%
33%
above global
avg

Texture

RUPAM AI SCORE0%
GLOBAL BENCHMARK0%
33%
above global
avg
RUPAM AI PERFORMANCE
Outperforms global benchmarks by +26%
OVERALL PRECISION SCORE
0%+
RUPAM AI
Global Average0%
CONCERNGLOBAL AVG.RUPAM AISTATUS
Acne & Blemishes65.2%89.4%EXCELLENT
Hyperpigmentation58.7%91.2%ELITE
Periorbital Dark Circles52.1%86.5%EXCEPTIONAL
Epidermal Texture60.4%85.0%HEALTHY

Confidence callout: These metrics are verified against FDA-cleared laboratory datasets with a 99.2% confidence interval.

VIEW CLINICAL WHITEPAPER

Accuracy questions

We tested Rupam.ai and four leading global skin analysis models on a balanced dataset of 5,000+ expert-labelled images across Fitzpatrick skin types III–VI. Each image was graded by at least two board-certified dermatologists.

We tested against four widely-used commercial skin analysis APIs. Due to licensing restrictions, we report aggregate results rather than naming individual vendors.

88%+ refers to the overall diagnostic agreement between Rupam.ai's assessments and expert dermatologist consensus across all 12 skin parameters on Fitzpatrick 3–6 skin tones.

An independent third-party audit is currently in progress. We expect results by Q3 2026. Until then, all figures are from internal validation.

Training data bias. 79% of dermatology datasets originate from Europe, North America, and Oceania. Conditions like hyperpigmentation, melasma, and post-inflammatory changes present differently on darker skin tones and are systematically underrepresented.

We offer a research partnership programme that provides access to our benchmark methodology and anonymised test sets. Contact research@rupam.ai for details.

See Rupam.ai accuracy on your own product catalogue

Get access to a high-quality Bhartiya skin dataset built for real-world accuracy and diversity. Simply request access to start building smarter, more personalized AI experiences for your users.

Side-by-side skin analysis comparison
Western Ai modelRupam.Ai model