Multi-Source Data Aggregation for Public Health Platforms: Reconciling Africa CDC, WHO, Johns Hopkins, and Our World in Data

Four credible sources, four different numbers for the same indicator on the same day. The discipline of multi-source aggregation is not picking the winner. It is publishing the harmonised series with the methodological caveats preserved, so users understand what they are looking at.

Written by

PANEOTECH Team

Published

July 12, 2021

Read time

8 min read

From our work POLIWATCH COVID Watch Africa, Continental Health Information Hub for Pandemic Response Continental

The four-sources problem

A continental health platform building on international data inherits a structural problem. Multiple credible sources publish epidemiological indicators for the same African countries on the same day, and the numbers do not always agree. The Africa Centres for Disease Control and Prevention publish official continental aggregations. The World Health Organisation Regional Office for Africa publishes its own series. Our World in Data harmonises further and applies its own methodology. Johns Hopkins University publishes a global series with its own collection cadence. Each source is methodologically defensible. None is wrong. They simply represent different snapshots of an evolving data flow.

The temptation is to pick a winner. Choose one source as canonical and ignore the others. The temptation is wrong. Each source has strengths the others do not. Africa CDC carries continental institutional weight. WHO Afro carries the global health authority frame. Our World in Data carries methodological transparency and revision discipline. Johns Hopkins carries the global comparative frame. Picking one and discarding the others reduces the platform's value to the analytical communities that need access to all four perspectives.

The harmonisation discipline

The architectural answer is harmonisation rather than selection. The platform ingests all credible sources, normalises country names to standardised codes at the ingestion boundary, aligns indicator definitions through documented mappings, and publishes both the harmonised series and the source-by-source breakdowns. Users see the consolidated continental view by default, and can drop down to the source-level series when they need to understand discrepancies, methodology differences, or revision patterns.

The discipline that makes harmonisation work is methodological transparency. Every harmonised indicator is documented with its source mappings, its conversion logic, its revision policy, and the date of its last upstream update. Discrepancies between sources are surfaced, not concealed. Where the sources disagree by more than a methodological margin, the platform shows both rather than averaging them into a single number that misrepresents the underlying flow. The user sees the data the sources actually publish, with the platform's harmonisation work visibly in service of comparability rather than concealment.

What we built for COVID Watch Africa

PANEOTECH delivered the multi-source aggregation pipeline behind COVID Watch Africa for POLIWATCH AFRICA. Africa CDC, WHO Afro, Our World in Data, and Johns Hopkins University were ingested as separate streams, harmonised at the country and indicator level, and published as both consolidated continental series and source-attributed breakdowns. The methodology page on the platform documented the harmonisation logic, the source-by-source mappings, the revision policy, and the limitations users should account for in their interpretation.

The result was a platform that supported the analytical work of public health teams, journalists, and researchers across the continent, with the credibility that came from showing the data rather than improvising it. Decision-makers consulting the platform saw the harmonised continental view they needed at speed, with the source-level transparency they needed for the published record.

The institutional lesson

For public health platforms drawing on multiple international sources, the choice is not between picking a winner and presenting confused data. It is between honest harmonisation with methodological transparency and the false simplification that loses the platform its credibility the first time a user spot-checks a number against an upstream source. Aggregate honestly, document fully, and the platform earns the institutional trust that public health information demands.

Related project

Continental 2020-2021 POLIWATCH AFRICA

COVID Watch Africa, Continental Health Information Hub for Pandemic Response

A continental health information platform developed by PANEOTECH for POLIWATCH AFRICA, covering all fifty-five African Union member states during the COVID-19 pandemic. The platform centralised epidemiological data, national policy responses, and citizen reporting into a single hub that supported emergency coordination for decision-makers, public health professionals, journalists, and researchers across the continent.

View project

About the author

PANEOTECH Team

Pan-African Digital Systems Engineering

PANEOTECH designs and delivers secure, scalable, and sustainable digital ecosystems for governments, multilateral institutions, and the private sector across Africa. Field notes, case studies, and analyses from our engagements appear in this publication.

About PANEOTECH Get in touch

Latest insights

Offline-First, Multilingual Mobile Architecture: Engineering Knowledge Platforms for Sahel Connectivity

Translating Institutional Frameworks into Caregiver-Ready Content: Editorial Discipline for Infant and Young Child Feeding Platforms

BPM-Driven No-Code Workflows for Quality Teams: Configurable Forms, Routing, and Audit Trails Without a Developer

Offline-First Field Operations: PWA, Trusted Web Activity, and the Sync Status Contract With the Inspector

From Spreadsheet QMS to Integrated Platform: When Compliance Becomes an Operational Asset

Mobile Knowledge Platforms for Vulnerable Populations: The UNICEF Mauritania Young Child Nutrition App

Our expertise

Public Sector

Software & Product

Data & Intelligence

Infrastructure

Our solutions

Rafiki AI

UbuntuLink

UbuntuSuite

All Business Africa

Visit Africa

Talenta Africa

Recent work

Integrated Seed Certification and Traceability System for SARIS Somalia

Public Sector Collaboration Hub for the African Capacity Building Foundation

HydroMet AI, AI-Powered Early Warning System for Hydrometeorological Hazards in the Senegal River Valley

Climate Watch Africa, Continental Climate Intelligence Platform with the Climate Insight AI Agent

Multi-Source Data Aggregation for Public Health Platforms: Reconciling Africa CDC, WHO, Johns Hopkins, and Our World in Data

The four-sources problem

The harmonisation discipline

What we built for COVID Watch Africa

The institutional lesson

COVID Watch Africa, Continental Health Information Hub for Pandemic Response

PANEOTECH Team

More from PANEOTECH

Offline-First, Multilingual Mobile Architecture: Engineering Knowledge Platforms for Sahel Connectivity

BPM-Driven No-Code Workflows for Quality Teams: Configurable Forms, Routing, and Audit Trails Without a Developer

Offline-First Field Operations: PWA, Trusted Web Activity, and the Sync Status Contract With the Inspector

Low-Bandwidth Web Performance for African Audiences: Engineering for Sub-3-Second Loads on Constrained Connections

AI on Public Sector Platforms: Grounded, Cited, and Subject to the Same Editorial Governance as Everything Else

Human-in-the-Loop AI for Public Safety: Why Critical Alerts Should Never Auto-Diffuse