Data
Science
Unit

Transforming diverse datasets into meaningful insights that support research, outbreak response, and public health planning.

Domains Integrated
0
Trend Detection Models
0
Outbreak Insight
0
Real-Time Dashboards
0

What the Unit Does

The Data Science Unit acts as the analytical backbone of CERI. By integrating data from genomics, lab systems, field studies, and health partners, the team delivers insights that power rapid and accurate responses to emerging health threats.

Their work spans data engineering, statistical analysis, algorithm development, and the creation of dashboards and visualizations that communicate complex findings with clarity — ensuring that science translates directly into public health action.

Data Engineering

Building robust data pipelines that ingest, clean, and integrate information from genomics, lab systems, field studies, and health partners.

Statistical Analysis

Rigorous statistical frameworks to identify patterns and correlations in complex epidemiological and genomic datasets.

Machine Learning

ML-based approaches to detect trends, model disease dynamics, and guide evidence-based decision-making during outbreaks.

Genomics Integration

Linking genomic sequence data with epidemiological, clinical, and environmental sources for comprehensive surveillance.

Dashboard & Visualization

Creating interactive dashboards and visualizations that communicate complex findings with clarity to diverse stakeholders.

Algorithm Development

Designing novel computational algorithms tailored to the unique challenges of epidemic response and public health planning.

Data Science Team

Prof. Houriiyah Tegally
Prof. Houriiyah Tegally

Associate Professor in Bioinformatics and Head: Data Science Unit

Haingo Andry
Haingo Andry

PhD Candidate: Applied Mathematics

Graeme Dor
Graeme Dor

PhD Candidate: Bioinformatics

Nikita Sitharam
Nikita Sitharam

PhD candidate: Bioinformatics

Carlin Foka Takamgno
Carlin Foka Takamgno

PhD candidate: Applied Mathematics

Publications

2026
Human MERS-CoV cases are falling but pose an ongoing pandemic threat

Subissi L, Otieno J, Shah A, Abu-Raddad L, Agrawal A, Mehairi A,…

Nature Health

2026
Addressing pandemic-wide systematic errors in the SARS-CoV-2 phylogeny

Hunt M, Hinrichs A, Anderson D, Karim L, Dearlove B, Knaggs J,…

Nature Methods 23:653-662

2026
Chikungunya virus

Ramphal Y, de Oliveira T

Nature Ecology & Evolution 10:610-610

2026
Re-emergence of DENV-3 in Paraguay After Two Decades: A Genomic and Epidemiological Investigation

Cantero C, Vazquez C, Gonzalez S, Rojas A, Fleitas F, Barrios J,…

Emerging Microbes & Infections

2025
Tracing the spatial origins and spread of SARS-CoV-2 Omicron lineages in South Africa

Dor G, Wilkinson E, Martin DP, Moir M, Tshiabuila D, Kekana D,…

Nature Communications 28;16(1):4937. doi: 10.1038/s41467-025-60081-0

2025
Genomic Surveillance of Climate-Amplified Cholera Outbreak, Malawi, 2022–2023

Chabuka L, Choga W, Mavian C, Moir M, Morgenstern C, Tegaly H,…

Emerging Infectious Diseases 31(6):. doi: 10.3201/eid3106.240930.

2025
Artificial intelligence for modelling infectious disease epidemics

Kraemer M, Tsui J, Chang S, Lytras S, Khurana M, Vanderslott S,…

Nature

2025
Spatiotemporal disease suitability prediction for Oropouche virus and the role of vectors across the Americas

Poongavanan J, Dunaiski M, D’or G, Kraemer M, Giovanetti M, Lim A,…

medRxiv doi: 10.1101/2025.02.28.25323068.

2025
Characterization of SARS-CoV-2 intrahost genetic evolution in vaccinated and non-vaccinated patients from the Kenyan population

Lugano D, Mwangi K, Mware B, Kibet G, Osiany S, Kiritu E,…

medRxiv doi: 10.1101/2025.03.03.25323296.

2025
Unveiling novel features and phylogenomic assessment of indigenous Priestia megaterium AB-S79 using comparative genomics

Adeniji A, Chukwuneme C, Conceição E, Ayangbenro A, Wilkinson E, Maasdorp E,…

Microbiology Spectrum doi: 10.1128/spectrum.01466-24.

News