Back to browse
10k harmonized time-series datasets of African data

10k harmonized time-series datasets of African data

by kossisoroyce·Jun 17, 2026·2 points·0 comments

AI Analysis

●●SolidNiche GemBig Brain

7,900+ harmonized African datasets with BibTeX provenance and one-line dataset library loading.

Strengths
  • Consistent pipeline across 54 countries with unified missing value markers and snake_case normalization
  • BibTeX citation tracking with esa_source and esa_processed fields for every row
  • Snappy-compressed Parquet with fixed 80/20 splits loadable via Hugging Face datasets library
Weaknesses
  • Fundamentally curation work rather than technical innovation—similar pipelines exist for other regions
  • Long-term sustainability unclear for a 13-person nonprofit maintaining 7,900+ datasets
Category
Target Audience

ML researchers and data scientists working on African development projects

Similar To

Kaggle Datasets · Our World in Data · UCI Machine Learning Repository

Similar Projects

OtherMid

F1 Data – 100 resources across 11 racing series

Curated F1 resource list when FastF1 and Jolpica already cover APIs.

Niche Gem
subinium
302mo ago
OtherPass

LegalTech – A curated list of tools and software

Curated list with zero stars — feels like marketing for the sponsor's product.

pkhodiyar
303mo ago