Hausa News Corpus | سجل مجموعات البيانات

A curated collection of news articles in Hausa language, covering politics, sports, entertainment, and local news from Nigerian media outlets.

Back to Registry

Hausa News Corpus

real

hausa-news-corpus v2025-01 | Maintained by African NLP Initiative

Access Dataset

A curated collection of news articles in Hausa language, covering politics, sports, entertainment, and local news from Nigerian media outlets.

Examples
50,000
Languages
1
License
CC-BY-4.0
Availability
public download

Core Information

ID
hausa-news-corpus
Version
2025-01
Maintainer
African NLP Initiative
Contact
data@africanlp.org
DOI
10.5281/zenodo.example

Access Information

Availability
public download

Provenance

Source Types
web-scrape
Geography
nigeriaafrica
Collection Period
2020-01-01 - 2024-12-31
Notes
Articles collected from major Nigerian news websites with permission. Content was deduplicated and filtered for quality.

Intended Use

Intended Uses
  • Language model training
  • Text classification research
  • Low-resource language NLP
Out of Scope
  • Real-time news analysis
  • Fact verification without human review