← Back to Orinode

Nigerian English Speech Recognition

Orinode's Nigerian English ASR achieves 10.9% WER on naturalistic Nigerian-accented English (May 2026, 200 samples) — a substantial improvement over off-the-shelf US/UK-trained ASR systems on the same audio.

What "Nigerian English" actually is

Nigerian English (ISO 639-3 eng-NG) is the local variant of English spoken by an estimated 100+ million Nigerians as a second language. It has stable, systematic phonological and lexical features that diverge from General American or Received Pronunciation:

Why Whisper-large-v3 alone is not enough

OpenAI's Whisper-large-v3 was trained on ~680k hours of multilingual audio, dominated by US and European English. Its English language token (<|en|>) decodes Nigerian English passably but consistently:

Orinode's fine-tuned encoder corrects these systematic errors by training on Nigerian-accented English specifically.

Architecture

Performance (May 2026)

MetricValueN
WER (normalized)10.92%200
WER (raw, case-sensitive)12.03%200

Code-switching with Nigerian English

Real Nigerian English speech routinely embeds Hausa, Yorùbá, Igbo, or Pidgin tokens within an otherwise English utterance — e.g. "My boss said sannu, but he no fit even pronounce the name properly". Aria v1 handles this via a multilingual decoder that can emit any of the six trained languages mid-sentence. See our Naija customer-call code-switch dataset on Hugging Face for representative training data.

Get the model

Open weights on huggingface.co/Orinode. Production API access: [email protected].