49 2 months ago

The African Foundation Model (AFM) is a state-of-the-art language model specifically designed for African contexts, languages, and use cases. Built with the latest transformer optimizations from 2025 research. AFM combines power with efficiency.

tools

Models

View all →

46 models

afm:coder

15GB · 32K context window · Text · 2 months ago

afm:expert_01_multilingual_african

2.0GB · 128K context window · Text · 2 months ago

afm:expert_02_python_code

2.0GB · 128K context window · Text · 2 months ago

afm:expert_03_javascript_code

2.0GB · 128K context window · Text · 2 months ago

afm:expert_04_devops

2.0GB · 128K context window · Text · 2 months ago

afm:expert_05_database

2.0GB · 128K context window · Text · 2 months ago

afm:expert_06_machine_learning

2.0GB · 128K context window · Text · 2 months ago

afm:expert_07_computer_vision

2.0GB · 128K context window · Text · 2 months ago

afm:expert_08_nlp

2.0GB · 128K context window · Text · 2 months ago

afm:expert_09_cybersecurity

2.0GB · 128K context window · Text · 2 months ago

afm:expert_10_blockchain

2.0GB · 128K context window · Text · 2 months ago

afm:expert_11_finance

2.0GB · 128K context window · Text · 2 months ago

afm:expert_12_legal

2.0GB · 128K context window · Text · 2 months ago

afm:expert_13_medical

2.0GB · 128K context window · Text · 2 months ago

afm:expert_14_scientific_research

2.0GB · 128K context window · Text · 2 months ago

afm:expert_15_mathematics

2.0GB · 128K context window · Text · 2 months ago

afm:expert_16_physics

2.0GB · 128K context window · Text · 2 months ago

afm:expert_17_chemistry

2.0GB · 128K context window · Text · 2 months ago

afm:expert_18_biology

2.0GB · 128K context window · Text · 2 months ago

afm:expert_19_business_strategy

2.0GB · 128K context window · Text · 2 months ago

afm:expert_20_marketing

2.0GB · 128K context window · Text · 2 months ago

afm:expert_21_ui_ux

2.0GB · 128K context window · Text · 2 months ago

afm:expert_22_system_architecture

2.0GB · 128K context window · Text · 2 months ago

afm:expert_23_api_design

2.0GB · 128K context window · Text · 2 months ago

afm:expert_24_testing_qa

2.0GB · 128K context window · Text · 2 months ago

afm:expert_25_performance

2.0GB · 128K context window · Text · 2 months ago

afm:expert_26_mobile_dev

2.0GB · 128K context window · Text · 2 months ago

afm:expert_27_game_dev

2.0GB · 128K context window · Text · 2 months ago

afm:expert_28_iot

2.0GB · 128K context window · Text · 2 months ago

afm:expert_29_quantum_computing

2.0GB · 128K context window · Text · 2 months ago

afm:expert_30_data_science

2.0GB · 128K context window · Text · 2 months ago

afm:expert_31_cloud_architecture

2.0GB · 128K context window · Text · 2 months ago

afm:expert_32_ai_ethics

2.0GB · 128K context window · Text · 2 months ago

afm:vl

15GB · 32K context window · Text · 2 months ago

afm:q4_k_m

127MB · 256 context window · Text · 2 months ago

afm:q5_k_m

155MB · 256 context window · Text · 2 months ago

afm:q6_k

186MB · 256 context window · Text · 2 months ago

afm:q8_0

240MB · 256 context window · Text · 2 months ago

afm:f16

452MB · 256 context window · Text · 2 months ago

afm:afm-q4-k-m

2.0GB · 128K context window · Text · 2 months ago

afm:afm-q5-k-m

2.0GB · 128K context window · Text · 2 months ago

afm:afm-q6-k

2.0GB · 128K context window · Text · 2 months ago

afm:afm-q8-0

2.0GB · 128K context window · Text · 2 months ago

afm:afm-f16

2.0GB · 128K context window · Text · 2 months ago

afm:python-coder

15GB · 32K context window · Text · 2 months ago

afm:reasoning-v2

15GB · 32K context window · Text · 2 months ago

Readme

African Foundation Model (AFM) The African Foundation Model (AFM) is a state-of-the-art language model specifically designed for African contexts, languages, and use cases. Built with the latest transformer optimizations from 2025 research (DeepSeek V3, Llama 4), AFM combines power with efficiency.

Python 3.11+ PyTorch 2.4+ CUDA 13.0 License: Apache 2.0

🌍 Overview The African Foundation Model (AFM) is a state-of-the-art language model specifically designed for African contexts, languages, and use cases. Built with the latest transformer optimizations from 2025 research (DeepSeek V3, Llama 4), AFM combines power with efficiency.

Key Features ✨ Cutting-Edge 2025 Architecture

MoE (Mixture of Experts) - 8x capacity with minimal overhead iRoPE (Interleaved RoPE) - 256K training → 10M+ inference context MLA (Multi-head Latent Attention) - 75% KV cache reduction MTP (Multi-Token Prediction) - 5-10% better reasoning Flash Attention 3 - 3-5x faster training FP8 Training - 2x speedup, 50% less memory RMSNorm - Faster and more stable than LayerNorm SwiGLU Activation - Superior to GELU/ReLU GQA (Grouped Query Attention) - 3x smaller KV cache 🚀 Performance

246M base parameters → 1.2B total capacity (MoE) 30-50M active params per token 256K training context → 10M+ inference context 75% smaller KV cache (MLA) 3-5x faster training vs standard transformers 2-3x faster inference with speculative decoding 🌐 African Focus

✅ 60 Expert Models Configured Category Count Examples Languages 6 African, Asian, European, Middle Eastern, Indigenous, Sign Code 4 Python, Web, Systems, Mobile Science 4 Physics, Chemistry, Biology, Mathematics Medical & Legal 4 Diagnosis, Research, Contracts, Compliance Finance & Business 4 Analysis, Accounting, Strategy, Marketing Vision & Audio 6 Medical Vision, Autonomous, Industrial, Transcription, Synthesis, Analysis Advanced Tech 4 Cybersecurity, Logical Reasoning, Cloud Architecture, AI Ethicsafm_logo_transparent.png