49 Downloads Updated 2 months ago
Updated 2 months ago
2 months ago
b438d61083f2 · 2.0GB ·
African Foundation Model (AFM) The African Foundation Model (AFM) is a state-of-the-art language model specifically designed for African contexts, languages, and use cases. Built with the latest transformer optimizations from 2025 research (DeepSeek V3, Llama 4), AFM combines power with efficiency.
Python 3.11+ PyTorch 2.4+ CUDA 13.0 License: Apache 2.0
🌍 Overview The African Foundation Model (AFM) is a state-of-the-art language model specifically designed for African contexts, languages, and use cases. Built with the latest transformer optimizations from 2025 research (DeepSeek V3, Llama 4), AFM combines power with efficiency.
Key Features ✨ Cutting-Edge 2025 Architecture
MoE (Mixture of Experts) - 8x capacity with minimal overhead iRoPE (Interleaved RoPE) - 256K training → 10M+ inference context MLA (Multi-head Latent Attention) - 75% KV cache reduction MTP (Multi-Token Prediction) - 5-10% better reasoning Flash Attention 3 - 3-5x faster training FP8 Training - 2x speedup, 50% less memory RMSNorm - Faster and more stable than LayerNorm SwiGLU Activation - Superior to GELU/ReLU GQA (Grouped Query Attention) - 3x smaller KV cache 🚀 Performance
246M base parameters → 1.2B total capacity (MoE) 30-50M active params per token 256K training context → 10M+ inference context 75% smaller KV cache (MLA) 3-5x faster training vs standard transformers 2-3x faster inference with speculative decoding 🌐 African Focus
✅ 60 Expert Models Configured
Category Count Examples
Languages 6 African, Asian, European, Middle Eastern, Indigenous, Sign
Code 4 Python, Web, Systems, Mobile
Science 4 Physics, Chemistry, Biology, Mathematics
Medical & Legal 4 Diagnosis, Research, Contracts, Compliance
Finance & Business 4 Analysis, Accounting, Strategy, Marketing
Vision & Audio 6 Medical Vision, Autonomous, Industrial, Transcription, Synthesis, Analysis
Advanced Tech 4 Cybersecurity, Logical Reasoning, Cloud Architecture, AI Ethics