31 5 months ago

Flollama is a general-purpose AI chatbot created by Pratyush Kumar, powered by Meta’s LLaMA 3.2. It is designed to be helpful, respectful, and free for everyone. Flollama answers questions about code, science, learning, and more.

tools

Models

View all →

Readme

flollama

flollama is a custom 3B LLaMA 3.2–based chat model designed for fast, intelligent conversations on local devices. Built by Pratyush kumar, it balances performance and personality — ideal for coding, learning, and general chat.


💡 Features

  • Based on Meta’s LLaMA 3.2 architecture (3B)
  • Fast responses, low memory usage
  • Tuned for helpfulness, code generation, and casual conversation
  • Works on most modern GPUs and even some CPUs

🚀 Getting Started

Run flollama locally with Ollama:

ollama pull nvmpratyush/flollama
ollama run nvmpratyush/flollama

📦 Model Info

  • Architecture: LLaMA 3.2
  • Params: 3B
  • Quantization: Supports Q4/Q8/FP16
  • Use case: Chat, coding help, question answering

Built by Pratyush kumar