1,120 Downloads Updated 4 months ago
This is a quantised version of canopylabs/orpheus-3b-0.1-ft.
Orpheus is a high-performance Text-to-Speech model fine-tuned for natural, emotional speech synthesis. This repository hosts the 8-bit quantised version of the 3B parameter model, optimised for efficiency while maintaining high-quality output.
Orpheus-3b-FT-Q8_0 is a 3 billion parameter Text-to-Speech model that converts text inputs into natural-sounding speech with support for multiple voices and emotional expressions. The model has been quantised to 8-bit (Q8_0) format for efficient inference, making it accessible on consumer hardware.
Key features: - 8 distinct voice options with different characteristics - Support for emotion tags like laughter, sighs, etc. - Optimised for CUDA acceleration on RTX GPUs - Produces high-quality 24kHz mono audio - Fine-tuned for conversational naturalness
This model is designed to be used with an LLM inference server that connects to the Orpheus-FastAPI frontend, which provides both a web UI and OpenAI-compatible API endpoints.
This quantised model can be loaded into any of these LLM inference servers:
Visit https://github.com/legraphista/LocalOrpheusTTS for a quick start project
Listen to the model in action with different voices and emotions:
https://lex-au.github.io/Orpheus-FastAPI/DefaultTest.mp3”
https://lex-au.github.io/Orpheus-FastAPI/LeahHappy.mp3
https://lex-au.github.io/Orpheus-FastAPI/TaraSad.mp3
https://lex-au.github.io/Orpheus-FastAPI/ZacContemplative.mp3
The model supports 8 different voices:
- tara
: Female, conversational, clear
- leah
: Female, warm, gentle
- jess
: Female, energetic, youthful
- leo
: Male, authoritative, deep
- dan
: Male, friendly, casual
- mia
: Female, professional, articulate
- zac
: Male, enthusiastic, dynamic
- zoe
: Female, calm, soothing
You can add expressiveness to speech by inserting tags:
- <laugh>
, <chuckle>
: For laughter sounds
- <sigh>
: For sighing sounds
- <cough>
, <sniffle>
: For subtle interruptions
- <groan>
, <yawn>
, <gasp>
: For additional emotional expression
This model is available under the Apache License 2.0.
The original Orpheus model was created by Canopy Labs. This repository contains a quantised version optimised for use with the Orpheus-FastAPI server.
If you use this quantised model in your research or applications, please cite:
@misc{orpheus-tts-2025,
author = {Canopy Labs},
title = {Orpheus-3b-0.1-ft: Text-to-Speech Model},
year = {2025},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/canopylabs/orpheus-3b-0.1-ft}}
}
@misc{orpheus-quantised-2025,
author = {Lex-au},
title = {Orpheus-3b-FT-Q8_0: Quantised TTS Model with FastAPI Server},
note = {GGUF quantisation of canopylabs/orpheus-3b-0.1-ft},
year = {2025},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/lex-au/Orpheus-3b-FT-Q8_0.gguf}}
}