rouge

phi-3.5-mini-4k-instruct

This model Microsoft updated Phi-3 Mini version (June 2024) (Phi-3.1 Mini)

3,196 Pulls 2 Tags Updated 1 year ago

autocoder-s-6.7b

This is the 6.7B version of AutoCoder.

803 Pulls 3 Tags Updated 1 year ago

phi-3-medium-4k-instruct-abliterated-v3

This is microsoft/Phi-3-medium-4k-instruct (Uncensored)

787 Pulls 2 Tags Updated 1 year ago

replete-coder-qwen2-1.5b

Although Replete-Coder has amazing coding capabilities, its trained on vaste amount of non-coding data, fully cleaned and uncensored.

521 Pulls 3 Tags Updated 1 year ago

qwen2-7b-instruct-deccp

This is a simple abliterated (refusal-orthoganalized) version of the Qwen2-7B-Instruct model.

466 Pulls 2 Tags Updated 1 year ago

wizardlm-2-7b-abliterated

This is the WizardLM-2-7B model with orthogonalized bfloat16 safetensor weights, based on the implementation by @failspy

376 Pulls 2 Tags Updated 1 year ago

daybreak-kunoichi-2dpo-7b

Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.

269 Pulls 1 Tag Updated 1 year ago

yi-coder-9b-chat

Yi-Coder (q6 and q8)

238 Pulls 2 Tags Updated 1 year ago

neuralstar_fusionwriter_4x7b

NeuralStar_FusionWriter_4x7b is a Mixture of Experts (MoE) made with the following models using LazyMergekit:

208 Pulls 2 Tags Updated 1 year ago

qwen2-7b-instruct

194 Pulls 1 Tag Updated 1 year ago

llama-3-8b-instruct-mopeymule

Overview: Llama-MopeyMule-3 is an orthogonalized version of the Llama-3. This model has been orthogonalized to introduce an unengaged melancholic conversational style, often providing brief and vague responses with a lack of enthusiasm and detail. It tend

173 Pulls 3 Tags Updated 1 year ago

llama-3-gutenberg-8b

This model is based on Llama-3-8b, and is governed by META LLAMA 3 COMMUNITY LICENSE AGREEMENT nbeerbower/llama-3-bophades-v3-8B finetuned on jondurbin/gutenberg-dpo-v0.1.

92 Pulls 2 Tags Updated 1 year ago

nanbeige2-16b-chat

The Nanbeige2-16B-Chat is the latest 16B model developed by the Nanbeige Lab, which utilized 4.5T tokens of high-quality training data during the training phase.

75 Pulls 1 Tag Updated 1 year ago

calme-7b-instruct

Calme-7B is a state-of-the-art language model with 7 billion parameters, fine-tuned over high-quality datasets on top of Mistral-7B. The Calme-7B models excel in generating text that resonates with clarity, calmness, and coherence.

74 Pulls 2 Tags Updated 1 year ago

mistroll-7b-v2.2

This model was trained 2x faster with Unsloth and Huggingface's TRL library. This is an experiment on fixing models with incorrect behaviors.

38 Pulls 1 Tag Updated 1 year ago

ultramerge-7b

This model is an experimental DPO fine-tune of automerger/YamShadow-7B on the following datasets

13 Pulls 1 Tag Updated 1 year ago