huihui_ai/deepscaler-abliterated

huihui_ai/ deepscaler-abliterated:latest

1,955 Downloads Updated 1 year ago

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

ollama run huihui_ai/deepscaler-abliterated

curl http://localhost:11434/api/chat \
  -d '{
    "model": "huihui_ai/deepscaler-abliterated",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='huihui_ai/deepscaler-abliterated',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'huihui_ai/deepscaler-abliterated',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

752ca9d83330 · 7.1GB ·

model

archqwen2

parameters1.78B

quantizationF32

7.1GB

license

1.1kB

params

{ "stop": [ "<｜begin▁of▁sentence｜>", "<｜end▁of▁sentence｜>",

179B

template

{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice

387B

Readme

This is an uncensored version of agentica-org/DeepScaleR-1.5B-Preview created with abliteration (see remove-refusals-with-transformers to know more about it).
This is a crude, proof-of-concept implementation to remove refusals from an LLM model without using TransformerLens.

References

HuggingFace

Donation

Your donation helps us continue our further development and improvement, a cup of coffee can do it.

bitcoin:

  bc1qqnkhuchxw0zqjh2ku3lu4hq45hc6gy84uk70ge