13 yesterday

tools thinking
ollama run charaf/qwen3-embedding-8b-mlx-mxfp8

Details

yesterday

3f0f25c15c90 · 7.8GB ·

{ "architectures": [ "Qwen3ForCausalLM" ], "attention_bias": false, "attention_dropout": 0.0, "bos_t
{ "bos_token_id": 151643, "eos_token_id": 151643, "max_new_tokens": 2048, "transformers_version": "4
{ "version": "1.0", "truncation": null, "padding": null, "added_tokens": [ { "id": 151643, "content"
{ "add_bos_token": false, "add_prefix_space": false, "added_tokens_decoder": { "151643": { "content"
{"!":0,"\"":1,"#":2,"$":3,"%":4,"&":5,"'":6,"(":7,")":8,"*":9,"+":10,",":11,"-":12,".":13,"/":14,"0"
{ "num_ctx": 32768 }
{{ .Prompt }}
398 tensors

Readme

No readme