TCYZ/cokertme2:6m/model

TCYZ/ cokertme2:6m

70 Downloads Updated 4 months ago

Çökertme 2, Türkçe doğal dil işleme görevleri için optimize edilmiş, ultra-nano ölçekli bir dil modeli ailesidir.

0.8m 1.6m 5m 6m 14m 100m

cokertme2:6m ... /

model

40ce0395bf9f · 27MB

Metadata

general.architecture

llama

llama
llama.attention.head_count

8

8
llama.attention.head_count_kv

8

8
llama.attention.layer_norm_rms_epsilon

1e-05

1e-05
llama.block_count

2

2
llama.context_length

128

128
llama.embedding_length

64

64
llama.feed_forward_length

170

170
tokenizer.ggml.model

llama

llama
tokenizer.ggml.scores

[0, 0, 0, 0, 0, ...]

[0, 0, 0, 0, 0, ...]
tokenizer.ggml.token_type

[1, 1, 1, 1, 1, ...]

[1, 1, 1, 1, 1, ...]
tokenizer.ggml.tokens

[!, ", #, $, %, ...]

[!, ", #, $, %, ...]

Tensor

Name

Type

Shape
token_embd.weight

F32

F32

[64, 50257]

blk.0

blk.0.attn_k.weight

F32

F32

[64, 64]
blk.0.attn_norm.weight

F32

F32

[64]
blk.0.attn_output.weight

F32

F32

[64, 64]
blk.0.attn_q.weight

F32

F32

[64, 64]
blk.0.attn_v.weight

F32

F32

[64, 64]
blk.0.ffn_down.weight

F32

F32

[170, 64]
blk.0.ffn_gate.weight

F32

F32

[64, 170]
blk.0.ffn_norm.weight

F32

F32

[64]
blk.0.ffn_up.weight

F32

F32

[64, 170]

blk.1

blk.1.attn_k.weight

F32

F32

[64, 64]
blk.1.attn_norm.weight

F32

F32

[64]
blk.1.attn_output.weight

F32

F32

[64, 64]
blk.1.attn_q.weight

F32

F32

[64, 64]
blk.1.attn_v.weight

F32

F32

[64, 64]
blk.1.ffn_down.weight

F32

F32

[170, 64]
blk.1.ffn_gate.weight

F32

F32

[64, 170]
blk.1.ffn_norm.weight

F32

F32

[64]
blk.1.ffn_up.weight

F32

F32

[64, 170]

output.weight

F32

F32

[64, 50257]
output_norm.weight

F32

F32

[64]