Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
3logic
/
llama_dpo
:latest
3
Downloads
Updated
1 year ago
a finetuned Llama 3.1 8b model with supplementary DPO training
a finetuned Llama 3.1 8b model with supplementary DPO training
Cancel
tools
Updated 1 year ago
1 year ago
158aab3789a2 · 16GB ·
model
arch
llama
·
parameters
8.03B
·
quantization
F16
16GB
template
{{- if or .System .Tools }}<|start_header_id|>system<|end_header_id|> {{- if .System }} {{ .System }
1.5kB
license
LLAMA 3.1 COMMUNITY LICENSE AGREEMENT Llama 3.1 Version Release Date: July 23, 2024 “Agreement”
12kB
adapter
arch
llama
·
parameters
168M
·
quantization
F16
336MB
params
{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"
96B
Readme
Model Specifications
Base Model
Architecture
: Llama 3.1
Size
: 8B parameters
Type
: Instruct model
Precision
: FP16
Finetuned model parameters
Method
: Full Precision LoRA
Epoch
: 20
Rank
: 64
Alpha
: 16
Training Dataset
: train_05_prompted_v2
DPO Parameters
Epoch
: 3
Rank
: 64
Alpha
: 16
Training Dataset
: dpo_en_demo
Performance Metrics
Metric
DPO Score
Fine Tuned Model Score
Base Model Score
Agentic Similarity
83.67
86
84.67
CoT Contextual Accuracy
55
⁄
4
56 / 3
54 / 5
Medical GPT Score
58.17
65
51.75
Loss Function:
Write
Preview
# Model Specifications ## Base Model - **Architecture**: Llama 3.1 - **Size**: 8B parameters - **Type**: Instruct model - **Precision**: FP16 ## Finetuned model parameters - **Method**: Full Precision LoRA - **Epoch**: 20 - **Rank**: 64 - **Alpha**: 16 - **Training Dataset**: train_05_prompted_v2 ## DPO Parameters - **Epoch**: 3 - **Rank**: 64 - **Alpha**: 16 - **Training Dataset**: dpo_en_demo ## Performance Metrics | Metric | DPO Score | Fine Tuned Model Score | Base Model Score | |--------|--------|--------|--------| | Agentic Similarity | 83.67 | 86 | 84.67 | | CoT Contextual Accuracy | 55/4 | 56 / 3 | 54 / 5 | | Medical GPT Score | 58.17 | 65 | 51.75 | ### Loss Function: 
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)