Meta Llama-3-8b with Self-Play Preference Optimization for Language Model Alignment at iteration 3

186 4 months ago

21 Tags
379b5e986817 • 4.7GB • 4 months ago
da9defb0d385 • 2.8GB • 4 months ago
ccadd7fcd444 • 2.6GB • 4 months ago
2a7f26fe46c1 • 3.7GB • 4 months ago
82887c324edf • 3.3GB • 4 months ago
deb49f988d37 • 4.7GB • 4 months ago
569035230314 • 4.4GB • 4 months ago
3c5baed8baf1 • 3.2GB • 4 months ago
9d89820e7b63 • 4.3GB • 4 months ago
99627e2857dd • 4.0GB • 4 months ago
1f3635d3f96e • 3.7GB • 4 months ago
379b5e986817 • 4.7GB • 4 months ago
82e35a0bdb00 • 5.1GB • 4 months ago
9e303f62e401 • 4.9GB • 4 months ago
f3e3f7c1c201 • 4.7GB • 4 months ago
da0dc2fa0a46 • 5.6GB • 4 months ago
146dcb9df8ab • 6.1GB • 4 months ago
161e2a16f8aa • 5.7GB • 4 months ago
0d5792850346 • 5.6GB • 4 months ago
b19229efd5a9 • 6.6GB • 4 months ago
4b2f19a50676 • 8.5GB • 4 months ago