Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Haokai Zhao
jz666
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
6 days ago
jz666/llama3-ultrafeedback-leven-4
published
a dataset
6 days ago
jz666/llama3-ultrafeedback-leven-4
updated
a dataset
6 days ago
jz666/llama3-ultrafeedback-leven
View all activity
Organizations
None yet
models
16
Sort: Recently updated
jz666/dpo-grad-acc-128-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21, 2025
jz666/dpo-grad-acc-32-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21, 2025
•
2
jz666/dpo-grad-acc-64-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21, 2025
•
4
jz666/dpo-grad-acc-16-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21, 2025
•
3
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 20, 2025
•
1
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 17, 2025
jz666/simpo-train-large-wrong
Text Generation
•
9B
•
Updated
Oct 16, 2025
•
2
jz666/simpo-train-filtered-full
Text Generation
•
9B
•
Updated
Oct 14, 2025
•
2
jz666/simpo-train-large-correct
Text Generation
•
9B
•
Updated
Oct 14, 2025
•
6
jz666/simpo-train-small-wrong
Text Generation
•
9B
•
Updated
Oct 14, 2025
•
2
View 16 models
datasets
31
Sort: Recently updated
jz666/llama3-ultrafeedback-leven-4
Viewer
•
Updated
6 days ago
•
61.8k
•
10
jz666/llama3-ultrafeedback-leven
Viewer
•
Updated
6 days ago
•
61.8k
•
24
jz666/llama3-ultrafeedback-ches-4
Viewer
•
Updated
7 days ago
•
61.8k
•
10
jz666/llama3-ultrafeedback-ches
Viewer
•
Updated
7 days ago
•
61.8k
•
21
jz666/llama3-ultrafeedback-reward-4-ches
Viewer
•
Updated
7 days ago
•
61.8k
•
36
jz666/llama3-ultrafeedback-templated-ppl-reward-chosen-4
Viewer
•
Updated
24 days ago
•
61.8k
•
20
jz666/llama3-ultrafeedback-flip
Viewer
•
Updated
25 days ago
•
61.8k
•
12
jz666/gemma2-ultrafeedback-flip
Viewer
•
Updated
25 days ago
•
61.5k
•
10
jz666/llama3-ultrafeedback-reward-chosen-4
Viewer
•
Updated
26 days ago
•
61.8k
•
29
jz666/gemma2-ultrafeedback-reward-chosen-4
Viewer
•
Updated
26 days ago
•
61.5k
•
16
View 31 datasets