arxiv:2406.15252
Dongfu Jiang
DongfuJiang
AI & ML interests
NLP, common sense reasoning
Organizations
models
13
DongfuJiang/PairRM-V2-phi-3-4k-mini-all
Updated
DongfuJiang/vapo_lora_all_data_iter_2
Updated
DongfuJiang/vapo_lora_all_data_iter_1
Updated
•
1
DongfuJiang/PairRM-V2-phi3-3-mini-unified-feedback
Updated
•
1
DongfuJiang/PairRM-V2-phi3-3-mini-ultra-feedback-binarized-lora
Updated
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1600
Text Generation
•
Updated
•
5
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1200
Text Generation
•
Updated
•
7
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2000
Text Generation
•
Updated
•
7
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2400
Text Generation
•
Updated
•
6
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2882
Text Generation
•
Updated
•
7
datasets
5
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
26
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
25
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
1
DongfuJiang/WildFeedback
Viewer
•
Updated
•
26.5k
•
3
DongfuJiang/FeTaQA
Viewer
•
Updated
•
10.3k
•
265
•
6