37 21 167

Dongfu Jiang

DongfuJiang

https://jdf-prog.github.io/

AI & ML interests

NLP, common sense reasoning

Organizations

WildFeedback data demo

models 13

DongfuJiang/PairRM-V2-phi-3-4k-mini-all

Updated Aug 5

DongfuJiang/vapo_lora_all_data_iter_2

Updated Aug 1

DongfuJiang/vapo_lora_all_data_iter_1

Updated Jul 31 • 1

DongfuJiang/PairRM-V2-phi3-3-mini-unified-feedback

Updated Jul 30 • 1

DongfuJiang/PairRM-V2-phi3-3-mini-ultra-feedback-binarized-lora

Updated Jul 26

DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1600

Text Generation • Updated Jul 25 • 5

DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1200

Text Generation • Updated Jul 25 • 7

DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2000

Text Generation • Updated Jul 25 • 7

DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2400

Text Generation • Updated Jul 25 • 6

DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2882

Text Generation • Updated Jul 25 • 7

datasets 5

DongfuJiang/simpo_v2_ultrafeedback

Viewer • Updated Aug 2 • 59.9k • 26

DongfuJiang/VAPO

Viewer • Updated Jul 31 • 72.5k • 25

DongfuJiang/PairRM-data

Viewer • Updated Jul 30 • 586k • 1

DongfuJiang/WildFeedback

Viewer • Updated Jul 26 • 26.5k • 3

DongfuJiang/FeTaQA

Viewer • Updated May 8, 2023 • 10.3k • 265 • 6

Dongfu Jiang

AI & ML interests

Organizations

Papers 9

spaces 2 Sort: Recently updated

VAPO data demo

WildFeedback data demo

models 13 Sort: Recently updated

datasets 5 Sort: Recently updated

spaces 2

models 13

datasets 5