Alina Lozovskaya

alozowski

AI & ML interests

NLP in all aspects

Organizations

alozowski's activity

New activity in open-llm-leaderboard/results 2 days ago

Model fail, re-eval request 😊

8
#885 opened about 1 month ago by dnhkng

How to calculate GPQA score?

4
#928 opened 7 days ago by JJaeuk

🚩 Report: Not working

1
#939 opened 3 days ago by Lyte

Why failed

1
#936 opened 4 days ago by DZgas
New activity in open-llm-leaderboard/requests 3 days ago

failed

6
#59 opened 6 days ago by legolasyiu
New activity in open-llm-leaderboard/open_llm_leaderboard 11 days ago

check-submit

5
#920 opened 11 days ago by alozowski
New activity in open-llm-leaderboard/results 11 days ago

Missing Llama 3.1 405B

1
#15 opened 14 days ago by lukestanley
New activity in open-llm-leaderboard/open_llm_leaderboard 14 days ago

Model evaluation failed

1
#916 opened 15 days ago by CoolSpring

bump-up-gradio

5
#918 opened 14 days ago by alozowski
New activity in open-llm-leaderboard/open_llm_leaderboard 17 days ago

Still pending

6
#900 opened 25 days ago by legolasyiu
New activity in open-llm-leaderboard/open_llm_leaderboard 18 days ago

Incomplete model

1
#909 opened 19 days ago by MaziyarPanahi

bump-up-transformers

5
#910 opened 18 days ago by alozowski
New activity in open-llm-leaderboard/open_llm_leaderboard 23 days ago

Model evaluations failed

4
#884 opened about 1 month ago by DavidGF

Incorrect ifeval benchmark

5
#879 opened about 1 month ago by DavidGF
New activity in open-llm-leaderboard/requests 23 days ago

all failed tests

1
#57 opened 24 days ago by legolasyiu
New activity in open-llm-leaderboard/open_llm_leaderboard 23 days ago

Model Failed: StableProse

3
#894 opened 28 days ago by nlpguy