Spaces:
Runtime error
Runtime error
Commit History
Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding 303626f
Refactor dataset version column name to lowercase in auto_eval_column_dict f1d5836
Refactor init_leaderboard function to handle multiple subsets and improve column selection and hiding 09c7b10
Refactor load_dataset to include split parameter in populate.py 95e674a
Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding d9f262c
Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding 4c839ed
chore: Refactor get_leaderboard_df to handle multiple subsets in populate.py 88477a4
chore: Update app.py to include select_columns and hide_columns in init_leaderboard function 37b74a1
chore: Update app.py to include select_columns and hide_columns in init_leaderboard function ab7ee2d
chore: Round numeric columns to two decimal places in get_leaderboard_df 7660cbc
chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND 2f420b7
chore: Remove commented out code for model information in utils.py 903180b
chore: Update auto_eval_column_dict to use "Total" instead of "Overall" for the Overall column 15d3941
chore: Update model name in auto_eval_column_dict d47aa6d
chore: Update search columns in app.py to include model and license names 94d4dbb
chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND 3c62a69
chore: Update Tasks enum values in about.py 046ddc7
Update GOOGLE_SHEET_ID in envs.py 93dabac
chore: Remove commented out code for model information in utils.py d598d7d
chore: Remove commented out code for model information in utils.py 65654bf
chore: Update page title to "LiveBench" 8336bbd
chore: Update about page title to "Live Bench" 24c1f06
Revert "Update repository references in envs.py" ce61fc8
Update repository references in envs.py 1d340cf
Update src/envs.py adad63e verified
added leaderboard component to simplify main script 8b28d2b
Clémentine commited on
doc c1b8a96
Clémentine commited on
simplified the template 24622c4
Clémentine commited on
CPU, TOKEN, env variables (#4) 55cc480 verified
Update src/submission/check_validity.py 6eb8bfd
made token a requirement f982b8e
Clémentine commited on
test f0298e1
Clémentine commited on
fix c15e77e
Clémentine commited on
removed quantization to simplify b899767
Clémentine commited on
now with a functionning backend 1ffc326
Clémentine commited on
update read 943f952
Clémentine commited on
fixs 314f91a
Clémentine commited on
updated leaderboard efeee6d
Clémentine commited on
Simplified leaderboard v0 9833cdb
Clémentine commited on
simplified some parts of the code + updated requirements 9d22eee
Clémentine commited on
Added check on tokenizer to prevent submissions which won't run 7302987
Clémentine commited on
Update benchmark count and fix typo (`inetuning->finetuning`) (#395) 7abc6a7
fix order of request file vs request file list, to avoid resubmitting issues 976f398
Clémentine commited on
cache 4ff9eef
Clémentine commited on
update for caching 395eff6
Clémentine commited on
add model architecture as column 3dfaf22
Clémentine commited on
Simplify About eaace79
Clémentine commited on
Refactor 2 - added plotting back b1a1395
Clémentine commited on
fix value error in param size ccefec9
Clémentine commited on