Spaces:
Running
Running
| <br/> | |
| # SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | |
| <!-- [π arxiv](https://arxiv.org/pdf/2409.07440) | --> | |
| [π» GitHub](https://github.com/allenai/super-benchmark) | [π€ HuggingFace](https://huggingface.co/datasets/allenai/super) | Updated: **{LAST_UPDATED}** | |