Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10 • 660
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published Jun 26 • 51
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 222
DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Paper • 2502.18460 • Published Feb 25 • 3
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6, 2024 • 68