Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 9 days ago • 62
In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 12 days ago • 39