-
Graph Neural Network Training with Data Tiering
Paper • 2111.05894 • Published -
Graph Neural Networks are Dynamic Programmers
Paper • 2203.15544 • Published • 1 -
Graph Neural Networks for Jamming Source Localization
Paper • 2506.03196 • Published -
Code as Agent Harness
Paper • 2605.18747 • Published • 213
Collections
Discover the best community collections!
Collections including paper arxiv:2603.27771
-
Distributional AGI Safety
Paper • 2512.16856 • Published • 1 -
Soft-Label Governance for Distributional Safety in Multi-Agent Systems
Paper • 2604.19752 • Published • 3 -
Virtual Agent Economies
Paper • 2509.10147 • Published • 27 -
Emergent Social Intelligence Risks in Generative Multi-Agent Systems
Paper • 2603.27771 • Published • 52
-
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Paper • 2411.02337 • Published • 36 -
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Paper • 2411.04996 • Published • 51 -
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
Paper • 2410.08815 • Published • 47
-
Monitored Markov Decision Processes
Paper • 2402.06819 • Published -
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Paper • 2505.08988 • Published -
Bayesian Risk Markov Decision Processes
Paper • 2106.02558 • Published -
Sotopia-RL: Reward Design for Social Intelligence
Paper • 2508.03905 • Published • 23
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 112 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 516 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
Graph Neural Network Training with Data Tiering
Paper • 2111.05894 • Published -
Graph Neural Networks are Dynamic Programmers
Paper • 2203.15544 • Published • 1 -
Graph Neural Networks for Jamming Source Localization
Paper • 2506.03196 • Published -
Code as Agent Harness
Paper • 2605.18747 • Published • 213
-
Monitored Markov Decision Processes
Paper • 2402.06819 • Published -
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Paper • 2505.08988 • Published -
Bayesian Risk Markov Decision Processes
Paper • 2106.02558 • Published -
Sotopia-RL: Reward Design for Social Intelligence
Paper • 2508.03905 • Published • 23
-
Distributional AGI Safety
Paper • 2512.16856 • Published • 1 -
Soft-Label Governance for Distributional Safety in Multi-Agent Systems
Paper • 2604.19752 • Published • 3 -
Virtual Agent Economies
Paper • 2509.10147 • Published • 27 -
Emergent Social Intelligence Risks in Generative Multi-Agent Systems
Paper • 2603.27771 • Published • 52
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 112 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 516 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Paper • 2411.02337 • Published • 36 -
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Paper • 2411.04996 • Published • 51 -
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
Paper • 2410.08815 • Published • 47
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100