Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.27771

Graph Neural Network Training with Data Tiering

Paper • 2111.05894 • Published Nov 10, 2021
Graph Neural Networks are Dynamic Programmers

Paper • 2203.15544 • Published Mar 29, 2022 • 1
Graph Neural Networks for Jamming Source Localization

Paper • 2506.03196 • Published Jun 1, 2025
Code as Agent Harness

Paper • 2605.18747 • Published 22 days ago • 213

Distributional AGI Safety

Distributional AGI Safety

Paper • 2512.16856 • Published Dec 18, 2025 • 1
Soft-Label Governance for Distributional Safety in Multi-Agent Systems

Paper • 2604.19752 • Published Mar 19 • 3
Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12, 2025 • 27
Emergent Social Intelligence Risks in Generative Multi-Agent Systems

Paper • 2603.27771 • Published Mar 29 • 52

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 36
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 47

Read Later 📚

Interesting papers on AI, LLMs, etc. to add to reading list

Monitored Markov Decision Processes

Paper • 2402.06819 • Published Feb 9, 2024
Generalization in Monitored Markov Decision Processes (Mon-MDPs)

Paper • 2505.08988 • Published May 13, 2025
Bayesian Risk Markov Decision Processes

Paper • 2106.02558 • Published Jun 4, 2021
Sotopia-RL: Reward Design for Social Intelligence

Paper • 2508.03905 • Published Aug 5, 2025 • 23

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 76
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 112
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 516
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31

about 17 hours ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 37
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

Graph Neural Network Training with Data Tiering

Paper • 2111.05894 • Published Nov 10, 2021
Graph Neural Networks are Dynamic Programmers

Paper • 2203.15544 • Published Mar 29, 2022 • 1
Graph Neural Networks for Jamming Source Localization

Paper • 2506.03196 • Published Jun 1, 2025
Code as Agent Harness

Paper • 2605.18747 • Published 22 days ago • 213

Read Later 📚

Interesting papers on AI, LLMs, etc. to add to reading list

Monitored Markov Decision Processes

Paper • 2402.06819 • Published Feb 9, 2024
Generalization in Monitored Markov Decision Processes (Mon-MDPs)

Paper • 2505.08988 • Published May 13, 2025
Bayesian Risk Markov Decision Processes

Paper • 2106.02558 • Published Jun 4, 2021
Sotopia-RL: Reward Design for Social Intelligence

Paper • 2508.03905 • Published Aug 5, 2025 • 23

Distributional AGI Safety

Distributional AGI Safety

Paper • 2512.16856 • Published Dec 18, 2025 • 1
Soft-Label Governance for Distributional Safety in Multi-Agent Systems

Paper • 2604.19752 • Published Mar 19 • 3
Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12, 2025 • 27
Emergent Social Intelligence Risks in Generative Multi-Agent Systems

Paper • 2603.27771 • Published Mar 29 • 52

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 76
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 112
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 516
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 36
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 47

about 17 hours ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 37
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs