arxiv:2510.06557
Milad Aghajohari
miladink
AI & ML interests
NLP, ML, Multi-Agent RL, SSL, AI
Recent Activity
upvoted
a
paper
26 days ago
Grounding Computer Use Agents on Human Demonstrations
authored
a paper
2 months ago
LOQA: Learning with Opponent Q-Learning Awareness
authored
a paper
2 months ago
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit
Assignment