Milad Aghajohari's picture

2 3 3

Milad Aghajohari

miladink

·

AI & ML interests

NLP, ML, Multi-Agent RL, SSL, AI

Recent Activity

upvoted a paper 27 days ago

Grounding Computer Use Agents on Human Demonstrations

authored a paper 2 months ago

LOQA: Learning with Opponent Q-Learning Awareness

authored a paper 2 months ago

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

View all activity

Organizations

authored 3 papers 2 months ago

LOQA: Learning with Opponent Q-Learning Awareness

Paper • 2405.01035 • Published May 2, 2024

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 27

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8 • 30

authored a paper 8 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 86