Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Milad Aghajohari's picture
2 3 3

Milad Aghajohari

miladink
Moreza009's profile picture
·
  • maghajohari
  • miladink

AI & ML interests

NLP, ML, Multi-Agent RL, SSL, AI

Recent Activity

upvoted a paper 27 days ago
Grounding Computer Use Agents on Human Demonstrations
authored a paper 2 months ago
LOQA: Learning with Opponent Q-Learning Awareness
authored a paper 2 months ago
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
View all activity

Organizations

MathMinds AGI's profile picture

authored 3 papers 2 months ago

LOQA: Learning with Opponent Q-Learning Awareness

Paper • 2405.01035 • Published May 2, 2024

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 27

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8 • 30
authored a paper 8 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 86
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs