Reformulating the RL of reasoning LLMs through Markovian Thinking paradigm.
AI & ML interests
computational linguistics, natural language processing
Recent Activity
View all activity
Papers
Value Drifts: Tracing Value Alignment During LLM Post-Training
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
spaces
7
pinned
Running
6
AfroBench
🥇
Comprehensive benchmark of LLMs on African Languages
pinned
Running
1
mSTEB Leaderboard
🥇
Leaderboard for mSTEB benchmark
pinned
Running
17
WebLINX Explorer
😻
Visualize web interaction recordings
Runtime error
3
Agent Reward Bench Leaderboard
🥇
Leaderboard for AgentRewardBench
Running
4
Agent Reward Bench Demo
💻
Explore agent trajectories and judgments in web benchmarks
Sleeping
3
Safearena Leaderboard
🏃
SafeArena Leaderboard
models
94
McGill-NLP/LLM2Vec-Qwen3-4B-mntp
Updated
•
16
McGill-NLP/LLM2Vec-Qwen3-17B-mntp
Updated
•
19
McGill-NLP/LLM2Vec-Qwen3-06B-mntp
Updated
•
11
McGill-NLP/LLM2Vec-Qwen25-7B-Instruct-mntp-unsup-simcse
Updated
McGill-NLP/LLM2Vec-Qwen25-3B-Instruct-mntp-unsup-simcse
Updated
McGill-NLP/LLM2Vec-Qwen25-15B-Instruct-mntp-unsup-simcse
Updated
McGill-NLP/LLM2Vec-Qwen25-05B-Instruct-mntp-unsup-simcse
Updated
McGill-NLP/LLM2Vec-Qwen25-7B-Instruct-mntp
Updated
•
9
McGill-NLP/LLM2Vec-Qwen25-3B-Instruct-mntp
Updated
•
9
McGill-NLP/LLM2Vec-Qwen25-15B-Instruct-mntp
Updated
•
20
datasets
37
McGill-NLP/RealVQA-w_model_results_crag_mm_validation_50
Viewer
•
Updated
•
50
•
21
McGill-NLP/value-drifts
Viewer
•
Updated
•
10.6k
•
53
McGill-NLP/SSA-MT
Viewer
•
Updated
•
23.3k
•
32
McGill-NLP/SSA-MTE
Viewer
•
Updated
•
92.9k
•
211
•
2
McGill-NLP/openmath-filtered
Viewer
•
Updated
•
200k
•
57
McGill-NLP/WebLINX-full
Updated
•
13.8k
•
6
McGill-NLP/msteb_requests
Updated
•
1.42k
McGill-NLP/msteb_results
Updated
•
1.39k
McGill-NLP/GlobalNLI
Viewer
•
Updated
•
37.2k
•
45
McGill-NLP/WebMMU
Viewer
•
Updated
•
4.24k
•
61
•
1