1 4 1

Shannon Shen

shannons

https://szj.io

AI & ML interests

None yet

Recent Activity

updated a dataset 10 days ago

rl-research/dsqa

published a dataset 10 days ago

rl-research/dsqa

authored a paper about 1 month ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

View all activity

Organizations

updated a dataset 10 days ago

rl-research/dsqa

Viewer • Updated 10 days ago • 900 • 13

published a dataset 10 days ago

rl-research/dsqa

Viewer • Updated 10 days ago • 900 • 13

authored 3 papers about 1 month ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Paper • 2406.07835 • Published Jun 10, 2024 • 2

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10 • 15

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24 • 60

upvoted a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24 • 60

updated a dataset about 1 month ago

rl-research/deep_research_bench_eval

Viewer • Updated Nov 23 • 100 • 75

published a dataset about 1 month ago

rl-research/deep_research_bench_eval

Viewer • Updated Nov 23 • 100 • 75

updated a dataset about 1 month ago

rl-research/webwalker_test

Viewer • Updated Nov 23 • 680 • 32

published 2 datasets about 1 month ago

rl-research/webwalker_test

Viewer • Updated Nov 23 • 680 • 32

rl-research/researchqa_official_subset_ids

Viewer • Updated Nov 23 • 776 • 62

updated a dataset about 1 month ago

rl-research/researchqa_official_subset_ids

Viewer • Updated Nov 23 • 776 • 62

upvoted a paper 3 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10 • 15

published a dataset 5 months ago

shannons/ot3-1.2m-10k

Viewer • Updated Jul 16 • 10k • 22

updated 2 datasets 5 months ago

rl-rag/combined-sft-training-data-v20250724

Viewer • Updated Jul 24 • 568 • 15

rl-rag/combined-sft-training-data-v20250724

Viewer • Updated Jul 24 • 568 • 15

updated 2 datasets 6 months ago

shannons/ot3-1.2m-10k

Viewer • Updated Jul 16 • 10k • 22

shannons/ot3-1.2m-10k

Viewer • Updated Jul 16 • 10k • 22

Shannon Shen

AI & ML interests

Recent Activity

Organizations

shannons's activity