SpectralPO

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ziniuli authored a paper about 1 month ago

Why Transformers Need Adam: A Hessian Perspective

ziniuli authored a paper about 1 month ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

ziniuli authored a paper about 1 month ago

CoRT: Code-integrated Reasoning within Thinking

View all activity

Organization Card

Community About org cards

This repo contains all the models for paper -

Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO

https://arxiv.org/abs/2505.11595

Please cite

@inproceedings{chen2025spectral,
  title = {Spectral Policy Optimization: Coloring your Incorrect Reasoning in {GRPO}},
  author = {Peter Chen and Xiaopeng Li and Ziniu Li and Xi Chen and Tianyi Lin},
  booktitle = {2nd AI for Math Workshop @ ICML 2025},
  year = {2025},
  url = {https://openreview.net/forum?id=IIBDElbi7s}
}

Collections 7

View 7 collections

models 27

datasets 0

None public yet

AI & ML interests

Recent Activity

Team members 3

Collections 7

models 27 Sort: Recently updated

datasets 0

models 27