Clockz commited on
Commit
d95b6a3
·
verified ·
1 Parent(s): a1ca024

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ <h1 align="center">
6
+ On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
7
+ </h1>
8
+
9
+ <div align="center">
10
+
11
+ <a href="https://chenlong-clock.github.io">Charlie Zhang</a>, <a href="https://www.phontron.com">Graham Neubig</a>,
12
+ <a href="https://xiangyue9607.github.io">Xiang Yue</a>
13
+
14
+ Carnegie Mellon University, Language Technologies Institute
15
+
16
+ </div>
17
+
18
+ <div align="center">
19
+
20
+ [![arXiv](https://img.shields.io/badge/arXiv-2512.07783-b31b1b.svg?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.07783)
21
+ [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
22
+ ![Python](https://img.shields.io/badge/python-3.9%2B-blue)
23
+
24
+ </div>
25
+
26
+
27
+ This repository contains post-trained checkpoints in extrapolation tasks.
28
+
29
+ ## 📚 Citation
30
+
31
+ If you find this work or code useful, please consider citing:
32
+
33
+ ```bibtex
34
+ @misc{zhang2025interplaypretrainingmidtrainingrl,
35
+ title={On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models},
36
+ author={Charlie Zhang and Graham Neubig and Xiang Yue},
37
+ year={2025},
38
+ eprint={2512.07783},
39
+ archivePrefix={arXiv},
40
+ primaryClass={cs.CL},
41
+ url={https://arxiv.org/abs/2512.07783},
42
+ }
43
+ ```