nobrand
/

KULLM-R

Text Generation

text-generation-inference

Model card Files Files and versions

nobrand commited on Aug 6, 2025

Commit

376c644

·

verified ·

1 Parent(s): 258353d

Update README.md

Files changed (1) hide show

README.md +9 -2

README.md CHANGED Viewed

@@ -110,8 +110,15 @@ print("content:", content)
 ## Evaluation
-- **KULLM-R vs Qwen3-8B**: Shows superior reasoning efficiency, shorter reasoning steps, higher readability, and better explanation quality compared to models of similar scale when evaluated on Korean reasoning tasks.
 ## Intended Use

 ## Evaluation
+- Shows superior reasoning efficiency, shorter reasoning steps, higher readability, and better explanation quality compared to models of similar scale when evaluated on HRM-8K.
+| Task       | Score | Think Step Length  | Korean Response Ratio |
+|------------|:-----:|:------------------:|:---------------------:|
+| GSM8k      |   91.9     |   896      |    94.47    |
+| KSM        |   70.9     |   7979     |   80.6      |
+| MATH       |   95.1     |   2668     |   96.12     |
+| OMNI Math  |   61.9     |   7987     |   73.91     |
+<img src="KULLM_R_result.png" width="1000"/>
 ## Intended Use