nobrand commited on
Commit
376c644
·
verified ·
1 Parent(s): 258353d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -2
README.md CHANGED
@@ -110,8 +110,15 @@ print("content:", content)
110
 
111
  ## Evaluation
112
 
113
- - **KULLM-R vs Qwen3-8B**: Shows superior reasoning efficiency, shorter reasoning steps, higher readability, and better explanation quality compared to models of similar scale when evaluated on Korean reasoning tasks.
114
-
 
 
 
 
 
 
 
115
 
116
  ## Intended Use
117
 
 
110
 
111
  ## Evaluation
112
 
113
+ - Shows superior reasoning efficiency, shorter reasoning steps, higher readability, and better explanation quality compared to models of similar scale when evaluated on HRM-8K.
114
+ | Task | Score | Think Step Length | Korean Response Ratio |
115
+ |------------|:-----:|:------------------:|:---------------------:|
116
+ | GSM8k | 91.9 | 896 | 94.47 |
117
+ | KSM | 70.9 | 7979 | 80.6 |
118
+ | MATH | 95.1 | 2668 | 96.12 |
119
+ | OMNI Math | 61.9 | 7987 | 73.91 |
120
+
121
+ <img src="KULLM_R_result.png" width="1000"/>
122
 
123
  ## Intended Use
124