Update README.md
Browse files
README.md
CHANGED
|
@@ -110,8 +110,15 @@ print("content:", content)
|
|
| 110 |
|
| 111 |
## Evaluation
|
| 112 |
|
| 113 |
-
-
|
| 114 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 115 |
|
| 116 |
## Intended Use
|
| 117 |
|
|
|
|
| 110 |
|
| 111 |
## Evaluation
|
| 112 |
|
| 113 |
+
- Shows superior reasoning efficiency, shorter reasoning steps, higher readability, and better explanation quality compared to models of similar scale when evaluated on HRM-8K.
|
| 114 |
+
| Task | Score | Think Step Length | Korean Response Ratio |
|
| 115 |
+
|------------|:-----:|:------------------:|:---------------------:|
|
| 116 |
+
| GSM8k | 91.9 | 896 | 94.47 |
|
| 117 |
+
| KSM | 70.9 | 7979 | 80.6 |
|
| 118 |
+
| MATH | 95.1 | 2668 | 96.12 |
|
| 119 |
+
| OMNI Math | 61.9 | 7987 | 73.91 |
|
| 120 |
+
|
| 121 |
+
<img src="KULLM_R_result.png" width="1000"/>
|
| 122 |
|
| 123 |
## Intended Use
|
| 124 |
|