ykae
/

monarch-bert-base-mnli

Text Classification

monarch-matrices

hardware-efficient

Eval Results (legacy)

text-embeddings-inference

Model card Files Files and versions

ykae commited on Jan 25

Commit

13b6afa

·

verified ·

1 Parent(s): 16cf376

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -70,7 +70,7 @@ Measured on a single NVIDIA H100 using `torch.compile(mode="max-autotune")`.
 | **Parameters** | 85.65M | **28.98M** | 📉 **-66.2%** |
 | **Compute (GFLOPs)** | 696.5 | **232.6** | 📉 **-66.6%** |
 | **Throughput (TPS)** | 7261 | **9029** | 🚀 **+24.3%** |
-| **Latency (Batch 32)** | 4.41 ms | **3.54 ms** | ⚡ **24,6 % Faster** |
 | **Accuracy (MNLI)** | 83.62% | **78.34%** | 📉 **-5.28%** |
 ## Usage

 | **Parameters** | 85.65M | **28.98M** | 📉 **-66.2%** |
 | **Compute (GFLOPs)** | 696.5 | **232.6** | 📉 **-66.6%** |
 | **Throughput (TPS)** | 7261 | **9029** | 🚀 **+24.3%** |
+| **Latency (Batch 32)** | 4.41 ms | **3.54 ms** | ⚡ **+24.6% Faster** |
 | **Accuracy (MNLI)** | 83.62% | **78.34%** | 📉 **-5.28%** |
 ## Usage