gte-modernbert-base GGUF

GGUF format of Alibaba-NLP/gte-modernbert-base for use with CrispEmbed.

Alibaba GTE ModernBERT Base. General-purpose English text embedding model with long context support (8,192 tokens). Strong MTEB performance (64.38) and long-document retrieval (LoCo 88.88).

Model details

  • Architecture: ModernBERT encoder-only transformer (149M params)
  • Embedding dimension: 768
  • Languages: English
  • Context length: 8,192 tokens
  • MTEB score: 64.38
  • License: Apache 2.0

Files

File Quantization Size
gte-modernbert-base.gguf F32 ~560 MB
gte-modernbert-base-q8_0.gguf Q8_0 ~150 MB
gte-modernbert-base-q4_k.gguf Q4_K ~85 MB

Quick Start

See CrispEmbed for full documentation and CrispASR for speech-to-text.

Downloads last month
129
GGUF
Model size
0.1B params
Architecture
bert
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/gte-modernbert-base-GGUF

Quantized
(12)
this model