OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper • 2412.07626 • Published Dec 10, 2024 • 27
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains Paper • 2406.12045 • Published Jun 17, 2024 • 9
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 32
GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated Aug 19 • 12
GLiNER-BioMed Collection Collection of high-quality GLiNER models tuned for working with biomedical data • 7 items • Updated Apr 2 • 7
GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition Paper • 2504.00676 • Published Apr 1 • 5
view article Article Multi-Label Classification Model From Scratch: Step-by-Step Tutorial Jan 8, 2024 • 49
PLAID: An Efficient Engine for Late Interaction Retrieval Paper • 2205.09707 • Published May 19, 2022 • 2
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model Mar 10 • 146
RAFT: Adapting Language Model to Domain Specific RAG Paper • 2403.10131 • Published Mar 15, 2024 • 72
Common 7B Language Models Already Possess Strong Math Capabilities Paper • 2403.04706 • Published Mar 7, 2024 • 20
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 105
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 345