view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 123
view article Article MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression Feb 4, 2025 • 19