AHN Collection Artificial Hippocampus Networks (AHNs) for Efficient Long-Context Modeling • 9 items • Updated Oct 9, 2025 • 6
VINCIE Collection A diffusion transformer model for in-context image generation and editing • 3 items • Updated Sep 6, 2025 • 7
Pre-training Dataset Samples Collection A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated 9 days ago • 18
Pivotal Token Search Collection Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success • 12 items • Updated 14 days ago • 5
Internal Coherence Maximization Collection Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs • 7 items • Updated Oct 10, 2025 • 4
Ellora Collection Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement • 12 items • Updated Oct 20, 2025 • 4
Poseidon Reasoning Collection (LLM) research, benchmarking, and STEM-focused • 2 items • Updated Jul 19, 2025 • 3
Kontext Dev LoRAs Collection Collection of Kontext Dev LoRAs by fal • 30 items • Updated Jul 27, 2025 • 31
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 43