Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection Paper • 2404.16944 • Published Apr 25, 2024 • 1
The Appeal and Reality of Recycling LoRAs with Adaptive Merging Paper • 2602.12323 • Published Feb 12 • 1
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale Paper • 2602.23361 • Published about 1 month ago • 15
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published Feb 24 • 30
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published Feb 11 • 15
Quantifying the Carbon Emissions of Machine Learning Paper • 1910.09700 • Published Oct 21, 2019 • 39
The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving Paper • 2601.00747 • Published Jan 2 • 20
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 124
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published Dec 18, 2025 • 18
supertoken Collection The initial checkpoints for the token comparison research. • 20 items • Updated May 22, 2025 • 2
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5, 2025 • 60