DiRL: An Efficient Post-Training Framework for Diffusion Language Models Paper • 2512.22234 • Published 8 days ago • 16
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 23 days ago • 55
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction Paper • 2508.02558 • Published Aug 4 • 10
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Paper • 2506.14429 • Published Jun 17 • 44
Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache Paper • 2506.11886 • Published Jun 13 • 20