-
CLEX: Continuous Length Extrapolation for Large Language Models
Paper • 2310.16450 • Published • 10 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 25
Juan Herrera
juampahc
AI & ML interests
None yet
Recent Activity
liked a model 2 days ago
OuteAI/Llama-OuteTTS-1.0-1B upvoted a collection 7 days ago
NVIDIA Nemotron v3 liked a model 27 days ago
xkos/Qwen3-TTS-12Hz-1.7B-ONNX