Antonio (Anthonny) Badilla-Olivas's picture

8

Antonio (Anthonny) Badilla-Olivas

abotresol

·

https://github.com/Antonio-Tresol

AI & ML interests

NLP, Deep Reinforcement Learning.

Organizations

None yet

upvoted a paper 5 months ago

StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs

Paper • 2506.03077 • Published Jun 3, 2025 • 17

upvoted 2 papers 8 months ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28, 2025 • 39

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5, 2025 • 85

upvoted 5 papers 11 months ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20, 2025 • 26

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19, 2025 • 36

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19, 2025 • 27

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17, 2025 • 39

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Paper • 2502.13965 • Published Feb 19, 2025 • 19