amico's picture

30 255

amico

amico

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

tencent/WeDLM-8B-Instruct

liked a model 10 days ago

Qwen/Qwen-Image-Layered

liked a model 12 days ago

google/functiongemma-270m-it

View all activity

Organizations

None yet

upvoted a collection 28 days ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 29 days ago • 80

upvoted an article about 1 month ago

Article

Continuous batching from first principles

+1

Nov 25

•

288

upvoted an article 2 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23

•

138

upvoted a collection 3 months ago

Spaces for Audio / Voices

605 items • Updated 22 days ago • 31

upvoted 2 articles 5 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5

•

508

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

May 21

•

38

upvoted a paper 5 months ago

Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26 • 46

upvoted an article 7 months ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

+2

May 23

•

170

upvoted 2 articles 8 months ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

+7

Apr 29

•

43

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30

•

81

upvoted a paper 8 months ago

Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series

Paper • 2401.03955 • Published Jan 8, 2024 • 12

upvoted 4 papers 9 months ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 130

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 301

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published Mar 31 • 62

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 55

upvoted 2 articles 9 months ago

Article

Introducing Gradio's new Dataframe!

Mar 24

•

29

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

+1

Apr 15, 2024

•

191

upvoted 2 collections 11 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 550

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 7 days ago • 103

upvoted an article 11 months ago

Article

🦸🏻#7: From Agentic AI to Physical AI

Jan 11

•

7