Qwen

Team

company

https://qwen.ai/

alibaba_qwen

QwenLM

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

littlebird13 updated a Space 3 days ago

Qwen/Qwen3-Omni-Demo

Haihao authored a paper 3 days ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

wenhuach authored a paper 3 days ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

View all activity

Papers

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Qwen3-VL Technical Report

View all Papers

littlebird13

updated a Space 3 days ago

Qwen3 Omni Demo

Generate audio responses from text and media inputs

ShuaiBai623

authored a paper 4 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 12 days ago • 110

littlebird13

published a model 4 days ago

Qwen/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated 5 days ago • 1.4k • 9

danielhanchen

posted an update 5 days ago

Post

3144

Mistral's new Ministral 3 models can now be Run & Fine-tuned locally! (16GB RAM)
Ministral 3 have vision support and the best-in-class performance for their sizes.
14B Instruct GGUF: unsloth/Ministral-3-14B-Instruct-2512-GGUF
14B Reasoning GGUF: unsloth/Ministral-3-14B-Reasoning-2512-GGUF

🐱 Step-by-step Guide: https://docs.unsloth.ai/new/ministral-3
All GGUFs, BnB, FP8 etc. variants uploads: https://huggingface.co/collections/unsloth/ministral-3

3 replies

·

bartowski

in Qwen/Qwen3-Coder-30B-A3B-Instruct 5 days ago

Does new update require GGUF update?

#34 opened 5 days ago by

littlebird13

updated 2 models 5 days ago

Qwen/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated 5 days ago • 1.4k • 9

Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated 5 days ago • 539 • 2

cyente

updated 2 models 5 days ago

Qwen/Qwen3-Coder-30B-A3B-Instruct-FP8

Text Generation • 31B • Updated 5 days ago • 299k • 118

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated 5 days ago • 1.16M • • 799

littlebird13

published a model 5 days ago

Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated 5 days ago • 539 • 2

fernandofernandes

authored 4 papers 6 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 15

Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

Paper • 2406.14971 • Published Jun 21, 2024

Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit

Paper • 2506.06607 • Published Jun 7 • 2

LFM2 Technical Report

Paper • 2511.23404 • Published 10 days ago • 34

terryyz

authored a paper 6 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 15 days ago • 242

chujiezheng

authored 2 papers 6 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 13 days ago • 33

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 7 days ago • 78

yangapku

authored a paper 6 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 7 days ago • 78

danielhanchen

posted an update 9 days ago

Post

8190

Qwen3-Next can now be Run locally! (30GB RAM)
Instruct GGUF: unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

The models come in Thinking and Instruct versions and utilize a new architecture, allowing it to have ~10x faster inference than Qwen32B.
💜 Step-by-step Guide: https://docs.unsloth.ai/models/qwen3-next

Thinking GGUF: unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF

littlebird13

updated a Space 10 days ago

Qwen TTS Clone Demo

Clone and synthesize voice from a sample