JulianZhu

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

llm-semantic-router/halugate-sentinel

liked a model 11 months ago

nvidia/DeepSeek-R1-NVFP4

liked a model 11 months ago

qihoo360/TinyR1-32B-Preview

View all activity

Organizations

liked a model about 1 month ago

llm-semantic-router/halugate-sentinel

Text Classification • 0.1B • Updated Dec 4, 2025 • 1.58k • 6

liked 2 models 11 months ago

nvidia/DeepSeek-R1-NVFP4

Text Generation • 397B • Updated Jun 6, 2025 • 9.93k • 267

qihoo360/TinyR1-32B-Preview

Text Generation • 33B • Updated Sep 24, 2025 • 48 • • 331

liked 2 datasets 11 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 206 • 215

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21, 2025 • 110k • 398 • 720

liked a model 11 months ago

thu-coai/CritiqueLLM-6B

Text Generation • Updated Jun 28, 2024 • 4 • 5

upvoted a collection 11 months ago

Tifa-Deepsex-14b-CoT-V1

Collection

Tifa系列角色扮演模型思维链技术验证模型 • 3 items • Updated Feb 8, 2025 • 44

liked a model 11 months ago

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.18k • 818

upvoted a paper 11 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

liked a Space 12 months ago

DeepSeek-R1 WebGPU

🧠

554

Next-generation reasoning model that runs locally in-browser

liked 2 models 12 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 334k • • 13k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • 685B • Updated Mar 27, 2025 • 2.72k • 941

liked 2 models about 1 year ago

MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • 456B • Updated Jul 3, 2025 • 56.7k • 282

MiniMaxAI/MiniMax-Text-01

Text Generation • 456B • Updated Jul 3, 2025 • 1.11k • 652

liked 3 models over 1 year ago

liked a Space over 1 year ago

Chattts Zero

🐢

339

Generate audio from text with tuning options

liked 2 models almost 2 years ago

mistral-community/Mixtral-8x22B-Instruct-v0.1-4bit

Text Generation • 143B • Updated Jul 1, 2024 • 27 • 11

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 580 • 2.38k

JulianZhu

AI & ML interests

Recent Activity

Organizations

JulianZhu's activity

DeepSeek-R1 WebGPU

Chattts Zero