40 41 254

Kaizhao Liang PRO

kz919

https://kyleliang919.github.io/

AI & ML interests

None yet

Recent Activity

upvoted an article 8 days ago

PaliGemma – Google's Cutting-Edge Open Vision Language Model

updated a Space about 1 month ago

kz919/trl-lora-without-regret

published a Space about 1 month ago

kz919/trl-lora-without-regret

View all activity

Organizations

upvoted an article 8 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

•

277

upvoted 3 papers about 2 months ago

Cautious Weight Decay

Paper • 2510.12402 • Published Oct 14 • 5

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6 • 123

Artificial Hippocampus Networks for Efficient Long-Context Modeling

Paper • 2510.07318 • Published Oct 8 • 30

upvoted a paper 3 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 116

upvoted a changelog 4 months ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30

• 201

upvoted an article 4 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

•

upvoted a paper 4 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 166

upvoted an article 5 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18

•

upvoted a paper 5 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24 • 41

upvoted a paper 7 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 124

upvoted an article 9 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

upvoted 3 papers 10 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 151

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 29

upvoted 2 articles 10 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

•

490

upvoted a paper 11 months ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11

upvoted 2 papers 12 months ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 84

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 28

Kaizhao Liang PRO

AI & ML interests

Recent Activity

Organizations

kz919's activity

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

A failed experiment: Infini-Attention, and why we should keep trying?

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1

Welcome to Inference Providers on the Hub 🔥