In a Training Loop 🔄

4 38 59

Karsten Kuhnke PRO

mindchain

https://www.linkedin.com/in/jankarstenkuhnke/

AI & ML interests

Mechanistic Interpretability, Sparse Autoencoders, JumpReLU, Reward Modeling, RLHF, AI Alignment, Function Calling, Gemma, Nemotron

Recent Activity

updated a collection 1 day ago

Text to Motion

liked a model 1 day ago

tencent/HY-Motion-1.0

updated a Space 2 days ago

mindchain/react-blog

View all activity

Organizations

upvoted 6 collections 2 days ago

upvoted a paper 2 days ago

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published 15 days ago • 12

upvoted 4 papers 3 days ago

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28, 2025 • 36

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 123

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 123

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 14 days ago • 88

upvoted an article 3 days ago

Article

Diffusers welcomes FLUX-2

Nov 25, 2025

•

166

upvoted a paper 3 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 9 days ago • 59

upvoted 3 collections 3 days ago

— Awesome RL datasets 📈 —

Collection

3 items • Updated Sep 23, 2025 • 1

— Long-context post-training 🧶 —

Collection

Resources for post-training LLMs with long-context samples • 5 items • Updated Sep 14, 2025 • 6

smol2operator Release

Collection

4 items • Updated Sep 23, 2025 • 24

upvoted a paper 3 days ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published 21 days ago • 18

upvoted a collection 3 days ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 178

upvoted 2 articles 4 days ago

Article

Codex is Open Sourcing AI models

21 days ago

•

Article

New in llama.cpp: Model Management

21 days ago

•

100

Karsten Kuhnke PRO

AI & ML interests

Recent Activity

Organizations

mindchain's activity

Diffusers welcomes FLUX-2

Codex is Open Sourcing AI models

New in llama.cpp: Model Management