In a Training Loop 🔄

3 31 47

Karsten Kuhnke PRO

mindchain

https://www.linkedin.com/in/jankarstenkuhnke/

AI & ML interests

Mechanistic Interpretability, Sparse Autoencoders, JumpReLU, Reward Modeling, RLHF, AI Alignment, Function Calling, Gemma, Nemotron

Recent Activity

liked a model about 3 hours ago

tencent/WeDLM-8B-Instruct

updated a Space about 4 hours ago

mindchain/react-blog

published a Space about 4 hours ago

mindchain/react-blog

View all activity

Organizations

upvoted 4 papers about 7 hours ago

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28 • 36

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20 • 122

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 122

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 12 days ago • 84

upvoted an article about 7 hours ago

Article

Diffusers welcomes FLUX-2

Nov 25

•

166

upvoted a paper about 8 hours ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 6 days ago • 54

upvoted 3 collections about 8 hours ago

upvoted a paper about 9 hours ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published 18 days ago • 11

upvoted a collection about 9 hours ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 177

upvoted 5 articles 1 day ago

Article

Codex is Open Sourcing AI models

19 days ago

•

Article

New in llama.cpp: Model Management

18 days ago

•

100

Article

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

6 days ago

•

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

12 days ago

•

Article

We Got Claude to Fine-Tune an Open Source LLM

26 days ago

•

546

upvoted 4 collections 1 day ago

Google Gemma Scope 2 - Neuronpedia

Collection

Google Gemma Scope 2: JumpReLU SAEs for Gemma 2 interpretability. 270M PT/IT, 1B PT variants. Neuronpedia integration. Mechanistic analysis. • 11 items • Updated 1 day ago • 1

SigLIP2

Collection

36 items • Updated Jul 10 • 101

Gemma 3n

Collection

4 items • Updated Jul 10 • 253

EmbeddingGemma

Collection

3 items • Updated Sep 11 • 105

Karsten Kuhnke PRO

AI & ML interests

Recent Activity

Organizations

mindchain's activity

Diffusers welcomes FLUX-2

Codex is Open Sourcing AI models

New in llama.cpp: Model Management

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

We Got Claude to Fine-Tune an Open Source LLM