8 11 19

Yingfa Chen

chen-yingfa

https://chen-yingfa.github.io

AI & ML interests

Long-context modeling, continual learning, architectures

Recent Activity

updated a collection 1 day ago

HypeNet

liked a model 1 day ago

chen-yingfa/HypeNet-2B

updated a model 1 day ago

chen-yingfa/HypeNet-2B

View all activity

Organizations

None yet

updated a collection 1 day ago

HypeNet

Collection

The models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts • 1 item • Updated 1 day ago

liked a model 1 day ago

chen-yingfa/HypeNet-2B

2B • Updated 1 day ago • 103 • 1

updated a model 1 day ago

chen-yingfa/HypeNet-2B

2B • Updated 1 day ago • 103 • 1

published a model 1 day ago

chen-yingfa/HypeNet-2B

2B • Updated 1 day ago • 103 • 1

upvoted an article 6 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

7 days ago

•

778

liked a model about 2 months ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 6 days ago • 1.35k • 496

liked a dataset about 2 months ago

openbmb/UltraData-Math

Viewer • Updated Feb 20 • 181M • 9.31k • 265

liked a model 2 months ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated Mar 7 • 21.9k • 922

authored a paper 2 months ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published Jan 29 • 14

upvoted a paper 2 months ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published Jan 29 • 14

submitted a paper to Daily Papers 2 months ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published Jan 29 • 14

upvoted a paper 5 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 132

liked a dataset 5 months ago

caskcsg/Litelong_Nextlong_512k

Preview • Updated Sep 20, 2025 • 137 • 1

authored a paper 6 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 4

upvoted a paper 6 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 4

commented a paper 6 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 4 •

updated a collection 7 months ago

Cost-Optimal GQA Models

Collection

2 items • Updated Sep 14, 2025 • 1

published 2 models 7 months ago

chen-yingfa/cogqa-19m

Updated Sep 14, 2025

chen-yingfa/cogqa-3m

Updated Sep 14, 2025

updated a collection 7 months ago

MLP

Collection

2 items • Updated Sep 11, 2025

Yingfa Chen

AI & ML interests

Recent Activity

Organizations

chen-yingfa's activity

Welcome Gemma 4: Frontier multimodal intelligence on device