-
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs
Paper • 2412.04144 • Published • 6 -
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation
Paper • 2410.08371 • Published • 3 -
MERGE^3: Efficient Evolutionary Merging on Consumer-grade GPUs
Paper • 2502.10436 • Published • 1 -
Mergenetic: a Simple Evolutionary Model Merging Library
Paper • 2505.11427 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2403.13187
-
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 66 -
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper • 2401.15024 • Published • 74 -
Your Transformer is Secretly Linear
Paper • 2405.12250 • Published • 158 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 65
-
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs
Paper • 2412.04144 • Published • 6 -
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation
Paper • 2410.08371 • Published • 3 -
MERGE^3: Efficient Evolutionary Merging on Consumer-grade GPUs
Paper • 2502.10436 • Published • 1 -
Mergenetic: a Simple Evolutionary Model Merging Library
Paper • 2505.11427 • Published • 14
-
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 66 -
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper • 2401.15024 • Published • 74 -
Your Transformer is Secretly Linear
Paper • 2405.12250 • Published • 158 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 65