Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 11 days ago • 40
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Paper • 2504.05897 • Published Apr 8, 2025 • 21