Spaces:
Sleeping
Sleeping
Commit History
HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103) 577f7e4
HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930) f9dbd96
HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202) f95736f
uvos commited on
CUDA/HIP: Share the same unified memory allocation logic. (llama/12934) 143cb70
David Huang commited on
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032) a027c1d
David Huang commited on
CUDA: app option to compile without FlashAttention (llama/12025) fbc5f16
HIP: force max threads per block to be 1024 (llama/11621) f509509
fxzjshm commited on
CUDA: use mma PTX instructions for FlashAttention (llama/11583) f328957
HIP: require at least HIP 5.5 72c425b
uvos commited on
Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420) 2cc4df4
uvos commited on
hip : Add hipGraph and VMM support to ROCM (llama/11362) 089afa0
uvos commited on