whisper.cpp / ggml /src /ggml-hip

Commit History

HIP: bump requirement to rocm 6.1 (llama/15296)
58a3802

uvos commited on

HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103)
577f7e4

uvos commited on

HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930)
f9dbd96

uvos commited on

HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202)
f95736f

uvos commited on

CUDA/HIP: Share the same unified memory allocation logic. (llama/12934)
143cb70

David Huang commited on

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)
a027c1d

David Huang commited on

CUDA: app option to compile without FlashAttention (llama/12025)
fbc5f16

JohannesGaessler commited on

HIP: force max threads per block to be 1024 (llama/11621)
f509509

fxzjshm commited on

CUDA: use mma PTX instructions for FlashAttention (llama/11583)
f328957

JohannesGaessler Diego Devesa commited on

HIP: require at least HIP 5.5
72c425b

uvos commited on

Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420)
2cc4df4

uvos commited on

hip : Add hipGraph and VMM support to ROCM (llama/11362)
089afa0

uvos commited on

ggml : do not define GGML_USE_CUDA when building with GGML_BACKEND_DL (llama/11211)
79f750d

rgerganov commited on

ggml : add support for dynamic loading of backends (llama/10469)
b73266f

Diego Devesa ggerganov HF Staff commited on

CUDA: remove DMMV, consolidate F16 mult mat vec (llama/10318)
e446f60

JohannesGaessler commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov HF Staff R0CKSTAR commited on