Spaces:
Sleeping
Sleeping
Commit History
sync : ggml (new ops, new backend, etc) (#1602)
895e87a
unverified
ggml : re-enable blas for src0 != F32 (#1583)
87987de
unverified
sync : ggml (ggml-alloc + linker + gguf fixes) (#1501)
58507b9
unverified
whisper : add full CUDA and Metal offloading (#1472)
da4acca
unverified
ggml : fix MIN / MAX macro re-definition
1344fc4
unverified
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified
sync : ggml (const correctness)
4ce2d25
unverified
metal : add F32 support + update bench output
02d7878
unverified
whisper : Metal and ggml-alloc support (#1270)
714ee6b
unverified
whisper : fix bench regression + fix performance when using CPU BLAS (#1275)
abbf5f2
unverified
sync : ggml (HBM + Metal + style) (#1264)
88deeba
unverified
build : do not use _GNU_SOURCE gratuitously (#1129)
beefa34
unverified
Przemysław Pawełczyk
commited on
ggml : posixify pagesize (#1251)
4902c26
unverified
Przemysław Pawełczyk
commited on
ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247)
8bb66c1
unverified
ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220)
d41ba35
unverified
ggml : fix compilation errors incurred by -Werror (#1227)
45ef7b5
unverified
ChangSeok Oh
commited on
ggml : fix compiling when SSE3 is available but not SSSE3 (#1210)
b7995b7
unverified
Przemysław Pawełczyk
commited on
ggml : detect SSSE3 (#1211)
82a619c
unverified
Przemysław Pawełczyk
commited on
ci : more platforms coverage (#1101)
c4448fa
unverified
alonfaraj
Alon Faraj
commited on
Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)"
1e5ddb0
unverified
ggml : sync latest repo (mostly refactoring changes)
d97fd69
unverified
ggml : do not use _GNU_SOURCE gratuitously (#1027)
3a69cdf
unverified
Przemysław Pawełczyk
commited on
ggml : sync latest ggml lib
a100c9a
unverified
ggml : update WASM SIMD
84c1cc7
unverified
ggml : sync latest ggml repo
6ee8740
unverified
ggml : add AVX dot products
7e7b11c
unverified
ggml : sync latest ggml
803e1be
unverified
ggml : fix 32-bit ARM build + quantization
87ee234
ggml : sync ggml (clBLAST + tensor names)
f50d3b3
unverified
ggml : fix UB (int << 31)
8253b98
unverified
whisper : add integer quantization support (#540)
a5f8f3c
unverified
ggml : fix WASM build
c3d7603
unverified
ggml : fix 32-bit ARM NEON (#836)
5fa72ca
unverified
ggml : use vzip instead of vuzp for consistency
741db99
unverified
ggml : fix WASM build
ada8c2d
unverified
ggml : sync with ggml repo (warning fixes + asserts)
caf2759
unverified
ggml : sync latest ggml + llama.cpp updates (quantization)
ede1268
unverified
ggml, ci : fix build on whisper.android (ARM_NEON) + add CI (#764)
dedf05b
unverified
ggml : sync latest ggml
7b8292f
unverified
ggml : fix q4_1 dot product types (#759)
984a856
unverified
ggml : sync latest changes from ggml and llama.cpp
3bd52ce
unverified
ggml : fix WASM build
70332a0
unverified
ggml : backport llama.cpp updates (close #709)
bf6b4f8
unverified
talk-llama : add new example + sync ggml from llama.cpp (#664)
a8c74e6
unverified
whisper : reduce memory usage during inference (#431)
3aa9e6c
unverified
whisper : PPC64 big-endian support (#398)
239569b
unverified
fitzsim
commited on
bench : add memcpy and ggml_mul_mat benchmarks
a660ed9
unverified
ggml : remove obsolete zeroing + comment fixes (#390)
9c35c0d
unverified
ggml : correct behaviour of ggml_vec_sum_f32 (#390)
ffffc6e
unverified
Abitofevrything
commited on