Commits · natasa365/whisper.cpp

examples : add HEAPU8 to all of the exported runtime methods (#3134)

a4fc5fb
unverified

Enes Grahovac commited on May 10, 2025

wasm : add note about worker.js file generation [no ci] (#3133)

c6a619d
unverified

danbev commited on May 9, 2025

whisper : deprecate WHISPER_CCACHE CMake option (#3131)

c4aa3ee
unverified

danbev commited on May 9, 2025

stream.wasm : add HEAPU8 to exported runtime methods (#3130)

df2c5e7
unverified

danbev commited on May 8, 2025

sync : ggml

87f0773

ggerganov commited on May 7, 2025

cuda : remove nrows_x in mul_mat_q_process_tile (llama/13325)

0fd6120

R0CKSTAR commited on May 7, 2025

CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (llama/13135)

9fb68a1

JohannesGaessler commited on May 6, 2025

SYCL: Disable reorder optimize by default and stop setting tensor extras when optimize is disabled (llama/13254)

53abb97

Akarshan Biswas commited on May 6, 2025

CUDA: fix bad asserts for partial offload (llama/13337)

23e676b

JohannesGaessler commited on May 6, 2025

CUDA: fix --split-mode row for MMQ (llama/13323)

1136116

JohannesGaessler commited on May 6, 2025

CUDA: fix logic for clearing padding with -ngl 0 (llama/13320)

c3e51a2

JohannesGaessler commited on May 5, 2025

SYCL: Disable mul_mat kernels for noncontiguous tensor b (llama/13308)

3628417

Akarshan Biswas commited on May 5, 2025

rpc : use backend registry, support dl backends (llama/13304)

0286805

Diego Devesa commited on May 4, 2025

ggml : activate s390x simd for Q3_K (llama/13301)

1bfe279

taronaeo commited on May 4, 2025

CUDA: fix race condition in MMQ stream-k fixup (llama/13299)

160742f

JohannesGaessler commited on May 4, 2025

CUDA: fix race condition in MMQ ids_dst (llama/13294)

d249810

JohannesGaessler commited on May 4, 2025

vulkan: Additional type support for unary, binary, and copy (llama/13266)

b9cb11e

jeffbolznv commited on May 4, 2025

ci : add bindings-java jar artifact to release (#3126)

03b0716
unverified

danbev commited on May 7, 2025

cli : avoid std::exchange

ba2be5c

ggerganov commited on May 7, 2025

sync : ggml

27f99b0

ggerganov commited on May 7, 2025

vulkan : fix lint (llama/0)

49be727

ggerganov commited on May 2, 2025

ggml : Enable MMA for BF16 in llamafile_sgemm (llama/13148)

7da5bcc

shalinib commited on May 2, 2025

rpc : avoid uninitialized memory in serialize_tensor (llama/13210)

31cad24

Justin Santa Barbara commited on May 1, 2025

ggml: Don't assert fail when tensor data changes (llama/13222)

af16d74

Jesse Gross commited on May 1, 2025

build : fix build info on windows (llama/13239)

415b9fc

Diego Devesa commited on May 1, 2025

vulkan: Add bfloat16 support (llama/12554)

b21f8a1

jeffbolznv commited on May 1, 2025

vulkan: Handle src1 batch dimension in non-contiguous mat-vec-mul shader (llama/13191)

710fdcf

jeffbolznv commited on May 1, 2025

vulkan : kernels for depthwise 2D convolution (CONV_2D_DW) (ggml/1204)

43d9f3e

Acly commited on May 2, 2025

ci : zip windows artifacts for release uploading (#3124)

3dbef6c
unverified

danbev commited on May 7, 2025

ci : add zip extension to xcframework artifact name (#3120)

a8a2519
unverified

danbev commited on May 7, 2025

whisper: remove MSVC warnings pragmas (#3090)

e0d130c
unverified

danbev commited on May 5, 2025

server: update abort mechanism to handle HTTP connection closure (#3112)

02b25fa
unverified

sachaarbonel commited on May 5, 2025

cli : support "-" for stdout like stdin (#3050)

7e3c27c
unverified

Daniel Tang commited on May 5, 2025

docs : Update cli documentation (#3102)

8566207
unverified

arpitjain96 commited on May 2, 2025

cmake : removed stdc++fs (#3097)

e715962
unverified

JaredTweed commited on May 2, 2025

server : update httplib.h to version 0.20.0 (#3101)

238f652
unverified

sachaarbonel commited on May 2, 2025

ruby : refine HTTP cache feature (#3109)

f1d4a23
unverified

KitaitiMakoto commited on May 1, 2025

talk-llama : sync llama.cpp

05fda4a

ggerganov commited on May 1, 2025

sync : ggml

6d29e32

ggerganov commited on May 1, 2025

CUDA: batched+noncont MMQ, refactor bs>1 MoE code (llama/13199)

a867083

JohannesGaessler commited on Apr 30, 2025

vulkan: use uint array index to avoid glslang bug (llama/13193)

fd2d86d

jeffbolznv commited on Apr 30, 2025

ggml : fix ppc64le build (llama/13176)

07ec79f

shalinib commited on Apr 30, 2025

feat(ggml-cpu): enable z17 compile (llama/13182)

10f7d18

Aaron Teo commited on Apr 30, 2025

CUDA: fix non-cont. inputs for batched mat mul (llama/13155)

d13b876

JohannesGaessler commited on Apr 29, 2025

fix(rpc): Improve input validation and error handling (llama/13069)

9e9f2fe

Ville Vesilehto commited on Apr 28, 2025

SYCL: Add all missing unary kernels (llama/13074)

d2ce872

Akarshan Biswas commited on Apr 28, 2025

musa: fix typo in cc control (llama/13144)

5fb7320

R0CKSTAR commited on Apr 28, 2025

CUDA: fix q_nope_absorbed prec for DS 2 Lite f16 (llama/13137)

e9c9d4b

JohannesGaessler commited on Apr 28, 2025

musa: fix build warning (llama/13129)

3436ba4

R0CKSTAR commited on Apr 27, 2025

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (llama/13107)

c47823e

sxx-404 commited on Apr 26, 2025

Commit History

examples : add HEAPU8 to all of the exported runtime methods (#3134) a4fc5fb unverified

wasm : add note about worker.js file generation [no ci] (#3133) c6a619d unverified

whisper : deprecate WHISPER_CCACHE CMake option (#3131) c4aa3ee unverified

stream.wasm : add HEAPU8 to exported runtime methods (#3130) df2c5e7 unverified

sync : ggml 87f0773

cuda : remove nrows_x in mul_mat_q_process_tile (llama/13325) 0fd6120

CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (llama/13135) 9fb68a1

SYCL: Disable reorder optimize by default and stop setting tensor extras when optimize is disabled (llama/13254) 53abb97

CUDA: fix bad asserts for partial offload (llama/13337) 23e676b

CUDA: fix --split-mode row for MMQ (llama/13323) 1136116

CUDA: fix logic for clearing padding with -ngl 0 (llama/13320) c3e51a2

SYCL: Disable mul_mat kernels for noncontiguous tensor b (llama/13308) 3628417

rpc : use backend registry, support dl backends (llama/13304) 0286805

ggml : activate s390x simd for Q3_K (llama/13301) 1bfe279

CUDA: fix race condition in MMQ stream-k fixup (llama/13299) 160742f

CUDA: fix race condition in MMQ ids_dst (llama/13294) d249810

vulkan: Additional type support for unary, binary, and copy (llama/13266) b9cb11e

ci : add bindings-java jar artifact to release (#3126) 03b0716 unverified

cli : avoid std::exchange ba2be5c

sync : ggml 27f99b0

vulkan : fix lint (llama/0) 49be727

ggml : Enable MMA for BF16 in llamafile_sgemm (llama/13148) 7da5bcc

rpc : avoid uninitialized memory in serialize_tensor (llama/13210) 31cad24

ggml: Don't assert fail when tensor data changes (llama/13222) af16d74

build : fix build info on windows (llama/13239) 415b9fc

vulkan: Add bfloat16 support (llama/12554) b21f8a1

vulkan: Handle src1 batch dimension in non-contiguous mat-vec-mul shader (llama/13191) 710fdcf

vulkan : kernels for depthwise 2D convolution (CONV_2D_DW) (ggml/1204) 43d9f3e

ci : zip windows artifacts for release uploading (#3124) 3dbef6c unverified

ci : add zip extension to xcframework artifact name (#3120) a8a2519 unverified

whisper: remove MSVC warnings pragmas (#3090) e0d130c unverified

server: update abort mechanism to handle HTTP connection closure (#3112) 02b25fa unverified

cli : support "-" for stdout like stdin (#3050) 7e3c27c unverified

docs : Update cli documentation (#3102) 8566207 unverified

cmake : removed stdc++fs (#3097) e715962 unverified

server : update httplib.h to version 0.20.0 (#3101) 238f652 unverified

ruby : refine HTTP cache feature (#3109) f1d4a23 unverified

talk-llama : sync llama.cpp 05fda4a

sync : ggml 6d29e32

CUDA: batched+noncont MMQ, refactor bs>1 MoE code (llama/13199) a867083

vulkan: use uint array index to avoid glslang bug (llama/13193) fd2d86d

ggml : fix ppc64le build (llama/13176) 07ec79f

feat(ggml-cpu): enable z17 compile (llama/13182) 10f7d18

CUDA: fix non-cont. inputs for batched mat mul (llama/13155) d13b876

fix(rpc): Improve input validation and error handling (llama/13069) 9e9f2fe

SYCL: Add all missing unary kernels (llama/13074) d2ce872

musa: fix typo in cc control (llama/13144) 5fb7320

CUDA: fix q_nope_absorbed prec for DS 2 Lite f16 (llama/13137) e9c9d4b

musa: fix build warning (llama/13129) 3436ba4

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (llama/13107) c47823e

examples : add HEAPU8 to all of the exported runtime methods (#3134)

a4fc5fb
unverified

wasm : add note about worker.js file generation [no ci] (#3133)

c6a619d
unverified

whisper : deprecate WHISPER_CCACHE CMake option (#3131)

c4aa3ee
unverified

stream.wasm : add HEAPU8 to exported runtime methods (#3130)

df2c5e7
unverified

sync : ggml

87f0773

cuda : remove nrows_x in mul_mat_q_process_tile (llama/13325)

0fd6120

CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (llama/13135)

9fb68a1

SYCL: Disable reorder optimize by default and stop setting tensor extras when optimize is disabled (llama/13254)

53abb97

CUDA: fix bad asserts for partial offload (llama/13337)

23e676b

CUDA: fix --split-mode row for MMQ (llama/13323)

1136116

CUDA: fix logic for clearing padding with -ngl 0 (llama/13320)

c3e51a2

SYCL: Disable mul_mat kernels for noncontiguous tensor b (llama/13308)

3628417

rpc : use backend registry, support dl backends (llama/13304)

0286805

ggml : activate s390x simd for Q3_K (llama/13301)

1bfe279

CUDA: fix race condition in MMQ stream-k fixup (llama/13299)

160742f

CUDA: fix race condition in MMQ ids_dst (llama/13294)

d249810

vulkan: Additional type support for unary, binary, and copy (llama/13266)

b9cb11e

ci : add bindings-java jar artifact to release (#3126)

03b0716
unverified

cli : avoid std::exchange

ba2be5c

sync : ggml

27f99b0

vulkan : fix lint (llama/0)

49be727

ggml : Enable MMA for BF16 in llamafile_sgemm (llama/13148)

7da5bcc

rpc : avoid uninitialized memory in serialize_tensor (llama/13210)

31cad24

ggml: Don't assert fail when tensor data changes (llama/13222)

af16d74

build : fix build info on windows (llama/13239)

415b9fc

vulkan: Add bfloat16 support (llama/12554)

b21f8a1

vulkan: Handle src1 batch dimension in non-contiguous mat-vec-mul shader (llama/13191)

710fdcf

vulkan : kernels for depthwise 2D convolution (CONV_2D_DW) (ggml/1204)

43d9f3e

ci : zip windows artifacts for release uploading (#3124)

3dbef6c
unverified

ci : add zip extension to xcframework artifact name (#3120)

a8a2519
unverified

whisper: remove MSVC warnings pragmas (#3090)

e0d130c
unverified

server: update abort mechanism to handle HTTP connection closure (#3112)

02b25fa
unverified

cli : support "-" for stdout like stdin (#3050)

7e3c27c
unverified

docs : Update cli documentation (#3102)

8566207
unverified

cmake : removed stdc++fs (#3097)

e715962
unverified

server : update httplib.h to version 0.20.0 (#3101)

238f652
unverified

ruby : refine HTTP cache feature (#3109)

f1d4a23
unverified

talk-llama : sync llama.cpp

05fda4a

sync : ggml

6d29e32

CUDA: batched+noncont MMQ, refactor bs>1 MoE code (llama/13199)

a867083

vulkan: use uint array index to avoid glslang bug (llama/13193)

fd2d86d

ggml : fix ppc64le build (llama/13176)

07ec79f

feat(ggml-cpu): enable z17 compile (llama/13182)

10f7d18

CUDA: fix non-cont. inputs for batched mat mul (llama/13155)

d13b876

fix(rpc): Improve input validation and error handling (llama/13069)

9e9f2fe

SYCL: Add all missing unary kernels (llama/13074)

d2ce872

musa: fix typo in cc control (llama/13144)

5fb7320

CUDA: fix q_nope_absorbed prec for DS 2 Lite f16 (llama/13137)

e9c9d4b

musa: fix build warning (llama/13129)

3436ba4

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (llama/13107)

c47823e