Reset schedule earlier to allow overlap with ggml graph computation on device (llama/6933) 3a8eea8 agray3 commited on Apr 26, 2024
ggml : fix ggml_backend_cpu_supports_op() for CPY (llama/0) d645791 ggerganov commited on Apr 21, 2024
llama : add pipeline parallelism support (llama/6017) b5bb3f3 unverified slaren compilade ggerganov commited on Mar 13, 2024
ggml : introduce ggml_status (ggml/750) 151c676 unverified Michael Podvitskiy slaren ggerganov commited on Mar 4, 2024
ci : add an option to fail on compile warning (llama/3952) b5903fc unverified abastola ggerganov commited on Feb 17, 2024
Early return for zero size calls to get_tensor. (llama/5482) f1f5c00 unverified AT ggerganov commited on Feb 13, 2024
ggml : add abort_callback for cpu backend (ggml/725) a8ea91b unverified Michael Podvitskiy commited on Feb 9, 2024
Nomic Vulkan backend (llama/4456) f5fd92d unverified Cebtenzzre niansa manyoso apage43 ToKiNoBug ggerganov slaren commited on Jan 29, 2024
ggml : add Vulkan backend (llama/2059) 5a97aba unverified OccamRazor SlyEcho Concedo slaren ggerganov commited on Jan 28, 2024
ggml : add unified SYCL backend for Intel GPUs (llama/2690) 01169e0 unverified Abhilash Majumder jianyuzh KevinLy hengyu ggerganov commited on Jan 28, 2024
cuda : fix tensor size calculation for non-split buffer (llama/5145) 8f3eb65 unverified slaren commited on Jan 26, 2024
llama : run all KQV ops on the CPU with no KV offload (llama/5049) 97ce95c unverified slaren commited on Jan 20, 2024
ggml : add IQ2 to test-backend-ops + refactoring (llama/4990) 227f2ae unverified ggerganov commited on Jan 17, 2024
ggml : introduce GGML_CALL function annotation (llama/4850) 7815f68 unverified jartine commited on Jan 16, 2024
llama : ggml-backend integration (llama/4766) 362430b unverified slaren ggerganov JohannesGaessler commited on Jan 12, 2024
ggml : add error handling to graph_compute (#1714) 92f24ee unverified finnvoorhees commited on Jan 3, 2024
sync : ggml (VMM, sync-ggml-am, dotprod ARM fixes, CUDA fixes) (#1691) 919a447 unverified ggerganov commited on Dec 29, 2023
sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677) aa86ade unverified ggerganov commited on Dec 22, 2023
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 7006035 unverified ggerganov Chris Raethke commited on Nov 3, 2023