llama.cpp

History

Prashant Vithule 4806498bf1 ggml: aarch64: implement SVE kernels for q3_K_q8_K vector dot (#11917 ) * Added SVE Implementation for Q3_K Kernel in ggml-cpu-quants.c file * Improved Formating of code in ggml-cpu-quants.c file * style : minor fixes * style : less whitespaces * style : ptr spaceing --------- Co-authored-by: vithulep <p.m.vithule1517@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2025-02-20 12:08:32 +02:00
..
amx	remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797 )	2024-12-12 19:02:49 +01:00
cmake	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
llamafile	llamafile: use member variable instead of constant for iq4nlt (#11780 )	2025-02-13 18:05:04 +01:00
CMakeLists.txt	ggml : fixes for AVXVNNI instruction set with MSVC and Clang (#11027 )	2024-12-31 15:23:33 +01:00
cpu-feats-x86.cpp	ggml : add predefined list of CPU backend variants to build (#10626 )	2024-12-04 14:45:40 +01:00
ggml-cpu-aarch64.cpp	ggml-backend : only offload from host buffers (fix) (#11124 )	2025-01-07 16:11:57 +01:00
ggml-cpu-aarch64.h	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-hbm.cpp	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-hbm.h	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-impl.h	ggml : optimize and build warning fix for LoongArch (#11709 )	2025-02-07 09:38:31 +02:00
ggml-cpu-quants.c	ggml: aarch64: implement SVE kernels for q3_K_q8_K vector dot (#11917 )	2025-02-20 12:08:32 +02:00
ggml-cpu-quants.h	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
ggml-cpu-traits.cpp	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-traits.h	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu.c	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
ggml-cpu.cpp	ggml-cpu: Fix duplicate MATMUL_INT8 (#11817 )	2025-02-12 13:22:58 +01:00