llama.cpp

History

SXX 77d5e9a76a ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107 ) * ggml: dynamic x86_64 feature detection for FP32 <-> FP16/BF16 conversion * move fp converter to ggml-cpu * Switch ggml_compute_forward_get_rows_f16/bf16 to new ggml_cpu_fp16/bf16_to_fp32		2025-04-26 16:05:31 +02:00
..
amx	ggml : upgrade init_tensor API to return a ggml_status (#11854 )	2025-02-28 14:41:47 +01:00
cmake	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
kleidiai	ggml-cpu : update KleidiAI to v1.5.0 (#12568 )	2025-03-25 13:10:18 +02:00
llamafile	llamafile : ppc64le GEMV forwarding for FP32. (#12594 )	2025-03-28 09:43:22 +02:00
binary-ops.cpp	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
binary-ops.h	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
CMakeLists.txt	ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871 )	2025-04-21 18:13:51 +02:00
common.h	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
cpu-feats-x86.cpp	ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871 )	2025-04-21 18:13:51 +02:00
ggml-cpu-aarch64.cpp	ggml : Add AVX512 implementation of GEMM - Q4_Kx8 (#12829 )	2025-04-15 09:22:36 +03:00
ggml-cpu-aarch64.h	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-hbm.cpp	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-hbm.h	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-impl.h	ggml-cpu-impl.h: do not redefine bool on POWER9 (#12856 )	2025-04-10 01:00:34 +02:00
ggml-cpu-quants.c	ggml : riscv: add 128-bit RVV support (#12530 )	2025-03-27 08:38:34 +02:00
ggml-cpu-quants.h	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
ggml-cpu-traits.cpp	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu-traits.h	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
ggml-cpu.c	ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107 )	2025-04-26 16:05:31 +02:00
ggml-cpu.cpp	cpu: fix cpu backend's supports-op for GET_ROWS_BACK. fixes a fatal when running test-backend-ops with only the CPU backend (ggml/1190)	2025-04-14 09:26:15 +03:00
ops.cpp	ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107 )	2025-04-26 16:05:31 +02:00
ops.h	ggml : Depthwise 2D convolution (ggml/1152)	2025-04-24 17:32:47 +03:00
simd-mappings.h	ggml: fix compilation error s390x (#12848 )	2025-04-11 08:20:07 +03:00
unary-ops.cpp	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
unary-ops.h	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
vec.cpp	cpu: move all the operators into a separate c++ file (except mul_mat) (ggml/1167)	2025-04-07 18:44:17 +03:00
vec.h	cpu: move all the operators into a separate c++ file (except mul_mat) (ggml/1167)	2025-04-07 18:44:17 +03:00