llama.cpp/ggml/src/ggml-cpu
SXX 77d5e9a76a
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107)
* ggml: dynamic x86_64 feature detection for FP32 <-> FP16/BF16 conversion

* move fp converter to ggml-cpu

* Switch ggml_compute_forward_get_rows_f16/bf16 to new ggml_cpu_fp16/bf16_to_fp32
2025-04-26 16:05:31 +02:00
..
amx ggml : upgrade init_tensor API to return a ggml_status (#11854) 2025-02-28 14:41:47 +01:00
cmake ggml : build backends as libraries (#10256) 2024-11-14 18:04:35 +01:00
kleidiai ggml-cpu : update KleidiAI to v1.5.0 (#12568) 2025-03-25 13:10:18 +02:00
llamafile llamafile : ppc64le GEMV forwarding for FP32. (#12594) 2025-03-28 09:43:22 +02:00
binary-ops.cpp cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
binary-ops.h cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
CMakeLists.txt ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871) 2025-04-21 18:13:51 +02:00
common.h cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
cpu-feats-x86.cpp ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871) 2025-04-21 18:13:51 +02:00
ggml-cpu-aarch64.cpp ggml : Add AVX512 implementation of GEMM - Q4_Kx8 (#12829) 2025-04-15 09:22:36 +03:00
ggml-cpu-aarch64.h ggml : refactor online repacking (#10446) 2024-12-07 14:37:50 +02:00
ggml-cpu-hbm.cpp ggml : refactor online repacking (#10446) 2024-12-07 14:37:50 +02:00
ggml-cpu-hbm.h ggml : refactor online repacking (#10446) 2024-12-07 14:37:50 +02:00
ggml-cpu-impl.h ggml-cpu-impl.h: do not redefine bool on POWER9 (#12856) 2025-04-10 01:00:34 +02:00
ggml-cpu-quants.c ggml : riscv: add 128-bit RVV support (#12530) 2025-03-27 08:38:34 +02:00
ggml-cpu-quants.h ggml : build backends as libraries (#10256) 2024-11-14 18:04:35 +01:00
ggml-cpu-traits.cpp ggml : refactor online repacking (#10446) 2024-12-07 14:37:50 +02:00
ggml-cpu-traits.h ggml : refactor online repacking (#10446) 2024-12-07 14:37:50 +02:00
ggml-cpu.c ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107) 2025-04-26 16:05:31 +02:00
ggml-cpu.cpp cpu: fix cpu backend's supports-op for GET_ROWS_BACK. fixes a fatal when running test-backend-ops with only the CPU backend (ggml/1190) 2025-04-14 09:26:15 +03:00
ops.cpp ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107) 2025-04-26 16:05:31 +02:00
ops.h ggml : Depthwise 2D convolution (ggml/1152) 2025-04-24 17:32:47 +03:00
simd-mappings.h ggml: fix compilation error s390x (#12848) 2025-04-11 08:20:07 +03:00
unary-ops.cpp cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
unary-ops.h cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
vec.cpp cpu: move all the operators into a separate c++ file (except mul_mat) (ggml/1167) 2025-04-07 18:44:17 +03:00
vec.h cpu: move all the operators into a separate c++ file (except mul_mat) (ggml/1167) 2025-04-07 18:44:17 +03:00