llama.cpp

History

Chen Xi ed67bcb24f [SYCL] fix multi-gpu issue on sycl (#8554 ) --------- Signed-off-by: Chen Xi <xi2chen@intel.com> Co-authored-by: Meng, Hengyu <hengyu.meng@intel.com>		2024-07-25 19:45:18 +08:00
..
dpct	[SYCL] fix multi-gpu issue on sycl (#8554 )	2024-07-25 19:45:18 +08:00
backend.hpp	[SYCL] add concat through dim 1/2 (#8483 )	2024-07-15 19:32:15 +08:00
common.cpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
common.hpp	[SYCL] fix multi-gpu issue on sycl (#8554 )	2024-07-25 19:45:18 +08:00
concat.cpp	[SYCL] add concat through dim 1/2 (#8483 )	2024-07-15 19:32:15 +08:00
concat.hpp	[SYCL] add concat through dim 1/2 (#8483 )	2024-07-15 19:32:15 +08:00
convert.cpp	[SYCL] Use multi_ptr to clean up deprecated warnings (#8256 )	2024-07-10 16:10:49 +01:00
convert.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
dequantize.hpp	Dequant improvements rebase (#8255 )	2024-07-03 09:55:34 +08:00
dmmv.cpp	[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266 )	2024-07-05 13:06:13 +08:00
dmmv.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
mmq.cpp	[SYCL] Use multi_ptr to clean up deprecated warnings (#8256 )	2024-07-10 16:10:49 +01:00
mmq.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
mmvq.cpp	[SYCL] Fix the sub group size of Intel (#8106 )	2024-07-02 10:16:00 +08:00
mmvq.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
norm.cpp	[SYCL] Use multi_ptr to clean up deprecated warnings (#8256 )	2024-07-10 16:10:49 +01:00
norm.hpp	[SYCL] Fix the sub group size of Intel (#8106 )	2024-07-02 10:16:00 +08:00
presets.hpp	[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266 )	2024-07-05 13:06:13 +08:00
rope.cpp	sycl : fix powf call in device code (#8368 )	2024-07-08 14:22:41 +01:00
rope.hpp	[SYCL] Update SYCL-Rope op and Refactor (#8157 )	2024-07-01 19:39:06 +08:00
softmax.cpp	[SYCL] fix scratch size of softmax (#8642 )	2024-07-23 15:43:28 +08:00
softmax.hpp	[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266 )	2024-07-05 13:06:13 +08:00
vecdotq.hpp	CUDA: refactor and optimize IQ MMVQ (#8215 )	2024-07-01 20:39:06 +02:00