llama.cpp/ggml/src/ggml-cuda/softmax.cuh at 396856b40029dd6747d2fbdb179e828683418045 - ver4a/llama.cpp - git.uncontrol.me

ver4a/llama.cpp

Johannes Gäßler 9c8dcefe17

CUDA: backwards pass for misc. ops, add tests (#11257 )

* CUDA: backwards pass for misc. ops, add tests

* remove restrict from pointers

2025-01-16 16:43:38 +01:00

7 lines

228 B

Text

Raw Blame History

 #include "common.cuh"
 #define CUDA_SOFT_MAX_BLOCK_SIZE 1024
 void ggml_cuda_op_soft_max(ggml_backend_cuda_context & ctx, ggml_tensor * dst);
 void ggml_cuda_op_soft_max_back(ggml_backend_cuda_context & ctx, ggml_tensor * dst);