* Add include files for std::min/max and std::toupper/tolower * win32: move _USE_MATH_DEFINES before includes to ensure M_PI is defined * Use GGML_RESTRICT instead of "restrict" keyword everywhere, and use "__restrict" in MSVC plain C mode * win32: only use __restrict in MSVC if C11/C17 support is not enabled --------- Co-authored-by: Marcus Groeber <Marcus.Groeber@cerence.com> |
||
|---|---|---|
| .. | ||
| CMakeLists.txt | ||
| parallel.cpp | ||
| README.md | ||
llama.cpp/example/parallel
Simplified simulation of serving incoming requests in parallel