mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-04-19 04:56:11 +00:00

* Add include files for std::min/max and std::toupper/tolower * win32: move _USE_MATH_DEFINES before includes to ensure M_PI is defined * Use GGML_RESTRICT instead of "restrict" keyword everywhere, and use "__restrict" in MSVC plain C mode * win32: only use __restrict in MSVC if C11/C17 support is not enabled --------- Co-authored-by: Marcus Groeber <Marcus.Groeber@cerence.com>
llama.cpp/examples/lookahead
Demonstration of lookahead decoding technique: