mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-04-17 12:06:10 +00:00

The readme tells people to use the command line option "-t 8", causing 8 threads to be started. On systems with fewer than 8 cores, this causes a significant slowdown. Remove the option from the example command lines and use /proc/cpuinfo on Linux to determine a sensible default.