mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-04-23 20:36:04 +00:00

* convert_hf : faster lazy safetensors This makes '--dry-run' much, much faster. * convert_hf : fix memory leak in lazy MoE conversion The '_lazy' queue was sometimes self-referential, which caused reference cycles of objects old enough to avoid garbage collection until potential memory exhaustion.