llama.cpp/CONTRIBUTING.md

# Pull requests (for contributors)

- llama.cpp uses the ggml tensor library for model evaluation. If you are unfamiliar with ggml, consider taking a look at the [examples in the ggml repository](https://github.com/ggml-org/ggml/tree/master/examples/). [simple](https://github.com/ggml-org/ggml/tree/master/examples/simple) shows the bare minimum for using ggml. [gpt-2](https://github.com/ggml-org/ggml/tree/master/examples/gpt-2) has minimal implementations for language model inference using GPT-2. [mnist](https://github.com/ggml-org/ggml/tree/master/examples/mnist) demonstrates how to train and evaluate a simple image classifier
- Test your changes:
    - Execute [the full CI locally on your machine](ci/README.md) before publishing
    - Verify that the perplexity and the performance are not affected negatively by your changes (use `llama-perplexity` and `llama-bench`)
    - If you modified the `ggml` source, run the `test-backend-ops` tool to check whether different backend implementations of the `ggml` operators produce consistent results (this requires access to at least two different `ggml` backends)
    - If you modified a `ggml` operator or added a new one, add the corresponding test cases to `test-backend-ops`
- Create separate PRs for each feature or fix. Avoid combining unrelated changes in a single PR
- Consider allowing write access to your branch for faster reviews, as reviewers can push commits directly
- If your PR becomes stale, don't hesitate to ping the maintainers in the comments

# Pull requests (for collaborators)

- Squash-merge PRs
- Use the following format for the squashed commit title: `<module> : <commit title> (#<issue_number>)`. For example: `utils : fix typo in utils.py (#1234)`
- Optionally pick a `<module>` from here: https://github.com/ggml-org/llama.cpp/wiki/Modules
- Consider adding yourself to [CODEOWNERS](CODEOWNERS)

# Coding guidelines

- Avoid adding third-party dependencies, extra files, extra headers, etc.
- Always consider cross-compatibility with other operating systems and architectures
- Avoid fancy-looking modern STL constructs, use basic `for` loops, avoid templates, keep it simple
- Vertical alignment makes things more readable and easier to batch edit
- Clean-up any trailing whitespaces, use 4 spaces for indentation, brackets on the same line, `void * ptr`, `int & a`
- Use sized integer types such as `int32_t` in the public API, e.g. `size_t` may also be appropriate for allocation sizes or byte offsets
- Declare structs with `struct foo {}` instead of `typedef struct foo {} foo`
    - In C++ code omit optional `struct` and `enum` keyword whenever they are not necessary
    ```cpp
    // OK
    llama_context * ctx;
    const llama_rope_type rope_type;

    // not OK
    struct llama_context * ctx;
    const enum llama_rope_type rope_type;
    ```

    _(NOTE: this guideline is yet to be applied to the `llama.cpp` codebase. New code should follow this guideline.)_

- Try to follow the existing patterns in the code (indentation, spaces, etc.). In case of doubt use `clang-format` (from clang-tools v15+) to format the added code
- For anything not covered in the current guidelines, refer to the [C++ Core Guidelines](https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines)
- Tensors store data in row-major order. We refer to dimension 0 as columns, 1 as rows, 2 as matrices
- Matrix multiplication is unconventional: [`C = ggml_mul_mat(ctx, A, B)`](https://github.com/ggml-org/llama.cpp/blob/880e352277fc017df4d5794f0c21c44e1eae2b84/ggml.h#L1058-L1064) means $C^T = A B^T \Leftrightarrow C = B A^T.$

![matmul](media/matmul.png)

# Naming guidelines

- Use `snake_case` for function, variable and type names
- Naming usually optimizes for longest common prefix (see https://github.com/ggml-org/ggml/pull/302#discussion_r1243240963)

    ```cpp
    // not OK
    int small_number;
    int big_number;

    // OK
    int number_small;
    int number_big;
    ```

- Enum values are always in upper case and prefixed with the enum name

    ```cpp
    enum llama_vocab_type {
        LLAMA_VOCAB_TYPE_NONE = 0,
        LLAMA_VOCAB_TYPE_SPM  = 1,
        LLAMA_VOCAB_TYPE_BPE  = 2,
        LLAMA_VOCAB_TYPE_WPM  = 3,
        LLAMA_VOCAB_TYPE_UGM  = 4,
        LLAMA_VOCAB_TYPE_RWKV = 5,
    };
    ```

- The general naming pattern is `<class>_<method>`, with `<method>` being `<action>_<noun>`

    ```cpp
    llama_model_init();           // class: "llama_model",         method: "init"
    llama_sampler_chain_remove(); // class: "llama_sampler_chain", method: "remove"
    llama_sampler_get_seed();     // class: "llama_sampler",       method: "get_seed"
    llama_set_embeddings();       // class: "llama_context",       method: "set_embeddings"
    llama_n_threads();            // class: "llama_context",       method: "n_threads"
    llama_adapter_lora_free();    // class: "llama_adapter_lora",  method: "free"
    ```

    - The `get` `<action>` can be omitted
    - The `<noun>` can be omitted if not necessary
    - The `_context` suffix of the `<class>` is optional. Use it to disambiguate symbols when needed
    - Use `init`/`free` for constructor/destructor `<action>`

- Use the `_t` suffix when a type is supposed to be opaque to the user - it's not relevant to them if it is a struct or anything else

    ```cpp
    typedef struct llama_context * llama_context_t;

    enum llama_pooling_type llama_pooling_type(const llama_context_t ctx);
    ```

    _(NOTE: this guideline is yet to be applied to the `llama.cpp` codebase. New code should follow this guideline)_

- C/C++ filenames are all lowercase with dashes. Headers use the `.h` extension. Source files use the `.c` or `.cpp` extension
- Python filenames are all lowercase with underscores

- _(TODO: abbreviations usage)_

# Preprocessor directives

- _(TODO: add guidelines with examples and apply them to the codebase)_

    ```cpp
    #ifdef FOO
    #endif // FOO
    ```

# Documentation

- Documentation is a community effort
- When you need to look into the source code to figure out how to use an API consider adding a short summary to the header file for future reference
- When you notice incorrect or outdated documentation, please update it

# Resources

The Github issues, PRs and discussions contain a lot of information that can be useful to get familiar with the codebase. For convenience, some of the more important information is referenced from Github projects:

https://github.com/ggml-org/llama.cpp/projects
contrib : clarify PR squashing + module names (#8630) * contrib : clarify PR squashing * contrib : fix typo + add list of modules 2024-07-23 11:28:38 +03:00			`# Pull requests (for contributors)`
docs: Added initial PR template with directions for doc only changes and squash merges [no ci] (#7700) This commit adds pull_request_template.md and CONTRIBUTING.md . It focuses on explaining to contributors the need to rate PR complexity level, when to add [no ci] and how to format PR title and descriptions. Co-authored-by: Brian <mofosyne@gmail.com> Co-authored-by: compilade <git@compilade.net> 2024-06-09 11:24:29 -04:00
doc: add links to ggml examples [no ci] (#11958) 2025-02-19 20:45:17 +01:00			- llama.cpp uses the ggml tensor library for model evaluation. If you are unfamiliar with ggml, consider taking a look at the [examples in the ggml repository](https://github.com/ggml-org/ggml/tree/master/examples/). [simple](https://github.com/ggml-org/ggml/tree/master/examples/simple) shows the bare minimum for using ggml. [gpt-2](https://github.com/ggml-org/ggml/tree/master/examples/gpt-2) has minimal implementations for language model inference using GPT-2. [mnist](https://github.com/ggml-org/ggml/tree/master/examples/mnist) demonstrates how to train and evaluate a simple image classifier
contributing : update guidelines (#8316) 2024-07-05 09:09:47 +03:00			`- Test your changes:`
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			`- Execute [the full CI locally on your machine](ci/README.md) before publishing`
			- Verify that the perplexity and the performance are not affected negatively by your changes (use `llama-perplexity` and `llama-bench`)
			- If you modified the `ggml` source, run the `test-backend-ops` tool to check whether different backend implementations of the `ggml` operators produce consistent results (this requires access to at least two different `ggml` backends)
			- If you modified a `ggml` operator or added a new one, add the corresponding test cases to `test-backend-ops`
doc: update contributing guidelines [no ci] (#11969) 2025-02-21 12:51:25 +01:00			`- Create separate PRs for each feature or fix. Avoid combining unrelated changes in a single PR`
contrib : simplify + minor edits [no ci] 2024-10-06 14:15:27 +03:00			`- Consider allowing write access to your branch for faster reviews, as reviewers can push commits directly`
contrib : clarify PR squashing + module names (#8630) * contrib : clarify PR squashing * contrib : fix typo + add list of modules 2024-07-23 11:28:38 +03:00			`- If your PR becomes stale, don't hesitate to ping the maintainers in the comments`

			`# Pull requests (for collaborators)`

			`- Squash-merge PRs`
			- Use the following format for the squashed commit title: `<module> : <commit title> (#<issue_number>)`. For example: `utils : fix typo in utils.py (#1234)`
repo : update links to new url (#11886) * repo : update links to new url ggml-ci * cont : more urls ggml-ci 2025-02-15 16:40:57 +02:00			- Optionally pick a `<module>` from here: https://github.com/ggml-org/llama.cpp/wiki/Modules
contrib : refresh (#10593) * contrib : refresh * contrib : expand [no ci] * contrib : expand test-backend-ops instructions * contrib : add CODEOWNERS * prs : update template to not have checkbox [no ci] 2024-12-02 08:53:27 +02:00			`- Consider adding yourself to [CODEOWNERS](CODEOWNERS)`
docs: Added initial PR template with directions for doc only changes and squash merges [no ci] (#7700) This commit adds pull_request_template.md and CONTRIBUTING.md . It focuses on explaining to contributors the need to rate PR complexity level, when to add [no ci] and how to format PR title and descriptions. Co-authored-by: Brian <mofosyne@gmail.com> Co-authored-by: compilade <git@compilade.net> 2024-06-09 11:24:29 -04:00
contributing : update guidelines (#8316) 2024-07-05 09:09:47 +03:00			`# Coding guidelines`
docs: Added initial PR template with directions for doc only changes and squash merges [no ci] (#7700) This commit adds pull_request_template.md and CONTRIBUTING.md . It focuses on explaining to contributors the need to rate PR complexity level, when to add [no ci] and how to format PR title and descriptions. Co-authored-by: Brian <mofosyne@gmail.com> Co-authored-by: compilade <git@compilade.net> 2024-06-09 11:24:29 -04:00
contributing : update guidelines (#8316) 2024-07-05 09:09:47 +03:00			`- Avoid adding third-party dependencies, extra files, extra headers, etc.`
			`- Always consider cross-compatibility with other operating systems and architectures`
contrib : simplify + minor edits [no ci] 2024-10-06 14:15:27 +03:00			- Avoid fancy-looking modern STL constructs, use basic `for` loops, avoid templates, keep it simple
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			`- Vertical alignment makes things more readable and easier to batch edit`
contributing : update guidelines (#8316) 2024-07-05 09:09:47 +03:00			- Clean-up any trailing whitespaces, use 4 spaces for indentation, brackets on the same line, `void * ptr`, `int & a`
contrib : add naming guidelines (cont) (#11177) 2025-01-13 15:08:44 +02:00			- Use sized integer types such as `int32_t` in the public API, e.g. `size_t` may also be appropriate for allocation sizes or byte offsets
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			- Declare structs with `struct foo {}` instead of `typedef struct foo {} foo`
			- In C++ code omit optional `struct` and `enum` keyword whenever they are not necessary
			```cpp
			`// OK`
			`llama_context * ctx;`
			`const llama_rope_type rope_type;`

			`// not OK`
			`struct llama_context * ctx;`
			`const enum llama_rope_type rope_type;`
			```
contrib : add naming guidelines (cont) (#11177) 2025-01-13 15:59:26 +02:00
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			_(NOTE: this guideline is yet to be applied to the `llama.cpp` codebase. New code should follow this guideline.)_
contrib : add naming guidelines (cont) (#11177) 2025-01-13 15:59:26 +02:00
ggml : upgrade init_tensor API to return a ggml_status (#11854) * Upgrade init_tensor API to return a ggml_status To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status. * misc fixes --------- Co-authored-by: slaren <slarengh@gmail.com> 2025-02-28 05:41:47 -08:00			- Try to follow the existing patterns in the code (indentation, spaces, etc.). In case of doubt use `clang-format` (from clang-tools v15+) to format the added code
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			`- For anything not covered in the current guidelines, refer to the [C++ Core Guidelines](https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines)`
contributing : update guidelines (#8316) 2024-07-05 09:09:47 +03:00			`- Tensors store data in row-major order. We refer to dimension 0 as columns, 1 as rows, 2 as matrices`
repo : update links to new url (#11886) * repo : update links to new url ggml-ci * cont : more urls ggml-ci 2025-02-15 16:40:57 +02:00			- Matrix multiplication is unconventional: [`C = ggml_mul_mat(ctx, A, B)`](https://github.com/ggml-org/llama.cpp/blob/880e352277fc017df4d5794f0c21c44e1eae2b84/ggml.h#L1058-L1064) means $C^T = A B^T \Leftrightarrow C = B A^T.$
contributing : update guidelines (#8316) 2024-07-05 09:09:47 +03:00
			`![matmul](media/matmul.png)`
docs: Added initial PR template with directions for doc only changes and squash merges [no ci] (#7700) This commit adds pull_request_template.md and CONTRIBUTING.md . It focuses on explaining to contributors the need to rate PR complexity level, when to add [no ci] and how to format PR title and descriptions. Co-authored-by: Brian <mofosyne@gmail.com> Co-authored-by: compilade <git@compilade.net> 2024-06-09 11:24:29 -04:00
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			`# Naming guidelines`

			- Use `snake_case` for function, variable and type names
repo : update links to new url (#11886) * repo : update links to new url ggml-ci * cont : more urls ggml-ci 2025-02-15 16:40:57 +02:00			`- Naming usually optimizes for longest common prefix (see https://github.com/ggml-org/ggml/pull/302#discussion_r1243240963)`
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00
			```cpp
			`// not OK`
			`int small_number;`
			`int big_number;`

			`// OK`
			`int number_small;`
			`int number_big;`
			```

			`- Enum values are always in upper case and prefixed with the enum name`

			```cpp
			`enum llama_vocab_type {`
			`LLAMA_VOCAB_TYPE_NONE = 0,`
			`LLAMA_VOCAB_TYPE_SPM = 1,`
			`LLAMA_VOCAB_TYPE_BPE = 2,`
			`LLAMA_VOCAB_TYPE_WPM = 3,`
			`LLAMA_VOCAB_TYPE_UGM = 4,`
			`LLAMA_VOCAB_TYPE_RWKV = 5,`
			`};`
			```

			- The general naming pattern is `<class>_<method>`, with `<method>` being `<action>_<noun>`

			```cpp
			`llama_model_init(); // class: "llama_model", method: "init"`
			`llama_sampler_chain_remove(); // class: "llama_sampler_chain", method: "remove"`
			`llama_sampler_get_seed(); // class: "llama_sampler", method: "get_seed"`
			`llama_set_embeddings(); // class: "llama_context", method: "set_embeddings"`
			`llama_n_threads(); // class: "llama_context", method: "n_threads"`
			`llama_adapter_lora_free(); // class: "llama_adapter_lora", method: "free"`
			```

			- The `get` `<action>` can be omitted
			- The `<noun>` can be omitted if not necessary
			- The `_context` suffix of the `<class>` is optional. Use it to disambiguate symbols when needed
			- Use `init`/`free` for constructor/destructor `<action>`

			- Use the `_t` suffix when a type is supposed to be opaque to the user - it's not relevant to them if it is a struct or anything else

			```cpp
			`typedef struct llama_context * llama_context_t;`

			`enum llama_pooling_type llama_pooling_type(const llama_context_t ctx);`
			```

			_(NOTE: this guideline is yet to be applied to the `llama.cpp` codebase. New code should follow this guideline)_

			- C/C++ filenames are all lowercase with dashes. Headers use the `.h` extension. Source files use the `.c` or `.cpp` extension
			`- Python filenames are all lowercase with underscores`

			`- _(TODO: abbreviations usage)_`

			`# Preprocessor directives`

contrib : add naming guidelines (cont) (#11177) 2025-01-13 15:59:26 +02:00			`- _(TODO: add guidelines with examples and apply them to the codebase)_`
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00
			```cpp
			`#ifdef FOO`
			`#endif // FOO`
			```

			`# Documentation`

			`- Documentation is a community effort`
contrib : add naming guidelines (cont) (#11177) 2025-01-13 15:08:44 +02:00			`- When you need to look into the source code to figure out how to use an API consider adding a short summary to the header file for future reference`
contrib : add naming guidelines (#11177) * contrib : add naming guidelines * contrib : expand naming guidelines [no ci] * contrib : cont [no ci] * contrib : add `_t` suffix guideline [no ci] * contrib : cont [no ci] * minor [no ci] * contrib : move coding guidelines to correct section [no ci] * contrib : minor reword coding guidelines [no ci] * contrib : add TODO for preprocessor directives [no ci] * contrib : expand [no ci] * minor [no ci] * contrib : clarify `_context` suffix usage [no ci] * contrib : filename guidelines [no ci] * contrib : fix notes [no ci] 2025-01-13 14:46:36 +02:00			`- When you notice incorrect or outdated documentation, please update it`

contrib : add Resources section (#9675) 2024-09-29 14:38:18 +03:00			`# Resources`

			`The Github issues, PRs and discussions contain a lot of information that can be useful to get familiar with the codebase. For convenience, some of the more important information is referenced from Github projects:`

repo : update links to new url (#11886) * repo : update links to new url ggml-ci * cont : more urls ggml-ci 2025-02-15 16:40:57 +02:00			`https://github.com/ggml-org/llama.cpp/projects`