mirror of https://github.com/llvm/llvm-project.git synced 2025-04-16 08:06:31 +00:00

History

Ethan Luis McDonough 9e5c136d5a

[PGO][Offload] Profile profraw generation for GPU instrumentation #76587 (#93365 )

This pull request is the second part of an ongoing effort to extends PGO
instrumentation to GPU device code and depends on #76587. This PR makes
the following changes:

- Introduces `__llvm_write_custom_profile` to PGO compiler-rt library.
This is an external function that can be used to write profiles with
custom data to target-specific files.
- Adds `__llvm_write_custom_profile` as weak symbol to libomptarget so
that it can write the collected data to a profraw file.
- Adds `PGODump` debug flag and only displays dump when the
aforementioned flag is set

2025-02-11 23:30:54 -06:00

cmake

[Offload] Fix the offload cache file triggering libc++ / libstdc++ mixing (#126313 )

2025-02-10 13:20:35 -06:00

DeviceRTL

[OpenMP] Replace use of target address space with <gpuintrin.h> local (#126119 )

2025-02-09 10:25:25 -06:00

docs

[Offload][NFC] Factor out and rename the __tgt_offload_entry struct (#123785 )

2025-01-21 12:05:24 -06:00

include

[PGO][Offload] Profile profraw generation for GPU instrumentation #76587 (#93365 )

2025-02-11 23:30:54 -06:00

liboffload

[Offload][NFC] Fix typos discovered by codespell (#125119 )

2025-01-31 09:35:29 -06:00

libomptarget

[Offload][NFC] Rename src/ -> libomptarget/ (#126573 )

2025-02-10 13:22:10 -06:00

plugins-nextgen

[PGO][Offload] Profile profraw generation for GPU instrumentation #76587 (#93365 )

2025-02-11 23:30:54 -06:00

test

[PGO][Offload] Profile profraw generation for GPU instrumentation #76587 (#93365 )

2025-02-11 23:30:54 -06:00

tools

[Offload][NFC] Fix typos discovered by codespell (#125119 )

2025-01-31 09:35:29 -06:00

unittests

Reland #118503 : [Offload] Introduce offload-tblgen and initial new API implementation (#118614 )

2024-12-05 09:34:04 +01:00

utils

…

CMakeLists.txt

[Offload][NFC] Rename src/ -> libomptarget/ (#126573 )

2025-02-10 13:22:10 -06:00

README.md

[Offload][NFC] Update README.md

2024-11-17 07:32:29 -08:00

README.txt

…

README.md

The LLVM/Offload Subproject

The Offload subproject aims at providing tooling, runtimes, and APIs that allow users to execute code on accelerators or other "co-processors" that may or may not match the architecture of their "host". In the long run, all kinds of targets are in scope of this effort, including but not limited to: CPUs, GPUs, FPGAs, AI/ML accelerators, distributed resources, etc.

For OpenMP offload users, the project is ready and fully usable. The final API design is still under development. More content will show up here and on our webpage soon. In the meantime, people are encouraged to participate in our meetings (see below) and check our development board as well as the discussions on Discourse.

Meetings

Every second Wednesday, 7:00 - 8:00am PT, starting Jan 24, 2024. Alternates with the OpenMP in LLVM meeting. invite.ics Meeting Minutes and Agenda