mirror of https://github.com/llvm/llvm-project.git synced 2025-04-17 00:06:33 +00:00

History

Ethan Luis McDonough c50d39f073

[PGO][Offload] Allow PGO flags to be used on GPU targets (#94268 )

This pull request is the third part of an ongoing effort to extends PGO
instrumentation to GPU device code and depends on
https://github.com/llvm/llvm-project/pull/93365. This PR makes the
following changes:

- Allows PGO flags to be supplied to GPU targets
- Pulls version global from device
- Modifies `__llvm_write_custom_profile` and `lprofWriteDataImpl` to
allow the PGO version to be overridden

2025-03-19 19:01:38 -05:00

cmake

Reapply "[Offload][AMDGPU] LLVM_ENABLE_RUNTIMES=flang-rt for amdgpu-offload-*" (#130274 )

2025-03-13 13:21:36 +01:00

DeviceRTL

[OpenMP] Replace utilities with 'gpuintrin.h' definitions (#131644 )

2025-03-19 10:47:21 -05:00

docs

[Offload][NFC] Factor out and rename the __tgt_offload_entry struct (#123785 )

2025-01-21 12:05:24 -06:00

include

[openmp][nfc] Use builtin align in the devicertl (#131918 )

2025-03-18 21:31:49 +00:00

liboffload

[Offload][NFC] Fix typos discovered by codespell (#125119 )

2025-01-31 09:35:29 -06:00

libomptarget

[offload] Remove redundant checks in MappingInfoTy::lookupMapping (#127638 )

2025-02-18 11:01:36 -06:00

plugins-nextgen

[PGO][Offload] Allow PGO flags to be used on GPU targets (#94268 )

2025-03-19 19:01:38 -05:00

test

[PGO][Offload] Allow PGO flags to be used on GPU targets (#94268 )

2025-03-19 19:01:38 -05:00

tools

[Offload][NFC] Fix typos discovered by codespell (#125119 )

2025-01-31 09:35:29 -06:00

unittests

Reland #118503 : [Offload] Introduce offload-tblgen and initial new API implementation (#118614 )

2024-12-05 09:34:04 +01:00

utils

…

CMakeLists.txt

[Offload][NFC] Rename src/ -> libomptarget/ (#126573 )

2025-02-10 13:22:10 -06:00

README.md

[Offload][NFC] Update README.md

2024-11-17 07:32:29 -08:00

README.txt

…

README.md

The LLVM/Offload Subproject

The Offload subproject aims at providing tooling, runtimes, and APIs that allow users to execute code on accelerators or other "co-processors" that may or may not match the architecture of their "host". In the long run, all kinds of targets are in scope of this effort, including but not limited to: CPUs, GPUs, FPGAs, AI/ML accelerators, distributed resources, etc.

For OpenMP offload users, the project is ready and fully usable. The final API design is still under development. More content will show up here and on our webpage soon. In the meantime, people are encouraged to participate in our meetings (see below) and check our development board as well as the discussions on Discourse.

Meetings

Every second Wednesday, 7:00 - 8:00am PT, starting Jan 24, 2024. Alternates with the OpenMP in LLVM meeting. invite.ics Meeting Minutes and Agenda