Mathew Odden 6b35155294
Fix invalid lowerings for ROCm in Pallas (#223)
popcount and clz were effectively broken on ROCm,
since math_dialect had incorrect lowerings.

Use the device intrinsics for these functions, as
well as for exp and absf, which fixes some accuracy issues in
the pallas tests.

Docs for OCML/OCKL

- https://github.com/ROCm/llvm-project/blob/amd-staging/amd/device-libs/doc/OCML.md
- https://github.com/ROCm/llvm-project/blob/amd-staging/amd/device-libs/doc/OCKL.md
2025-02-14 11:27:52 -06:00
..
2024-12-19 18:38:06 +00:00
2024-12-29 13:06:19 +00:00
2024-06-26 16:10:18 -04:00
2025-01-15 10:22:29 +00:00
2025-01-15 10:22:29 +00:00
2025-01-15 10:22:29 +00:00
2024-11-05 09:08:33 -08:00
2025-01-15 10:22:29 +00:00
2023-12-11 08:45:52 -08:00
2024-06-26 16:10:18 -04:00
2025-01-15 10:22:29 +00:00
2024-08-13 14:48:38 -07:00
2025-01-10 18:20:09 -08:00
2024-06-26 16:10:18 -04:00
2024-06-26 16:10:18 -04:00