mirror of
https://github.com/llvm/llvm-project.git
synced 2025-04-19 03:56:49 +00:00

This commit extends the lowering of amdgpu.mfma to handle the new double-rate MFMAs in gfx950 and adds tests for these operations. It also adds support for MFMAs on small floats (f6 and f4), which are implented using the "scaled" MFMA intrinsic with a scale value of 0 in order to have an unscaled MFMA. This commit does not add a `amdgpu.scaled_mfma` operation, as that is future work. --------- Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>
Multi-Level Intermediate Representation
See https://mlir.llvm.org/ for more information.