llvm-project/Headers at users/OCHyams/ki-clang-member-init - llvm-project - Gitea For EOELAB

mirrors/llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-24 12:46:07 +00:00

History

Jon Chesterfield cba9dc6e9d

[libc][nfc] Use common implementation of read_first_lane_u64 (#131027 )

No codegen regression on either target. The two builtin_ffs implied on
nvptx CSE away.

```
define internal i64 @__gpu_read_first_lane_u64(i64 noundef %__lane_mask, i64 noundef %__x) #2 {
entry:
  %shr = lshr i64 %__x, 32
  %conv = trunc nuw i64 %shr to i32
  %conv1 = trunc i64 %__x to i32
  %conv2 = trunc i64 %__lane_mask to i32
  %0 = tail call range(i32 0, 33) i32 @llvm.cttz.i32(i32 %conv2, i1 true)
  %iszero = icmp eq i32 %conv2, 0
  %sub = select i1 %iszero, i32 -1, i32 %0
  %1 = tail call i32 @llvm.nvvm.shfl.sync.idx.i32(i32 %conv2, i32 %conv, i32 %sub, i32 31)
  %conv4 = sext i32 %1 to i64
  %shl = shl nsw i64 %conv4, 32
  %2 = tail call i32 @llvm.nvvm.shfl.sync.idx.i32(i32 %conv2, i32 %conv1, i32 %sub, i32 31)
  %conv7 = zext i32 %2 to i64
  %or = or disjoint i64 %shl, %conv7
  ret i64 %or
}
; becomes

define internal i64 @__gpu_competing_read_first_lane_u64(i64 noundef %__lane_mask, i64 noundef %__x) #2 {
entry:
  %shr = lshr i64 %__x, 32
  %conv = trunc nuw i64 %shr to i32
  %conv1 = trunc i64 %__x to i32
  %conv.i = trunc i64 %__lane_mask to i32
  %0 = tail call range(i32 0, 33) i32 @llvm.cttz.i32(i32 %conv.i, i1 true)
  %iszero = icmp eq i32 %conv.i, 0
  %sub.i = select i1 %iszero, i32 -1, i32 %0
  %1 = tail call i32 @llvm.nvvm.shfl.sync.idx.i32(i32 %conv.i, i32 %conv, i32 %sub.i, i32 31)
  %conv4 = zext i32 %1 to i64
  %shl = shl nuw i64 %conv4, 32
  %2 = tail call i32 @llvm.nvvm.shfl.sync.idx.i32(i32 %conv.i, i32 %conv1, i32 %sub.i, i32 31)
  %conv7 = zext i32 %2 to i64
  %or = or disjoint i64 %shl, %conv7
  ret i64 %or
}
```

The sext vs zext difference is vaguely interesting but since the bits
are immediately discarded in either case it make no odds. The amdgcn one
doesn't need CSE, the readfirstlane function is a single call to an
intrinsic.

Drive by fix to __gpu_match_all_u32, it was calling first_lane_u64 and
could use first_lane_u32 instead. Added the missing call to gpuintrin.c
test case and a stray missing static as well.

2025-03-12 21:29:46 +00:00

..

[Clang] nonblocking/nonallocating attributes: 2nd pass caller/callee analysis (#99656 )

2024-10-03 02:14:51 +02:00

__clang_hip_cmath.hip

[HIP] Fix __clang_hip_cmath.hip for ambiguity (#101341 )

2024-08-02 18:37:22 -04:00

__clang_hip_libdevice_declares.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

__clang_hip_math_deprecated.hip

[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548 )

2024-11-06 11:53:33 +00:00

__clang_hip_math_ocml_rounded_ops.hip

[IR] Allow fast math flags on fptrunc and fpext (#115894 )

2024-12-04 10:53:04 +00:00

__clang_hip_math.hip

clang/HIP: Do not call ocml in scalbln implementations (#129639 )

2025-03-07 07:55:26 +07:00

__cpuidex_conflict.c

Revert "[Clang] __has_builtin should return false for aux triple builtins (#121839 ) (#124626 )

2025-01-27 22:11:46 +00:00

altivec-header.c

…

altivec-intrin.c

…

amdgcn_openmp_device_math_c.c

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

amdgcn_openmp_device_math_constexpr.cpp

[OpenMP] Rework handling of global ctor/dtors in OpenMP (#71739 )

2023-11-10 14:53:53 -06:00

amdgcn_openmp_device_math.c

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

amdgcn-openmp-device-math-complex.c

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

amdgcn-openmp-device-math-complex.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

arm64-apple-ios-types.cpp

…

arm-acle-header.c

[Arm64EC] Fix compilation of arm_acle.h (#91281 )

2024-05-06 20:04:55 -07:00

arm-cde-header.c

…

arm-cmse-header-ns.c

…

arm-cmse-header.c

…

arm-fp16-header.c

[clang] Improve hermeticity of clang header tests.

2023-07-31 08:25:36 +01:00

arm-neon-header.c

[Driver,test] Remove invalid -arch for non-Darwin AArch64 OSes

2023-12-05 18:07:22 -08:00

builtins-header.c

…

c11.c

…

c89.c

…

cpuid.c

[clang][x86] Support -masm=intel in cpuid.h (#127331 )

2025-02-25 15:25:26 +05:30

crash-instantiated-in-scope-cxx-modules2.cpp

[C++20][Modules] Fix crash when function and lambda inside loaded from different modules (#109167 )

2024-09-25 08:31:49 +01:00

crash-instantiated-in-scope-cxx-modules3.cpp

[C++20][Modules] Fix crash when function and lambda inside loaded from different modules (#109167 )

2024-09-25 08:31:49 +01:00

crash-instantiated-in-scope-cxx-modules4.cpp

[C++20][Modules] Load function body from the module that gives canonical decl (#111992 )

2024-12-16 12:22:43 +00:00

crash-instantiated-in-scope-cxx-modules5.cpp

[C++20][Modules] Fix crash/compiler error due broken AST links (#123648 )

2025-01-23 10:35:58 +00:00

crash-instantiated-in-scope-cxx-modules.cpp

[C++20][Modules] Fix crash when function and lambda inside loaded from different modules (#109167 )

2024-09-25 08:31:49 +01:00

cuda_with_openmp.cu

…

cuda_wrapper_algorithm.cu

[CUDA][HIP] Fix std::min in wrapper header (#93976 )

2024-06-03 11:06:44 -04:00

cxx11.cpp

…

float16.c

…

float-aix.c

…

float-darwin.c

…

float.c

Remove FiniteMathOnly and use only NoHonorINFs and NoHonorNANs. (#97342 )

2024-07-26 08:16:38 -04:00

gpu_disabled_math.cpp

[Clang] Add __CLANG_GPU_DISABLE_MATH_WRAPPERS macro for offloading math (#98234 )

2024-08-14 16:54:38 -05:00

gpuintrin_lang.c

[Clang] Fix GPU intrinsics test on different range metadata

2024-11-11 12:38:27 -06:00

gpuintrin.c

[libc][nfc] Use common implementation of read_first_lane_u64 (#131027 )

2025-03-12 21:29:46 +00:00

header_unit_preprocessed_output.cpp

[C++20][Modules] Quote header unit name in preprocessor output (-E) (#112883 )

2024-10-24 08:20:43 +01:00

header-unit-common-cmp-cat.cpp

[C++20][Modules] Relax ODR check in unnamed modules (#111160 )

2024-10-10 08:49:39 +01:00

hexagon-audio-headers.c

…

hexagon-headers.c

…

hexagon-hvx-headers.c

…

hip-header.hip

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

htm-header.c

…

import_header_unit_after_pragma.cpp

[C++20][Modules] Allow import for a header unit after #pragma (#111662 )

2024-10-11 08:23:35 +01:00

int64-type.c

…

lasxintrin.c

[LoongArch][Clang] Make the parameters and return value of {x,}vxor.v builti ns unsigned char vectors (#114513 )

2024-11-04 17:52:52 +08:00

limits.cpp

[clang] Extend clang's <limits.h> to define *LONG_LONG*_ macros for bionic (#115406 )

2024-11-14 10:39:08 -08:00

lit.local.cfg

…

lsxintrin.c

[LoongArch][Clang] Make the parameters and return value of {x,}vxor.v builti ns unsigned char vectors (#114513 )

2024-11-04 17:52:52 +08:00

mm3dnow.c

Remove support for 3DNow!, both intrinsics and builtins. (#96246 )

2024-07-16 12:08:48 -04:00

mm_malloc.c

…

ms-arm64-intrin.cpp

[test] %clang_cc1: remove redundant actions

2024-05-05 10:46:06 -07:00

ms-intrin.cpp

[Headers] [ARM64EC] Fix extra tokens inside intrin0.h preprocessor directive (#112066 )

2024-10-14 12:22:25 -07:00

ms-no-wchar.cpp

…

ms-null-ms-header-vs-stddef.cpp

…

ms-wchar.c

…

no-xend.cpp

Remove xbegin and _xend (#126952 )

2025-02-22 16:46:08 +08:00

nvptx_device_cmath_functions_cxx17.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_cmath_functions.c

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

nvptx_device_cmath_functions.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_complex.c

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

nvptx_device_math_complex.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

nvptx_device_math_functions_cxx17.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_functions.c

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_functions.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_macro.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

nvptx_device_math_modf.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_sin_cos.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_sin.c

[CUDA][HIP] Rename and fix -fcuda-approx-transcendentals

2023-07-25 12:01:41 -04:00

nvptx_device_math_sin.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

nvptx_device_math_sincos.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

opencl-builtins.cl

…

opencl-c-header.cl

[OpenCL] Add cl_ext_image_unorm_int_2_101010_EXT extension (#113145 )

2024-10-21 23:55:19 +01:00

openmp_device_math_isnan.cpp

[Inliner] Also propagate noundef and align ret attributes during inlining

2023-10-03 16:12:19 -05:00

openmp_new_nothrow.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

openmp-device-functions-bool.c

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

pconfigintin.c

…

pmmintrin.c

…

ppc-intrinsics.c

…

riscv-sifive-header.c

[RISCV] Install sifive_vector.h to riscv-resource-headers (#66330 )

2023-09-20 14:06:45 +08:00

riscv-vector-header.c

…

sgxintrin.c

…

stdarg-cxx-modules.cpp

[C++20][Modules] Allow using stdarg.h with header units (#100739 )

2024-08-01 12:33:20 +03:00

stdarg-gnuc_va_list.c

…

stdarg.c

[Modules] Make clang modules for the C standard library headers

2023-10-03 12:41:11 -07:00

stdarg.cpp

…

stdargneeds.c

[Modules] Make clang modules for the C standard library headers

2023-10-03 12:41:11 -07:00

stdatomic-deprecations.c

…

stdatomic.c

[clang] Define ATOMIC_FLAG_INIT correctly for C++. (#97534 )

2024-07-24 08:53:39 -04:00

stdbool.c

…

stdbool.cpp

…

stdckdint.c

Update stdckdint.h and make it available in pre-C23 modes. (#69649 )

2023-10-25 11:04:31 -07:00

stddef.c

[Modules] Make clang modules for the C standard library headers

2023-10-03 12:41:11 -07:00

stddefneeds.c

[Modules] Make clang modules for the C standard library headers

2023-10-03 12:41:11 -07:00

stddefneeds.cpp

[clang][headers] Including stddef.h always redefines NULL (#99727 )

2024-07-23 13:02:59 -07:00

stdint-typeof-MINMAX.cpp

…

stdint.c

[C2x] Support -std=c23 and -std=gnu23

2023-08-10 13:57:40 -04:00

target_include_new.cpp

[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags

2023-07-10 14:14:16 +01:00

texture_intrinsics.cu

…

tgmath-darwin.c

…

tgmath.c

…

thumbv7-apple-ios-types.cpp

…

typedef_guards.c

…

unwind.c

…

wasm.c

[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548 )

2024-11-06 11:53:33 +00:00

wasm.cpp

…

wchar_limits.cpp

…

wmmintrin.c

…

x86_64-apple-macosx-types.cpp

…

x86-header-warnings.c

…

x86-intrinsics-headers-clean.cpp

…

x86-intrinsics-headers.c

…

x86intrin-2.c

…

x86intrin.c

…

x86intrin.cpp

…

xmmintrin-unsupported.c

[test] Replace aarch64-*-{eabi,gnueabi} with aarch64

2024-02-12 15:00:45 -08:00

xmmintrin.c

Clang: convert __m64 intrinsics to unconditionally use SSE2 instead of MMX. (#96540 )

2024-07-24 17:00:12 -04:00