llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-24 11:36:08 +00:00

Author	SHA1	Message	Date
Joseph Huber	598e882ee8	[libc] Template the writing mode for the writer class (#111559 ) Summary: Currently we dispatch the writing mode off of a runtime enum passed in by the constructor. This causes very unfortunate codegen for the GPU targets where we get worst-case codegen because of the unused function pointer for `sprintf`. Instead, this patch moves all of this to a template so it can be masked out. This results in no dynamic stack and uses 60 VGPRs instead of 117. It also compiles about 5x as fast.	2025-03-12 13:51:44 -05:00
Connector Switch	ab22f652a4	[libc] implement `strings/str{n}casecmp_l` (#130407 ) ref: https://pubs.opengroup.org/onlinepubs/9799919799/functions/strcasecmp_l.html This patch introduces the `strcasecmp_l` function. At present, the locale parameter is ignored, making it a stub implementation. This is consistent with how other locale-related functions, such as `islower_l`, are treated in our codebase as well as in [musl](https://github.com/bminor/musl/blob/master/src/string/strcasecmp.c) and [bionic](https://cs.android.com/android/platform/superproject/main/+/main:bionic/libc/bionic/strings_l.cpp). --------- Co-authored-by: Michael Jones <michaelrj@google.com>	2025-03-12 10:49:04 +08:00
Connector Switch	426caf1182	[libc] add locale proxy header (#130621 ) Address review comments in #130407. This patch is already covered by existing locale test cases.	2025-03-11 08:46:29 +08:00
Vinay Deshmukh	257e483715	[libc] Add `-Wno-sign-conversion` & re-attempt `-Wconversion` (#129811 ) Relates to https://github.com/llvm/llvm-project/issues/119281#issuecomment-2699470459	2025-03-10 11:57:09 -04:00
Ishaan Verma	735afc8ceb	[libc] Added type-generic macros for fixed-point functions (#129371 ) Adds macros `absfx`, `countlsfx` and `roundfx` . ref: #129111	2025-03-09 00:11:34 -05:00
Connector Switch	a5588b6d20	[libc] implement `strings/ffs` (#129892 ) This patch adds the `strings/ffs` function. ref: https://pubs.opengroup.org/onlinepubs/9799919799/functions/ffs.html Closes: #122054.	2025-03-08 09:31:18 +08:00
Alexey Samsonov	d90423e310	[libc][bazel] Minor cleanup to remove unused dependencies. (#130348 ) * strcpy doesn't need to depend on memcpy * qsort_test_helper has been generalized and doesn't need to depend on qsort. This is a small cleanup to unblock the work outlined in #130327.	2025-03-07 13:31:18 -08:00
Connector Switch	dd9a2f0677	[libc] `first_trailing_one(0)` should be `0`. (#130155 ) Fix this minor bug. See more context at #129892.	2025-03-08 00:11:43 +08:00
Augie Fackler	da61b0ddc5	Revert "[libc] Enable -Wconversion for tests. (#127523 )" This reverts commit 1e6e845d49a336e9da7ca6c576ec45c0b419b5f6 because it changed the 1st parameter of adjust() to be unsigned, but libc itself calls adjust() with a negative argument in align_backward() in op_generic.h.	2025-03-05 16:42:40 -05:00
Vinay Deshmukh	1e6e845d49	[libc] Enable -Wconversion for tests. (#127523 ) Relates to: #119281	2025-03-04 10:24:35 -05:00
Michael Jones	64ae0a102f	[libc] implement l64a (#129099 ) Adds l64a, which generates the base 64 string expected by a64l.	2025-02-27 12:48:39 -08:00
lntue	4fd762caa6	[libc] Fix sqrtf128 smoke test for riscv32. (#129094 )	2025-02-27 13:55:22 -05:00
Krishna Pandey	f6bfa33cdb	[libc][stdfix] Implement fixed point bitsfx functions in llvm libc (#128413 ) Fixes #113359 --------- Signed-off-by: krishna2803 <kpandey81930@gmail.com>	2025-02-27 13:05:27 -05:00
Michael Jones	8beec9fc48	[libc] implement a64l (#128758 ) Implement the posix function a64l. Standard: https://pubs.opengroup.org/onlinepubs/9799919799/functions/a64l.html	2025-02-25 13:57:13 -08:00
wldfngrs	3e284554a8	[libc][math][c23] Add acosf16() function (#127731 ) - Implementation of acosf16 (inverse cosine) function for 16-bit inputs. - Exhaustive tests across the 16-bit input range.	2025-02-23 19:46:46 +01:00
lntue	8ea6b735a6	[libc] Fix alignment issue for HermeticTestUtils.cpp. (#128426 ) Full build precommit bots were failing due to mis-alignment of atomics in hermetic tests. This PR enforces the alignment for the bump allocator of hermetic test framework. Fixes https://github.com/llvm/llvm-project/issues/128185.	2025-02-23 13:41:51 -05:00
Joseph Huber	559cb6ba63	[libc] Fix scanf dependencies on the GPU build Summary: The GPU build doesn't have `file` as a struct but does use these targets. We need to add an escape for this.	2025-02-22 13:14:20 -06:00
Michael Jones	3fad640f06	[libc] Skip scanf internals when no file available (#128097 ) A temporary fix based on discussions in #128079 Currently, baremetal targets are failing to build because the scanf internals require FILE* (either from the system's libc or our libc). Normally we'd just turn off the broken entrypoint, but since the scanf internals are built separately that doesn't work. This patch adds extra conditions to building those internals, which we can hopefully remove once we have a proper way to build scanf for embedded.	2025-02-21 12:26:23 -08:00
Michael Jones	5981335d75	[libc] Fix scanf cmake for targets without FILE (#128056 ) Another followup fix to #121215 The new cmake wouldn't define the readerat all if the target wasn't GPU or didn't have a definition of FILE. This patch rewrites the cmake to be more general. As a followup, I'd like to make `use_system_file` consistent between /test and /src. Currently in /src it includes the `COMPILE_OPTIONS` and in /test it does not.	2025-02-20 13:47:45 -08:00
Joseph Huber	6cc7ca084a	[Clang] Fix cross-lane scan when given divergent lanes (#127703 ) Summary: The scan operation implemented here only works if there are contiguous ones in the executation mask that can be used to propagate the result. There are two solutions to this, one is to enter 'whole-wave-mode' and forcibly turn them back on, or to do this serially. This implementation does the latter because it's more portable, but checks to see if the parallel fast-path is applicable. Needs to be backported for correct behavior and because it fixes a failing libc test.	2025-02-19 16:46:59 -06:00
Zaky Hermawan	ef49760fa9	[libc][POSIX][unistd] implement getsid (#127341 ) Fixes https://github.com/llvm/llvm-project/issues/126603 --------- Signed-off-by: ZakyHermawan <zaky.hermawan9615@gmail.com>	2025-02-19 14:26:18 -08:00
Joseph Huber	9a584b07d7	[Clang] Add handlers for 'match_any' and 'match_all' to `gpuintrin.h` (#127504 ) Summary: These helpers are very useful but currently absent. They allow the user to get a bitmask representing the matches within the warp. I have made an executive decision to drop the `predicate` return from `match_all` because it's easily testable with `match_all() == __activemask()`.	2025-02-17 14:06:24 -06:00
Michael Jones	398f865499	[libc] Implement strftime (#122556 ) Implements the posix-specified strftime conversions for the default locale, along with comprehensive unit tests. This reuses a lot of design from printf, as well as the printf writer. Roughly based on #111305, but with major rewrites.	2025-02-14 15:56:55 -08:00
Roland McGrath	e477e568e6	[libc] Make test macros suppress the -Wdangling-else warnings (#127149 ) Use the trick from gtest to allow `ASSERT_...` and `EXPECT_...` macros to be used in braceless `if` without producing warnings about the nested `if`-`else` that results.	2025-02-13 20:05:24 -08:00
lntue	5ee9787730	[libc][math] Improve the performance of sqrtf128. (#122578 ) Use a combination of polynomial approximation and Newton-Raphson iterations in 64-bit and 128-bit integers to improve the performance of sqrtf128. The correct rounding is provided by squaring the result and comparing it with the argument. Performance improvement using the newly added perf test: - My function = the improved implementation from this PR - Other function = current implementation using `libc/src/__support/FPUtil/generic/sqrt.h` ``` Performance tests with inputs in denormal range: -- My function -- Total time : 1260765265 ns Average runtime : 125.951 ns/op Ops per second : 7939623 op/s -- Other function -- Total time : 7160726518 ns Average runtime : 715.357 ns/op Ops per second : 1397902 op/s -- Average runtime ratio -- Mine / Other's : 0.176067 Performance tests with inputs in normal range: -- My function -- Total time : 373003808 ns Average runtime : 37.2631 ns/op Ops per second : 26836189 op/s -- Other function -- Total time : 7353398916 ns Average runtime : 734.605 ns/op Ops per second : 1361275 op/s -- Average runtime ratio -- Mine / Other's : 0.0507254 ``` --------- Co-authored-by: Alexei Sibidanov <sibid@uvic.ca>	2025-02-13 17:49:52 -05:00
Nick Desaulniers	3e02069afe	[libc][pthread] fix -Wmissing-field-initializers (#126314 ) Fixes: llvm-project/libc/test/integration/src/pthread/pthread_rwlock_test.cpp:59:29: warning: missing field '__preference' initializer [-Wmissing-field-initializers] 59 \| pthread_rwlock_t rwlock = PTHREAD_RWLOCK_INITIALIZER; \| ^ Also, add a test that demonstrates the same issue for PTHREAD_MUTEX_INITIALIZER, and fix that, too. PTHREAD_ONCE_INIT does not have this issue and does have test coverage.	2025-02-12 14:28:29 -08:00
Krishna Pandey	1f51038036	[libc][stdfix] Implement `countlsfx` functions in libc. (#126597 ) fixes #113357 --------- Signed-off-by: krishna2803 <kpandey81930@gmail.com>	2025-02-12 14:12:44 -05:00
Nick Desaulniers	a3e2075e50	[libc][bit_test] fix -Wimplicit-int-conversion (#126317 ) Fixes: llvm-project/libc/src/__support/CPP/bit.h:235:28: error: implicit conversion loses integer precision: 'int' to 'cpp::enable_if_t<cpp::is_unsigned_v<unsigned short>, unsigned short>' (aka 'unsigned short') [-Werror,-Wimplicit-int-conversion] 235 \| return (value << rotate) \| (value >> (N - rotate)); \| ~~~~~~ ~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~ llvm-project/libc/src/__support/CPP/bit.h:247:28: error: implicit conversion loses integer precision: 'int' to 'cpp::enable_if_t<cpp::is_unsigned_v<unsigned short>, unsigned short>' (aka 'unsigned short') [-Werror,-Wimplicit-int-conversion] 247 \| return (value >> rotate) \| (value << (N - rotate)); \| ~~~~~~ ~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~ llvm-project/libc/test/src/__support/CPP/bit_test.cpp:45:36: error: implicit conversion loses integer precision: 'int' to 'unsigned char' [-Werror,-Wimplicit-int-conversion] 45 \| EXPECT_FALSE(has_single_bit<T>(two_bits_value)); \| ~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~ Via the libc-cpp-utils-tests ninja target.	2025-02-12 08:59:02 -08:00
Nick Desaulniers	b8ba266820	[libc][test][stdbit] fix -Wimplicit-int-conversion (#126616 ) When cross compiling the libc-stdbit-tests, the existing tests trigger numerous instances of -Wimplicit-int-conversion. The truncation of these implicit promotions is intentional.	2025-02-11 09:31:01 -08:00
wldfngrs	7ee56b9afc	[libc][math][c23] Add asinf16() function (#124212 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2025-02-10 12:38:55 +01:00
Petr Hosek	6dbe542290	[libc] Use `${libc_opt_high_flag}` instead of `-O3` (#123233 ) This is preferable since `${libc_opt_high_flag}` will be set correctly for the compiler used.	2025-02-07 13:36:06 -08:00
Joseph Huber	7623d91784	Revert "[libc][stdfix] Fix buildbot failure because of a typo. (#126291 )" This reverts commit bada9220b87e73c0f4a498b82f883e17eda928d1. Revert "[libc][stdfix] Implement fixed point `countlsfx` functions in llvm-libc (#125356)" This reverts commit f2a1103b323492160d7d27a1575fbda709b49036.	2025-02-07 14:34:40 -06:00
Krishna Pandey	f2a1103b32	[libc][stdfix] Implement fixed point `countlsfx` functions in llvm-libc (#125356 ) fixes #113357	2025-02-07 13:53:17 -05:00
Nick Desaulniers	e566313a1f	[libc][signal] clean up usage of sighandler_t (#125745 ) `man 3 signal`'s declaration has a face _only a mother could love_. sighandler_t and __sighandler_t are not defined in the C standard, or POSIX. They are helpful typedefs provided by glibc and the Linux kernel UAPI headers respectively since working with function pointers' syntax can be painful. But we should not rely on them; in C++ we have `auto*` and `using` statements. Remove the proxy header, and only include a typedef for sighandler_t when targeting Linux, for compatibility with glibc. Fixes: #125598	2025-02-07 10:43:56 -08:00
Alan Zhao	4c8acbded1	[libc][minor] Fix assertion in LlvmLibcFILETest.SimpleFileOperations (#126109 ) The file descriptor of the first opened file is not necessarily 3, so we change the assertion so that it's >= 0 (i.e. not an error.) Fixes #126106	2025-02-06 12:57:22 -08:00
c8ef	e1c63bbde5	[libc] implement `sys/uio/readv` (#124718 ) Closes #124694. This patch adds the `sys/uio/readv` function. ref: https://pubs.opengroup.org/onlinepubs/009696699/functions/readv.html	2025-02-06 09:28:20 +08:00
Nick Desaulniers	8e35b3d29b	[libc][POSIX][poll.h] implement poll (#125118 ) Simple syscall. Fixes: #124647	2025-02-05 13:24:39 -08:00
Roland McGrath	4eab219487	[libc] Remove LlvmLibcStackChkFail.Smash test (#125919 ) This test was problematic, and also unnecessary. It's not really a test of the libc functionality or ABI. That's already covered by the LlvmLibcStackChkFail.Death test. The Smash test was in fact just testing that the compiler produces the call in the expected situation. That's a compiler test, not a libc test. It's not really feasible to make a test like this both reliable and safe. Since it's not something libc needs to test, it's not worth trying.	2025-02-05 12:34:19 -08:00
Joseph Huber	2d8106cb5a	[Clang] Add width handling for <gpuintrin.h> shuffle helper (#125896 ) Summary: The CUDA impelementation has long supported the `width` argument on its shuffle instrucitons, which makes it more difficult to replace those uses with this helper. This patch just correctly implements that for AMDGPU and NVPTX so it's equivalent to `__shfl_sync` in CUDA. This will ease porting. Fortunately these get optimized out correctly when passing in known widths.	2025-02-05 12:38:48 -06:00
Krishna Pandey	6f750cfb96	[libc][unistd] Implement setsid (#125704 ) https://man7.org/linux/man-pages/man2/setsid.2.html closes #124632	2025-02-05 10:08:16 -08:00
Roland McGrath	66ce716676	Revert "[libc] Make LlvmLibcStackChkFail.Smash test compatible with asan, hwasan" (#125785 ) Reverts llvm/llvm-project#125763 This causes failures in asan. More thought is needed.	2025-02-04 16:06:21 -08:00
Roland McGrath	1e7624ca4f	[libc] Make LlvmLibcStackChkFail.Smash test compatible with asan, hwasan (#125763 ) Previously this test was entirely disabled under asan, but not hwasan. Instead of disabling the test, make the test compatible with both asan and hwasan by disabling sanitizers only on the subroutine that does the stack-smashing.	2025-02-04 13:42:23 -08:00
Simon Tatham	b53da77c50	[libc] Alternative algorithm for decimal FP printf (#123643 ) The existing options for bin→dec float conversion are all based on the Ryū algorithm, which generates 9 output digits at a time using a table lookup. For users who can't afford the space cost of the table, the table-lookup subroutine is replaced with one that computes the needed table entry on demand, but the algorithm is otherwise unmodified. The performance problem with computing table entries on demand is that now you need to calculate a power of 10 for each 9 digits you output. But if you're calculating a custom power of 10 anyway, it's easier to just compute one, and multiply the _whole_ mantissa by it. This patch adds a header file alongside `float_dec_converter.h`, which replaces the whole Ryū system instead of just the table-lookup routine, implementing this alternative simpler algorithm. The result is accurate enough to satisfy (minimally) the accuracy demands of IEEE 754-2019 even in 128-bit long double. The new float128 test cases demonstrate this by testing the cases closest to the 39-digit rounding boundary. In my tests of generating 39 output digits (the maximum number supported by this algorithm) this code is also both faster and smaller than the USE_DYADIC_FLOAT version of the existing Ryū code.	2025-02-04 08:57:54 +00:00
Simon Tatham	c06d0ff806	[libc] Optimize BigInt→decimal in IntegerToString (#123580 ) When IntegerToString converts a BigInt into decimal, it determines each digit by computing `n % 10` and then resets n to `n / 10`, until the number becomes zero. The div and mod operations are done using `BigInt::divide_unsigned`, which uses the simplest possible bit-by-bit iteration, which is a slow algorithm in general, but especially so if the divisor 10 must first be promoted to a BigInt the same size as the dividend. The effect is to make each division take quadratic time, so that the overall decimal conversion is cubic – and the division is quadratic in the number of _bits_, so the constant of proportionality is also large. In this patch I've provided custom code to extract decimal digits much faster, based on knowing that the divisor is always 10, and processing a word at a time. So each digit extraction is linear-time with a much smaller constant of proportionality. Full comments are in the code. The general strategy is to do the reduction mod 10 first to determine the output digit; then subtract it off, so that what's left is guaranteed to be an exact multiple of 10; and finally divide by 10 using modular-arithmetic techniques rather than reciprocal-approximation-based ordinary integer division. I didn't find any existing tests of IntegerToString on a BigInt, so I've added one.	2025-02-04 08:57:41 +00:00
Roland McGrath	648981f913	[libc] Build with -Wdeprecated, fix some warnings (#125373 ) While GCC's -Wdeprecated is on by default and doesn't do much, Clang's -Wdeprecated enables many more things. More apply in C++20, so switch a test file that tickled one to using that. In future, C++20 should probably be made the baseline for compiling all the libc code.	2025-02-01 18:22:18 -08:00
lntue	c47a57393c	[libc] Fix conversion warnings for float16 tests. (#124830 ) Fixes https://github.com/llvm/llvm-project/issues/124801.	2025-01-28 16:00:05 -05:00
Shourya Goel	7f37b34d31	[libc][complex] Testing infra for MPC (#121261 ) This PR aims to add the groundwork to test the precision of libc complex functions against MPC. I took `cargf` as a test to verify that the infra works fine.	2025-01-28 11:01:16 +05:30
lntue	a976036a10	[libc][NFC] Remove extra ; in exhaustive_test.h. (#124216 ) These cause warnings when running check-libc.	2025-01-24 11:57:43 -05:00
Joseph Huber	256f40d0e6	[libc] Use the NVIDIA device allocator for GPU malloc (#124277 ) Summary: This is a blocker on another patch in the OpenMP runtime. The problem is that NVIDIA truly doesn't handle RPC-based allocations very well. It cannot reliably update the MMU while a kernel is running and it will usually deadlock if called from a separate thread due to internal use of TLS. This patch just removes the definition of `malloc` and `free` for NVPTX. The result here is that they will be undefined, which is the cue for the `nvlink` linker to define them for us. So, as far as `libc` is concerned it still implements malloc.	2025-01-24 10:17:20 -06:00
Nick Desaulniers	631a6e0004	[libc][wchar] implement wcslen (#124150 ) Update string_utils' string_length to work with char* or wchar_t*, so that it may be reusable when implementing wmemchr, wcspbrk, wcsrchr, wcsstr. Link: #121183 Link: #124027 Co-authored-by: Nick Desaulniers <ndesaulniers@google.com> --------- Co-authored-by: Tristan Ross <tristan.ross@midstall.com>	2025-01-23 13:33:04 -08:00

1 2 3 4 5 ...

1721 Commits