llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-22 21:06:04 +00:00

Author	SHA1	Message	Date
Joseph Huber	db6b7a84e6	[libc][NFC] Strip all training whitespace and missing newlines (#124163 )	2025-01-23 12:02:54 -06:00
Nick Desaulniers	431ea2d076	[libc] move bcmp, bzero, bcopy, index, rindex, strcasecmp, strncasecmp to strings.h (#118899 ) docgen relies on the convention that we have a file foo.cpp in libc/src/\<header\>/. Because the above functions weren't in libc/src/strings/ but rather libc/src/string/, docgen could not find that we had implemented these. Rather than add special carve outs to docgen, let's fix up our sources for these 7 functions to stick with the existing conventions the rest of the codebase follows. Link: #118860 Fixes: #118875	2024-12-10 08:58:45 -08:00
Guillaume Chatelet	4873968649	[libc] Remove automemcpy folder (#118781 ) The build is currently broken and we don't have the resources to keep it up to date :-/	2024-12-06 09:30:13 +01:00
David Peixotto	081a80f2b5	Fix build issues with libc mem* benchmarks (#115982 ) Fix a few issues found when trying to build the benchmark: Errors 1. Unable to find include "src/__support/macros/config.h" in LibcMemoryBenchmarkMain.cpp Warnings 2. Unused variable warning `Index` in MemorySizeDistributions.cpp 3. Fix deprecation warning for const-ref version of `DoNotOptimize`. warning: 'DoNotOptimize<void *>' is deprecated: The const-ref version of this method can permit undesired compiler optimizations in benchmarks	2024-11-14 11:10:11 -08:00
Youngsuk Kim	c3d78a7af8	[libc][benchmarks] Tidy uses of raw_string_ostream (NFC) As specified in the docs, 1) raw_string_ostream is always unbuffered and 2) the underlying buffer may be used directly ( 65b13610a5226b84889b923bae884ba395ad084d for further reference ) Avoid unneeded calls to raw_string_ostream::str(), to avoid excess indirection.	2024-09-17 10:25:18 -05:00
jameshu15869	deb6b45c32	[libc][gpu] Add Atan2 Benchmarks (#104708 ) This PR adds benchmarking for `atan2()`, `__nv_atan2()`, and `__ocml_atan2_f64()` using the same setup as `sin()`. This PR also adds support for throughout bencmarking for functions with 2 inputs.	2024-08-18 12:50:30 -05:00
Schrodinger ZHU Yifan	b7c7dbd473	Revert "libc: Remove `extern "C"` from main declarations" (#102827 ) Reverts llvm/llvm-project#102825	2024-08-11 13:40:50 -07:00
David Blaikie	1b71c471c7	libc: Remove `extern "C"` from main declarations (#102825 ) This is invalid in C++, and clang recently started warning on it as of #101853	2024-08-11 13:17:27 -07:00
jameshu15869	2b592b16c1	[libc][gpu] Add Sinf Benchmarks (#102532 ) This PR adds benchmarking for `sinf()` using the same set up as `sin()` but with a smaller range for floats.	2024-08-08 16:26:26 -05:00
jameshu15869	1248698e9b	[libc] [gpu] Fix Minor Benchmark UI Issues (#102529 ) Previously, `AmdgpuSinTwoPow_128` and others were too large for their table cells. This PR shortens the name to `AmdSin...` There were also some `-` missing in the separator. This PR instead creates the separator string using the length of the headers.	2024-08-08 15:32:20 -05:00
jameshu15869	9a070d6d0f	[libc] [gpu] Add Generic, NvSin, and OcmlSinf64 Throughput Benchmark (#101917 ) This PR implements `2a158426d4` to provide better throughput benchmarking for libc `sin()` and `__nv_sin()`. These changes have not been tested on AMDGPU yet, only compiled.	2024-08-08 15:05:34 -05:00
jameshu15869	39826b1030	[libc] [gpu] Change Time To Be Per Iteration (#101919 ) Previously, the time field was the total time take to run all iterations of the benchmark. This PR changes the value displayed to be the average time take by each iteration.	2024-08-05 08:27:31 -05:00
Joseph Huber	ebdcb76d1a	[libc] Only link in the appropriate architecture's device libs	2024-07-30 18:36:41 -05:00
jameshu15869	8f7910a4fc	[libc] Add AMDGPU Sin Benchmark (#101120 ) This PR adds support for benchmarking `__ocml_sin_f64()` against `sin()`. This PR is currently a draft because I do not have access to an AMD GPU and was not able to test the PR, but the code compiled when I ran `ninja gpu-benchmark` from `runtimes-amdgcn-amd-amdhsa-bins` Co-authored-by: Joseph Huber <huberjn@outlook.com>	2024-07-30 10:19:48 -05:00
jameshu15869	677796cab3	[libc] Add Generic and NVPTX Sin Benchmark (#99795 ) This PR adds sin benchmarking for a range of values and on a pregenerated random distribution.	2024-07-29 22:09:11 -05:00
Joseph Huber	79afb94da1	[libc] Make NVPTX benchmarks use LTO for linking Summary: Now that we can do LTO, we can make the benchmarks more accurate by allowing optimization + inlining of the implementation.	2024-07-27 06:53:12 -05:00
jameshu15869	a09c0f676d	[libc] Add Minimum Time and Iterations, Reduce Epsilon (#100838 ) This PR adds minimums (50 iterations, 500 us, and epsilon of 0.0001) to ensure that all benchmarks run at least a set number of times before outputting a final measurement.	2024-07-26 20:30:19 -05:00
Joseph Huber	6911f823ad	[libc] Fix invalid format specifier in benchmark Summary: This value is a uint32_t but is printed as a uint64_t, leading to invalid offsets when done on AMDGPU due to its packed format extending past the buffer.	2024-07-22 11:21:22 -05:00
jameshu15869	197b142232	[libc] Add N Threads Benchmark Helper (#99834 ) This PR adds a `BENCHMARK_N_THREADS()` helper to register benchmarks with a specific number of threads. This PR replaces the flags used originally to allow any amount of threads.	2024-07-21 21:56:40 -05:00
jameshu15869	a964f2e8a1	[libc] Improve Benchmark UI (#99796 ) This PR changes the output to resemble Google Benchmark. e.g. ``` Running Suite: LlvmLibcIsAlNumGpuBenchmark Benchmark \| Cycles \| Min \| Max \| Iterations \| Time (ns) \| Stddev \| Threads \| ----------------------------------------------------------------------------------------------------- IsAlnum \| 92 \| 76 \| 482 \| 23 \| 86500 \| 76 \| 64 \| IsAlnumSingleThread \| 87 \| 76 \| 302 \| 20 \| 72000 \| 49 \| 1 \| IsAlnumSingleWave \| 87 \| 76 \| 302 \| 20 \| 72000 \| 49 \| 32 \| IsAlnumCapital \| 89 \| 76 \| 299 \| 17 \| 78500 \| 52 \| 64 \| IsAlnumNotAlnum \| 87 \| 76 \| 303 \| 20 \| 76000 \| 49 \| 64 \| ```	2024-07-21 16:40:01 -05:00
jameshu15869	ef47bbb471	[libc] Add AMDGPU Timing to CMake (#99603 ) `libc/benchmarks/gpu/timing/CMakeLists.txt` did not correctly build `amdgpu` utils. This PR fixes that issue by adding `amdgpu` to the loop that adds the correct sub directories.	2024-07-19 06:57:55 -05:00
jameshu15869	8badfccefe	[libc] Add Multithreaded GPU Benchmarks (#98964 ) This PR runs benchmarks on a 32 threads (A single warp on NVPTX) by default, adding the option for single threaded benchmarks. We can specify that a benchmark should be run on a single thread using the `SINGLE_THREADED_BENCHMARK()` macro. I chose to use a flag here so that other options could be added in the future.	2024-07-18 07:18:23 -05:00
jameshu15869	1ecffdaf27	[libc] Add Kernel Resource Usage to nvptx-loader (#97503 ) This PR allows `nvptx-loader` to read the resource usage of `_start`, `_begin`, and `_end` when executing CUDA binaries. Example output: ``` $ nvptx-loader --print-resource-usage libc/benchmarks/gpu/src/ctype/libc.benchmarks.gpu.src.ctype.isalnum_benchmark.__build__ [ RUN ] LlvmLibcIsAlNumGpuBenchmark.IsAlnumWrapper [ OK ] LlvmLibcIsAlNumGpuBenchmark.IsAlnumWrapper: 93 cycles, 76 min, 470 max, 23 iterations, 78000 ns, 80 stddev _begin registers: 25 _start registers: 80 _end registers: 62 ``` --------- Co-authored-by: Joseph Huber <huberjn@outlook.com>	2024-07-17 16:07:12 -05:00
jameshu15869	b42c332d73	[libc] Use Atomics in GPU Benchmarks (#98842 ) This PR replaces our old method of reducing the benchmark results by using an array to using atomics instead. This should help us implement single threaded benchmarks.	2024-07-15 07:08:23 -05:00
Petr Hosek	5ff3ff33ff	[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration (#98597 ) This is a part of #97655.	2024-07-12 09:28:41 -07:00
Mehdi Amini	ce9035f5bd	Revert "[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration" (#98593 ) Reverts llvm/llvm-project#98075 bots are broken	2024-07-12 09:12:13 +02:00
Petr Hosek	3f30effe1b	[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration (#98075 ) This is a part of #97655.	2024-07-11 12:35:22 -07:00
jameshu15869	eeed5896de	[libc] Correctly Run Multiple Benchmarks in the Same File (#98467 ) There was previously an issue where registering multiple benchmarks in the same file would only give the results for the last benchmark to run. This PR fixes the issue. @jhuber6	2024-07-11 06:58:10 -05:00
jameshu15869	eb66e31bc2	[libc] Add Timing Utils for AMDGPU (#96828 ) PR for adding AMDGPU timing utils for benchmarking. I was not able to test this code since I do not have an AMD GPU, but I was able to successfully compile this code using -DRUNTIMES_amdgcn-amd-amdhsa_LIBC_GPU_TEST_ARCHITECTURE=gfx90a -DRUNTIMES_amdgcn-amd-amdhsa_LIBC_GPU_LOADER_EXECUTABLE=echo -DRUNTIMES_amdgcn_amd-amdhsa_LIBC_GPU_TARGET_ARCHITECTURE=gfx90a to force the code to compile without having an AMD gpu on my machine. @jhuber6	2024-07-10 16:04:56 -05:00
jameshu15869	f4e6ddbc2e	[libc] Fix Cppcheck Issues (#96999 ) This PR fixes linting issues discovered by `cppcheck`. Fixes: https://github.com/llvm/llvm-project/issues/96863	2024-07-06 17:53:36 -05:00
jameshu15869	02b57dedb7	[libc] NVPTX Profiling (#92009 ) PR for adding microbenchmarking infrastructure for NVPTX. `nvlink` cannot perform LTO, so we cannot inline `libc` functions and this function call overhead is not adjusted for during microbenchmarking.	2024-06-26 16:38:39 -05:00
lntue	73aab2f697	[libc] Revert https://github.com/llvm/llvm-project/pull/83199 since it broke Fuchsia. (#83374 ) With some header fix forward for GPU builds.	2024-02-29 14:43:53 -05:00
Joseph Huber	04e8653f18	[libc] Add "include/" to the LLVM include directories (#83199 ) Summary: Recent changes added an include path in the float128 type that used the internal `libc` path to find the macro. This doesn't work once it's installed because we need to search from the root of the install dir. This patch adds "include/" to the include path so that our inclusion of installed headers always match the internal use.	2024-02-27 17:45:15 -06:00
Joseph Huber	69c0b2febe	[libc][NFC] Remove all trailing spaces from libc (#82831 ) Summary: There are a lot of random training spaces on various lines. This patch just got rid of all of them with `sed 's/\ \+$//g'.	2024-02-23 16:34:00 -06:00
Dmitry Vyukov	605fadf0ca	[libc] Add --sweep-min-size flag for benchmarks (#70302 ) We have --sweep-max-size, it's reasonable to have --sweep-min-size as well. It can be used when working on the logic for larger sizes, or to collect a profile for larger sizes only.	2023-10-26 11:06:15 +02:00
Guillaume Chatelet	b6bc9d72f6	[libc] Mass replace enclosing namespace (#67032 ) This is step 4 of https://discourse.llvm.org/t/rfc-customizable-namespace-to-allow-testing-the-libc-when-the-system-libc-is-also-llvms-libc/73079	2023-09-26 11:45:04 +02:00
Guillaume Chatelet	467077796a	[reland][libc][cmake] Tidy compiler includes (#66783 ) (#66878 ) This is a reland of #66783 a35a3b75b219247eb9ff6784d1a0fe562f72d415 fixing the benchmark breakage.	2023-09-20 11:21:46 +02:00
Vitaly Buka	9abc1e080d	[test][libc] Fix aligned_alloc argument Size must be multiple of Alignment. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D157247	2023-08-07 16:36:20 -07:00
Guillaume Chatelet	eea5d9cc41	[libc][benchmark] Do not force static linking Being able to link statically depends on other CMake options and choice of libc.	2023-05-11 09:09:28 +00:00
Guillaume Chatelet	b51e6bfcb6	[libc] Allows cross compilation of membenchmarks This patch makes sure: - we pass the correct compiler options when building Google benchmarks, - we only import the C++ version of the memory functions. The change in libc/cmake/modules/LLVMLibCTestRules.cmake is here to make sure CMake can generate the right command line in the presence of the CMAKE_CROSSCOMPILING_EMULATOR option. Relevant documentation: https://cmake.org/cmake/help/latest/variable/CMAKE_CROSSCOMPILING_EMULATOR.html https://cmake.org/cmake/help/latest/command/add_custom_command.html#command:add_custom_command " If COMMAND specifies an executable target name (created by the `add_executable()` command), it will automatically be replaced by the location of the executable created at build time if either of the following is true: - The target is not being cross-compiled (i.e. the CMAKE_CROSSCOMPILING variable is not set to true). - New in version 3.6: The target is being cross-compiled and an emulator is provided (i.e. its CROSSCOMPILING_EMULATOR target property is set). In this case, the contents of CROSSCOMPILING_EMULATOR will be prepended to the command before the location of the target executable. " Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D150200	2023-05-11 08:57:50 +00:00
Archibald Elliott	d768bf994f	[NFC][TargetParser] Replace uses of llvm/Support/Host.h The forwarding header is left in place because of its use in `polly/lib/External/isl/interface/extract_interface.cc`, but I have added a GCC warning about the fact it is deprecated, because it is used in `isl` from where it is included by Polly.	2023-02-10 09:59:46 +00:00
Kazu Hirata	660c33e51d	[libc] Use std::optional instead of llvm::Optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 21:10:14 -08:00
Kazu Hirata	b5516be056	[libc] Use std::optional instead of llvm::Optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 20:52:00 -08:00
Kazu Hirata	e1cdda57fa	[libc] Use std::optional instead of llvm::Optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 20:22:48 -08:00
serge-sans-paille	984b800a03	Move from llvm::makeArrayRef to ArrayRef deduction guides - last part This is a follow-up to https://reviews.llvm.org/D140896, split into several parts as it touches a lot of files. Differential Revision: https://reviews.llvm.org/D141298	2023-01-10 11:47:43 +01:00
Archibald Elliott	2a261a7b57	[libc] Add Missing TargetParser Dependency	2022-12-20 11:31:14 +00:00
Archibald Elliott	f09cf34d00	[Support] Move TargetParsers to new component This is a fairly large changeset, but it can be broken into a few pieces: - `llvm/Support/TargetParser` are all moved from the LLVM Support component into a new LLVM Component called "TargetParser". This potentially enables using tablegen to maintain this information, as is shown in https://reviews.llvm.org/D137517. This cannot currently be done, as llvm-tblgen relies on LLVM's Support component. - This also moves two files from Support which use and depend on information in the TargetParser: - `llvm/Support/Host.{h,cpp}` which contains functions for inspecting the current Host machine for info about it, primarily to support getting the host triple, but also for `-mcpu=native` support in e.g. Clang. This is fairly tightly intertwined with the information in `X86TargetParser.h`, so keeping them in the same component makes sense. - `llvm/ADT/Triple.h` and `llvm/Support/Triple.cpp`, which contains the target triple parser and representation. This is very intertwined with the Arm target parser, because the arm architecture version appears in canonical triples on arm platforms. - I moved the relevant unittests to their own directory. And so, we end up with a single component that has all the information about the following, which to me seems like a unified component: - Triples that LLVM Knows about - Architecture names and CPUs that LLVM knows about - CPU detection logic for LLVM Given this, I have also moved `RISCVISAInfo.h` into this component, as it seems to me to be part of that same set of functionality. If you get link errors in your components after this patch, you likely need to add TargetParser into LLVM_LINK_COMPONENTS in CMake. Differential Revision: https://reviews.llvm.org/D137838	2022-12-20 11:05:50 +00:00
Kazu Hirata	b950e06149	[libc] Use std::optional instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 17:01:55 -08:00
Siva Chandra Reddy	8dcb7f6b6f	[libc][Obvious] Fix memory function benchmarks after removal of None.	2022-12-11 00:31:21 +00:00
Kazu Hirata	f7dffc28b3	Don't include None.h (NFC) I've converted all known uses of None to std::nullopt, so we no longer need to include None.h. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 11:24:26 -08:00

1 2 3

113 Commits