llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-28 16:16:07 +00:00

Author	SHA1	Message	Date
Mircea Trofin	0a1f7f0e66	[ctxprof][nfc] Drop `Section` in `writeFlatSection` (#130654 ) It matches `writeContextual`, and also, it's not the whole section that's being written there, just a profile, piecemeal.	2025-03-10 12:50:41 -07:00
Mircea Trofin	c8fd7a8a3a	[ctxprof] Profile section for flat profiles (#129932 ) A section for flat profiles (i.e. non-contextual). This is useful for debugging or for intentional cases where a root isn't identified. This patch adds the reader/writer support. `compiler-rt` changes follow in a subsequent change.	2025-03-06 21:18:57 -08:00
Mircea Trofin	5f70ed5bda	[nfc] Small fixups (coding style) post- PR #129626 (#129935 )	2025-03-06 10:58:14 -08:00
Nikita Popov	979c275097	[IR] Store Triple in Module (NFC) (#129868 ) The module currently stores the target triple as a string. This means that any code that wants to actually use the triple first has to instantiate a Triple, which is somewhat expensive. The change in #121652 caused a moderate compile-time regression due to this. While it would be easy enough to work around, I think that architecturally, it makes more sense to store the parsed Triple in the module, so that it can always be directly queried. For this change, I've opted not to add any magic conversions between std::string and Triple for backwards-compatibilty purses, and instead write out needed Triple()s or str()s explicitly. This is because I think a decent number of them should be changed to work on Triple as well, to avoid unnecessary conversions back and forth. The only interesting part in this patch is that the default triple is Triple("") instead of Triple() to preserve existing behavior. The former defaults to using the ELF object format instead of unknown object format. We should fix that as well.	2025-03-06 10:27:47 +01:00
Mircea Trofin	5223ddd83f	[ctxprof] Prepare profile format for flat profiles (#129626 ) The profile format has now a separate section called "Contexts" - there will be a corresponding one for flat profiles. The root has a separate tag because, in addition to not having a callsite ID as all the other context nodes have under it, it will have additional fields in subsequent patches. The rest of this patch amounts to a bit of refactorings in the reader/writer (for better reuse later) and tests fixups.	2025-03-05 07:22:35 -08:00
Mircea Trofin	2068a18c86	[ctxprof][nfc] Prepare CtxProfAnalysis for flat profiles (#129623 ) Mostly remove the equivalence "no contexts == no CtxProfAnalysis result", and instead check explicitly there are no contextual profiles.	2025-03-04 16:42:47 -08:00
Lei Wang	d38380d3d8	[CSSPGO] Fix redundant reading of profile metadata (#129609 ) Fix a build speed regression due to repeated reading of profile metadata. Before the function `readFuncMetadata(ProfileHasAttribute, Profiles)` reads the metadata for all the functions(`Profiles`), however, it's actually used for on-demand loading, it can be called for multiple times, which leads to redundant reading that causes the build speed regression. Now fix it to read the metadata only for the new loaded functions(functions in the `FuncsToUse`).	2025-03-04 11:39:59 -08:00
Kazu Hirata	192b13bc9f	[ProfileData] Avoid repeated hash lookups (NFC) (#129194 )	2025-02-27 22:41:46 -08:00
Kazu Hirata	e264b0e856	[ProfileData] Avoid repeated hash lookups (NFC) (#128829 )	2025-02-26 00:57:28 -08:00
Ken Matsui	2847e1501e	[PGO] Fix incorrect count threshold calculation when 0% cutoff (#117359 ) DefaultCutoffsData does not have an entry for the 0th percentile. As a result, when the getEntryForPercentile method is called with a percentile argument of 0, it returns a ProfileSummaryEntry for the 1st percentile instead. This behavior affects the threshold calculations, such as getHotCountThreshold, causing them to incorrectly identify some sample profile counts as hot when they should not be. This patch addresses the issue by handling the 0th percentile case in the getEntryForPercentile method. This ensures that when the -profile-summary-cutoff-hot (or -cold) option is set to 0, no sample counts are treated as hot (or all sample counts are treated as cold).	2025-02-18 15:51:17 -05:00
Mike Hommey	e8999309f1	[Coverage] Speed up function record iteration (#122050 ) When iterating over function records, filtered by file name, currently, the iteration goes over all the function records, repeatedly for each source file, essentially giving quadratic behavior. 413647d730972eac9675f695c2ea63fb393a5531 sped up some cases by keeping track of the indices of the function records corresponding to each file name. This change expands the use of that map to FunctionRecordIterator. On a test case with Firefox's libxul.so and a 2.5MB profile, this brings down the runtime of `llvm-cov export $lib --instr-profile $prof -t lcov` from 12 minutes with 90% spent in skipOtherFiles to 19 seconds with no samples in skipOtherFiles at all under a sampling profiler (with a sampling interval of 1ms). Fixes #62079	2025-01-17 07:56:12 +01:00
Mircea Trofin	b15845c005	[ctxprof] dump profiles using yaml (for testing) (#123108 ) This is a follow-up from PR #122545, which enabled converting yaml to contextual profiles. This change uses the lower level yaml APIs because: - the mapping APIs `llvm::yaml` offers don't work with `const` values, because they (the APIs) want to enable both serialization and deserialization - building a helper data structure would be an alternative, but it'd be either memory-consuming or overly-complex design, given the recursive nature of the contextual profiles.	2025-01-15 16:49:59 -08:00
Mircea Trofin	6329355860	[ctxprof] Move test serialization to yaml (#122545 ) We have a textual representation of contextual profiles for test scenarios, mainly. This patch moves that to YAML instead of JSON. YAML is more succinct and readable (some of the .ll tests should be illustrative). In addition, JSON is parse-able by the YAML reader. A subsequent patch will address deserialization. (thanks, @kazutakahirata, for showing me how to use the llvm YAML reader/writer APIs, which I incorrectly thought to be more low-level than the JSON ones!)	2025-01-10 18:04:25 -08:00
NAKAMURA Takumi	61b294aa15	Introduce CounterExpressionBuilder::subst(C, Map) (#112698 ) This return a counter for each term in the expression replaced by ReplaceMap. At the moment, this doesn't update the Map, so Map is marked as `const`.	2025-01-09 16:27:35 +09:00
NAKAMURA Takumi	97097958fd	[Coverage] MCDC: Move findIndependencePairs deferred into MCDCRecord (#121188 ) The result of "Independence pairs" is not mergeable. This change makes defers re-calculation of "Independence pairs" after merging test vectors. No apparent behavior changes.	2025-01-07 08:52:04 +09:00
Zibi Sarbinowski	1d51546635	[SystemZ][z/OS] Open YAML files for read as text files (#121340 ) This patch makes sure YAML files are opened for reading as text file to trigger auto-conversion from EBCDIC encoding into expected ASCII encoding on z/OS platform. This is required to fix the following lit tests: ``` LLVM :: tools/llvm-gsymutil/ARM_AArch64/macho-gsym-callsite-info-exe.yaml LLVM :: tools/llvm-gsymutil/ARM_AArch64/macho-gsym-callsite-info-obj.test LLVM :: tools/llvm-gsymutil/ARM_AArch64/macho-gsym-callsite-info-dsym.yaml LLVM :: Transforms/PGOProfile/memprof_undrift_missing_leaf.ll ```	2024-12-31 07:24:59 -05:00
NAKAMURA Takumi	ee6f10d372	[Coverage] Make `MCDCRecord::Folded` as `[false/true]` with BitVector. NFC. (#121190 ) For merging `MCDCRecord`s, `Folded` is expected to be promoted as "Non-folded".	2024-12-28 17:48:30 +09:00
NAKAMURA Takumi	aa2fdc69d3	[Coverage] Move SingleByteCoverage out of CountedRegion (#110966 ) `SingleByteCoverage` is not per-region attribute at least. Move it into `CoverageData` since it comes from `profdata`. Depends on: #120841	2024-12-27 20:42:26 +09:00
NAKAMURA Takumi	275a27703e	[Coverage][Single] Round Counters to boolean after evaluation (#110972 ) Rounding in merging segments has been done after #75425. Depends on: #113114	2024-12-24 08:01:01 +09:00
Kazu Hirata	2886576944	[memprof] clang-format MemProf-related files (NFC) (#120504 )	2024-12-19 10:25:29 -08:00
NAKAMURA Takumi	e698079658	Allow `CoverageMapping::getCoverageForFile()` to show Branches also outside functions (#120416 ) Fixes #119952	2024-12-19 08:41:28 +09:00
Kazu Hirata	6fb967ec9e	[memprof] Move Frame::hash and hashCallStack to IndexedMemProfData (NFC) (#120365 ) Now that IndexedMemProfData::{addFrame,addCallStack} are the only callers of Frame::hash and hashCallStack, respectively, this patch moves those functions into IndexedMemProfData and makes them private. With this patch, we can obtain FrameId and CallStackId only through addFrame and addCallStack, respectively.	2024-12-18 10:56:45 -08:00
Kazu Hirata	66edefaee5	[memprof] Move YAML support to MemProfYAML.h (NFC) (#119515 ) The YAML support is increasing in size, so this patch moves it to a separate file.	2024-12-11 09:17:16 -08:00
Kazu Hirata	684e79f254	[memprof] Add YAML read/write support to llvm-profdata (#118915 ) This patch adds YAML read/write support to llvm-profdata. The primary intent is to accommodate MemProf profiles in test cases, thereby avoiding the binary format. The read support is via llvm-profdata merge. This is useful when we want to verify that the compiler does the right thing on a given .ll file and a MemProf profile in a test case. In the test case, we would convert the MemProf profile in YAML to an indexed profile and invoke the compiler on the .ll file along with the indexed profile. The write support is via llvm-profdata show --memory. This is useful when we wish to convert an indexed MemProf profile to YAML while writing tests. We would compile a test case in C++, run it for an indexed MemProf profile, and then convert it to the text format.	2024-12-07 20:22:05 -08:00
Fangrui Song	4153c2dc05	[ProfileData] Avoid deprecated is_pod	2024-12-07 16:22:14 -08:00
Kazu Hirata	c5e4e8f87d	[memprof] Add IndexedMemProfData::addCallStack (#118920 ) This patch adds a helper function to replace an idiom like: CallStackId CSId = hashCallStack(CallStack) MemProfData.CallStacks.try_emplace(CSId, CallStack); // Do something with CSId.	2024-12-06 12:10:11 -08:00
Kazu Hirata	bda02096d3	[ProfileData] Add InstrProfWriter::writeBinaryIds (NFC) (#118754 ) The patch makes InstrProfWriter::writeImpl less monolithic by adding InstrProfWriter::writeBinaryIds to serialize binary IDs. This way, InstrProfWriter::writeImpl can simply call the new function instead of handling all the details within writeImpl.	2024-12-05 08:39:27 -08:00
Kazu Hirata	50f8580e2c	[memprof] Add IndexedMemProfData::addFrame (#118724 ) This patch adds a helper function to replace an idiom like: FrameId Id = F.hash(); MemProfData.Frames.try_emplace(Id, F); // Do something with Id.	2024-12-04 20:33:35 -08:00
Kazu Hirata	c3d15188cf	[memprof] Move YAML traits to MemProf.h (NFC) (#118668 ) This patch moves the MemProf YAML traits to MemProf.h so that the YAML writer can access them from outside MemProfReader.cpp in the future.	2024-12-04 12:01:39 -08:00
ronryvchin	ff281f7d37	[PGO] Add option to always instrumenting loop entries (#116789 ) This patch extends the PGO infrastructure with an option to prefer the instrumentation of loop entry blocks. This option is a generalization of `19fb5b467b`, and helps to cover cases where the loop exit is never executed. An example where this can occur are event handling loops. Note that change does NOT change the default behavior.	2024-12-04 07:56:46 +01:00
Kazu Hirata	3ce8b7d220	[memprof] Remove inline call stacks (#117833 ) Now that MemProf format version 1 has been removed, nobody uses: - IndexedAllocationInfo::CallStack - IndexedMemProfRecord::CallSites This patch removed the dead struct fields. You might notice that IndexedMemProfRecord::{clear,merge} do not mention CallSiteIds at all. I think it's an oversight. clear doesn't matter at the moment because we call it during serialization to reduce memory footprint. merge is simply not as well tested as it should be. I'll follow up with a separate patch to address these issues.	2024-11-27 11:10:53 -08:00
Kazu Hirata	9d55e862d9	[memprof] Fix warnings on MSVC MSVC doesn't seem to count a use in static_assert as a use.	2024-11-27 09:50:45 -08:00
Kazu Hirata	e98396f484	Reapply [memprof] Add YAML-based deserialization for MemProf profile (#117829 ) This patch adds YAML-based deserialization for MemProf profile. It's been painful to write tests for MemProf passes because we do not have a text format for the MemProf profile. We would write a test case in C++, run it for a binary MemProf profile, and then finally run a test written in LLVM IR with the binary profile. This patch paves the way toward YAML-based MemProf profile. Specifically, it adds new class YAMLMemProfReader derived from MemProfReader. For now, it only adds a function to parse StringRef pointing to YAML data. Subseqeunt patches will wire it to llvm-profdata and read from a file. The field names are based on various printYAML functions in MemProf.h. I'm not aiming for compatibility with the format used in printYAML, but I don't see a point in changing the field names. This iteration works around the unavailability of ScalarTraits<uintptr_t> on macOS.	2024-11-27 08:19:07 -08:00
Florian Hahn	7e312c3b90	Revert "[memprof] Add YAML-based deserialization for MemProf profile (#117829 )" This reverts commit c00e53208db638c35499fc80b555f8e14baa35f0. It looks like this breaks building LLVM on macOS and some other platform/compiler combos https://lab.llvm.org/buildbot/#/builders/23/builds/5252 https://green.lab.llvm.org/job/llvm.org/job/clang-san-iossim/5356/console In file included from /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/lib/ProfileData/MemProfReader.cpp:34: In file included from /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/ProfileData/MemProfReader.h:24: In file included from /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/ProfileData/InstrProfReader.h:22: In file included from /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/ProfileData/InstrProfCorrelator.h:21: /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/Support/YAMLTraits.h:1173:36: error: implicit instantiation of undefined template 'llvm::yaml::MissingTrait<unsigned long>' char missing_yaml_trait_for_type[sizeof(MissingTrait<T>)]; ^ /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/Support/YAMLTraits.h:961:7: note: in instantiation of function template specialization 'llvm::yaml::yamlize<unsigned long>' requested here yamlize(*this, Val, Required, Ctx); ^ /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/Support/YAMLTraits.h:883:11: note: in instantiation of function template specialization 'llvm::yaml::IO::processKey<unsigned long, llvm::yaml::EmptyContext>' requested here this->processKey(Key, Val, true, Ctx); ^ /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/ProfileData/MIBEntryDef.inc:55:1: note: in instantiation of function template specialization 'llvm::yaml::IO::mapRequired<unsigned long>' requested here MIBEntryDef(AccessHistogram = 27, AccessHistogram, uintptr_t) ^ /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/lib/ProfileData/MemProfReader.cpp:77:8: note: expanded from macro 'MIBEntryDef' Io.mapRequired(KeyStr.str().c_str(), MIB.Name); \ ^ /Users/ec2-user/jenkins/workspace/llvm.org/clang-san-iossim/llvm-project/llvm/include/llvm/Support/YAMLTraits.h:310:8: note: template is declared here struct MissingTrait; ^ 1 error generated.	2024-11-27 09:04:16 +00:00
Kazu Hirata	c00e53208d	[memprof] Add YAML-based deserialization for MemProf profile (#117829 ) This patch adds YAML-based deserialization for MemProf profile. It's been painful to write tests for MemProf passes because we do not have a text format for the MemProf profile. We would write a test case in C++, run it for a binary MemProf profile, and then finally run a test written in LLVM IR with the binary profile. This patch paves the way toward YAML-based MemProf profile. Specifically, it adds new class YAMLMemProfReader derived from MemProfReader. For now, it only adds a function to parse StringRef pointing to YAML data. Subseqeunt patches will wire it to llvm-profdata and read from a file. The field names are based on various printYAML functions in MemProf.h. I'm not aiming for compatibility with the format used in printYAML, but I don't see a point in changing the field names.	2024-11-26 23:48:03 -08:00
Kazu Hirata	5add295fd7	[memprof] Use IndexedMemProfRecord in MemProfReader (NFC) (#117613 ) IndexedMemProfRecord contains a complete package of the MemProf profile, including frames, call stacks, and records. This patch replaces the three member variables of MemProfReader with IndexedMemProfRecord. This transition significantly simplies both the constructor and the final "take" method: MemProfReader(IndexedMemProfData MemProfData) : MemProfData(std::move(MemProfData)) {} IndexedMemProfData takeMemProfData() { return std::move(MemProfData); }	2024-11-26 14:33:45 -08:00
Kazu Hirata	b0ca543532	[memprof] Remove dead code in MemProfReader (NFC) (#117607 ) The only constructor in current use is the one that takes IndexedMemProfData. Likewise, the only accessor in current use is takeMemProfData.	2024-11-25 11:01:26 -08:00
Kazu Hirata	ff7b42c194	[memprof] Speed up llvm-profdata (#117446 ) CallStackRadixTreeBuilder::build takes the parameter MemProfFrameIndexes by value, involving copies: std::optional<const llvm::DenseMap<FrameIdTy, LinearFrameId>> MemProfFrameIndexes Then "build" makes another copy of MemProfFrameIndexe and passes it to encodeCallStack for every call stack, which is painfully slow. This patch changes the type to a pointer so that we don't have to make a copy every time we pass the argument. Without this patch, it takes 553 seconds to run "llvm-profdata merge" on a large MemProf raw profile. This patch shortenes that down to 67 seconds.	2024-11-24 21:08:54 -08:00
Kazu Hirata	9e3215ac16	[memprof] Add an assert to InstrProfWriter::addMemProfData (#117426 ) This patch adds a quick validity check to InstrProfWriter::addMemProfData. Specifically, we check to see if we have all (or none) of the MemProf profile components (frames, call stacks, records). The credit goes to Teresa Johnson for suggesting this assert.	2024-11-24 21:07:59 -08:00
Kazu Hirata	9d8a11fb39	[memprof] Remove verifyIndexedMemProfRecord and verifyFunctionProfileData (#117412 ) This patch removes two functions to verify the consistency between: - IndexedAllocationInfo::CallStack - IndexedAllocationInfo::CSId Now that MemProf format Version 1 has been removed, IndexedAllocationInfo::CallStack doesn't participate in either serialization or deserialization, so we don't care about the consistency between the two fields in IndexAllocationInfo. Subsequent patches will remove uses of the old field and eventually remove the field.	2024-11-22 21:58:01 -08:00
Kazu Hirata	a0153eaa65	[memprof] Fix builds under EXPENSIVE_CHECKS memprof::Version1 has been removed, so the whole block of code is dead.	2024-11-22 17:23:16 -08:00
Teresa Johnson	776476c282	Reapply "[MemProf] Use radix tree for alloc contexts in bitcode summaries" (#117395 ) (#117404 ) This reverts commit fdb050a5024320ec29d2edf3f2bc686c3a84abaa, and restores ccb4702038900d82d1041ff610788740f5cef723, with a fix for build bot failures. Specifically, add ProfileData to the dependences of the BitWriter library, which was causing shared library builds of LLVM to fail. Reproduced the failure with a shared library build and confirmed this change fixes that build failure.	2024-11-22 16:18:30 -08:00
Teresa Johnson	fdb050a502	Revert "[MemProf] Use radix tree for alloc contexts in bitcode summaries" (#117395 ) Reverts llvm/llvm-project#117066 This is causing some build bot failures that need investigation.	2024-11-22 14:57:58 -08:00
Teresa Johnson	ccb4702038	[MemProf] Use radix tree for alloc contexts in bitcode summaries (#117066 ) Leverage the support added to represent allocation contexts in a more compact way via a radix tree in the indexed profile to similarly reduce sizes of the bitcode summaries. For a large target, this reduced the size of the per-module summaries by about 18% and in the distributed combined index files by 28%.	2024-11-22 14:49:55 -08:00
Kazu Hirata	ad2bdd8fab	[memprof] Remove MemProf format Version 1 (#117357 ) This patch removes MemProf format Version 1 now that Version 2 and 3 are working well.	2024-11-22 11:53:31 -08:00
Teresa Johnson	e14827f082	[MemProf] Templatize CallStackRadixTreeBuilder (NFC) (#117014 ) Prepare for usage in the bitcode reader/writer where we already have a LinearFrameId: - templatize input frame id type in CallStackRadixTreeBuilder - templatize input frame id type in computeFrameHistogram - make the map from FrameId to LinearFrameId optional We plan to use the same radix format in the ThinLTO summary records, where we already have a LinearFrameId.	2024-11-20 10:08:58 -08:00
Kazu Hirata	4f1b20f023	[ProfileData] Remove unused includes (NFC) (#116751 ) Identified with misc-include-cleaner.	2024-11-19 19:42:20 -08:00
Kazu Hirata	f97c610d1f	[memprof] Add MemProfReader::takeMemProfData (#116769 ) This patch adds MemProfReader::takeMemProfData, a function to return the complete MemProf profile from the reader. We can directly pass its return value to InstrProfWriter::addMemProfData without having to deal with the indivual components of the MemProf profile. The new function is named "take", but it doesn't do std::move yet because of type differences (DenseMap v.s. MapVector). The end state I'm trying to get to is roughly as follows: - MemProfReader accepts IndexedMemProfData as a parameter as opposed to the three individual components (frames, call stacks, and records). - MemProfReader keeps IndexedMemProfData as a class member without decomposing it into its individual components. - MemProfReader returns IndexedMemProfData like: IndexedMemProfData takeMemProfData() { return std::move(MemProfData); }	2024-11-19 19:33:26 -08:00
Kazu Hirata	6bf8f08989	[memprof] Add InstrProfWriter::addMemProfData (#116528 ) This patch adds InstrProfWriter::addMemProfData, which adds the complete MemProf profile (frames, call stacks, and records) to the writer context. Without this function, functions like loadInput in llvm-profdata.cpp and InstrProfWriter::mergeRecordsFromWriter must add one item (frame, call stack, or record) at a time. The new function std::moves the entire MemProf profile to the writer context if the destination is empty, which is the common use case. Otherwise, we fall back to adding one item at a time behind the scene. Here are a couple of reasons why we should add this function: - We've had a bug where we forgot to add one of the three data structures (frames, call stacks, and records) to the writer context, resulting in a nearly empty indexed profile. We should always package the three data structures together, especially on API boundaries. - We expose a little too much of the MemProf detail to InstrProfWriter. I'd like to gradually transform InstrProfReader/Writer to entities managing buffers (sequences of bytes), with actual serialization/deserialization left to external classes. We already do some of this in InstrProfReader, where InstrProfReader "contracts out" to IndexedMemProfReader to handle MemProf details. I am not changing loadInput or InstrProfWriter::mergeRecordsFromWriter for now because MemProfReader uses DenseMap for frames and call stacks, whereas MemProfData uses MapVector. I'll resolve these mismatches in subsequent patches.	2024-11-18 08:56:25 -08:00
Kazu Hirata	0d38f64e7d	[memprof] Remove MemProf format Version 0 (#116442 ) This patch removes MemProf format Version 0 now that version 2 and 3 seem to be working well. I'm not touching version 1 for now because some tests still rely on version 1. Note that Version 0 is identical to Version 1 except that the MemProf section of the indexed format has a MemProf version field.	2024-11-15 15:37:00 -08:00

1 2 3 4 5 ...

1143 Commits