llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-28 03:16:05 +00:00

Author	SHA1	Message	Date
Jan Svoboda	8bbd0797d4	[clang] Allocate `Module` instances in `BumpPtrAllocator` (#112795 ) In `clang-scan-deps`, we're creating lots of `Module` instances. Allocating them all in a bump-pointer allocator reduces the number of retired instructions by 1-1.5% on my workload.	2024-10-22 08:57:33 -07:00
Abhina Sree	46dc91e7d9	[SystemZ][z/OS] Add new openFileForReadBinary function, and pass IsText parameter to getBufferForFile (#111723 ) This patch adds an IsText parameter to the following getBufferForFile, getBufferForFileImpl. We introduce a new virtual function openFileForReadBinary which defaults to openFileForRead except in RealFileSystem which uses the OF_None flag instead of OF_Text. The default is set to OF_Text instead of OF_None, this change in value does not affect any other platforms other than z/OS. Setting this parameter correctly is required to open files on z/OS in the correct encoding. The IsText parameter is based on the context of where we open files, for example, in the ASTReader, HeaderMap requires that files always be opened in binary even though they might be tagged as text.	2024-10-21 08:20:22 -04:00
Dmitry Polukhin	65780f4d8e	[C++20][Modules] Allow import for a header unit after #pragma (#111662 ) Summary: `#pragma` and headers that finish with them shouldn't prevent `import "header_unit.h"` syntax. Test Plan: check-clang	2024-10-11 08:23:35 +01:00
cor3ntin	75611caff4	[Clang] Improve type traits recognition in `__has_builtin` (#111516 ) `__has_builtin` was relying on reversible identifiers and string matching to recognize builtin-type traits, leading to some newer type traits not being recognized. Fixes #111477	2024-10-08 23:03:32 +02:00
Rainer Orth	b3c1403dc0	[clang] Fix std::tm etc. mangling on Solaris (#106353 ) Recently, Solaris bootstrap got broken because Solaris uses a non-standard mangling of `std::tm` and a few others. This was fixed with a hack in PR #100724. The Solaris ABI requires mangling `std::tm` as `tm` and similarly for `std::div_t`, `std::ldiv_t`, and `std::lconv`, which is what this patch implements. The hack needs to stay in place to allow building with older versions of `clang`. Tested on `amd64-pc-solaris2.11`, `sparcv9-sun-solaris2.11` (2-stage builds with both `clang-19` and `gcc-14` as build compiler), and `x86_64-pc-linux-gnu`.	2024-10-07 19:05:23 +02:00
Jan Svoboda	b1aea98cfa	[clang] Make deprecations of some `FileManager` APIs formal (#110014 ) Some `FileManager` APIs still return `{File,Directory}Entry` instead of the preferred `{File,Directory}EntryRef`. These are documented to be deprecated, but don't have the attribute that warns on their usage. This PR marks them as such with `LLVM_DEPRECATED()` and replaces their usage with the recommended counterparts. NFCI.	2024-09-25 10:36:44 -07:00
Nikolas Klauser	f5be5cdaad	[Clang] Add __builtin_common_type (#99473 ) This implements the logic of the `common_type` base template as a builtin alias. If there should be no `type` member, an empty class is returned. Otherwise a specialization of a `type_identity`-like class is returned. The base template (i.e. `std::common_type`) as well as the empty class and `type_identity`-like struct are given as arguments to the builtin.	2024-09-22 09:25:52 +02:00
Aaron Ballman	1881f648e2	Remove ^^ as a token in OpenCL (#108224 ) OpenCL has a reserved operator (^^), the use of which was diagnosed as an error (735c6cdebdcd4292928079cb18a90f0dd5cd65fb). However, OpenCL also encourages working with the blocks language extension. This token has a parsing ambiguity as a result. Consider: unsigned x=0; unsigned y=x^^{return 0;}(); This should result in y holding the value zero (0^0) through an immediately invoked block call as the right-hand side of the xor operator. However, it causes errors instead because of this reserved token: https://godbolt.org/z/navf7jTv1 This token is still reserved in OpenCL 3.0, so we still wish to issue a diagnostic for its use. However, we do not need to create a token for an extension point that's been unused for about a decade. So this patch moves the diagnostic from a parsing diagnostic to a lexing diagnostic and no longer forms a single token. The diagnostic behavior is slightly worse as a result, but still seems acceptable. Part of the reason this is coming up is because WG21 is considering using ^^ as a token for reflection, so this token may come back in the future.	2024-09-16 07:46:58 -04:00
Mital Ashok	4137309842	[Clang] Warn with -Wpre-c23-compat instead of -Wpre-c++17-compat for u8 character literals in C23 (#97210 ) Co-authored-by: cor3ntin <corentinjabot@gmail.com>	2024-09-05 10:15:54 +02:00
Jan Svoboda	17dc43d623	[clang] Stop adjusting the module cache path (#102540 ) This patch stops adjustments of the module cache path beyond what is done in `ParseHeaderSearchArgs` (making it absolute and removing dots). This enables more efficient implementation of the caching VFS in https://github.com/llvm/llvm-project/pull/88800.	2024-08-13 08:39:11 -07:00
Aaron Ballman	8f0c865d10	Fix a crash with empty escape sequences when lexing (#102339 ) The utilities we use for lexing string and character literals can be run in a mode where we pass a null pointer for the diagnostics engine. This mode is used by the format string checkers, for example. However, there were two places that failed to account for a null diagnostic engine pointer: `\o{}` and `\x{}`. This patch adds a check for a null pointer and correctly handles fallback behavior. Fixes #102218	2024-08-08 07:32:39 -04:00
Kazu Hirata	1fa7f05b70	[clang] Construct SmallVector with ArrayRef (NFC) (#101898 )	2024-08-04 23:46:34 -07:00
Krystian Stasiowski	708a9a06cb	[Clang][Parse] Fix ambiguity with nested-name-specifiers that may declarative (#96364 ) Consider the following: ``` template<typename T> struct A { }; template<typename T> int A<T>::B::* f(); // error: no member named 'B' in 'A<T>' ``` Although this is clearly valid, clang rejects it because the _nested-name-specifier_ `A<T>::` is parsed as-if it was declarative, meaning, we parse it as-if it was the _nested-name-specifier_ in a redeclaration/specialization. However, we don't (and can't) know whether the _nested-name-specifier_ is declarative until we see the '`*`' token, but at that point we have already complained that `A` has no member named `B`! This patch addresses this bug by adding support for _fully_ unannotated _and_ unbounded tentative parsing, which allows for us to parse past tokens without having to cache them until we reach a point where we can guarantee to be past the construct we are disambiguating. I don't know where the approach taken here is ideal -- alternatives are welcome. However, the performance impact (as measured by llvm-compile-time-tracker (https://llvm-compile-time-tracker.com/?config=Overview&stat=instructions%3Au&remote=sdkrystian) is quite minimal (0.09%, which I plan to further improve).	2024-07-29 14:01:00 -04:00
Rainer Orth	599de13555	[clang] Handle tm mangling on Solaris in PPMacroExpansion.cpp (#100724 ) The introduction of `std::put_time` in fad17b43dbc09ac7e0a95535459845f72c2b739a broke the Solaris build: ``` Undefined first referenced symbol in file _ZNKSt8time_putIcSt19ostreambuf_iteratorIcSt11char_traitsIcEEE3putES3_RSt8ios_basecPKSt2tmPKcSB_ lib/libclangLex.a(PPMacroExpansion.cpp.o) ld: fatal: symbol referencing errors ``` As discussed in PR #99075, the problem is that GCC mangles `std::tm` as `tm` on Solaris for binary compatility, while `clang` doesn't (Issue #33114). As a stop-gap measure, this patch introduces an `asm` level alias to the same effect. Tested on `sparcv9-sun-solaris2.11`, `amd64-pc-solaris2.11`, and `x86_64-pc-linux-gnu`.	2024-07-26 16:51:08 +02:00
Yi-Chi Lee	fad17b43db	[clang] replaced the usage of `asctime` with `std::put_time` (#99075 ) In `clang/lib/Lex/PPMacroExpansion.cpp`, replaced the usage of the obsolete `asctime` function with `std::put_time` for generating timestamp strings. Fixes: https://github.com/llvm/llvm-project/issues/98724	2024-07-24 08:56:37 +02:00
Volodymyr Sapsai	cbd5ba20d1	[Modules] Don't search for modulemaps in the immediate sub-directories of search paths for recent Apple SDKs. (#100005 ) Such searches can be costly and non-intuitive. We've seen complaints from developers that they don't expect clang to find modules on their own and not in search paths that developers provide. Keeping the search of modulemaps in subdirectories for code completion as it provides better user experience. If you are defining module "UsefulCode" in "include/UnrelatedName/module.modulemap", it is recommended to rename the directory "UnrelatedName" to "UsefulCode". If you cannot do so, you can add to "include/module.modulemap" a line like `extern module UsefulCode "UnrelatedName/module.modulemap"`, so clang can find module "UsefulCode" without checking each subdirectory in "include/". rdar://106677321 --------- Co-authored-by: Jan Svoboda <jan@svoboda.ai>	2024-07-23 13:59:44 -07:00
Argyrios Kyrtzidis	d19e71db8a	[clang/Lex/DependencyDirectivesScanner] Ignore import/include directives with missing filenames without failing the scan (#100126 ) Follow-up to `34ab855826b8cb0c3b46c770b83390bd1fe95c64`: * Don't fail the scan with an include with missing filename, it may be inside a skipped preprocessor block. Let the compilation provide any related error. * Fix an issue where the lexer was skipping through the next directive, after ignoring the include with missing filename.	2024-07-23 12:59:59 -07:00
Cyndy Ishida	34ab855826	[clang][deps] Ignore import/include directives with missing filenames (#99520 ) Previously source input like `#import ` resulted in infinite calls append the same token into `CurDirTokens`. This patch now ignores those directive lines if they won't actually end up being compiled. (e.g. macro guarded) resolves: rdar://121247565	2024-07-22 21:10:05 -07:00
yronglin	c91e85278c	Revert "[Clang] Implement P3034R1 Module Declarations Shouldn’t be Macros" (#99838 ) Reverts llvm/llvm-project#90574	2024-07-22 13:16:51 +08:00
yronglin	e77a01d79a	[Clang] Implement P3034R1 Module Declarations Shouldn’t be Macros (#90574 ) This PR implement [P3034R1 Module Declarations Shouldn’t be Macros](https://wg21.link/P3034R1), and refactor the convoluted state machines in module name lexical analysis. --------- Signed-off-by: yronglin <yronglin777@gmail.com> Co-authored-by: Aaron Ballman <aaron@aaronballman.com> Co-authored-by: cor3ntin <corentinjabot@gmail.com>	2024-07-20 20:41:00 +08:00
Mariya Podchishchaeva	c813667095	[clang] Fix static analyzer concerns in #embed code (#99331 ) 1. Dead code in `LookupEmbedFile`. The loop always exited on the first iteration. This was also causing a bug of not checking all directories provided by `--embed-dir`. 2. Use of uninitialized variable `CurTok` in `LexEmbedParameters`. It was used to initialize the field which seems to be unused. Removed unused field, this way `CurTok` should be initialized by Lex method.	2024-07-19 09:33:35 +02:00
Michael Spencer	4272847546	[clang][deps] Don't treat ObjC method args as module directives (#97654 ) `import:(type)name` is a method argument decl in ObjC, but the C++20 preprocessing rules say this is a preprocessing line. Because the dependency directive scanner is not language dependent, this patch extends the C++20 rule to exclude `module :` (which is never a valid module decl anyway), and `import :` that is not followed by an identifier. This is ok to do because in C++20 mode the compiler will later error on lines like this anyway, and the dependencies the scanner returns are still correct.	2024-07-18 13:37:01 -07:00
Cyndy Ishida	1c55586e9a	[clang] Fix typo in comments	2024-07-18 10:12:36 -07:00
Sirraide	e46468407a	[Clang] Allow raw string literals in C as an extension (#88265 ) This enables raw R"" string literals in C in some language modes and adds an option to disable or enable them explicitly as an extension. Background: GCC supports raw string literals in C in `-gnuXY` modes starting with gnu99. This pr both enables raw string literals in gnu99 mode and later in C and adds an `-f[no-]raw-string-literals` flag to override this behaviour. The decision not to enable raw string literals in gnu89 mode, according to the GCC devs, is intentional as that mode is supposed to be used for ‘old code’ that they don’t want to break; we’ve decided to match GCC’s behaviour here as well. The `-fraw-string-literals` flag can additionally be used to enable raw string literals in modes where they aren’t enabled by default (such as c99—as opposed to gnu99—or even e.g. C++03); conversely, the negated flag can be used to disable them in any gnuXY modes that do provide them by default, or to override a previous flag. However, we do not support disabling raw string literals (or indeed either of these two options) in C++11 mode and later, because we don’t want to just start supporting disabling features that are actually part of the language in the general case. This fixes #85703.	2024-07-10 12:10:44 +02:00
Mike Rice	54b61adc0c	[NFC][clang] Replace unreachable code in literal processing with assert (#96579 ) Address static verifier concerns about dead code in DoubleUnderscore check. Replace it with an assert.	2024-06-25 07:14:40 -07:00
Fangrui Song	f5b93ae588	[clang] Fix -Wsign-compare in 32-bit builds	2024-06-22 13:40:05 -07:00
Fangrui Song	12c0281f8c	[Lex] Replace hash_combine with a stable hash	2024-06-20 23:26:15 -07:00
Mariya Podchishchaeva	41c6e43792	Reland [clang][Sema, Lex, Parse] Preprocessor embed in C and C++ (#95802 ) This commit implements the entirety of the now-accepted [N3017 -Preprocessor Embed](https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3017.htm) and its sister C++ paper [p1967](https://wg21.link/p1967). It implements everything in the specification, and includes an implementation that drastically improves the time it takes to embed data in specific scenarios (the initialization of character type arrays). The mechanisms used to do this are used under the "as-if" rule, and in general when the system cannot detect it is initializing an array object in a variable declaration, will generate EmbedExpr AST node which will be expanded by AST consumers (CodeGen or constant expression evaluators) or expand embed directive as a comma expression. This reverts commit `682d461d5a`. --------- Co-authored-by: The Phantom Derpstorm <phdofthehouse@gmail.com> Co-authored-by: Aaron Ballman <aaron@aaronballman.com> Co-authored-by: cor3ntin <corentinjabot@gmail.com> Co-authored-by: H. Vetinari <h.vetinari@gmx.com>	2024-06-20 14:38:46 +02:00
Chuanqi Xu	da2ad44119	[HeaderSearch] Introduce LazyIdentifierInfoPtr for Controlling Macro in HeaderFileInfo This patch is helpful to reduce 32 bits for HeaderFileInfo by combining a uint32_t and pointer into a tagged pointer. This is reviewed as part of https://github.com/llvm/llvm-project/pull/92085 and required to be split as a separate commit	2024-06-20 13:13:47 +08:00
Ian Anderson	29b0d57554	[clang][modules] HeaderSearch::MarkFileModuleHeader sets textual headers' HeaderFileInfo non-external when it shouldn't (#89005 ) HeaderSearch::MarkFileModuleHeader is no longer properly checking for no-changes, and so sets the HeaderFileInfo for every `textual header` to non-external.	2024-06-15 07:41:58 -07:00
kadir çetinkaya	8e0ba08448	[clang][HeaderSearch] Fix handling of relative file-paths in suggestPathToFileForDiagnostics (#95121 ) Normalize header-to-be-spelled using WorkingDir, similar to search paths themselves. Addresses https://github.com/llvm/llvm-project/issues/81215.	2024-06-14 16:07:42 +02:00
Ziqing Luo	2e7b95e4c0	[Safe Buffers] Serialize unsafe_buffer_usage pragmas (#92031 ) The commit adds serialization and de-serialization implementations for the stored regions. Basically, the serialized representation of the regions of a PP is a (ordered) sequence of source location encodings. For de-serialization, regions from loaded files are stored by their ASTs. When later one queries if a loaded location L is in an opt-out region, PP looks up the regions of the loaded AST where L is at. (Background if helps: a pair of `#pragma clang unsafe_buffer_usage begin/end` pragmas marks a warning-opt-out region. The begin and end locations (opt-out regions) are stored in preprocessor instances (PP) and will be queried by the `-Wunsafe-buffer-usage` analyzer.) The reported issue at upstream: https://github.com/llvm/llvm-project/issues/90501 rdar://124035402	2024-06-13 22:44:24 -07:00
Aaron Ballman	4f09ac7705	Fix off-by-one issue found by post-commit review	2024-06-13 07:48:08 -04:00
Vitaly Buka	682d461d5a	Revert "✨ [Sema, Lex, Parse] Preprocessor embed in C and C++ (and Obj-C and Obj-C++ by-proxy)" (#95299 ) Reverts llvm/llvm-project#68620 Introduce or expose a memory leak and UB, see llvm/llvm-project#68620	2024-06-12 13:14:26 -07:00
The Phantom Derpstorm	5989450e00	[clang][Sema, Lex, Parse] Preprocessor embed in C and C++ (and Obj-C and Obj-C++ by-proxy) (#68620 ) This commit implements the entirety of the now-accepted [N3017 - Preprocessor Embed](https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3017.htm) and its sister C++ paper [p1967](https://wg21.link/p1967). It implements everything in the specification, and includes an implementation that drastically improves the time it takes to embed data in specific scenarios (the initialization of character type arrays). The mechanisms used to do this are used under the "as-if" rule, and in general when the system cannot detect it is initializing an array object in a variable declaration, will generate EmbedExpr AST node which will be expanded by AST consumers (CodeGen or constant expression evaluators) or expand embed directive as a comma expression. --------- Co-authored-by: Aaron Ballman <aaron@aaronballman.com> Co-authored-by: cor3ntin <corentinjabot@gmail.com> Co-authored-by: H. Vetinari <h.vetinari@gmx.com> Co-authored-by: Podchishchaeva, Mariya <mariya.podchishchaeva@intel.com>	2024-06-12 09:16:02 +02:00
Pavel Samolysov	69e9e779b7	[clang] Replace X && isa<Y>(X) with isa_and_nonnull<Y>(X). NFC (#94987 ) This addresses a clang-tidy suggestion.	2024-06-11 05:30:50 +03:00
Nishith Kumar M Shah	0559eaff5a	Revert "Pass LangOpts from CompilerInstance to DependencyScanningWorker (#93753 )" (#94488 ) This reverts commit 9862080b1cbf685c0d462b29596e3f7206d24aa2.	2024-06-05 11:42:13 -07:00
Nishith Kumar M Shah	9862080b1c	Pass LangOpts from CompilerInstance to DependencyScanningWorker (#93753 ) This commit fixes https://github.com/llvm/llvm-project/issues/88896 by passing LangOpts from the CompilerInstance to DependencyScanningWorker so that the original LangOpts are preserved/respected. This makes for more accurate parsing/lexing when certain language versions or features specific to versions are to be used.	2024-06-03 17:20:43 +02:00
cor3ntin	2ace7bdcfe	[Clang] allow ` @$ `` in raw string delimiters in C++26 (#93216 ) And as an extension in older language modes. Per https://eel.is/c++draft/lex.string#nt:d-char Fixes #93130	2024-05-28 15:38:02 +02:00
Vlad Serebrennikov	84aee95124	[clang][NFC] Remove `const_cast` from `Preprocessor::addModuleMacro()`	2024-05-17 18:11:49 +03:00
Serge Pavlov	f4066fa2dd	[clang] Use constant rounding mode for floating literals (#90877 ) Conversion of floating-point literal to binary representation must be made using constant rounding mode, which can be changed using pragma FENV_ROUND. For example, the literal "0.1F" should be representes by either 0.099999994 or 0.100000001 depending on the rounding direction.	2024-05-17 12:06:34 +07:00
Kazu Hirata	deffae5da1	[clang] Use StringRef::operator== instead of StringRef::equals (NFC) (#91844 ) I'm planning to remove StringRef::equals in favor of StringRef::operator==. - StringRef::operator==/!= outnumber StringRef::equals by a factor of 24 under clang/ in terms of their usage. - The elimination of StringRef::equals brings StringRef closer to std::string_view, which has operator== but not equals. - S == "foo" is more readable than S.equals("foo"), especially for !Long.Expression.equals("str") vs Long.Expression != "str".	2024-05-11 11:38:52 -07:00
cor3ntin	2dbe89d150	[Clang] Implement __reference_converts_from_temporary (#91199 ) This completes the required language support for P2255R2.	2024-05-10 08:50:44 +02:00
David Stone	7d913c5ea9	[clang][Modules] Make `Module::Requirement` a struct (NFC) (#67900 ) `Module::Requirement` was defined as a `std::pair<std::string, bool>`. This required a comment to explain what the data members mean and makes the usage harder to understand. Replace this with a struct with two members, `FeatureName` and `RequiredState`. --------- Co-authored-by: cor3ntin <corentinjabot@gmail.com>	2024-05-05 09:45:04 +02:00
Fangrui Song	6f390ea60d	[Lex] Fix clang -Wparentheses after #89923	2024-04-29 13:47:09 -07:00
Troy Butler	99ce84cef0	[clang][NFC] Reformat suspicious condition (#89923 ) Addresses issue #89805. Assignment + comparison performed in conditional statement. Resolved by parenthesizing comparison operation. Fixes #89805. --------- Signed-off-by: Troy-Butler <squintik@outlook.com>	2024-04-29 21:24:29 +02:00
Jan Svoboda	d609029d6c	[clang][modules] Allow module maps with textual headers to be non-affecting (#89441 ) When writing out a PCM, we skip serializing headers' `HeaderFileInfo` struct whenever this condition evaluates to `true`: ```c++ !HFI \|\| (HFI->isModuleHeader && !HFI->isCompilingModuleHeader) ``` However, when Clang parses a module map file, each textual header gets a `HFI` with `isModuleHeader=false`, `isTextualModuleHeader=true` and `isCompilingModuleHeader=false`. This means the condition evaluates to `false` even if the header was never included and the module map did not affect the compilation. Each PCM file that happened to parse such module map then contains a copy of the `HeaderFileInfo` struct for all textual headers, and considers the containing module map affecting. This patch makes it so that we skip headers that have not been included, essentially removing the virality of textual headers when it comes to PCM serialization.	2024-04-24 09:05:56 -07:00
js324	ca1f1c9572	[BitInt] Expose a _BitInt literal suffix in C++ (#86586 ) This exposes _BitInt literal suffixes __wb and u__wb as an extension in C++. There is a new Extension warning, and the tests are essentially the same as the existing _BitInt literal tests for C but with a few additional cases. Fixes #85223	2024-04-22 14:42:57 -04:00
Jan Svoboda	84df7a09f8	[clang][modules] Do not resolve `HeaderFileInfo` externally in `ASTWriter` (#87848 ) Clang uses the `HeaderFileInfo` struct to track bits of information on header files, which gets used throughout the compiler. We also use this to compute the set of affecting module maps in `ASTWriter` and in the end serialize the information into the `HEADER_SEARCH_TABLE` record of a PCM file, allowing clients to learn about headers from the module. In doing so, Clang asks for existing `HeaderFileInfo` for all known `FileEntries`. Note that this asks the loaded PCM files for the information they have on each header file in question. This seems unnecessary: we only want to serialize information on header files that either belong to the current module or that got included textually. Loaded PCM files can't provide us with any useful information. For explicit modules with lazy loading (using `-fmodule-map-file=<path>` with `-fmodule-file=<name>=<path>`) the compiler knows about header files listed in the module map files on the command-line. This can be a large number. Asking for existing `HeaderFileInfo` can trigger deserialization of `HEADER_SEARCH_TABLE` from loaded PCM files. Keys of the on-disk hash table consist of the header file size and modification time. However, with explicit modules Clang zeroes out the modification time. Moreover, if you import lots of modules, some of their header files end up having identical sizes. This means lots of hash collisions that can only be resolved by running the serialized filename through `FileManager` and comparing equality of the `FileEntry`. This ends up being super expensive, essentially re-stating lots of the transitively loaded SDK header files. This patch cleans up the API for getting `HeaderFileInfo` and makes sure `ASTWriter` uses the version that doesn't ask loaded PCM files for more information. This removes the excessive stat traffic coming from `ASTWriter` hopefully without changing observable behavior.	2024-04-11 14:44:55 -07:00
Bill Wendling	fca51911d4	[NFC][Clang] Improve const correctness for IdentifierInfo (#79365 ) The IdentifierInfo isn't typically modified. Use 'const' wherever possible.	2024-04-11 00:33:40 +00:00

1 2 3 4 5 ...

2872 Commits