llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-05-03 09:16:06 +00:00

History

Evgeniy Stepanov 93db40a147 Always_inline codegen rewrite.

Current implementation may end up emitting an undefined reference for
an "inline __attribute__((always_inline))" function by generating an
"available_externally alwaysinline" IR function for it and then failing to
inline all the calls. This happens when a call to such function is in dead
code. As the inliner is an SCC pass, it does not process dead code.

Libc++ relies on the compiler never emitting such undefined reference.

With this patch, we emit a pair of
1. internal alwaysinline definition (called F.alwaysinline)
2a. A stub F() { musttail call F.alwaysinline }
  -- or, depending on the linkage --
2b. A declaration of F.

The frontend ensures that F.inlinefunction is only used for direct
calls, and the stub is used for everything else (taking the address of
the function, really). Declaration (2b) is emitted in the case when
"inline" is meant for inlining only (like __gnu_inline__ and some
other cases).

This approach, among other nice properties, ensures that alwaysinline
functions are always internal, making it impossible for a direct call
to such function to produce an undefined symbol reference.

This patch is based on ideas by Chandler Carruth and Richard Smith.

llvm-svn: 247494

2015-09-12 01:07:37 +00:00

ABIInfo.h

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

Address.h

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

BackendUtil.cpp

Convert SampleProfile pass into a Module pass.

2015-08-25 15:25:13 +00:00

CGAtomic.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGBlocks.cpp

When comparing two block captures for layout, don't crash

2015-09-11 22:00:51 +00:00

CGBlocks.h

Move BlockByrefHelpers back to CodeGenModule.h to placate MSVC.

2015-09-08 08:21:11 +00:00

CGBuilder.h

Remove unnecessary braces; this resolves against a

2015-09-08 08:57:00 +00:00

CGBuiltin.cpp

Fix vld1_lane intrinsic generation

2015-09-09 01:37:18 +00:00

CGCall.cpp

Record function attribute "stackrealign" instead of using backend option

2015-09-11 18:55:09 +00:00

CGCall.h

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGClass.cpp

Always_inline codegen rewrite.

2015-09-12 01:07:37 +00:00

CGCleanup.cpp

[SEH] Use cleanupendpad so that WinEHPrepare gets the coloring right

2015-09-10 22:11:13 +00:00

CGCleanup.h

[SEH] Use cleanupendpad so that WinEHPrepare gets the coloring right

2015-09-10 22:11:13 +00:00

CGCUDANV.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGCUDARuntime.cpp

Pass expressions instead of argument ranges to EmitCall/EmitCXXConstructorCall.

2014-08-21 20:26:47 +00:00

CGCUDARuntime.h

Revert r240270 ("Fixed/added namespace ending comments using clang-tidy").

2015-06-22 23:07:51 +00:00

CGCXX.cpp

Always_inline codegen rewrite.

2015-09-12 01:07:37 +00:00

CGCXXABI.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGCXXABI.h

[MS ABI] Make member pointers return true for isIncompleteType

2015-09-10 21:52:00 +00:00

CGDebugInfo.cpp

Remove an unnecessary check. NFC

2015-09-11 18:54:31 +00:00

CGDebugInfo.h

Module Debugging: Emit forward declarations for types that are defined in

2015-09-11 17:23:08 +00:00

CGDecl.cpp

clangCodeGen: Fix comments. [-Wdocumentation]

2015-09-08 09:42:41 +00:00

CGDeclCXX.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGException.cpp

[CodeGen] Teach SimplifyPersonality about the updated LandingPadInst

2015-09-11 15:40:05 +00:00

CGExpr.cpp

[OPENMP] Preserve alignment of the original variables for the captured references.

2015-09-11 10:29:41 +00:00

CGExprAgg.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGExprComplex.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGExprConstant.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGExprCXX.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGExprScalar.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGLoopInfo.cpp

Add new llvm.loop.unroll.enable metadata for use with "#pragma unroll".

2015-08-10 17:29:39 +00:00

CGLoopInfo.h

Add new llvm.loop.unroll.enable metadata for use with "#pragma unroll".

2015-08-10 17:29:39 +00:00

CGObjC.cpp

ARC: Fix the precise-lifetime suppression of returns_inner_pointer

2015-09-09 23:37:17 +00:00

CGObjCGNU.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGObjCMac.cpp

Support noreturn in limited contexts on Objective-C message sends.

2015-09-10 22:27:50 +00:00

CGObjCRuntime.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGObjCRuntime.h

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CGOpenCLRuntime.cpp

…

CGOpenCLRuntime.h

Revert r240270 ("Fixed/added namespace ending comments using clang-tidy").

2015-06-22 23:07:51 +00:00

CGOpenMPRuntime.cpp

Always_inline codegen rewrite.

2015-09-12 01:07:37 +00:00

CGOpenMPRuntime.h

Fix \param in r247251. [-Wdocumentation]

2015-09-11 08:13:32 +00:00

CGRecordLayout.h

Respect alignment of nested bitfields

2015-07-10 17:30:00 +00:00

CGRecordLayoutBuilder.cpp

Respect alignment of nested bitfields

2015-07-10 17:30:00 +00:00

CGStmt.cpp

convert builtin_unpredictable on a switch into metadata for LLVM

2015-09-09 22:39:06 +00:00

CGStmtOpenMP.cpp

[OPENMP] Preserve alignment of the original variables for the captured references.

2015-09-11 10:29:41 +00:00

CGValue.h

Introduce __builtin_nontemporal_store and __builtin_nontemporal_load.

2015-09-08 23:52:33 +00:00

CGVTables.cpp

Revert "Generating assumption loads of vptr after ctor call (fixed)"

2015-09-10 20:18:30 +00:00

CGVTables.h

Header guard canonicalization, clang part.

2014-08-13 16:25:19 +00:00

CGVTT.cpp

Remove and forbid raw_svector_ostream::flush() calls.

2015-08-13 18:12:56 +00:00

CMakeLists.txt

[CMake] Fill up required libs, corresponding to r241653.

2015-07-08 02:06:21 +00:00

CodeGenABITypes.cpp

LLVM API Change: the Module always owns the DataLayout

2015-07-24 16:04:29 +00:00

CodeGenAction.cpp

[CUDA] Postprocess bitcode linked in during device-side CUDA compilation.

2015-09-10 18:24:23 +00:00

CodeGenFunction.cpp

[MS ABI] Make member pointers return true for isIncompleteType

2015-09-10 21:52:00 +00:00

CodeGenFunction.h

Revert "Generating assumption loads of vptr after ctor call (fixed)"

2015-09-10 20:18:30 +00:00

CodeGenModule.cpp

Always_inline codegen rewrite.

2015-09-12 01:07:37 +00:00

CodeGenModule.h

Always_inline codegen rewrite.

2015-09-12 01:07:37 +00:00

CodeGenPGO.cpp

Switch users of the 'for (StmtRange range = stmt->children(); range; ++range)‘ pattern to range for loops.

2015-07-02 21:03:14 +00:00

CodeGenPGO.h

InstrProf: Cede ownership of createProfileWeights to CGF

2015-05-02 05:00:55 +00:00

CodeGenTBAA.cpp

Remove and forbid raw_svector_ostream::flush() calls.

2015-08-13 18:12:56 +00:00

CodeGenTBAA.h

Header guard canonicalization, clang part.

2014-08-13 16:25:19 +00:00

CodeGenTypeCache.h

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

CodeGenTypes.cpp

LLVM API Change: the Module always owns the DataLayout

2015-07-24 16:04:29 +00:00

CodeGenTypes.h

Remove superfluous private:, TypeCache is private by default.

2015-08-13 07:12:03 +00:00

CoverageMappingGen.cpp

Use llvm::reverse to make a bunch of loops use foreach. NFC.

2015-07-30 17:22:52 +00:00

CoverageMappingGen.h

[cleanup] Re-sort *all* #include lines with llvm/utils/sort_includes.py

2015-01-14 11:29:14 +00:00

EHScopeStack.h

[SEH] Use cleanupendpad so that WinEHPrepare gets the coloring right

2015-09-10 22:11:13 +00:00

ItaniumCXXABI.cpp

Always_inline codegen rewrite.

2015-09-12 01:07:37 +00:00

Makefile

…

MicrosoftCXXABI.cpp

[MS ABI] Make member pointers return true for isIncompleteType

2015-09-10 21:52:00 +00:00

ModuleBuilder.cpp

Rename DescriptionString -> DataLayoutString as it matches the actual

2015-08-05 23:48:05 +00:00

ObjectFilePCHContainerOperations.cpp

Debug Info: Remove an unnecessary debug type visitor.

2015-09-10 17:13:31 +00:00

README.txt

…

SanitizerMetadata.cpp

[ASan] Initial support for Kernel AddressSanitizer

2015-06-19 12:19:07 +00:00

SanitizerMetadata.h

Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition.

2015-02-15 22:54:08 +00:00

TargetInfo.cpp

Compute and preserve alignment more faithfully in IR-generation.

2015-09-08 08:05:57 +00:00

TargetInfo.h

[OPENMP] Introduced type trait "__builtin_omp_required_simd_align" for default simd alignment.

2015-07-02 03:40:19 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//