llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-05-13 19:06:05 +00:00

History

Adam Nemet aad816083e [OptRemark,LDist] RFC: Add hotness attribute

Summary:
This is the first set of changes implementing the RFC from
http://thread.gmane.org/gmane.comp.compilers.llvm.devel/98334

This is a cross-sectional patch; rather than implementing the hotness
attribute for all optimization remarks and all passes in a patch set, it
implements it for the 'missed-optimization' remark for Loop
Distribution. My goal is to shake out the design issues before scaling
it up to other types and passes.

Hotness is computed as an integer as the multiplication of the block
frequency with the function entry count. It's only printed in opt
currently since clang prints the diagnostic fields directly. E.g.:

remark: /tmp/t.c:3:3: loop not distributed: use -Rpass-analysis=loop-distribute for more info (hotness: 300)

A new API added is similar to emitOptimizationRemarkMissed. The
difference is that it additionally takes a code region that the
diagnostic corresponds to. From this, hotness is computed using BFI.
The new API is exposed via an analysis pass so that it can be made
dependent on LazyBFI. (Thanks to Hal for the analysis pass idea.)

This feature can all be enabled by setDiagnosticHotnessRequested in the
LLVM context. If this is off, LazyBFI is not calculated (D22141) so
there should be no overhead.

A new command-line option is added to turn this on in opt.

My plan is to switch all user of emitOptimizationRemark* to use this
module instead.

Reviewers: hfinkel

Subscribers: rcox2, mzolotukhin, llvm-commits

Differential Revision: http://reviews.llvm.org/D21771

llvm-svn: 275583

2016-07-15 17:23:20 +00:00

AliasAnalysis.cpp

[AliasAnalysis] Give back AA results for fence instructions

2016-07-15 17:19:24 +00:00

AliasAnalysisEvaluator.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

AliasAnalysisSummary.cpp

[CFLAA] Split out more things from CFLSteens. NFC.

2016-07-06 00:47:21 +00:00

AliasAnalysisSummary.h

[CFLAA] Simplify CFLGraphBuilder. NFC.

2016-07-11 22:59:09 +00:00

AliasSetTracker.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

Analysis.cpp

[OptRemark,LDist] RFC: Add hotness attribute

2016-07-15 17:23:20 +00:00

AssumptionCache.cpp

[PM] Make the AnalysisManager parameter to run methods a reference.

2016-03-11 11:05:24 +00:00

BasicAliasAnalysis.cpp

BasicAA should look through functions with returned arguments

2016-07-11 01:32:20 +00:00

BlockFrequencyInfo.cpp

[BFI] Add new LazyBFI analysis pass

2016-07-13 05:01:48 +00:00

BlockFrequencyInfoImpl.cpp

[BFI]: NFC refactoring

2016-06-22 17:12:12 +00:00

BranchProbabilityInfo.cpp

Re-submit r272891 "Prevent dangling pointer problems in BranchProbabilityInfo"

2016-07-15 14:31:16 +00:00

CallGraph.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

CallGraphSCCPass.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

CallPrinter.cpp

[CG] Rename the DOT printing pass to actually reference "DOT".

2016-03-10 11:04:40 +00:00

CaptureTracking.cpp

[CaptureTracking] Volatile operations capture their memory location

2016-05-26 17:36:22 +00:00

CFG.cpp

Avoid overly large SmallPtrSet/SmallSet

2016-01-30 01:24:31 +00:00

CFGPrinter.cpp

…

CFLAndersAliasAnalysis.cpp

[CFLAA] Split the CFL graph out from CFLSteens. NFC.

2016-07-06 00:36:12 +00:00

CFLGraph.h

Attempt to make buildbots happy.

2016-07-11 23:18:32 +00:00

CFLSteensAliasAnalysis.cpp

[CFLAA] Simplify CFLGraphBuilder. NFC.

2016-07-11 22:59:09 +00:00

CGSCCPassManager.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

CMakeLists.txt

[OptRemark,LDist] RFC: Add hotness attribute

2016-07-15 17:23:20 +00:00

CodeMetrics.cpp

use range-based for loop; NFCI

2016-03-08 20:53:48 +00:00

ConstantFolding.cpp

Simplify llvm.masked.load w/ undef masks

2016-07-14 06:58:37 +00:00

CostModel.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

Delinearization.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

DemandedBits.cpp

Port DemandedBits to the new pass manager.

2016-04-18 23:55:01 +00:00

DependenceAnalysis.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

DivergenceAnalysis.cpp

DivergenceAnalysis: Fix crash with no return blocks

2016-05-09 16:57:08 +00:00

DominanceFrontier.cpp

[PM] Make the AnalysisManager parameter to run methods a reference.

2016-03-11 11:05:24 +00:00

DomPrinter.cpp

Introduce analysis pass to compute PostDominators in the new pass manager. NFC

2016-02-25 17:54:07 +00:00

EHPersonalities.cpp

X86: permit using SjLj EH on x86 targets as an option

2016-05-31 01:48:07 +00:00

GlobalsModRef.cpp

GlobalsAA: Functions with the argmemonly attribute won't read arbitrary globals

2016-07-14 15:50:27 +00:00

IndirectCallPromotionAnalysis.cpp

Remove another unused variable from r275216

2016-07-12 23:49:17 +00:00

InlineCost.cpp

Implement callsite-hotness based inline cost for Sample-based PGO

2016-07-11 16:48:54 +00:00

InstCount.cpp

…

InstructionSimplify.cpp

Simplify llvm.masked.load w/ undef masks

2016-07-14 06:58:37 +00:00

Interval.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

IntervalPartition.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

IteratedDominanceFrontier.cpp

Correct IDF calculator for ReverseIDF

2016-04-19 06:13:28 +00:00

IVUsers.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

LazyBlockFrequencyInfo.cpp

[BFI] Add new LazyBFI analysis pass

2016-07-13 05:01:48 +00:00

LazyCallGraph.cpp

[LCG] Hoist the definitions of the stream operator friends to be inline

2016-07-07 07:52:07 +00:00

LazyValueInfo.cpp

Reformat blank lines.

2016-07-04 01:26:33 +00:00

Lint.cpp

…

LLVMBuild.txt

Refactor indirect call promotion profitability analysis (NFC)

2016-07-12 21:13:44 +00:00

Loads.cpp

Teach isDereferenceablePointer to look through returned-argument functions

2016-07-11 03:08:49 +00:00

LoopAccessAnalysis.cpp

[LAA] Don't hold on to DominatorTree in the analysis result

2016-07-13 22:36:35 +00:00

LoopInfo.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

LoopPass.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

LoopPassManager.cpp

PM: Check that loop passes preserve a basic set of analyses

2016-05-03 21:35:08 +00:00

LoopUnrollAnalyzer.cpp

[LoopUnrollAnalyzer] Fix a bug in UnrolledInstAnalyzer::visitLoad.

2016-06-23 14:31:31 +00:00

MemDepPrinter.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

MemDerefPrinter.cpp

NFC. Move isDereferenceable to Loads.h/cpp

2016-02-24 12:49:04 +00:00

MemoryBuiltins.cpp

fix formatting; NFC

2016-07-07 16:19:09 +00:00

MemoryDependenceAnalysis.cpp

Typos. NFC.

2016-06-28 17:19:10 +00:00

MemoryLocation.cpp

[TLI] Unify LibFunc signature checking. NFCI.

2016-04-27 19:04:35 +00:00

ModuleDebugInfoPrinter.cpp

…

ModuleSummaryAnalysis.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

ObjCARCAliasAnalysis.cpp

[PM] Make the AnalysisManager parameter to run methods a reference.

2016-03-11 11:05:24 +00:00

ObjCARCAnalysisUtils.cpp

…

ObjCARCInstKind.cpp

…

OptimizationDiagnosticInfo.cpp

[OptRemark,LDist] RFC: Add hotness attribute

2016-07-15 17:23:20 +00:00

OrderedBasicBlock.cpp

…

PHITransAddr.cpp

Annotate dump() methods with LLVM_DUMP_METHOD, addressing Richard Smith r259192 post commit comment.

2016-01-29 20:50:44 +00:00

PostDominators.cpp

[PM] Remove support for omitting the AnalysisManager argument to new

2016-06-17 00:11:01 +00:00

ProfileSummaryInfo.cpp

[PM] Remove support for omitting the AnalysisManager argument to new

2016-06-17 00:11:01 +00:00

PtrUseVisitor.cpp

…

README.txt

…

RegionInfo.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

RegionPass.cpp

…

RegionPrinter.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

ScalarEvolution.cpp

Teach SCEV to look through returned-argument functions

2016-07-11 02:48:23 +00:00

ScalarEvolutionAliasAnalysis.cpp

[PM] Make the AnalysisManager parameter to run methods a reference.

2016-03-11 11:05:24 +00:00

ScalarEvolutionExpander.cpp

Fix ScalarEvolutionExpander step scaling bug

2016-07-13 01:28:12 +00:00

ScalarEvolutionNormalization.cpp

Remove emacs mode markers from .cpp files. NFC

2016-04-24 17:55:41 +00:00

ScopedNoAliasAA.cpp

[PM] Make the AnalysisManager parameter to run methods a reference.

2016-03-11 11:05:24 +00:00

SparsePropagation.cpp

Apply clang-tidy's modernize-loop-convert to lib/Analysis.

2016-06-26 17:27:42 +00:00

StratifiedSets.h

[CFLAA] Simplify CFLGraphBuilder. NFC.

2016-07-11 22:59:09 +00:00

TargetLibraryInfo.cpp

Reverting r275284 due to platform-specific test failures

2016-07-13 19:09:16 +00:00

TargetTransformInfo.cpp

This implements a more optimal algorithm for selecting a base constant in

2016-07-14 07:44:20 +00:00

Trace.cpp

Annotate dump() methods with LLVM_DUMP_METHOD, addressing Richard Smith r259192 post commit comment.

2016-01-29 20:50:44 +00:00

TypeBasedAliasAnalysis.cpp

[PM] Make the AnalysisManager parameter to run methods a reference.

2016-03-11 11:05:24 +00:00

TypeMetadataUtils.cpp

[IR] Make getIndexedOffsetInType return a signed result

2016-07-13 03:42:38 +00:00

ValueTracking.cpp

[ValueTracking] Use Instruction::getFunction; NFC

2016-07-14 20:19:01 +00:00

VectorUtils.cpp

SLPVectorizer: Move propagateMetadata to VectorUtils

2016-06-30 21:17:59 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//