123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130 |
- ================================
- LLVM Block Frequency Terminology
- ================================
- .. contents::
- :local:
- Introduction
- ============
- Block Frequency is a metric for estimating the relative frequency of different
- basic blocks. This document describes the terminology that the
- ``BlockFrequencyInfo`` and ``MachineBlockFrequencyInfo`` analysis passes use.
- Branch Probability
- ==================
- Blocks with multiple successors have probabilities associated with each
- outgoing edge. These are called branch probabilities. For a given block, the
- sum of its outgoing branch probabilities should be 1.0.
- Branch Weight
- =============
- Rather than storing fractions on each edge, we store an integer weight.
- Weights are relative to the other edges of a given predecessor block. The
- branch probability associated with a given edge is its own weight divided by
- the sum of the weights on the predecessor's outgoing edges.
- For example, consider this IR:
- .. code-block:: llvm
- define void @foo() {
- ; ...
- A:
- br i1 %cond, label %B, label %C, !prof !0
- ; ...
- }
- !0 = metadata !{metadata !"branch_weights", i32 7, i32 8}
- and this simple graph representation::
- A -> B (edge-weight: 7)
- A -> C (edge-weight: 8)
- The probability of branching from block A to block B is 7/15, and the
- probability of branching from block A to block C is 8/15.
- See :doc:`BranchWeightMetadata` for details about the branch weight IR
- representation.
- Block Frequency
- ===============
- Block frequency is a relative metric that represents the number of times a
- block executes. The ratio of a block frequency to the entry block frequency is
- the expected number of times the block will execute per entry to the function.
- Block frequency is the main output of the ``BlockFrequencyInfo`` and
- ``MachineBlockFrequencyInfo`` analysis passes.
- Implementation: a series of DAGs
- ================================
- The implementation of the block frequency calculation analyses each loop,
- bottom-up, ignoring backedges; i.e., as a DAG. After each loop is processed,
- it's packaged up to act as a pseudo-node in its parent loop's (or the
- function's) DAG analysis.
- Block Mass
- ==========
- For each DAG, the entry node is assigned a mass of ``UINT64_MAX`` and mass is
- distributed to successors according to branch weights. Block Mass uses a
- fixed-point representation where ``UINT64_MAX`` represents ``1.0`` and ``0``
- represents a number just above ``0.0``.
- After mass is fully distributed, in any cut of the DAG that separates the exit
- nodes from the entry node, the sum of the block masses of the nodes succeeded
- by a cut edge should equal ``UINT64_MAX``. In other words, mass is conserved
- as it "falls" through the DAG.
- If a function's basic block graph is a DAG, then block masses are valid block
- frequencies. This works poorly in practise though, since downstream users rely
- on adding block frequencies together without hitting the maximum.
- Loop Scale
- ==========
- Loop scale is a metric that indicates how many times a loop iterates per entry.
- As mass is distributed through the loop's DAG, the (otherwise ignored) backedge
- mass is collected. This backedge mass is used to compute the exit frequency,
- and thus the loop scale.
- Implementation: Getting from mass and scale to frequency
- ========================================================
- After analysing the complete series of DAGs, each block has a mass (local to
- its containing loop, if any), and each loop pseudo-node has a loop scale and
- its own mass (from its parent's DAG).
- We can get an initial frequency assignment (with entry frequency of 1.0) by
- multiplying these masses and loop scales together. A given block's frequency
- is the product of its mass, the mass of containing loops' pseudo nodes, and the
- containing loops' loop scales.
- Since downstream users need integers (not floating point), this initial
- frequency assignment is shifted as necessary into the range of ``uint64_t``.
- Block Bias
- ==========
- Block bias is a proposed *absolute* metric to indicate a bias toward or away
- from a given block during a function's execution. The idea is that bias can be
- used in isolation to indicate whether a block is relatively hot or cold, or to
- compare two blocks to indicate whether one is hotter or colder than the other.
- The proposed calculation involves calculating a *reference* block frequency,
- where:
- * every branch weight is assumed to be 1 (i.e., every branch probability
- distribution is even) and
- * loop scales are ignored.
- This reference frequency represents what the block frequency would be in an
- unbiased graph.
- The bias is the ratio of the block frequency to this reference block frequency.
|