pow-base-two-to-exp2

Status: stub. The full-length analysis is queued for a v1.0.x patch release per ADR 0018, section 5, criterion #6. The companion rule page at docs/rules/pow-base-two-to-exp2.md contains the canonical detection logic + GPU reasoning.

TL;DR

pow(2.0, x) compiles to the full transcendental pair exp2(x * log2(2.0)), which simplifies algebraically to exp2(x * 1.0) â€” and then to exp2(x). However, no shipping GPU shader compiler at the HLSL source level recognises this simplification and eliminates the log2 call at compile time; the log2(2.0) is computed at runtime (or at most folded to a constant 1.0 in a separate constant-folding pass that may not combine with the surrounding exp2). On AMD RDNA 3, NVIDIA Ada Lovelace, and Intel Xe-HPG, the pow lowering path always emits both a v_log_f32 and a v_exp_f32 (or their DXIL equivalents), both of which run at one-quarter peak VALU throughput. The resulting instruction cost is the same as pow(17.3, x).

What the rule fires on

Calls to pow(2.0, x) â€” or pow(2, x) â€” where the base is a literal integer or floating-point constant equal to exactly 2.0. The rule fires when the first argument of pow is a numeric literal that evaluates to 2.0 after type coercion and the exponent x is any expression. It does not fire when the base is a variable, a constant-buffer field, or any non-literal expression.

See the What it detects section of the rule page for the full pattern definition.

Why it matters

The full GPU-mechanism analysis lives in the Why it matters on a GPU section of the companion rule page.

Examples

The bad / good code snippets are kept canonical on the rule page; see pow-base-two-to-exp2.md -> Examples.

pow-base-two-to-exp2 ​

TL;DR ​

What the rule fires on ​

Why it matters ​

Examples ​

See also ​

pow-base-two-to-exp2

TL;DR

What the rule fires on

Why it matters

Examples

See also