maybereorderthread-without-payload-shrink

Status: stub. The full-length analysis is queued for a v1.0.x patch release per ADR 0018, section 5, criterion #6. The companion rule page at docs/rules/maybereorderthread-without-payload-shrink.md contains the canonical detection logic + GPU reasoning.

TL;DR

SER's runtime spills the entire ray-payload at the reorder point: MaybeReorderThread reorganises lanes, and the per-lane state (the payload, plus any caller-side live registers) has to follow each lane to its new position. NVIDIA's Indiana Jones path-tracer case study quantified this: the reorder's spill traffic is proportional to live-state size, and the case study reported 10-25% perf gains by shrinking the payload from 64 bytes to 16 bytes around the reorder, even when the larger payload was needed before and after.

What the rule fires on

A dx::MaybeReorderThread(...) call whose surrounding payload struct contains live state that is not read after the reorder, i.e., values written before the reorder, not consumed inside the reorder's downstream invocation, and not read after. The Phase 7 IR-level live-range analysis (shared with live-state-across-traceray) walks per-lane lifetimes across the reorder and identifies fields that are dead across the call.

See the What it detects section of the rule page for the full pattern definition.

Why it matters

The full GPU-mechanism analysis lives in the Why it matters on a GPU section of the companion rule page.

Examples

The bad / good code snippets are kept canonical on the rule page; see maybereorderthread-without-payload-shrink.md -> Examples.

maybereorderthread-without-payload-shrink ​

TL;DR ​

What the rule fires on ​

Why it matters ​

Examples ​

See also ​

maybereorderthread-without-payload-shrink

TL;DR

What the rule fires on

Why it matters

Examples

See also