Expand description
Block-based iteration engine for strided permutation operations.
Structs§
Constants§
- SMALL_
TENSOR_ THRESHOLD - Maximum total elements for the small tensor fast path.
Functions§
- build_
plan_ fused - Build an execution plan with dimension fusion.
- build_
plan_ fused_ small - Simplified plan for small tensors that fit in L1 cache.
- for_
each_ inner_ block_ preordered - Iterate over blocks with pre-ordered dimensions and initial offsets.
- total_
len - Utility: total number of elements.