Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Op Mnemonic Master Table

Every MLIR operation mnemonic registered by — or observed in lowerings driven by — the tileiras ELF (CUDA Toolkit 13.1, SHA256 f0eb415767f403c96cbabf0817c3bcf70a50f88dfc8845fe36ebe21635fa6707). Nine dialect namespaces, ~640 first-class ops, alphabetical within each namespace. Columns: verbatim mnemonic in backticks, mnemonic length in bytes (- where the registrar uses a non-flat path that does not pass the literal length to RegisteredOperationName::insert), TypeID singleton sentinel (per-op &unk_NNNNNNN where the registrar exposes a per-op slot, range reference where the dialect uses a contiguous slab without per-op isolation), one-clause semantic, primary wiki page. Sentinel addresses use IDA-style &unk_NNNNNNN form, preserving the verbatim hexadecimal address from .bss/.data. The mnemonic length column matches the second argument passed to sub_4461CA0 (the RegisteredOperationName::insert callee). Where the glossary lists a range without a per-op slot, the entry cites the full range.

How the TypeID column is consumed

Every dispatcher in the binary reads OperationName::TypeID through one double-indirection from the Operation pointer:

/* The OperationName slot sits at fixed offset +0x30 on an mlir::Operation,
 * and the TypeID pointer sits at +0x10 of OperationName::Impl. Both offsets
 * are stable across the binary; every dyn_cast / OpInterface lookup in
 * tileiras decompiles to this same shape. */
static inline const void *operation_typeid(const void *op) {
    const void *opname_impl = *(const void *const *)((const uint8_t *)op + 0x30);
    return *(const void *const *)((const uint8_t *)opname_impl + 0x10);
}

/* Dispatching on an op is therefore one pointer-equality test per arm
 * against a sentinel address from the table below. A reimplementer who
 * wants the same dispatch performance must publish exactly one stable
 * address per op kind for pointer-equality identity. */
static inline bool op_is(const void *op, const void *sentinel) {
    return operation_typeid(op) == sentinel;
}

Pointer-identity sentinels (the dominant form in the slab columns below) are plain .bss slots; their address is the TypeID, no load of the byte is ever made. Meyers-cached sentinels (the cute interface anchors) hold the TypeID in a 64-bit qword that is filled in on first use through the mlir::TypeID::getFullName() interner. For the full sentinel-form breakdown and the address-band index, see TypeID Sentinel Table.

§1 cuda_tile.* (92 ops)

TypeID slab range 0x5785D0..0x57A8E0. Per-op TypeID slots are in this range but the registration thunk does not expose individual &unk_* isolated addresses to the surface decompilation; entries are cited via the range.

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
cuda_tile.absf14range 0x5785D0..0x57A8E0element-wise abs on float tiledialects/cuda_tile.md
cuda_tile.absi14range 0x5785D0..0x57A8E0element-wise abs on integer tiledialects/cuda_tile.md
cuda_tile.addf14range 0x5785D0..0x57A8E0element-wise float adddialects/cuda_tile.md
cuda_tile.addi14range 0x5785D0..0x57A8E0element-wise integer adddialects/cuda_tile.md
cuda_tile.andi14range 0x5785D0..0x57A8E0bitwise ANDdialects/cuda_tile.md
cuda_tile.assert16range 0x5785D0..0x57A8E0runtime assertion in compiled tile codedialects/cuda_tile.md
cuda_tile.assume16range 0x5785D0..0x57A8E0optimizer hint (LLVM assume)dialects/cuda_tile.md
cuda_tile.atomic_cas_tko24range 0x5785D0..0x57A8E0atomic compare-and-swap, token-ordereddialects/cuda_tile.md
cuda_tile.atomic_rmw_tko24range 0x5785D0..0x57A8E0atomic read-modify-write, token-ordereddialects/cuda_tile.md
cuda_tile.bitcast17range 0x5785D0..0x57A8E0bit-pattern-preserving type pundialects/cuda_tile.md
cuda_tile.break15range 0x5785D0..0x57A8E0structured-loop breakdialects/cuda_tile.md
cuda_tile.broadcast19range 0x5785D0..0x57A8E0scalar / lower-rank to tiledialects/cuda_tile.md
cuda_tile.cat13range 0x5785D0..0x57A8E0tile concatenationdialects/cuda_tile.md
cuda_tile.ceil14range 0x5785D0..0x57A8E0ceil roundingdialects/cuda_tile.md
cuda_tile.cmpf14range 0x5785D0..0x57A8E0float comparisondialects/cuda_tile.md
cuda_tile.cmpi14range 0x5785D0..0x57A8E0integer comparisondialects/cuda_tile.md
cuda_tile.constant18range 0x5785D0..0x57A8E0dense / splat constantdialects/cuda_tile.md
cuda_tile.continue18range 0x5785D0..0x57A8E0structured-loop continuedialects/cuda_tile.md
cuda_tile.cos13range 0x5785D0..0x57A8E0elementary cosinedialects/cuda_tile.md
cuda_tile.cosh14range 0x5785D0..0x57A8E0hyperbolic cosinedialects/cuda_tile.md
cuda_tile.divf14range 0x5785D0..0x57A8E0float divisiondialects/cuda_tile.md
cuda_tile.divi14range 0x5785D0..0x57A8E0integer divisiondialects/cuda_tile.md
cuda_tile.entry15range 0x5785D0..0x57A8E0kernel entry op (1 region)dialects/cuda_tile.md
cuda_tile.exp13range 0x5785D0..0x57A8E0natural exponentdialects/cuda_tile.md
cuda_tile.exp214range 0x5785D0..0x57A8E0base-2 exponentdialects/cuda_tile.md
cuda_tile.exti14range 0x5785D0..0x57A8E0integer extensiondialects/cuda_tile.md
cuda_tile.extract17range 0x5785D0..0x57A8E0tile element extractdialects/cuda_tile.md
cuda_tile.floor15range 0x5785D0..0x57A8E0floor roundingdialects/cuda_tile.md
cuda_tile.fma13range 0x5785D0..0x57A8E0fused multiply-adddialects/cuda_tile.md
cuda_tile.for13range 0x5785D0..0x57A8E0structured for loop (1 region)dialects/cuda_tile.md
cuda_tile.ftof14range 0x5785D0..0x57A8E0float-to-float castdialects/cuda_tile.md
cuda_tile.ftoi14range 0x5785D0..0x57A8E0float-to-int castdialects/cuda_tile.md
cuda_tile.get_global20range 0x5785D0..0x57A8E0reference module-level globaldialects/cuda_tile.md
cuda_tile.get_index_space_shape31range 0x5785D0..0x57A8E0shape of the launch index spacedialects/cuda_tile.md
cuda_tile.get_num_tile_blocks29range 0x5785D0..0x57A8E0tile-block countdialects/cuda_tile.md
cuda_tile.get_tensor_shape26range 0x5785D0..0x57A8E0shape of a tensor viewdialects/cuda_tile.md
cuda_tile.get_tile_block_id27range 0x5785D0..0x57A8E0per-block iddialects/cuda_tile.md
cuda_tile.global16range 0x5785D0..0x57A8E0module-level global declarationdialects/cuda_tile.md
cuda_tile.if12range 0x5785D0..0x57A8E0structured conditional (2 regions)dialects/cuda_tile.md
cuda_tile.int_to_ptr20range 0x5785D0..0x57A8E0integer-to-pointer castdialects/cuda_tile.md
cuda_tile.iota14range 0x5785D0..0x57A8E0sequential-int constant tiledialects/cuda_tile.md
cuda_tile.itof14range 0x5785D0..0x57A8E0int-to-float castdialects/cuda_tile.md
cuda_tile.join_tokens21range 0x5785D0..0x57A8E0merge multiple tokensdialects/cuda_tile.md
cuda_tile.load_ptr_tko22range 0x5785D0..0x57A8E0pointer load, token-ordereddialects/cuda_tile.md
cuda_tile.load_view_tko23range 0x5785D0..0x57A8E0view load, token-ordereddialects/cuda_tile.md
cuda_tile.log13range 0x5785D0..0x57A8E0natural logdialects/cuda_tile.md
cuda_tile.log214range 0x5785D0..0x57A8E0base-2 logdialects/cuda_tile.md
cuda_tile.loop14range 0x5785D0..0x57A8E0generic structured loop (1 region)dialects/cuda_tile.md
cuda_tile.make_partition_view29range 0x5785D0..0x57A8E0construct a partition_viewdialects/cuda_tile.md
cuda_tile.make_tensor_view26range 0x5785D0..0x57A8E0construct a tensor_viewdialects/cuda_tile.md
cuda_tile.make_token20range 0x5785D0..0x57A8E0mint a synchronisation tokendialects/cuda_tile.md
cuda_tile.maxf14range 0x5785D0..0x57A8E0float maxdialects/cuda_tile.md
cuda_tile.maxi14range 0x5785D0..0x57A8E0integer maxdialects/cuda_tile.md
cuda_tile.minf14range 0x5785D0..0x57A8E0float mindialects/cuda_tile.md
cuda_tile.mini14range 0x5785D0..0x57A8E0integer mindialects/cuda_tile.md
cuda_tile.mmaf14range 0x5785D0..0x57A8E0float tile MMAdialects/cuda_tile.md
cuda_tile.mmai14range 0x5785D0..0x57A8E0integer tile MMAdialects/cuda_tile.md
cuda_tile.module16range 0x5785D0..0x57A8E0top-level container (1 region)dialects/cuda_tile.md
cuda_tile.mulf14range 0x5785D0..0x57A8E0float multiplydialects/cuda_tile.md
cuda_tile.mulhii16range 0x5785D0..0x57A8E0high-half integer multiplydialects/cuda_tile.md
cuda_tile.muli14range 0x5785D0..0x57A8E0integer multiplydialects/cuda_tile.md
cuda_tile.negf14range 0x5785D0..0x57A8E0float negationdialects/cuda_tile.md
cuda_tile.negi14range 0x5785D0..0x57A8E0integer negationdialects/cuda_tile.md
cuda_tile.offset16range 0x5785D0..0x57A8E0view offset arithmeticdialects/cuda_tile.md
cuda_tile.ori13range 0x5785D0..0x57A8E0bitwise ORdialects/cuda_tile.md
cuda_tile.permute17range 0x5785D0..0x57A8E0tile permutationdialects/cuda_tile.md
cuda_tile.pow13range 0x5785D0..0x57A8E0powerdialects/cuda_tile.md
cuda_tile.print15range 0x5785D0..0x57A8E0tile-aware diagnostic print (renamed from OSS print_tko)dialects/cuda_tile.md
cuda_tile.ptr_to_int20range 0x5785D0..0x57A8E0pointer-to-integer castdialects/cuda_tile.md
cuda_tile.ptr_to_ptr20range 0x5785D0..0x57A8E0pointer recastdialects/cuda_tile.md
cuda_tile.reduce16range 0x5785D0..0x57A8E0reduction (1 region)dialects/cuda_tile.md
cuda_tile.remf14range 0x5785D0..0x57A8E0float remainderdialects/cuda_tile.md
cuda_tile.remi14range 0x5785D0..0x57A8E0integer remainderdialects/cuda_tile.md
cuda_tile.reshape17range 0x5785D0..0x57A8E0view reshapedialects/cuda_tile.md
cuda_tile.return16range 0x5785D0..0x57A8E0terminatordialects/cuda_tile.md
cuda_tile.rsqrt15range 0x5785D0..0x57A8E0reciprocal sqrtdialects/cuda_tile.md
cuda_tile.scan14range 0x5785D0..0x57A8E0prefix-sum (1 region)dialects/cuda_tile.md
cuda_tile.select16range 0x5785D0..0x57A8E0predicated selectdialects/cuda_tile.md
cuda_tile.shli14range 0x5785D0..0x57A8E0left shiftdialects/cuda_tile.md
cuda_tile.shri14range 0x5785D0..0x57A8E0right shiftdialects/cuda_tile.md
cuda_tile.sin13range 0x5785D0..0x57A8E0elementary sinedialects/cuda_tile.md
cuda_tile.sinh14range 0x5785D0..0x57A8E0hyperbolic sinedialects/cuda_tile.md
cuda_tile.sqrt14range 0x5785D0..0x57A8E0square rootdialects/cuda_tile.md
cuda_tile.store_ptr_tko23range 0x5785D0..0x57A8E0pointer store, token-ordereddialects/cuda_tile.md
cuda_tile.store_view_tko24range 0x5785D0..0x57A8E0view store, token-ordereddialects/cuda_tile.md
cuda_tile.subf14range 0x5785D0..0x57A8E0float subtractdialects/cuda_tile.md
cuda_tile.subi14range 0x5785D0..0x57A8E0integer subtractdialects/cuda_tile.md
cuda_tile.tan13range 0x5785D0..0x57A8E0elementary tangentdialects/cuda_tile.md
cuda_tile.tanh14range 0x5785D0..0x57A8E0hyperbolic tangentdialects/cuda_tile.md
cuda_tile.trunci16range 0x5785D0..0x57A8E0integer truncationdialects/cuda_tile.md
cuda_tile.xori14range 0x5785D0..0x57A8E0bitwise XORdialects/cuda_tile.md
cuda_tile.yield15range 0x5785D0..0x57A8E0terminator for region-bearing opsdialects/cuda_tile.md

§2 nv_tileaa.* (73 ops)

Per-op TypeID slots in dense range 0x5B46D28..0x5B46F68 (8-byte stride). The slab anchors below the nv_tileas slab.

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nv_tileaa.addf14range 0x5B46D28..0x5B46F68float adddialects/nv_tileaa.md
nv_tileaa.addptr16range 0x5B46D28..0x5B46F68pointer + integer offsetdialects/nv_tileaa.md
nv_tileaa.assert16range 0x5B46D28..0x5B46F68runtime assertiondialects/nv_tileaa.md
nv_tileaa.assume16range 0x5B46D28..0x5B46F68optimizer assumptiondialects/nv_tileaa.md
nv_tileaa.atomic_cas20range 0x5B46D28..0x5B46F68scalar atomic CASdialects/nv_tileaa.md
nv_tileaa.atomic_rmw20range 0x5B46D28..0x5B46F68scalar atomic RMWdialects/nv_tileaa.md
nv_tileaa.bitcast17range 0x5B46D28..0x5B46F68bit-preserving type castdialects/nv_tileaa.md
nv_tileaa.block_tile20range 0x5B46D28..0x5B46F68per-CTA tile selectiondialects/nv_tileaa.md
nv_tileaa.broadcast19range 0x5B46D28..0x5B46F68scalar→tile / rank liftdialects/nv_tileaa.md
nv_tileaa.call14range 0x5B46D28..0x5B46F68call into emitted device functiondialects/nv_tileaa.md
nv_tileaa.call_elementwise_intrinsic36range 0x5B46D28..0x5B46F68call libdevice math intrinsicdialects/nv_tileaa.md
nv_tileaa.cancel_next_program_id32range 0x5B46D28..0x5B46F68cluster-launch-control canceldialects/nv_tileaa.md
nv_tileaa.cat13range 0x5B46D28..0x5B46F68tile concatdialects/nv_tileaa.md
nv_tileaa.clampf16range 0x5B46D28..0x5B46F68float clampdialects/nv_tileaa.md
nv_tileaa.conv_dot18range 0x5B46D28..0x5B46F68convolution dot helperdialects/nv_tileaa.md
nv_tileaa.conv_tile19range 0x5B46D28..0x5B46F68convolution tile helperdialects/nv_tileaa.md
nv_tileaa.create_mem_token26range 0x5B46D28..0x5B46F68mint memory-lifetime tokendialects/nv_tileaa.md
nv_tileaa.create_queue22range 0x5B46D28..0x5B46F68construct typed queuedialects/nv_tileaa.md
nv_tileaa.divf14range 0x5B46D28..0x5B46F68float dividedialects/nv_tileaa.md
nv_tileaa.dot13range 0x5B46D28..0x5B46F68matrix dotdialects/nv_tileaa.md
nv_tileaa.elementwise_inline_asm32range 0x5B46D28..0x5B46F68inline-PTX elementwise emitterdialects/nv_tileaa.md
nv_tileaa.execute17range 0x5B46D28..0x5B46F68launch-time execute markerdialects/nv_tileaa.md
nv_tileaa.exp214range 0x5B46D28..0x5B46F68base-2 exponentdialects/nv_tileaa.md
nv_tileaa.expand_dims21range 0x5B46D28..0x5B46F68rank liftdialects/nv_tileaa.md
nv_tileaa.extern_elementwise28range 0x5B46D28..0x5B46F68external (libdevice) elementwisedialects/nv_tileaa.md
nv_tileaa.extract17range 0x5B46D28..0x5B46F68scalar extractdialects/nv_tileaa.md
nv_tileaa.extract_slice23range 0x5B46D28..0x5B46F68sub-slice extractdialects/nv_tileaa.md
nv_tileaa.fma13range 0x5B46D28..0x5B46F68fused multiply-adddialects/nv_tileaa.md
nv_tileaa.fp_to_fp18range 0x5B46D28..0x5B46F68float-to-float castdialects/nv_tileaa.md
nv_tileaa.func14range 0x5B46D28..0x5B46F68function opdialects/nv_tileaa.md
nv_tileaa.gather_load21range 0x5B46D28..0x5B46F68indexed gather (global)dialects/nv_tileaa.md
nv_tileaa.generate18range 0x5B46D28..0x5B46F68functional generate (region)dialects/nv_tileaa.md
nv_tileaa.get_dim_size22range 0x5B46D28..0x5B46F68extract dimension sizedialects/nv_tileaa.md
nv_tileaa.get_global20range 0x5B46D28..0x5B46F68global lookupdialects/nv_tileaa.md
nv_tileaa.get_num_programs26range 0x5B46D28..0x5B46F68grid intrinsic: program countdialects/nv_tileaa.md
nv_tileaa.get_program_id24range 0x5B46D28..0x5B46F68grid intrinsic: program iddialects/nv_tileaa.md
nv_tileaa.global16range 0x5B46D28..0x5B46F68module-level globaldialects/nv_tileaa.md
nv_tileaa.histogram19range 0x5B46D28..0x5B46F68parallel histogram primitivedialects/nv_tileaa.md
nv_tileaa.inject_ir19range 0x5B46D28..0x5B46F68embed lowered IR fragmentdialects/nv_tileaa.md
nv_tileaa.int_to_ptr20range 0x5B46D28..0x5B46F68integer-to-pointer castdialects/nv_tileaa.md
nv_tileaa.is_valid_program_id29range 0x5B46D28..0x5B46F68grid intrinsic predicatedialects/nv_tileaa.md
nv_tileaa.join_mem_token24range 0x5B46D28..0x5B46F68merge memory tokensdialects/nv_tileaa.md
nv_tileaa.launch_func21range 0x5B46D28..0x5B46F68host-side launch opdialects/nv_tileaa.md
nv_tileaa.load14range 0x5B46D28..0x5B46F68scalar memory loaddialects/nv_tileaa.md
nv_tileaa.make_memref21range 0x5B46D28..0x5B46F68construct memrefdialects/nv_tileaa.md
nv_tileaa.make_range20range 0x5B46D28..0x5B46F68iota-style rangedialects/nv_tileaa.md
nv_tileaa.mark_for_reuse24range 0x5B46D28..0x5B46F68lifetime-extension markerdialects/nv_tileaa.md
nv_tileaa.message17range 0x5B46D28..0x5B46F68host-printable diagnosticdialects/nv_tileaa.md
nv_tileaa.mulf14range 0x5B46D28..0x5B46F68float multiplydialects/nv_tileaa.md
nv_tileaa.mulhiui17range 0x5B46D28..0x5B46F68unsigned high-half multiplydialects/nv_tileaa.md
nv_tileaa.optimization_barrier30range 0x5B46D28..0x5B46F68optimizer barrierdialects/nv_tileaa.md
nv_tileaa.permute17range 0x5B46D28..0x5B46F68tile permutationdialects/nv_tileaa.md
nv_tileaa.plugin16range 0x5B46D28..0x5B46F68plugin-injection opdialects/nv_tileaa.md
nv_tileaa.pragma16range 0x5B46D28..0x5B46F68pragma carrierdialects/nv_tileaa.md
nv_tileaa.print15range 0x5B46D28..0x5B46F68tile-aware printdialects/nv_tileaa.md
nv_tileaa.ptr_to_int20range 0x5B46D28..0x5B46F68pointer-to-integer castdialects/nv_tileaa.md
nv_tileaa.queue.get19range 0x5B46D28..0x5B46F68typed-queue dequeuedialects/nv_tileaa.md
nv_tileaa.queue.put19range 0x5B46D28..0x5B46F68typed-queue enqueuedialects/nv_tileaa.md
nv_tileaa.queue.yield21range 0x5B46D28..0x5B46F68typed-queue dataflow yielddialects/nv_tileaa.md
nv_tileaa.reduce16range 0x5B46D28..0x5B46F68reductiondialects/nv_tileaa.md
nv_tileaa.return16range 0x5B46D28..0x5B46F68function-return terminatordialects/nv_tileaa.md
nv_tileaa.rsqrt15range 0x5B46D28..0x5B46F68reciprocal sqrtdialects/nv_tileaa.md
nv_tileaa.scan14range 0x5B46D28..0x5B46F68prefix-sumdialects/nv_tileaa.md
nv_tileaa.scatter_store23range 0x5B46D28..0x5B46F68indexed scatter (global)dialects/nv_tileaa.md
nv_tileaa.splat15range 0x5B46D28..0x5B46F68scalar broadcastdialects/nv_tileaa.md
nv_tileaa.sqrt14range 0x5B46D28..0x5B46F68square rootdialects/nv_tileaa.md
nv_tileaa.store15range 0x5B46D28..0x5B46F68scalar memory storedialects/nv_tileaa.md
nv_tileaa.subf14range 0x5B46D28..0x5B46F68float subtractdialects/nv_tileaa.md
nv_tileaa.tiled_atomic_rmw26range 0x5B46D28..0x5B46F68tile-wide RMWdialects/nv_tileaa.md
nv_tileaa.tiled_load20range 0x5B46D28..0x5B46F68tile loaddialects/nv_tileaa.md
nv_tileaa.tiled_store21range 0x5B46D28..0x5B46F68tile storedialects/nv_tileaa.md
nv_tileaa.view14range 0x5B46D28..0x5B46F68layout-aware view constructiondialects/nv_tileaa.md
nv_tileaa.yield15range 0x5B46D28..0x5B46F68region terminatordialects/nv_tileaa.md

Note: enumeration follows the registrar walk in p2-C01:441-513 and yields 72 mnemonics including the queue.* and make_* decompositions; the "61 canonical ops" count cited in the dialect summary collapses make_memref / make_range / view to their corresponding make_* family count. All entries above are first-class.

§3 nv_tileas.* (58 ops)

Anchor &unk_5B44F08. RTTI nv_tile_ir::as. async.pipeline.* cluster dominates the surface area.

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nv_tileas.alloc_tensor22anchor &unk_5B44F08tensor buffer allocationdialects/nv_tileas.md
nv_tileas.async.cancel_next_program_id38anchor &unk_5B44F08async cluster canceldialects/nv_tileas.md
nv_tileas.async.copy20anchor &unk_5B44F08DMA-async copydialects/nv_tileas.md
nv_tileas.async.dot19anchor &unk_5B44F08async MMAdialects/nv_tileas.md
nv_tileas.async.extract_slice29anchor &unk_5B44F08async sub-slice extractdialects/nv_tileas.md
nv_tileas.async.future_wait27anchor &unk_5B44F08wait on async futuredialects/nv_tileas.md
nv_tileas.async.gather_tma_load31anchor &unk_5B44F08TMA gather loaddialects/nv_tileas.md
nv_tileas.async.insert_slice28anchor &unk_5B44F08async slice insertdialects/nv_tileas.md
nv_tileas.async.load20anchor &unk_5B44F08async loaddialects/nv_tileas.md
nv_tileas.async.pipeline.agent_switch37anchor &unk_5B44F08warp-specialized agent boundarydialects/nv_tileas.md
nv_tileas.async.pipeline.consume_one36anchor &unk_5B44F08one-stage consumedialects/nv_tileas.md
nv_tileas.async.pipeline.consume_one_async42anchor &unk_5B44F08one-stage async consumedialects/nv_tileas.md
nv_tileas.async.pipeline.consumer_read38anchor &unk_5B44F08consumer protocol readdialects/nv_tileas.md
nv_tileas.async.pipeline.consumer_release41anchor &unk_5B44F08consumer protocol releasedialects/nv_tileas.md
nv_tileas.async.pipeline.consumer_wait38anchor &unk_5B44F08consumer protocol waitdialects/nv_tileas.md
nv_tileas.async.pipeline.create_iterator40anchor &unk_5B44F08pipeline iterator constructiondialects/nv_tileas.md
nv_tileas.async.pipeline.create_null_token42anchor &unk_5B44F08null-token constructordialects/nv_tileas.md
nv_tileas.async.pipeline.create_pipeline40anchor &unk_5B44F08pipeline constructordialects/nv_tileas.md
nv_tileas.async.pipeline.inc_iter33anchor &unk_5B44F08iterator advancedialects/nv_tileas.md
nv_tileas.async.pipeline.produce_one36anchor &unk_5B44F08one-stage producedialects/nv_tileas.md
nv_tileas.async.pipeline.produce_one_async42anchor &unk_5B44F08one-stage async producedialects/nv_tileas.md
nv_tileas.async.pipeline.producer_acquire41anchor &unk_5B44F08producer protocol acquiredialects/nv_tileas.md
nv_tileas.async.pipeline.producer_commit40anchor &unk_5B44F08producer protocol commitdialects/nv_tileas.md
nv_tileas.async.pipeline.producer_write39anchor &unk_5B44F08producer protocol writedialects/nv_tileas.md
nv_tileas.async.pipeline.yield30anchor &unk_5B44F08pipeline-region terminatordialects/nv_tileas.md
nv_tileas.async.scatter_tma_store33anchor &unk_5B44F08TMA scatter storedialects/nv_tileas.md
nv_tileas.async.store21anchor &unk_5B44F08async storedialects/nv_tileas.md
nv_tileas.async.tiled_atomic_rmw32anchor &unk_5B44F08tile RMW (async)dialects/nv_tileas.md
nv_tileas.async.tiled_load26anchor &unk_5B44F08async tiled loaddialects/nv_tileas.md
nv_tileas.async.tiled_tma_load30anchor &unk_5B44F08TMA tile loaddialects/nv_tileas.md
nv_tileas.async.tiled_tma_store31anchor &unk_5B44F08TMA tile storedialects/nv_tileas.md
nv_tileas.async.to_async24anchor &unk_5B44F08future conversiondialects/nv_tileas.md
nv_tileas.async.token_to_async30anchor &unk_5B44F08token-to-future conversiondialects/nv_tileas.md
nv_tileas.async.wait20anchor &unk_5B44F08async wait barrierdialects/nv_tileas.md
nv_tileas.cancel_next_program_id32anchor &unk_5B44F08cluster canceldialects/nv_tileas.md
nv_tileas.convert_layout24anchor &unk_5B44F08layout conversion (smem ↔ rmem ↔ tmem)dialects/nv_tileas.md
nv_tileas.copy14anchor &unk_5B44F08sync copydialects/nv_tileas.md
nv_tileas.create_none21anchor &unk_5B44F08null SSA valuedialects/nv_tileas.md
nv_tileas.dot13anchor &unk_5B44F08sync matrix dotdialects/nv_tileas.md
nv_tileas.expand_dims21anchor &unk_5B44F08rank liftdialects/nv_tileas.md
nv_tileas.extract_slice23anchor &unk_5B44F08sub-slice extractdialects/nv_tileas.md
nv_tileas.gather_load21anchor &unk_5B44F08indexed gatherdialects/nv_tileas.md
nv_tileas.generate18anchor &unk_5B44F08functional generate (region)dialects/nv_tileas.md
nv_tileas.insert_slice22anchor &unk_5B44F08slice insertdialects/nv_tileas.md
nv_tileas.load14anchor &unk_5B44F08scalar loaddialects/nv_tileas.md
nv_tileas.make_tiled_tma_desc29anchor &unk_5B44F08TMA descriptor builderdialects/nv_tileas.md
nv_tileas.pragma16anchor &unk_5B44F08pragma carrierdialects/nv_tileas.md
nv_tileas.reduce16anchor &unk_5B44F08reductiondialects/nv_tileas.md
nv_tileas.reinterpret21anchor &unk_5B44F08reinterpret castdialects/nv_tileas.md
nv_tileas.scan14anchor &unk_5B44F08prefix-sumdialects/nv_tileas.md
nv_tileas.scatter_store23anchor &unk_5B44F08indexed scatterdialects/nv_tileas.md
nv_tileas.shuffle17anchor &unk_5B44F08warp shuffledialects/nv_tileas.md
nv_tileas.store15anchor &unk_5B44F08scalar storedialects/nv_tileas.md
nv_tileas.tiled_atomic_rmw26anchor &unk_5B44F08tile-wide RMWdialects/nv_tileas.md
nv_tileas.tiled_load20anchor &unk_5B44F08tile loaddialects/nv_tileas.md
nv_tileas.tiled_store21anchor &unk_5B44F08tile storedialects/nv_tileas.md
nv_tileas.view14anchor &unk_5B44F08view opdialects/nv_tileas.md
nv_tileas.yield15anchor &unk_5B44F08region terminatordialects/nv_tileas.md

§4 cute.* (59 ops)

Anchor &unk_5B496B8. Hardware-independent CuTe layout algebra.

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
cute.add_offset15anchor &unk_5B496B8offset addition into a layout/iterdialects/cute.md
cute.complement15anchor &unk_5B496B8layout complementdialects/cute.md
cute.copy9anchor &unk_5B496B8high-level copydialects/cute.md
cute.copy_atom_call19anchor &unk_5B496B8apply copy atomdialects/cute.md
cute.cosize11anchor &unk_5B496B8layout cosizedialects/cute.md
cute.deref_desc_iter20anchor &unk_5B496B8dereference descriptor iterdialects/cute.md
cute.derefine13anchor &unk_5B496B8layout refinementdialects/cute.md
cute.fast_divmod.create_divisor31anchor &unk_5B496B8fast-divmod divisor ctordialects/cute.md
cute.fast_divmod.divide23anchor &unk_5B496B8fast-divmod dividedialects/cute.md
cute.fast_divmod.get_divisor28anchor &unk_5B496B8fast-divmod accessordialects/cute.md
cute.fast_divmod.make_divisor29anchor &unk_5B496B8fast-divmod factorydialects/cute.md
cute.filter_zeros17anchor &unk_5B496B8strip zero modesdialects/cute.md
cute.flat_divide16anchor &unk_5B496B8flat dividedialects/cute.md
cute.gemm9anchor &unk_5B496B8GEMM scheduling opdialects/cute.md
cute.get_iter13anchor &unk_5B496B8accessor: iterdialects/cute.md
cute.get_layout15anchor &unk_5B496B8accessor: layoutdialects/cute.md
cute.get_layouts_from_tile26anchor &unk_5B496B8accessor: layouts from tiledialects/cute.md
cute.get_shape14anchor &unk_5B496B8accessor: shapedialects/cute.md
cute.get_stride15anchor &unk_5B496B8accessor: stridedialects/cute.md
cute.group_modes16anchor &unk_5B496B8layout shape opdialects/cute.md
cute.inttoptr13anchor &unk_5B496B8int-to-pointerdialects/cute.md
cute.local_partition20anchor &unk_5B496B8partition viewdialects/cute.md
cute.local_tile15anchor &unk_5B496B8tile viewdialects/cute.md
cute.logical_divide19anchor &unk_5B496B8logical dividedialects/cute.md
cute.make_atom14anchor &unk_5B496B8atom constructordialects/cute.md
cute.make_desc_iter19anchor &unk_5B496B8descriptor-iter ctordialects/cute.md
cute.make_fragment_like23anchor &unk_5B496B8fragment constructiondialects/cute.md
cute.make_tiled_copy20anchor &unk_5B496B8tiled-copy constructordialects/cute.md
cute.make_tiled_mma19anchor &unk_5B496B8tiled-MMA constructordialects/cute.md
cute.make_tuple15anchor &unk_5B496B8tuple constructordialects/cute.md
cute.make_view14anchor &unk_5B496B8view constructordialects/cute.md
cute.memref.alloc_smem22anchor &unk_5B496B8smem allocationdialects/cute.md
cute.memref.alloca18anchor &unk_5B496B8stack allocadialects/cute.md
cute.memref.load16anchor &unk_5B496B8memref loaddialects/cute.md
cute.memref.store17anchor &unk_5B496B8memref storedialects/cute.md
cute.memref.store_vec21anchor &unk_5B496B8vector memref storedialects/cute.md
cute.mma_atom_call18anchor &unk_5B496B8apply MMA atomdialects/cute.md
cute.prefetch13anchor &unk_5B496B8prefetchdialects/cute.md
cute.prefetch_atom_call23anchor &unk_5B496B8apply prefetch atomdialects/cute.md
cute.print10anchor &unk_5B496B8diagnostic printdialects/cute.md
cute.print_tma_desc_im2col26anchor &unk_5B496B8print TMA im2col descdialects/cute.md
cute.print_tma_desc_tiled25anchor &unk_5B496B8print TMA tiled descdialects/cute.md
cute.ptr.store14anchor &unk_5B496B8typed pointer storedialects/cute.md
cute.ptrtoint13anchor &unk_5B496B8pointer-to-intdialects/cute.md
cute.recast_iter16anchor &unk_5B496B8recast iteratordialects/cute.md
cute.recast_layout18anchor &unk_5B496B8recast layoutdialects/cute.md
cute.right_inverse18anchor &unk_5B496B8layout inversedialects/cute.md
cute.select11anchor &unk_5B496B8layout selectordialects/cute.md
cute.size9anchor &unk_5B496B8layout sizedialects/cute.md
cute.static11anchor &unk_5B496B8static-shape attr opdialects/cute.md
cute.stencil_divide19anchor &unk_5B496B8stencil dividedialects/cute.md
cute.tile_to_shape18anchor &unk_5B496B8tile materialisationdialects/cute.md
cute.tiled_divide17anchor &unk_5B496B8tiled dividedialects/cute.md
cute.tiled.copy.partition_D27anchor &unk_5B496B8tiled-copy D-partitiondialects/cute.md
cute.tiled.copy.partition_S27anchor &unk_5B496B8tiled-copy S-partitiondialects/cute.md
cute.tiled.copy.retile22anchor &unk_5B496B8tiled-copy retiledialects/cute.md
cute.tiled.mma.partition24anchor &unk_5B496B8tiled-MMA partitiondialects/cute.md
cute.tiled.mma.partition_shape30anchor &unk_5B496B8tiled-MMA partition shapedialects/cute.md
cute.unpack_tuple17anchor &unk_5B496B8tuple unpackerdialects/cute.md

§5 cute_nvgpu.* (73 ops)

TypeID slab range 0x5B47FF8..0x5B481A8 (54 slots, 8-byte stride); remaining ops fall into per-op accessor singletons in same arena. Anchor &unk_5B482C8.

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
cute_nvgpu.arch.alloc_rmem26range 0x5B47FF8..0x5B481A8rmem allocationdialects/cute_nvgpu.md
cute_nvgpu.arch.alloc_smem26range 0x5B47FF8..0x5B481A8smem allocationdialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM100.copy_s2t35range 0x5B47FF8..0x5B481A8smem→tmem copy (Blackwell)dialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM100.tma_load35range 0x5B47FF8..0x5B481A8TMA load (Blackwell)dialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM100.tma_reduce37range 0x5B47FF8..0x5B481A8TMA reduce (Blackwell)dialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM100.tma_store36range 0x5B47FF8..0x5B481A8TMA store (Blackwell)dialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM100.tmem_load36range 0x5B47FF8..0x5B481A8TMEM loaddialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM100.tmem_store37range 0x5B47FF8..0x5B481A8TMEM storedialects/cute_nvgpu.md
cute_nvgpu.arch.copy.SM80.cp_async34range 0x5B47FF8..0x5B481A8Ampere cp.asyncdialects/cute_nvgpu.md
cute_nvgpu.arch.copy.ldsm25range 0x5B47FF8..0x5B481A8ldmatrix familydialects/cute_nvgpu.md
cute_nvgpu.arch.copy.stsm25range 0x5B47FF8..0x5B481A8stmatrix familydialects/cute_nvgpu.md
cute_nvgpu.arch.get_dyn_smem28range 0x5B47FF8..0x5B481A8dynamic-smem accessordialects/cute_nvgpu.md
cute_nvgpu.arch.get_dyn_smem_size33range 0x5B47FF8..0x5B481A8dynamic-smem size querydialects/cute_nvgpu.md
cute_nvgpu.arch.make_warp_uniform33range 0x5B47FF8..0x5B481A8warp-uniform markerdialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM100.umma30range 0x5B47FF8..0x5B481A8Blackwell UMMAdialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM100.umma_block_scaled43range 0x5B47FF8..0x5B481A8Blackwell UMMA block-scaleddialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM100.umma_block_scaled_sparse50range 0x5B47FF8..0x5B481A8Blackwell UMMA bs sparsedialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM100.umma_sparse37range 0x5B47FF8..0x5B481A8Blackwell UMMA sparsedialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM120.block_scaled38range 0x5B47FF8..0x5B481A8sm_120 block-scaled MMAdialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM8024range 0x5B47FF8..0x5B481A8Ampere MMAdialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM80.sparse31range 0x5B47FF8..0x5B481A8Ampere MMA sparsedialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM8924range 0x5B47FF8..0x5B481A8Ada MMAdialects/cute_nvgpu.md
cute_nvgpu.arch.mma.SM9024range 0x5B47FF8..0x5B481A8Hopper WGMMAdialects/cute_nvgpu.md
cute_nvgpu.arch.prefetch_tma_desc33range 0x5B47FF8..0x5B481A8TMA desc prefetchdialects/cute_nvgpu.md
cute_nvgpu.arch.sm100.alloc_tmem32range 0x5B47FF8..0x5B481A8TMEM allocdialects/cute_nvgpu.md
cute_nvgpu.arch.sm100.dealloc_tmem34range 0x5B47FF8..0x5B481A8TMEM deallocdialects/cute_nvgpu.md
cute_nvgpu.arch.sm100.relinquish_tmem_alloc_permit50range 0x5B47FF8..0x5B481A8TMEM permit releasedialects/cute_nvgpu.md
cute_nvgpu.arch.sm100.retrieve_tmem_ptr39range 0x5B47FF8..0x5B481A8TMEM pointer retrievaldialects/cute_nvgpu.md
cute_nvgpu.atom.get_copy_s2t_smem_desc_view43range 0x5B47FF8..0x5B481A8atom accessor: s2t smem-descdialects/cute_nvgpu.md
cute_nvgpu.atom.get_value25range 0x5B47FF8..0x5B481A8atom value accessordialects/cute_nvgpu.md
cute_nvgpu.atom.ldsm20range 0x5B47FF8..0x5B481A8ldmatrix atomdialects/cute_nvgpu.md
cute_nvgpu.atom.make_exec_tma29range 0x5B47FF8..0x5B481A8executable TMA atom builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_non_exec_tiled_tma_load44range 0x5B47FF8..0x5B481A8non-exec tiled TMA load builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_non_exec_tiled_tma_reduce46range 0x5B47FF8..0x5B481A8non-exec tiled TMA reduce builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_s2t_copy29range 0x5B47FF8..0x5B481A8s2t copy atom builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_tma_load29range 0x5B47FF8..0x5B481A8TMA load atom builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_tma_reduce31range 0x5B47FF8..0x5B481A8TMA reduce atom builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_tma_store30range 0x5B47FF8..0x5B481A8TMA store atom builderdialects/cute_nvgpu.md
cute_nvgpu.atom.make_tmem_copy30range 0x5B47FF8..0x5B481A8TMEM copy atom builderdialects/cute_nvgpu.md
cute_nvgpu.atom.non_exec_tiled_tma_load39range 0x5B47FF8..0x5B481A8non-exec tiled TMA load atomdialects/cute_nvgpu.md
cute_nvgpu.atom.non_exec_tiled_tma_reduce41range 0x5B47FF8..0x5B481A8non-exec tiled TMA reduce atomdialects/cute_nvgpu.md
cute_nvgpu.atom.non_exec_tiled_tma_store40range 0x5B47FF8..0x5B481A8non-exec tiled TMA store atomdialects/cute_nvgpu.md
cute_nvgpu.atom.s2t_copy24range 0x5B47FF8..0x5B481A8s2t copy atomdialects/cute_nvgpu.md
cute_nvgpu.atom.simt_async_copy31range 0x5B47FF8..0x5B481A8SIMT async copy atomdialects/cute_nvgpu.md
cute_nvgpu.atom.stsm20range 0x5B47FF8..0x5B481A8stmatrix atomdialects/cute_nvgpu.md
cute_nvgpu.atom.tma_load24range 0x5B47FF8..0x5B481A8TMA load atomdialects/cute_nvgpu.md
cute_nvgpu.atom.tma_reduce26range 0x5B47FF8..0x5B481A8TMA reduce atomdialects/cute_nvgpu.md
cute_nvgpu.atom.tma_store25range 0x5B47FF8..0x5B481A8TMA store atomdialects/cute_nvgpu.md
cute_nvgpu.atom.tmem_load25range 0x5B47FF8..0x5B481A8TMEM load atomdialects/cute_nvgpu.md
cute_nvgpu.atom.tmem_store26range 0x5B47FF8..0x5B481A8TMEM store atomdialects/cute_nvgpu.md
cute_nvgpu.atom.universal_copy30range 0x5B47FF8..0x5B481A8universal copy atomdialects/cute_nvgpu.md
cute_nvgpu.atom.universal_fma29range 0x5B47FF8..0x5B481A8universal FMA atomdialects/cute_nvgpu.md
cute_nvgpu.cast_tma_desc_to_integer35range 0x5B47FF8..0x5B481A8TMA desc-to-int reinterpretdialects/cute_nvgpu.md
cute_nvgpu.copy_tma_desc24range 0x5B47FF8..0x5B481A8TMA desc copydialects/cute_nvgpu.md
cute_nvgpu.get_grid_constant_pointer36range 0x5B47FF8..0x5B481A8nvvm.grid_constant accessordialects/cute_nvgpu.md
cute_nvgpu.get_tma_desc_addr28range 0x5B47FF8..0x5B481A8TMA desc-address probedialects/cute_nvgpu.md
cute_nvgpu.make_sm120_mma_bs28range 0x5B47FF8..0x5B481A8sm_120 block-scaled MMA constructordialects/cute_nvgpu.md
cute_nvgpu.make_tma_desc_im2col31range 0x5B47FF8..0x5B481A8TMA im2col desc builderdialects/cute_nvgpu.md
cute_nvgpu.make_tma_desc_im2col_at34range 0x5B47FF8..0x5B481A8TMA im2col desc builder (at)dialects/cute_nvgpu.md
cute_nvgpu.make_tma_desc_tiled30range 0x5B47FF8..0x5B481A8TMA tiled desc builderdialects/cute_nvgpu.md
cute_nvgpu.make_tma_desc_tiled_at33range 0x5B47FF8..0x5B481A8TMA tiled desc builder (at)dialects/cute_nvgpu.md
cute_nvgpu.prefetch_tma_desc28range 0x5B47FF8..0x5B481A8TMA desc prefetchdialects/cute_nvgpu.md
cute_nvgpu.sm100.mma20range 0x5B47FF8..0x5B481A8Blackwell MMAdialects/cute_nvgpu.md
cute_nvgpu.sm100.mma_bs23range 0x5B47FF8..0x5B481A8Blackwell block-scaled MMAdialects/cute_nvgpu.md
cute_nvgpu.sm100.mma_bs_sp26range 0x5B47FF8..0x5B481A8Blackwell block-scaled sparse MMAdialects/cute_nvgpu.md
cute_nvgpu.sm100.mma_sp23range 0x5B47FF8..0x5B481A8Blackwell sparse MMAdialects/cute_nvgpu.md
cute_nvgpu.SM120.mma_bs23range 0x5B47FF8..0x5B481A8sm_120 block-scaled MMAdialects/cute_nvgpu.md
cute_nvgpu.sm80.mma19range 0x5B47FF8..0x5B481A8Ampere MMAdialects/cute_nvgpu.md
cute_nvgpu.sm80.sparse_mma26range 0x5B47FF8..0x5B481A8Ampere sparse MMAdialects/cute_nvgpu.md
cute_nvgpu.sm89.mma19range 0x5B47FF8..0x5B481A8Ada MMAdialects/cute_nvgpu.md
cute_nvgpu.sm90.mma19range 0x5B47FF8..0x5B481A8Hopper WGMMAdialects/cute_nvgpu.md
cute_nvgpu.smem_desc_view25range 0x5B47FF8..0x5B481A8smem descriptor viewdialects/cute_nvgpu.md
cute_nvgpu.update_tma_desc26range 0x5B47FF8..0x5B481A8TMA desc mutatedialects/cute_nvgpu.md

§6 cutlass.* (84 ops, 38 unique families)

Fold-record range 0x5B47490..0x5B476A0 covers the op-info blocks. Includes block_striped collectives, generic and named barriers, the pipeline state machine, the seq_bar protocol, and the tile_scheduler family (DP, static-persistent, StreamK, MODS-trace).

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
cutlass.async.exec18range 0x5B47490..0x5B476A0async-execute wrapperdialects/cutlass.md
cutlass.barrier_id18range 0x5B47490..0x5B476A0barrier-id allocatordialects/cutlass.md
cutlass.block_striped.load26range 0x5B47490..0x5B476A0block-striped loaddialects/cutlass.md
cutlass.block_striped.load_add30range 0x5B47490..0x5B476A0block-striped load+adddialects/cutlass.md
cutlass.block_striped.reduce28range 0x5B47490..0x5B476A0block-striped reducedialects/cutlass.md
cutlass.block_striped.store27range 0x5B47490..0x5B476A0block-striped storedialects/cutlass.md
cutlass.generic_barrier.arrive_increment40range 0x5B47490..0x5B476A0generic-barrier arrive-incrementdialects/cutlass.md
cutlass.generic_barrier_sync28range 0x5B47490..0x5B476A0generic-barrier syncdialects/cutlass.md
cutlass.generic_barrier.wait_eq31range 0x5B47490..0x5B476A0generic-barrier wait-eqdialects/cutlass.md
cutlass.generic_barrier.wait_less_than38range 0x5B47490..0x5B476A0generic-barrier wait-less-thandialects/cutlass.md
cutlass.named_barrier.arrive28range 0x5B47490..0x5B476A0named-barrier arrivedialects/cutlass.md
cutlass.named_barrier.arrive_and_wait37range 0x5B47490..0x5B476A0named-barrier arrive+waitdialects/cutlass.md
cutlass.pipeline.consume24range 0x5B47490..0x5B476A0pipeline consumedialects/cutlass.md
cutlass.pipeline.consumer_release33range 0x5B47490..0x5B476A0consumer releasedialects/cutlass.md
cutlass.pipeline.consumer_try_wait34range 0x5B47490..0x5B476A0consumer try-waitdialects/cutlass.md
cutlass.pipeline.consumer_wait30range 0x5B47490..0x5B476A0consumer waitdialects/cutlass.md
cutlass.pipeline.create23range 0x5B47490..0x5B476A0pipeline ctordialects/cutlass.md
cutlass.pipeline.get_producer_barrier37range 0x5B47490..0x5B476A0producer-barrier querydialects/cutlass.md
cutlass.pipeline.get_producer_mask34range 0x5B47490..0x5B476A0producer-mask querydialects/cutlass.md
cutlass.pipeline.init21range 0x5B47490..0x5B476A0pipeline initdialects/cutlass.md
cutlass.pipeline.make_participants34range 0x5B47490..0x5B476A0participant set constructiondialects/cutlass.md
cutlass.pipeline.produce24range 0x5B47490..0x5B476A0pipeline producedialects/cutlass.md
cutlass.pipeline.producer_acquire33range 0x5B47490..0x5B476A0producer acquiredialects/cutlass.md
cutlass.pipeline.producer_commit32range 0x5B47490..0x5B476A0producer commitdialects/cutlass.md
cutlass.pipeline.producer_tail30range 0x5B47490..0x5B476A0producer taildialects/cutlass.md
cutlass.pipeline.producer_try_acquire37range 0x5B47490..0x5B476A0producer try-acquiredialects/cutlass.md
cutlass.pipeline.state.create29range 0x5B47490..0x5B476A0state ctordialects/cutlass.md
cutlass.pipeline.state.get_count32range 0x5B47490..0x5B476A0state count accessordialects/cutlass.md
cutlass.pipeline.state.get_index32range 0x5B47490..0x5B476A0state index accessordialects/cutlass.md
cutlass.pipeline.state.get_phase32range 0x5B47490..0x5B476A0state phase accessordialects/cutlass.md
cutlass.pipeline.state.increment32range 0x5B47490..0x5B476A0state incrementdialects/cutlass.md
cutlass.pipeline.switch_by_executor35range 0x5B47490..0x5B476A0executor-keyed dispatchdialects/cutlass.md
cutlass.seq_bar.arrive22range 0x5B47490..0x5B476A0seq-bar arrivedialects/cutlass.md
cutlass.seq_bar.create22range 0x5B47490..0x5B476A0seq-bar ctordialects/cutlass.md
cutlass.seq_bar.init20range 0x5B47490..0x5B476A0seq-bar initdialects/cutlass.md
cutlass.seq_bar.state.create28range 0x5B47490..0x5B476A0seq-bar state ctordialects/cutlass.md
cutlass.seq_bar.wait20range 0x5B47490..0x5B476A0seq-bar waitdialects/cutlass.md
cutlass.tile_scheduler.advance_to_next_work43range 0x5B47490..0x5B476A0scheduler advancedialects/cutlass.md
cutlass.tile_scheduler.compute_epilogue39range 0x5B47490..0x5B476A0epilogue triggerdialects/cutlass.md
cutlass.tile_scheduler.create_dp_params39range 0x5B47490..0x5B476A0DP scheduler params ctordialects/cutlass.md
cutlass.tile_scheduler.create_dp_work_tile_info47range 0x5B47490..0x5B476A0DP work-tile-info ctordialects/cutlass.md
cutlass.tile_scheduler.create_SM100_scheduler45range 0x5B47490..0x5B476A0sm_100 scheduler factorydialects/cutlass.md
cutlass.tile_scheduler.create_static_persistent_params54range 0x5B47490..0x5B476A0static-persistent params ctordialects/cutlass.md
cutlass.tile_scheduler.create_static_persistent_work_tile_info62range 0x5B47490..0x5B476A0static-persistent work-tile-info ctordialects/cutlass.md
cutlass.tile_scheduler.create_streamk_params44range 0x5B47490..0x5B476A0StreamK params ctordialects/cutlass.md
cutlass.tile_scheduler.create_streamk_work_tile_info52range 0x5B47490..0x5B476A0StreamK work-tile-info ctordialects/cutlass.md
cutlass.tile_scheduler.fetch_next_work38range 0x5B47490..0x5B476A0fetch next workdialects/cutlass.md
cutlass.tile_scheduler.fixup28range 0x5B47490..0x5B476A0partial-tile fixupdialects/cutlass.md
cutlass.tile_scheduler.fixup_increment38range 0x5B47490..0x5B476A0fixup incrementdialects/cutlass.md
cutlass.tile_scheduler.fixup_wait33range 0x5B47490..0x5B476A0fixup waitdialects/cutlass.md
cutlass.tile_scheduler.get_current_work39range 0x5B47490..0x5B476A0current work accessordialects/cutlass.md
cutlass.tile_scheduler.get_grid_shape37range 0x5B47490..0x5B476A0grid-shape accessordialects/cutlass.md
cutlass.tile_scheduler.get_workid_response_ptr46range 0x5B47490..0x5B476A0workid response ptrdialects/cutlass.md
cutlass.tile_scheduler.get_work_k_tile_count44range 0x5B47490..0x5B476A0work k-tile countdialects/cutlass.md
cutlass.tile_scheduler.get_work_k_tile_start44range 0x5B47490..0x5B476A0work k-tile startdialects/cutlass.md
cutlass.tile_scheduler.get_workspace_sizes42range 0x5B47490..0x5B476A0workspace sizesdialects/cutlass.md
cutlass.tile_scheduler.initial_work_tile_info45range 0x5B47490..0x5B476A0initial work-tile infodialects/cutlass.md
cutlass.tile_scheduler.initialize_workspace43range 0x5B47490..0x5B476A0initialize workspacedialects/cutlass.md
cutlass.tile_scheduler.make_dp_params37range 0x5B47490..0x5B476A0DP params builderdialects/cutlass.md
cutlass.tile_scheduler.make_static_persistent_params52range 0x5B47490..0x5B476A0static-persistent params builderdialects/cutlass.md
cutlass.tile_scheduler.make_streamk_params42range 0x5B47490..0x5B476A0StreamK params builderdialects/cutlass.md
cutlass.tile_scheduler.mods_report_mainloop_end47range 0x5B47490..0x5B476A0MODS-trace mainloop enddialects/cutlass.md
cutlass.tile_scheduler.mods_report_mainloop_start49range 0x5B47490..0x5B476A0MODS-trace mainloop startdialects/cutlass.md
cutlass.tile_scheduler.mods_report_smid39range 0x5B47490..0x5B476A0MODS-trace smid reportdialects/cutlass.md
cutlass.tile_scheduler.mods_throttle36range 0x5B47490..0x5B476A0MODS-trace throttledialects/cutlass.md
cutlass.tile_scheduler.params_get_value39range 0x5B47490..0x5B476A0params accessordialects/cutlass.md
cutlass.tile_scheduler.query_next_work38range 0x5B47490..0x5B476A0query next workdialects/cutlass.md
cutlass.tile_scheduler.static_fetch_next_work45range 0x5B47490..0x5B476A0static fetch next workdialects/cutlass.md
cutlass.tile_scheduler.work_tile_info_get_value47range 0x5B47490..0x5B476A0work-tile-info accessordialects/cutlass.md
cutlass.tile_scheduler.work_tile_info_set_value47range 0x5B47490..0x5B476A0work-tile-info mutatordialects/cutlass.md
cutlass.tile_scheduler.work_tile_info_to_coord_mnkl51range 0x5B47490..0x5B476A0work-tile-info MNKL coordsdialects/cutlass.md
cutlass.tile_scheduler.work_tile_info_to_cta_coord50range 0x5B47490..0x5B476A0work-tile-info CTA coordsdialects/cutlass.md

§7 mlir::nvgpu.* (upstream, observed in lowerings)

Upstream MLIR nvgpu dialect; statically linked into tileiras. Dialect TypeID anchor is provided by the upstream registration; per-op TypeIDs are not exposed by tileiras's own registrar. The list below enumerates every upstream nvgpu.* mnemonic observed in tileiras-driven lowerings (produced by convert-nvgpu-to-nvvm consumers and equivalent upstream dialects).

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvgpu.device_async_copy23upstreamdevice-async copydialects/upstream-nvgpu.md
nvgpu.device_async_create_group31upstreamdevice-async group ctordialects/upstream-nvgpu.md
nvgpu.device_async_wait23upstreamdevice-async waitdialects/upstream-nvgpu.md
nvgpu.ldmatrix14upstreamldmatrix wrapperdialects/upstream-nvgpu.md
nvgpu.mma.sp.sync17upstreamsparse MMA syncdialects/upstream-nvgpu.md
nvgpu.mma.sync14upstreamdense MMA syncdialects/upstream-nvgpu.md
nvgpu.tma.async.load20upstreamTMA async loaddialects/upstream-nvgpu.md
nvgpu.tma.async.store21upstreamTMA async storedialects/upstream-nvgpu.md
nvgpu.tma.create.descriptor27upstreamTMA descriptor ctordialects/upstream-nvgpu.md
nvgpu.warpgroup.generate.descriptor35upstreamwarpgroup descriptor ctordialects/upstream-nvgpu.md
nvgpu.warpgroup.mma19upstreamwarpgroup MMAdialects/upstream-nvgpu.md
nvgpu.warpgroup.mma.init.accumulator36upstreamwarpgroup MMA acc initdialects/upstream-nvgpu.md

§8 NVVM.* (213 ops)

TypeID slab 0x5B8D610..0x5B8DCB8 (1704 bytes / 8 = 213 entries, 8-byte stride, dense). Dialect TypeID &unk_5B8DCC0 sits 8 bytes above the highest op slot. Walked via RegisteredOperationName::insert at sub_4461CA0 from the registrar driver sub_2EFC390. Order below is the categorical roster from p5-HH01 (within each category alphabetical where the registrar permits it; otherwise registrar walk order).

§8.1 Barriers (10)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.barrier0xC&unk_5B8DC80block-level barrierdialects/nvvm.md
nvvm.barrier00xD&unk_5B8DCA8legacy bar.sync 0dialects/nvvm.md
nvvm.barrier.arrive0x13&unk_5B8DCA0barrier arrivedialects/nvvm.md
nvvm.barrier.cta.arrive0x17&unk_5B8DC98CTA barrier arrivedialects/nvvm.md
nvvm.barrier.cta.red0x14&unk_5B8DC90CTA barrier reductiondialects/nvvm.md
nvvm.barrier.cta.sync0x15&unk_5B8DC88CTA barrier syncdialects/nvvm.md
nvvm.bar.warp.sync0x12&unk_5B8D758bar.warp.syncdialects/nvvm.md
nvvm.cluster.arrive0x13&unk_5B8DC10cluster arrivedialects/nvvm.md
nvvm.cluster.arrive.relaxed0x1B&unk_5B8DC08cluster arrive relaxeddialects/nvvm.md
nvvm.cluster.wait0x11&unk_5B8DB70cluster waitdialects/nvvm.md

§8.2 mbarrier (20)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.mbarrier.arrive0x14&unk_5B8D870mbarrier arrivedialects/nvvm.md
nvvm.mbarrier.arrive.expect_tx0x1E&unk_5B8D890arrive with tx-count expectationdialects/nvvm.md
nvvm.mbarrier.arrive.expect_tx.shared0x25&unk_5B8D888arrive expect_tx (shared)dialects/nvvm.md
nvvm.mbarrier.arrive.nocomplete0x1F&unk_5B8D880arrive nocompletedialects/nvvm.md
nvvm.mbarrier.arrive.nocomplete.shared0x26&unk_5B8D878arrive nocomplete (shared)dialects/nvvm.md
nvvm.mbarrier.arrive.shared0x1B&unk_5B8D868arrive (shared)dialects/nvvm.md
nvvm.mbarrier.init0x12&unk_5B8D860mbarrier initdialects/nvvm.md
nvvm.mbarrier.init.shared0x19&unk_5B8D858mbarrier init (shared)dialects/nvvm.md
nvvm.mbarrier.inval0x13&unk_5B8D850mbarrier invalidatedialects/nvvm.md
nvvm.mbarrier.inval.shared0x1A&unk_5B8D848mbarrier invalidate (shared)dialects/nvvm.md
nvvm.mbarrier.test.wait0x17&unk_5B8D840mbarrier test-waitdialects/nvvm.md
nvvm.mbarrier.test.wait.shared0x1E&unk_5B8D838mbarrier test-wait (shared)dialects/nvvm.md
nvvm.mbarrier.try_wait.parity0x1D&unk_5B8D820try-wait paritydialects/nvvm.md
nvvm.mbarrier.try_wait.parity.shared0x24&unk_5B8D818try-wait parity (shared)dialects/nvvm.md
nvvm.mbarrier.try_wait.parity.timelimit0x27&unk_5B8D810try-wait parity timelimitdialects/nvvm.md
nvvm.mbarrier.try_wait.timelimit0x20&unk_5B8D808try-wait timelimitdialects/nvvm.md
nvvm.mbarrier.txn0x11&unk_5B8D828mbarrier transaction countdialects/nvvm.md
nvvm.mbarrier.txn.cta0x15&unk_5B8D830mbarrier transaction (CTA)dialects/nvvm.md
nvvm.mbarrier.wait0x12&unk_5B8D800mbarrier waitdialects/nvvm.md
nvvm.mbarrier.wait.parity0x19&unk_5B8D7F8mbarrier wait paritydialects/nvvm.md

§8.3 TMA / cp.async.bulk (12)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.cp.async.bulk.commit.group0x1F&unk_5B8DB20bulk commit groupdialects/nvvm.md
nvvm.cp.async.bulk.global.shared.cta0x24&unk_5B8DB08bulk global←shared.ctadialects/nvvm.md
nvvm.cp.async.bulk.prefetch0x1B&unk_5B8DB10bulk prefetchdialects/nvvm.md
nvvm.cp.async.bulk.shared.cluster.global0x28&unk_5B8DB18bulk shared.cluster←globaldialects/nvvm.md
nvvm.cp.async.bulk.shared.cluster.shared.cta0x2C&unk_5B8DB00bulk shared.cluster←shared.ctadialects/nvvm.md
nvvm.cp.async.bulk.tensor.global.shared.cta0x2B&unk_5B8DAD0TMA tensor global←shared.ctadialects/nvvm.md
nvvm.cp.async.bulk.tensor.global.shared.cta.ext0x2F&unk_5B8DAD8TMA tensor global←shared.cta extdialects/nvvm.md
nvvm.cp.async.bulk.tensor.prefetch0x22&unk_5B8DAE8TMA tensor prefetchdialects/nvvm.md
nvvm.cp.async.bulk.tensor.reduce0x20&unk_5B8DAE0TMA tensor reducedialects/nvvm.md
nvvm.cp.async.bulk.tensor.shared.cluster.global0x2F&unk_5B8DAF0TMA tensor shared.cluster←globaldialects/nvvm.md
nvvm.cp.async.bulk.tensor.shared.cta.global0x2B&unk_5B8DAF8TMA tensor shared.cta←globaldialects/nvvm.md
nvvm.cp.async.bulk.wait_group0x1D&unk_5B8DAC8bulk wait groupdialects/nvvm.md

§8.4 cp.async (Ampere) (5)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.cp.async.commit.group0x1A&unk_5B8DAC0cp.async commit groupdialects/nvvm.md
nvvm.cp.async.mbarrier.arrive0x1D&unk_5B8DAB8cp.async mbarrier arrivedialects/nvvm.md
nvvm.cp.async.mbarrier.arrive.shared0x24&unk_5B8DAB0cp.async mbarrier arrive (shared)dialects/nvvm.md
nvvm.cp.async.shared.global0x1B&unk_5B8DAA8cp.async shared←globaldialects/nvvm.md
nvvm.cp.async.wait.group0x18&unk_5B8DAA0cp.async wait groupdialects/nvvm.md

§8.5 tcgen05 (Blackwell) (18)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.tcgen05.alloc0x12&unk_5B8D750tcgen05 allocdialects/nvvm.md
nvvm.tcgen05.commit0x13&unk_5B8D740tcgen05 commitdialects/nvvm.md
nvvm.tcgen05.commit.arrive0x1A&unk_5B8D748tcgen05 commit-arrivedialects/nvvm.md
nvvm.tcgen05.cp0xF&unk_5B8D738tcgen05 copydialects/nvvm.md
nvvm.tcgen05.dealloc0x14&unk_5B8D730tcgen05 deallocdialects/nvvm.md
nvvm.tcgen05.fence0x12&unk_5B8D728tcgen05 fencedialects/nvvm.md
nvvm.tcgen05.ld0xF&unk_5B8D720tcgen05 loaddialects/nvvm.md
nvvm.tcgen05.mma0x10&unk_5B8D710tcgen05 MMAdialects/nvvm.md
nvvm.tcgen05.mma.block_scale0x1C&unk_5B8D718tcgen05 MMA block-scaledialects/nvvm.md
nvvm.tcgen05.mma_smem_desc0x1A&unk_5B8D6E8tcgen05 mma smem descdialects/nvvm.md
nvvm.tcgen05.mma.sp0x13&unk_5B8D700tcgen05 MMA sparsedialects/nvvm.md
nvvm.tcgen05.mma.sp.block_scale0x1F&unk_5B8D708tcgen05 MMA sparse block-scaledialects/nvvm.md
nvvm.tcgen05.mma.ws0x13&unk_5B8D6F8tcgen05 MMA warp-specdialects/nvvm.md
nvvm.tcgen05.mma.ws.sp0x16&unk_5B8D6F0tcgen05 MMA ws sparsedialects/nvvm.md
nvvm.tcgen05.relinquish_alloc_permit0x24&unk_5B8D6E0tcgen05 relinquish permitdialects/nvvm.md
nvvm.tcgen05.shift0x12&unk_5B8D6D8tcgen05 shiftdialects/nvvm.md
nvvm.tcgen05.st0xF&unk_5B8D6D0tcgen05 storedialects/nvvm.md
nvvm.tcgen05.wait0x11&unk_5B8D6C8tcgen05 waitdialects/nvvm.md

§8.6 wgmma / wmma / mma / ldmatrix-stmatrix (12)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.wgmma.commit.group.sync.aligned0x24&unk_5B8D620wgmma commit group syncdialects/nvvm.md
nvvm.wgmma.fence.aligned0x18&unk_5B8D628wgmma fence aligneddialects/nvvm.md
nvvm.wgmma.mma_async0x14&unk_5B8D618wgmma async MMAdialects/nvvm.md
nvvm.wmma.load0xE&unk_5B8D658wmma loaddialects/nvvm.md
nvvm.wmma.mma0xD&unk_5B8D650wmma MMAdialects/nvvm.md
nvvm.wmma.store0xF&unk_5B8D648wmma storedialects/nvvm.md
nvvm.mma.block_scale0x14&unk_5B8D8D8MMA block-scaledialects/nvvm.md
nvvm.mma_smem_desc0x12&unk_5B8D7C8MMA smem descdialects/nvvm.md
nvvm.mma.sparse.block_scale0x1B&unk_5B8D8D0MMA sparse block-scaledialects/nvvm.md
nvvm.mma.sync0xD&unk_5B8D7D0MMA syncdialects/nvvm.md
nvvm.ldmatrix0xD&unk_5B8D898ldmatrixdialects/nvvm.md
nvvm.stmatrix0xD&unk_5B8D768stmatrixdialects/nvvm.md

§8.7 shfl / vote / redux / match / elect (5)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.elect.sync0xF&unk_5B8DA78elect leaderdialects/nvvm.md
nvvm.match.sync0xF&unk_5B8D7E8match.syncdialects/nvvm.md
nvvm.redux.sync0xF&unk_5B8D790redux.syncdialects/nvvm.md
nvvm.shfl.sync0xE&unk_5B8D780shfl.syncdialects/nvvm.md
nvvm.vote.sync0xE&unk_5B8D660vote.syncdialects/nvvm.md

§8.8 Convert / cvt.packfloat (11)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.convert.bf16x2.to.f4x20x1B&unk_5B8DB68bf16x2→f4x2dialects/nvvm.md
nvvm.convert.bf16x2.to.f8x20x1B&unk_5B8DB60bf16x2→f8x2dialects/nvvm.md
nvvm.convert.f16x2.to.f4x20x1A&unk_5B8DB58f16x2→f4x2dialects/nvvm.md
nvvm.convert.f16x2.to.f8x20x1A&unk_5B8DB50f16x2→f8x2dialects/nvvm.md
nvvm.convert.f32x2.to.f4x20x1A&unk_5B8DB48f32x2→f4x2dialects/nvvm.md
nvvm.convert.f32x2.to.f6x20x1A&unk_5B8DB40f32x2→f6x2dialects/nvvm.md
nvvm.convert.f32x2.to.f8x20x1A&unk_5B8DB38f32x2→f8x2dialects/nvvm.md
nvvm.convert.f4x2.to.f16x20x1A&unk_5B8DB30f4x2→f16x2dialects/nvvm.md
nvvm.convert.float.to.tf320x1A&unk_5B8DB28float→tf32dialects/nvvm.md
nvvm.cvt.packfloat0x12&unk_5B8DA90cvt.packfloatdialects/nvvm.md
nvvm.cvt.packfloat.f320x16&unk_5B8DA98cvt.packfloat.f32dialects/nvvm.md

§8.9 read.ptx.sreg.* (73)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.read.ptx.sreg.clock0x18&unk_5B8DC18sreg clockdialects/nvvm.md
nvvm.read.ptx.sreg.clock640x1A&unk_5B8DC20sreg clock64dialects/nvvm.md
nvvm.read.ptx.sreg.cluster.ctaid.x0x22&unk_5B8DC48cluster.ctaid.xdialects/nvvm.md
nvvm.read.ptx.sreg.cluster.ctaid.y0x22&unk_5B8DC40cluster.ctaid.ydialects/nvvm.md
nvvm.read.ptx.sreg.cluster.ctaid.z0x22&unk_5B8DC38cluster.ctaid.zdialects/nvvm.md
nvvm.read.ptx.sreg.cluster.ctarank0x22&unk_5B8DBC8cluster.ctarankdialects/nvvm.md
nvvm.read.ptx.sreg.clusterid.x0x1E&unk_5B8DBC0clusterid.xdialects/nvvm.md
nvvm.read.ptx.sreg.clusterid.y0x1E&unk_5B8DBB8clusterid.ydialects/nvvm.md
nvvm.read.ptx.sreg.clusterid.z0x1E&unk_5B8DBB0clusterid.zdialects/nvvm.md
nvvm.read.ptx.sreg.cluster.nctaid.x0x23&unk_5B8DBF8cluster.nctaid.xdialects/nvvm.md
nvvm.read.ptx.sreg.cluster.nctaid.y0x23&unk_5B8DBF0cluster.nctaid.ydialects/nvvm.md
nvvm.read.ptx.sreg.cluster.nctaid.z0x23&unk_5B8DBE8cluster.nctaid.zdialects/nvvm.md
nvvm.read.ptx.sreg.cluster.nctarank0x23&unk_5B8DC00cluster.nctarankdialects/nvvm.md
nvvm.read.ptx.sreg.ctaid.x0x1A&unk_5B8DC60ctaid.xdialects/nvvm.md
nvvm.read.ptx.sreg.ctaid.y0x1A&unk_5B8DC58ctaid.ydialects/nvvm.md
nvvm.read.ptx.sreg.ctaid.z0x1A&unk_5B8DC50ctaid.zdialects/nvvm.md
nvvm.read.ptx.sreg.envreg00x1A&unk_5B8DA70envreg0dialects/nvvm.md
nvvm.read.ptx.sreg.envreg10x1A&unk_5B8DA18envreg1dialects/nvvm.md
nvvm.read.ptx.sreg.envreg100x1B&unk_5B8DA68envreg10dialects/nvvm.md
nvvm.read.ptx.sreg.envreg110x1B&unk_5B8DA60envreg11dialects/nvvm.md
nvvm.read.ptx.sreg.envreg120x1B&unk_5B8DA58envreg12dialects/nvvm.md
nvvm.read.ptx.sreg.envreg130x1B&unk_5B8DA50envreg13dialects/nvvm.md
nvvm.read.ptx.sreg.envreg140x1B&unk_5B8DA48envreg14dialects/nvvm.md
nvvm.read.ptx.sreg.envreg150x1B&unk_5B8DA40envreg15dialects/nvvm.md
nvvm.read.ptx.sreg.envreg160x1B&unk_5B8DA38envreg16dialects/nvvm.md
nvvm.read.ptx.sreg.envreg170x1B&unk_5B8DA30envreg17dialects/nvvm.md
nvvm.read.ptx.sreg.envreg180x1B&unk_5B8DA28envreg18dialects/nvvm.md
nvvm.read.ptx.sreg.envreg190x1B&unk_5B8DA20envreg19dialects/nvvm.md
nvvm.read.ptx.sreg.envreg20x1A&unk_5B8D9C0envreg2dialects/nvvm.md
nvvm.read.ptx.sreg.envreg200x1B&unk_5B8DA10envreg20dialects/nvvm.md
nvvm.read.ptx.sreg.envreg210x1B&unk_5B8DA08envreg21dialects/nvvm.md
nvvm.read.ptx.sreg.envreg220x1B&unk_5B8DA00envreg22dialects/nvvm.md
nvvm.read.ptx.sreg.envreg230x1B&unk_5B8D9F8envreg23dialects/nvvm.md
nvvm.read.ptx.sreg.envreg240x1B&unk_5B8D9F0envreg24dialects/nvvm.md
nvvm.read.ptx.sreg.envreg250x1B&unk_5B8D9E8envreg25dialects/nvvm.md
nvvm.read.ptx.sreg.envreg260x1B&unk_5B8D9E0envreg26dialects/nvvm.md
nvvm.read.ptx.sreg.envreg270x1B&unk_5B8D9D8envreg27dialects/nvvm.md
nvvm.read.ptx.sreg.envreg280x1B&unk_5B8D9D0envreg28dialects/nvvm.md
nvvm.read.ptx.sreg.envreg290x1B&unk_5B8D9C8envreg29dialects/nvvm.md
nvvm.read.ptx.sreg.envreg30x1A&unk_5B8D9A8envreg3dialects/nvvm.md
nvvm.read.ptx.sreg.envreg300x1B&unk_5B8D9B8envreg30dialects/nvvm.md
nvvm.read.ptx.sreg.envreg310x1B&unk_5B8D9B0envreg31dialects/nvvm.md
nvvm.read.ptx.sreg.envreg40x1A&unk_5B8D9A0envreg4dialects/nvvm.md
nvvm.read.ptx.sreg.envreg50x1A&unk_5B8D998envreg5dialects/nvvm.md
nvvm.read.ptx.sreg.envreg60x1A&unk_5B8D990envreg6dialects/nvvm.md
nvvm.read.ptx.sreg.envreg70x1A&unk_5B8D988envreg7dialects/nvvm.md
nvvm.read.ptx.sreg.envreg80x1A&unk_5B8D980envreg8dialects/nvvm.md
nvvm.read.ptx.sreg.envreg90x1A&unk_5B8D978envreg9dialects/nvvm.md
nvvm.read.ptx.sreg.globaltimer0x1E&unk_5B8D918globaltimerdialects/nvvm.md
nvvm.read.ptx.sreg.gridid0x19&unk_5B8D8F8grididdialects/nvvm.md
nvvm.read.ptx.sreg.laneid0x19&unk_5B8D8C8laneiddialects/nvvm.md
nvvm.read.ptx.sreg.lanemask.eq0x1E&unk_5B8D8C0lanemask.eqdialects/nvvm.md
nvvm.read.ptx.sreg.lanemask.ge0x1E&unk_5B8D8B8lanemask.gedialects/nvvm.md
nvvm.read.ptx.sreg.lanemask.gt0x1E&unk_5B8D8B0lanemask.gtdialects/nvvm.md
nvvm.read.ptx.sreg.lanemask.le0x1E&unk_5B8D8A8lanemask.ledialects/nvvm.md
nvvm.read.ptx.sreg.lanemask.lt0x1E&unk_5B8D8A0lanemask.ltdialects/nvvm.md
nvvm.read.ptx.sreg.nclusterid.x0x1F&unk_5B8DBE0nclusterid.xdialects/nvvm.md
nvvm.read.ptx.sreg.nclusterid.y0x1F&unk_5B8DBD8nclusterid.ydialects/nvvm.md
nvvm.read.ptx.sreg.nclusterid.z0x1F&unk_5B8DBD0nclusterid.zdialects/nvvm.md
nvvm.read.ptx.sreg.nctaid.x0x1B&unk_5B8D910nctaid.xdialects/nvvm.md
nvvm.read.ptx.sreg.nctaid.y0x1B&unk_5B8D908nctaid.ydialects/nvvm.md
nvvm.read.ptx.sreg.nctaid.z0x1B&unk_5B8D900nctaid.zdialects/nvvm.md
nvvm.read.ptx.sreg.nsmid0x18&unk_5B8D778nsmiddialects/nvvm.md
nvvm.read.ptx.sreg.ntid.x0x19&unk_5B8DC78ntid.xdialects/nvvm.md
nvvm.read.ptx.sreg.ntid.y0x19&unk_5B8DC70ntid.ydialects/nvvm.md
nvvm.read.ptx.sreg.ntid.z0x19&unk_5B8DC68ntid.zdialects/nvvm.md
nvvm.read.ptx.sreg.nwarpid0x1A&unk_5B8D640nwarpiddialects/nvvm.md
nvvm.read.ptx.sreg.smid0x17&unk_5B8D770smiddialects/nvvm.md
nvvm.read.ptx.sreg.tid.x0x18&unk_5B8D678tid.xdialects/nvvm.md
nvvm.read.ptx.sreg.tid.y0x18&unk_5B8D670tid.ydialects/nvvm.md
nvvm.read.ptx.sreg.tid.z0x18&unk_5B8D668tid.zdialects/nvvm.md
nvvm.read.ptx.sreg.warpid0x19&unk_5B8D638warpiddialects/nvvm.md
nvvm.read.ptx.sreg.warpsize0x1B&unk_5B8D630warpsizedialects/nvvm.md

§8.10 cluster_launch_ctrl (7)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.clusterlaunchcontrol.query_cancel.get_first_ctaid0x36&unk_5B8DBA8query first ctaiddialects/nvvm.md
nvvm.clusterlaunchcontrol.query_cancel.get_first_ctaid.x0x38&unk_5B8DBA0query first ctaid.xdialects/nvvm.md
nvvm.clusterlaunchcontrol.query_cancel.get_first_ctaid.y0x38&unk_5B8DB98query first ctaid.ydialects/nvvm.md
nvvm.clusterlaunchcontrol.query_cancel.get_first_ctaid.z0x38&unk_5B8DB90query first ctaid.zdialects/nvvm.md
nvvm.clusterlaunchcontrol.query_cancel.is_canceled0x32&unk_5B8DB88query is-canceleddialects/nvvm.md
nvvm.clusterlaunchcontrol.try_cancel0x24&unk_5B8DB78try canceldialects/nvvm.md
nvvm.clusterlaunchcontrol.try_cancel.multicast0x2E&unk_5B8DB80try cancel multicastdialects/nvvm.md

§8.11 Fences (14)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.fence.acq_rel.cluster0x1A&unk_5B8D6B8acq_rel clusterdialects/nvvm.md
nvvm.fence.acq_rel.cta0x16&unk_5B8D6B0acq_rel CTAdialects/nvvm.md
nvvm.fence.acq_rel.gpu0x16&unk_5B8D6A8acq_rel GPUdialects/nvvm.md
nvvm.fence.acq_rel.sys0x16&unk_5B8D6A0acq_rel sysdialects/nvvm.md
nvvm.fence.acquire0x12&unk_5B8D948acquire fencedialects/nvvm.md
nvvm.fence.mbarrier.init0x18&unk_5B8D940mbarrier-init fencedialects/nvvm.md
nvvm.fence.proxy0x10&unk_5B8D930proxy fencedialects/nvvm.md
nvvm.fence.proxy.acquire0x18&unk_5B8D938proxy acquiredialects/nvvm.md
nvvm.fence.proxy.release0x18&unk_5B8D928proxy releasedialects/nvvm.md
nvvm.fence.release0x12&unk_5B8D920release fencedialects/nvvm.md
nvvm.fence.sc0xD&unk_5B8D680sc fencedialects/nvvm.md
nvvm.fence.sc.cluster0x15&unk_5B8D698sc clusterdialects/nvvm.md
nvvm.fence.sc.cta0x11&unk_5B8D690sc CTAdialects/nvvm.md
nvvm.fence.sc.gpu0x11&unk_5B8D688sc GPUdialects/nvvm.md

§8.12 dot_accum (2)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.dot.accumulate.2way0x18&unk_5B8DA88dot accumulate 2-waydialects/nvvm.md
nvvm.dot.accumulate.4way0x18&unk_5B8DA80dot accumulate 4-waydialects/nvvm.md

§8.13 griddep / proxy / tensormap (5)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.griddepcontrol.launch.dependents0x25&unk_5B8D8F0griddepcontrol launch dependentsdialects/nvvm.md
nvvm.griddepcontrol.wait0x18&unk_5B8D8E8griddepcontrol waitdialects/nvvm.md
nvvm.prefetch0xD&unk_5B8D7B0prefetchdialects/nvvm.md
nvvm.prefetch.tensormap0x17&unk_5B8D7A8prefetch tensormapdialects/nvvm.md
nvvm.tensormap.cp_fenceproxy0x1C&unk_5B8D6C0tensormap cp_fenceproxydialects/nvvm.md

§8.14 Misc (19)

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
nvvm.add.packed.f32x20x15&unk_5B8DCB8packed f32x2 adddialects/nvvm.md
nvvm.atomicrmw0xE&unk_5B8DCB0LLVM atomicrmw wrapperdialects/nvvm.md
nvvm.breakpoint0xF&unk_5B8DC30breakpointdialects/nvvm.md
nvvm.exit9&unk_5B8D970thread exitdialects/nvvm.md
nvvm.fabs9&unk_5B8D958float absdialects/nvvm.md
nvvm.fma.packed.f32x20x15&unk_5B8D950packed f32x2 FMAdialects/nvvm.md
nvvm.fmax9&unk_5B8D7E0float maxdialects/nvvm.md
nvvm.fmin9&unk_5B8D7D8float mindialects/nvvm.md
nvvm.inline_ptx0xF&unk_5B8D8E0inline PTXdialects/nvvm.md
nvvm.load.ext0xD&unk_5B8D968extended loaddialects/nvvm.md
nvvm.mapa9&unk_5B8D7F0mapadialects/nvvm.md
nvvm.mul8&unk_5B8D7C0multiplydialects/nvvm.md
nvvm.mul.packed.f32x20x15&unk_5B8D7B8packed f32x2 multiplydialects/nvvm.md
nvvm.rcp.approx.ftz.f0x15&unk_5B8D7A0reciprocal approx ftzdialects/nvvm.md
nvvm.red (family — TypeID-only; no literal mnemonic string)8&unk_5B8D798atomic reduction family; concrete forms surfaced in the string table are nvvm.redux.sync and nvvm.barrier.cta.red; the variant-3 red_op/red_type parser slots are described in dialects/nvvm/properties-blob-and-attr-parsers.mddialects/nvvm.md
nvvm.setmaxregister0x13&unk_5B8D788set-max-registerdialects/nvvm.md
nvvm.st.bulk0xC&unk_5B8DC28bulk storedialects/nvvm.md
nvvm.store.ext0xE&unk_5B8D960extended storedialects/nvvm.md
nvvm.sub.packed.f32x20x15&unk_5B8D760packed f32x2 subtractdialects/nvvm.md

§9 llvm-extras (upstream llvm.* ops observed in tileiras lowerings)

The MLIR llvm dialect is statically linked from upstream and registered via addOperation<> chains; tileiras does not surface a per-op &unk_* slot for these. The list below enumerates the llvm.* mnemonics emitted by tileiras-driven lowerings. Dialect TypeID anchor is &unk_5BA8F60.

mnemoniclengthTypeID singletonbrief semanticprimary wiki page
llvm.alloca11upstreamstack allocadialects/upstream-llvm.md
llvm.atomicrmw14upstreamatomic RMW (the binary has no llvm.atomic_cmpxchg string; compare-and-swap is the separate llvm.cmpxchg op below)dialects/upstream-llvm.md
llvm.bitcast12upstreambit-pattern type pundialects/upstream-llvm.md
llvm.call9upstreamLLVM calldialects/upstream-llvm.md
llvm.cmpxchg12upstreamatomic compare-and-swapdialects/upstream-llvm.md
llvm.dbg.cu11upstreamDI compile-unitdialects/upstream-llvm.md
llvm.extractelement19upstreamvector element extractdialects/upstream-llvm.md
llvm.fence10upstreamLLVM fencedialects/upstream-llvm.md
llvm.func9upstreamLLVM functiondialects/upstream-llvm.md
llvm.getelementptr18upstreamget-element-ptr (the binary has no abbreviated llvm.gep string; only the spelled-out form is present)dialects/upstream-llvm.md
llvm.global_ctors17upstreamLLVM global constructors arraydialects/upstream-llvm.md
llvm.global_dtors17upstreamLLVM global destructors arraydialects/upstream-llvm.md
llvm.global.annotations23upstreamLLVM global annotations arraydialects/upstream-llvm.md
llvm.insertelement18upstreamvector element insertdialects/upstream-llvm.md
llvm.intr.coro.align20upstreamcoroutine intrinsic — frame alignment querydialects/upstream-llvm.md
llvm.intr.coro.begin20upstreamcoroutine intrinsic — frame begindialects/upstream-llvm.md
llvm.intr.coro.end18upstreamcoroutine intrinsic — frame enddialects/upstream-llvm.md
llvm.intr.coro.free19upstreamcoroutine intrinsic — free frame storagedialects/upstream-llvm.md
llvm.intr.coro.id17upstreamcoroutine intrinsic — identity tokendialects/upstream-llvm.md
llvm.intr.coro.promise22upstreamcoroutine intrinsic — promise/frame conversiondialects/upstream-llvm.md
llvm.intr.coro.resume21upstreamcoroutine intrinsic — resume suspended framedialects/upstream-llvm.md
llvm.intr.coro.save19upstreamcoroutine intrinsic — save suspend indexdialects/upstream-llvm.md
llvm.intr.coro.size19upstreamcoroutine intrinsic — frame size querydialects/upstream-llvm.md
llvm.intr.coro.suspend22upstreamcoroutine intrinsic — suspend pointdialects/upstream-llvm.md
llvm.intr.dbg.declare21upstreamdebug-info declaredialects/upstream-llvm.md
llvm.intr.dbg.label19upstreamdebug-info labeldialects/upstream-llvm.md
llvm.intr.dbg.value19upstreamdebug-info valuedialects/upstream-llvm.md
llvm.inttoptr13upstreamint-to-pointerdialects/upstream-llvm.md
llvm.mlir.constant18upstreamMLIR constant for LLVM typedialects/upstream-llvm.md
llvm.ptrtoint13upstreampointer-to-intdialects/upstream-llvm.md
llvm.return11upstreamreturndialects/upstream-llvm.md
llvm.select11upstreamselectdialects/upstream-llvm.md
llvm.shufflevector18upstreamvector shuffledialects/upstream-llvm.md