当前位置: 首页 > news >正文

CANN/pto-isa非ISA运算规范

Non-ISA Operations

【免费下载链接】pto-isaParallel Tile Operation (PTO) is a virtual instruction set architecture designed by Ascend CANN, focusing on tile-level operations. This repository offers high-performance, cross-platform tile operations across Ascend platforms.项目地址: https://gitcode.com/cann/pto-isa

1. Scope

This page specifies PTO AS operations from~/pto-isa.txtthat are not represented as ISA instruction pages.

  • Level-1: SSA form, compiler-managed allocation/synchronization.
  • Level-2: DPS form, explicit buffer reuse and synchronization primitives.

2. View Operations

2.1make_tensor_view

// L1 %dst = pto.make_tensor_view %ptr, shape = [sh1,sh2,sh3,sh4,sh5] strides = [st1,st2,st3,st4,st5] : !pto.tensor_view<sh1xsh2xsh3xsh4xsh5xdtype>

2.2partition_view

// L1 %dst = pto.partition_view %src, offsets = [of1,of2,of3,of4,of5], sizes = [sh1,sh2,sh3,sh4,sh5] : !pto.tensor_view<sh1xsh2xsh3xsh4xsh5xdtype> -> !pto.partition_tensor_view<sh1xsh2xsh3xsh4xsh5xdtype>

3. Tile Allocation

3.1alloc_tile(static parameters)

// L2 %dst = pto.alloc_tile : !pto.tile_buf<loc, dtype, rows, cols, v_row, v_col, blayout, slayou, fractal, pad>

3.2alloc_tile(dynamic valid region)

// L2 %dst = pto.alloc_tile valid_row = %vr valid_col = %vc : !pto.tile_buf<loc, dtype, rows, cols, v_row=?, v_col=?, blayout, slayou, fractal, pad>

4. Kernel Parameter Queries

4.1get_block_idx

// L1 / L2 %idx = pto.get_block_idx

4.2get_subblock_idx

// L1 / L2 %idx = pto.get_subblock_idx

4.3get_block_num

// L1 / L2 %num = pto.get_block_num

4.4get_subblock_num

// L1 / L2 %num = pto.get_subblock_num

5. Pointer and Scalar Access

5.1addptr

// L2 %ptr_new = pto.addptr %ptr, %offset

5.2tgetval

// L2 pto.tgetval ins(%src, %index : !pto.tile_buf<...>, dtype) outs(%val : dtype)

5.3tsetval

// L2 pto.tsetval ins(%index, %val : dtype, dtype) outs(%dst : !pto.tile_buf<...>)

6. Synchronization Primitives (Level-2)

6.1record_event

pto.record_event[src_op, dst_op, eventID]

Supported ops in current table:TLOAD,TSTORE_ACC,TSTORE_VEC,TMOV_M2L,TMOV_M2S,TMOV_M2B,TMOV_M2V,TMOV_V2M,TMATMUL,TVEC.

6.2wait_event

pto.wait_event[src_op, dst_op, eventID]

Supported ops in current table:TLOAD,TSTORE_ACC,TSTORE_VEC,TMOV_M2L,TMOV_M2S,TMOV_M2B,TMOV_M2V,TMOV_V2M,TMATMUL,TVEC.

6.3barrier

pto.barrier(op)

Supported ops in current table:TVEC,TMATMUL.

7. Consistency Notes

  • Non-ISA PTO AS operations are documented in this section and are intentionally separate fromdocs/isa/manifest-driven instruction entries.
  • TSYNCinstruction pages indocs/isa/remain the canonical ISA-level synchronization semantics.
  • When table content in~/pto-isa.txtchanges, this page should be updated in the same change set.

【免费下载链接】pto-isaParallel Tile Operation (PTO) is a virtual instruction set architecture designed by Ascend CANN, focusing on tile-level operations. This repository offers high-performance, cross-platform tile operations across Ascend platforms.项目地址: https://gitcode.com/cann/pto-isa

创作声明:本文部分内容由AI辅助生成(AIGC),仅供参考

http://www.jsqmd.com/news/783164/

相关文章:

  • 2026年成都水刀配件一站式采购指南:力好机械与主流品牌深度对标 - 企业名录优选推荐
  • 荆州全城正规上门回收黄金 两区三市三县全覆盖 资质齐全交易放心 - 金掌柜黄金回收
  • 推荐几家国内知名的Ansys代理商 - 品牌2026
  • Ansys哪家代理商价格比较低 - 品牌2026
  • CANN图像处理算子库
  • 大模型训练与数据
  • 3分钟让Windows任务栏变透明:TranslucentTB完全使用指南
  • 可解释AI实战:从LIME、SHAP到反事实解释的技术演进与应用
  • 苏州蔷薇吊装搬运:口碑好的苏州设备搬运公司推荐哪几家 - LYL仔仔
  • 自贡房屋装修预算超支?看看这份2026年全屋定制整装对比评测 - 优质企业观察收录
  • CANN/ops-math ReplicationPad3d算子
  • cann/ops-cv非连续Tensor说明
  • CANN/pypto浮点数取模运算API文档
  • CANN/atvoss默认块配置说明
  • 结构型设计模式——桥接模式
  • 公平AI与领域知识融合:构建可解释的酒驾风险预测模型
  • 2026年成都水刀配件市场深度横评:力好机械一站式供应解决方案 - 企业名录优选推荐
  • CANN算子基础框架库
  • cherry Studio中配置openClaw教程
  • C 语言11 函数传参与递归
  • CANN元数据定义FrameworkType
  • 全域旅游打破资源壁垒,巨有科技一机游赋能区域文旅高质量发展
  • 语义通信赋能AIGC:基于ControlNet的高效可控图像生成框架解析
  • AI赋能分支定界算法:机器学习如何优化混合整数规划求解效率
  • 构建XAI与人类决策的统一框架:从证据积累到可解释AI实践
  • 昇腾CANN PTO ISA 概述
  • CANN运行时TDT通道基础传输
  • CANN/asnumpy 基准测试
  • AI+SPU-Net:机器人辅助脊柱手术的自动切面规划技术详解
  • CANN/ops-transformer FFA算子设计