当前位置：首页 > news >正文

CANN/DeepSeek-V3.2-Exp配置说明

news 2026/6/27 7:47:38

YAML Parameter Description

【免费下载链接】cann-recipes-infer本项目针对LLM与多模态模型推理业务中的典型模型、加速算法，提供基于CANN平台的优化样例项目地址: https://gitcode.com/cann/cann-recipes-infer

The configuration instructions in the YAML file can be found below.

Basic Config model_name: "deepseek_v3.2_exp" # The model name. String type model_path: "/data/models/DeepSeek-V3.2-Exp-bf16/" # The model path. String type exe_mode: "ge_graph" # The execution mode. Only support ["ge_graph", "eager", "acl_graph"] world_size: 128 # The world size. Int type Model Config pa_block_size: 128 # PA Block Size value. Support [128, 256] enable_weight_nz: True # Whether use nz-weight format for better performance. Support [False, True] with_ckpt: True # Whether load ckpt. Support [False, True] enable_multi_streams: True # Whether enable multistream for better performance. Support [False, True] enable_profiler: True # Whether enable profiling. Support [False, True] enable_cache_compile: False # Whether enable cache compile for better performance. Support [False, True] prefill_mini_batch_size: 0 # Mini_batch_size for prefill stage. perfect_eplb: False # Whether enable, test uniform scenario of MoE experts. Support [False, True] enable_auto_split_weight: True # Whether enable auto-split weight. Support [False, True] next_n: 1 # Steps using multi-token prediction. Support [0, 1, 2, 3] enable_offload: False # Whether enable offload. Support [False, True] Data Config dataset: "default" # Support ["default" "InfiniteBench" "LongBench"] input_max_len: 8192 # The input max length max_new_tokens: 100 # Max new tokens batch_size: 128 # Global batch size Parallel Config cp_size: 128 # Context Parallel Number. When using CP, cp_size should be the same as world_size; otherwise, set cp_size to 1. Only active at prefill stage attn_tp_size: 1 # Attention TP Number oproj_tp_size: 8 # Oproj TP Number. Only support when attn_tp_size == 1 dense_tp_size: 1 # Dense MLP TP Number moe_tp_size: 1 # MoE TP Number embed_tp_size: 16 # Embed TP Number lmhead_tp_size: 16 # LMHead TP Number

创作声明：本文部分内容由AI辅助生成（AIGC），仅供参考

查看全文

http://www.jsqmd.com/news/783829/