当前位置：首页 > news >正文

CANN/ops-nn RMS归一化动态量化算子

news 2026/7/5 23:43:47

AddRmsNormDynamicQuantV2

【免费下载链接】ops-nn本项目是CANN提供的神经网络类计算算子库，实现网络在NPU上加速计算。项目地址: https://gitcode.com/cann/ops-nn

算子功能：RmsNorm算子是大模型常用的归一化操作，相比LayerNorm算子，其去掉了减去均值的部分。DynamicQuant算子则是为输入张量进行对称动态量化的算子。AddRmsNormDynamicQuantV2算子将RmsNorm前的Add算子和RmsNorm归一化输出给到的1个或2个DynamicQuant算子融合起来，减少搬入搬出操作。
计算公式：
$$ x=x_{1}+x_{2} $$
$$ y = \operatorname{RmsNorm}(x)=\frac{x}{\operatorname{Rms}(\mathbf{x})}\cdot gamma, \quad \text { where } \operatorname{Rms}(\mathbf{x})=\sqrt{\frac{1}{n} \sum_{i=1}^n x_i^2+epsilon} $$
$$ yFP32=cast(y) $$
- 若smoothScale1Optional和smoothScale2Optional均不输入，则y2Out和scale2Out输出无实际意义。计算过程如下所示：
$$ scale1Out=row_max(abs(y))/127 $$
$$ y1Out=round(y/scale1Out) $$
- 若仅输入smoothScale1Optional，则y2Out和scale2Out输出无实际意义。计算过程如下所示：
$$ input = y\cdot smoothScale1Optional $$
$$ scale1Out=row_max(abs(input))/127 $$
$$ y1Out=round(input/scale1Out) $$
- 若smoothScale1Optional和smoothScale2Optional均输入，则算子的五个输出均为有效输出。计算过程如下所示：
$$ input1 = y\cdot smoothScale1Optional $$
$$ input2 = y\cdot smoothScale2Optional $$
$$ scale1Out=row_max(abs(input1))/127 $$
$$ scale2Out=row_max(abs(input2))/127 $$
$$ y1Out=round(input1/scale1Out) $$
$$ y2Out=round(input2/scale2Out) $$
其中row_max代表每行求最大值。

参数名	输入/输出/属性	描述	数据类型	数据格式
x1	输入	表示标准化过程中的源数据张量，对应公式中的`x1`。	FLOAT16、BFLOAT16	ND
x2	输入	表示标准化过程中的源数据张量，对应公式中的`x2`。	FLOAT16、BFLOAT16	ND
gamma	输入	表示标准化过程中的权重张量，对应公式中的`gamma`。shape需要与`x1`最后一维一致。	FLOAT16、BFLOAT16	ND
smooth_scale1	可选输入	表示量化过程中得到y1使用的smoothScale张量，对应公式中的`smoothScale1Optional`。	FLOAT16、BFLOAT16	ND
smooth_scale2	可选输入	表示量化过程中得到y2使用的smoothScale张量，对应公式中的`smoothScale2Optional`。	FLOAT16、BFLOAT16	ND
epsilon	可选属性	用于防止除0错误，对应公式中的`epsilon`。默认值为1e-6。	FLOAT	-
y1	输出	表示量化输出Tensor，对应公式中的`y1Out`。	INT8	ND
y2	输出	表示量化输出Tensor，对应公式中的`y2Out`。	INT8	ND
y3	输出	表示rmsNorm的FLOAT32类型输出Tensor，对应公式中的`yFP32`。	FLOAT32	ND
y4	输出	表示量化输出Tensor，对应公式中的`y`。	FLOAT16、BFLOAT16	ND
x	输出	表示x1和x2的和，对应公式中的`x`。	FLOAT16、BFLOAT16	ND
scale1	输出	第一路量化的输出，对应公式中的`scale1Out`。	FLOAT32	ND
scale2	输出	第二路量化的输出，对应公式中的`scale2Out`。	FLOAT32	ND