com.microsoft - DynamicQuantizeMatMul#
DynamicQuantizeMatMul - 1 (com.microsoft)#
Version
domain: com.microsoft
since_version: 1
function:
support_level:
shape inference:
This version of the operator has been available since version 1 of domain com.microsoft.
Summary
Inputs
Between 3 and 5 inputs.
A (heterogeneous) - T1: N-dimensional matrix A
B (heterogeneous) - T2: N-dimensional matrix B
b_scale (heterogeneous) - T1: Scale of quantized input ‘B’. It could be a scalar or a 1-D tensor, which means a per-tensor or per-column quantization. If it’s a 1-D tensor, its number of elements should be equal to the number of columns of input ‘B’.
b_zero_point (optional, heterogeneous) - T2: Zero point tensor for input ‘B’. It’s optional and default value is 0. It could be a scalar or a 1-D tensor, which means a per-tensor or per-column quantization. If it’s a 1-D tensor, its number of elements should be equal to the number of columns of input ‘B’.
bias (optional, heterogeneous) - T1: 1D input tensor, whose dimension is same as B’s last dimension
Outputs
Y (heterogeneous) - T1: Matrix multiply results from A * B
Examples