matmulQ8_0

Matrix multiplication with Q8_0 quantized weights.

FP32 output tensor of shape batch, outputDim or outputDim

input

FP32 input tensor of shape batch, inputDim or inputDim

weights

Q8_0 quantized weight data of shape inputDim, outputDim

ctx

ExecutionContext for creating the output tensor