the shape of the tensor
the packed byte array (4 ternary values per byte)
the scale factor for FP32 dequantization