PackedBlockStorage
Shared contract for all packed/quantized block tensor storage formats.
Instead of each quantization format (Q4_K, Q8_0, Ternary, …) inventing its own loader, planner, and backend handling path, all packed formats implement this interface. Backends and planners can dispatch on encoding without knowing every possible quantization scheme.
Individual formats still expose format-specific accessors (sub-block scales, code extraction, etc.) through their own sub-interfaces.
Inheritors
Properties
Number of blocks in this storage.
Logical element count.
The physical encoding describing the block layout.
Raw packed byte data containing all blocks.
Physical byte size of the packed data.
Functions
Dequantize a single block to float values.
Dequantize the entire tensor to a FloatArray. Default implementation calls dequantizeBlock for each block.
Convert this packed storage to a TensorStorage descriptor.