Q4_KTensorData

Tensor data interface for Q4_K quantized format.

Q4_K block format (256 elements per block, 144 bytes per block):

Each sub-block (32 elements):

Dequantization: outputi = codei * scale + min

Types

abstract val blockCount: Int

Number of Q4_K blocks in the tensor.

Raw packed data containing all blocks.

abstract fun getBlockD(blockIdx: Int): Float

Get the main scale factor (d) for a block.

abstract fun getBlockDMin(blockIdx: Int): Float

Get the minimum scale factor (dMin) for a block.

abstract fun getCode(blockIdx: Int, elementIdx: Int): Int

Get a 4-bit quantized code value (0..255 elements within block).

abstract fun getSubBlockMin(blockIdx: Int, subBlockIdx: Int): Float

Get the minimum value for a specific sub-block within a block.

abstract fun getSubBlockScale(blockIdx: Int, subBlockIdx: Int): Float

Get the scale for a specific sub-block within a block.

Dequantize Q4_K tensor data to a FloatArray. outputi = codei * scale + min