skainet-io-gguf/sk.ainet.io.gguf.dequant/QuantPolicy

QuantPolicy

enum QuantPolicy : Enum<QuantPolicy> (source)

Controls how quantized tensors are handled during weight loading.

Shared across all weight loaders (LLaMA, Gemma, etc.).

Entries

Keep quantized payloads as raw bytes (Int8 tensor) with quantized shape.

DEQUANTIZE_TO_FP32

DEQUANTIZE_TO_FP32

Dequantize to FP32 on load.

NATIVE_OPTIMIZED

NATIVE_OPTIMIZED

Mixed mode: dequantize F32/F16/BF16 tensors to FP32, but keep quantized weight tensors (Q4_0, Q8_0, etc.) as raw bytes for native kernel consumption.

Properties

val entries: EnumEntries<QuantPolicy>

Returns a representation of an immutable list of all enum entries, in the order they're declared.

Functions

fun valueOf(value: String): QuantPolicy

Returns the enum constant of this type with the specified name. The string must match exactly an identifier used to declare an enum constant in this type. (Extraneous whitespace characters are not permitted.)

fun values(): Array<QuantPolicy>

Returns an array containing the constants of this enum type, in the order they're declared.