DequantOps
Shared dequantization operations for GGUF and SafeTensors weight loading.
All functions are module-visible (internal) and stateless. They convert quantized byte payloads into FP32 float arrays.
Functions
Link copied to clipboard
Returns (bytesPerBlock, elemsPerBlock) for a given quantization type. Useful for chunked dequantization on single-threaded platforms (WASM).
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Dispatch dequantization based on tensor type for byte arrays.
Link copied to clipboard
Dispatch dequantization based on tensor type for List
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Handle column-major to row-major conversion for GGUF tensors. Data layout is unchanged — only the shape interpretation changes.