Package-level declarations
Types
GGML quantization types.
Parsed model metadata from a GGUF file.
GGUF model parser extending BaseModelParser. Provides metadata-first parsing of GGUF model files using SKaiNET's GGUF I/O capabilities.
ParametersLoader implementation backed by the legacy GGUFReader.
Streaming GGUF parameters loader — the recommended path for loading GGUF models.
Streaming GGUF reader that parses metadata without loading the entire file.
Tensor metadata for streaming access.
Abstracts the mapping from logical tensor roles to GGUF tensor name strings.
Properties
This is a kotlin gguf reader related logic interpreted from python code "gguf-py/gguf/constants.py" of github repo "https://github.com/ggerganov/llama.cpp"
This is a kotlin gguf reader interpreted from python code "gguf-py/gguf/gguf_reader.py" of github repo "https://github.com/ggerganov/llama.cpp"
Functions
Convert a physical byte shape back to a logical element shape.
Android implementation of createRandomAccessSource.
Platform-specific factory for creating RandomAccessSource instances.
JS implementation of createRandomAccessSource.
JVM implementation of createRandomAccessSource.
Native implementation of createRandomAccessSource.
WasmJS implementation of createRandomAccessSource.
WasmWASI implementation of createRandomAccessSource.
Check whether a quantization type uses block quantization (vs element-wise).
Get the block size for a quantization type.
Compute the total byte size for a quantized tensor.
Compute the total number of logical elements from a shape.
Convert a logical element shape to a physical byte shape for quantized storage.
Get the byte size per block for a quantization type.
Validate that a byte array has the correct size for a given quantized tensor.