loadToMap

Load model weights as mmap-backed tensors.

Return

LlamaWeights containing mmap-backed tensors

Parameters

T

the data type (must be FP32 for mmap loading)

V

the value type (Float for FP32)

ctx

execution context (used to wrap TensorData with ops)