loadToMap
Load model weights as mmap-backed tensors.
Return
LlamaWeights containing mmap-backed tensors
Parameters
T
the data type (must be FP32 for mmap loading)
V
the value type (Float for FP32)
ctx
execution context (used to wrap TensorData with ops)