loadLlamaRuntimeWeightsDequantizedStreaming

Load LLaMA runtime weights using streaming API with dequantization. Suitable for large models >2GB.


Backward-compatible overload defaulting to FP32.