loadKeyStorageRaw

fun loadKeyStorageRaw(layer: Int, startPos: Int = 0, endPos: Int = cache.currentSeqLen): TensorStorage(source)

Load raw TensorStorage for keys, preserving the cache's native encoding.

This is the zero-copy path for backends that can fuse decompression with attention computation (e.g., Metal fused dequant+SDPA).