loadKeyStorageRaw
fun loadKeyStorageRaw(layer: Int, startPos: Int = 0, endPos: Int = cache.currentSeqLen): TensorStorage(source)
Load raw TensorStorage for keys, preserving the cache's native encoding.
This is the zero-copy path for backends that can fuse decompression with attention computation (e.g., Metal fused dequant+SDPA).