Inference mode uses ForwardValue which doesn't track gradients. This is more memory-efficient when only forward pass is needed.