AccelerateCpuOps
CPU operations accelerated by Apple's Accelerate framework. Overrides hot-path operations (matmul, elementwise, reductions) with hardware-optimized routines that leverage ARM NEON and AMX.
Falls through to DefaultCpuOpsBase for non-FP32, non-contiguous, or complex broadcasting cases.