balanced

fun balanced(numLayers: Int, numHeads: Int, headDim: Int, maxSeqLen: Int): TurboQuantPreset(source)

Balanced preset: TurboQuant-4 for both keys and values.

Symmetric 4-bit compression for both K and V. Good balance between compression ratio and quality.