generateDenseLayer
Generates C code for Dense layer operations using nested loops. Follows existing DefaultCpuOps implementation patterns for matrix-vector multiplication.
The generated code implements: output = input * weight^T + bias This matches the Linear layer forward pass: input.matmul(weight.t()) + bias
Return
LayerCode containing generated C code fragment
Parameters
node
GraphNode representing a Dense/Linear layer