Creates a weight matrix for a linear layer with He initialization (good for ReLU)
number of input features
number of output features