Creates a weight matrix for a linear layer with Xavier/Glorot initialization
number of input features
number of output features