Creates a square convolution kernel
number of output channels
number of input channels
size of the square kernel (both height and width)