Which of the following is true?
- Perceptron networks always have a hidden layer.
- The information gain of an attribute `A` is the entropy of the data before testing on that attribute less
the expected entropy left after testing on that attribute.
- Our perceptron training algorithm assumed the function `g` in the perceptron was a step function.