Which of the following is true?
- Not being able to fit even a small dataset to your NN model might imply a software defect rather than underfitting.
- NNs are inherently serial so do not benefit from GPU implementations.
- Data parallelism in NNs is when multiple machines work on the same input at the same time, each running different parts of the model.