what's the convention on the meaning of "pre-training" vs "training from scratch" ?
Is this a nomenclature shift?
training from scratch would be initializing a neural network, and training it to add numbers directly.
what's the convention on the meaning of "pre-training" vs "training from scratch" ?
Is this a nomenclature shift?