Two methods that attempt to remove useless weights after training are reviewed and compared. Both make use of the second derivative information but use different approaches. Both lead, under the same set of hypothesis, to the same selection criterion for pruning unrelevant weights. Practical comparison is also carried out on a small toy problem.
展开▼