It is demonstrated that training the networks to have interpretable gradients improves their robustness to adversarial perturbations, especially in cross-norm attacks and under heavy perturbation.
D. Dou
9 papers
Boyang Albert Li
10 papers
Adam Noack
2 papers
Isaac Ahern
1 papers