Why Vgg16 uses relu after each convolution layer?

31 views Asked by Teresa Ho At 30 September 2018 at 14:58

In the CS231N course, it says we want zero-centered data to prevent the local gradient from always being the same sign of upstream gradient coming down, which causes inefficient gradient updates. But using relu in each layer is gonna output all positive numbers, how to solve the inefficient gradient update problem?

Original Q&A

TechQA.

Why Vgg16 uses relu after each convolution layer?

There are 0 answers

Related Questions in GRADIENT

Related Questions in VGG-NET

Related Questions in RELU

Popular Questions

Popular Tags

Trending Questions