Tuesday, April 25, 2023

Relu functions

Despite their practical success, Relu activations are theoretically ill-defined. This is due to the existence of numerous trivial local optima where the gradient is zero, and we are optimizing them with approximate local optimizers.

No comments:

Python Terminal Animation for Waiting

A nice small tool for terminal animations from yaspin import yaspin with yaspin(): do_stuff()