Aug 16 Aug 16 Deep Learning 101: Transformer Activation Functions Explainer - Sigmoid, ReLU, GELU, Swish Liz McQuillan Deep Learning