Dynamic Tanh Explained - Same or better performance with 8% efficiency improvement

Bunny Labs · Advanced ·📄 Research Papers Explained ·1y ago
This video talks about Dynamic Tanh. How we can achieve same or better result with replacing layer normalization while improving efficiency by around 8%. Research Paper: https://arxiv.org/pdf/2503.10622 Bunny Labs is a division of Bunny Choo Choo, a NLP-based startup focused on education. We created this course to share the knowledge and experience we gained when building Bunny Choo Choo. We are exploring AI voice technology. Please like the video and subscribe us if you cannot distinguish whether the voice is from AI. Please comment if you know that this voice is generated by AI. IG: @bunn…
Watch on YouTube ↗ (saves to browser)
Julia Minson | How to Disagree Better  | Talks at Google
Next Up
Julia Minson | How to Disagree Better | Talks at Google
Talks at Google