Kolmogorov-Arnold Networks: MLP vs KAN, Math, B-Splines, Universal Approximation Theorem

Umar Jamil · Beginner ·📐 ML Fundamentals ·2y ago

Skills: ML Maths Basics90%Neural Network Basics80%

In this video, I will be explaining Kolmogorov-Arnold Networks, a new type of network that was presented in the paper "KAN: Kolmogorov-Arnold Networks" by Liu et al. I will start the video by reviewing Multilayer Perceptrons, to show how the typical Linear layer works in a neural network. I will then introduce the concept of data fitting, which is necessary to understand Bézier Curves and then B-Splines. Before introducing Kolmogorov-Arnold Networks, I will also explain what is the Universal Approximation Theorem for Neural Networks and its equivalent for Kolmogorov-Arnold Networks called Kolmogorov-Arnold Representation Theorem. In the final part of the video, I will explain the structure of this new type of network, by deriving its structure step by step from the formula of the Kolmogorov-Arnold Representation Theorem, while comparing it with Multilayer Perceptrons at the same time. We will also explore some properties of this type of network, for example the easy interpretability and the possibility to perform continual learning. Paper: https://arxiv.org/abs/2404.19756 Slides PDF: https://github.com/hkproj/kan-notes Chapters 00:00:00 - Introduction 00:01:10 - Multilayer Perceptron 00:11:08 - Introduction to data fitting 00:15:36 - Bézier Curves 00:28:12 - B-Splines 00:40:42 - Universal Approximation Theorem 00:45:10 - Kolmogorov-Arnold Representation Theorem 00:46:17 - Kolmogorov-Arnold Networks 00:51:55 - MLP vs KAN 00:55:20 - Learnable functions 00:58:06 - Parameters count 01:00:44 - Grid extension 01:03:37 - Interpretability 01:10:42 - Continual learning

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: ML Maths Basics

View skill →

Coding the GARCH Model : Time Series Talk

Coding the GARCH Model : Time Series Talk

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

ChethanAIChronicles

“Hello, world” from scratch on a 6502 — Part 1

“Hello, world” from scratch on a 6502 — Part 1

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

ROC and AUC in R

ROC and AUC in R

StatQuest with Josh Starmer

Related AI Lessons

Bitcoin Has Moods

Learn how to build a model to detect Bitcoin mood swings using Python

Medium · Python

Matrix Multiplication at Scale: The Unreasonable Emergence of Intelligence

Discover how matrix multiplication is the foundation of AI and how it enables intelligence at scale

AUROC vs PR-AUC Explained with Coffee Filters and Fraud Detection

Learn to evaluate machine learning models using AUROC and PR-AUC with a coffee filter analogy, crucial for fraud detection and other applications

AUROC vs PR-AUC Explained with Coffee Filters and Fraud Detection

Learn to evaluate model performance using AUROC and PR-AUC with a coffee filter analogy, crucial for fraud detection and other classification tasks

Medium · Machine Learning

Chapters (14)

Introduction

1:10 Multilayer Perceptron

11:08 Introduction to data fitting

15:36 Bézier Curves

28:12 B-Splines

40:42 Universal Approximation Theorem

45:10 Kolmogorov-Arnold Representation Theorem

46:17 Kolmogorov-Arnold Networks

51:55 MLP vs KAN

55:20 Learnable functions

58:06 Parameters count

1:00:44 Grid extension

1:03:37 Interpretability

1:10:42 Continual learning

Advanced Data Structures and Problem-Solving Techniques