Principal Component Analysis (PCA) Explained With Eigenvectors (Math + Python)#machinelearning
Principal Component Analysis (PCA) is one of the most important mathematical techniques in Machine Learning and Data Science for reducing the number of features while keeping the most important information.
It is widely used in:
• Data preprocessing
• Feature engineering
• Image compression
• Visualization of high-dimensional data
• Noise reduction
🔹 1. The Problem PCA Solves
Real datasets often have many correlated features.
Example dataset:
Height Weight
170 65
175 70
180 75
These variables contain redundant information.
PCA transforms data into new independent variables.
🔹 2. PCA Core Idea
PCA finds directions where the data varies the most.
These directions are called Principal Components.
Mathematically:
First principal component:
max Var(wᵀX)
Subject to:
||w|| = 1
This means we find a direction w that maximizes data variance.
🔹 3. Covariance Matrix
PCA begins with the covariance matrix:
Cov(X) = (1/n) XᵀX
The covariance matrix measures how features vary together.
Example:
High covariance → strong relationship between variables.
🔹 4. Eigenvectors and Eigenvalues
PCA solves:
Cov(X) v = λ v
Where:
v = eigenvector
λ = eigenvalue
Interpretation:
Eigenvector → direction of maximum variance
Eigenvalue → magnitude of variance
🔹 5. Dimensionality Reduction
If we have 10 features, PCA can reduce them to 2 or 3 components while preserving most information.
Example:
Original dataset:
1000 samples × 50 features
After PCA:
1000 samples × 5 features
This speeds up machine learning models.
🔹 6. Python Implementation
In the code above we performed:
1️⃣ Data centering
2️⃣ Covariance calculation
3️⃣ Eigen decomposition
4️⃣ PCA using sklearn
Important tool:
from sklearn.decomposition import PCA
🔹 7. Where PCA Is Used
PCA is used in many real applications:
✔ Image compression
✔ Face recognition (Eigenfaces)
✔ Data visualization (2D/3D plots)
✔ Removing noise from datasets
✔ Preprocessing before ML models
🎯 INTERVIEW QUESTIONS (
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: Unsupervised Learning
View skill →Related AI Lessons
🎓
Tutor Explanation
DeepCamp AI