Foundations

ML Fundamentals

Neural networks, backpropagation, gradient descent — the maths behind AI

12,012
lessons
Skills in this topic
View full skill map →
ML Maths Basics
beginner
Manipulate vectors and matrices
Supervised Learning
beginner
Train decision trees, random forests, and neural nets
Unsupervised Learning
intermediate
Apply k-means and DBSCAN clustering
ML Pipelines
intermediate
Engineer features and handle missing data

Showing 1,239 reads from curated sources

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper 1mo ago
On the "Causality" Step in Policy Gradient Derivations: A Pedagogical Reconciliation of Full Return and Reward-to-Go
arXiv:2604.04686v1 Announce Type: new Abstract: In introductory presentations of policy gradients, one often derives the REINFORCE estimator using the full traj
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition
arXiv:2604.03476v1 Announce Type: cross Abstract: Optical Chemical Structure Recognition (OCSR) is critical for converting 2D molecular diagrams from printed li
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
RDEx-CMOP: Feasibility-Aware Indicator-Guided Differential Evolution for Fixed-Budget Constrained Multiobjective Optimization
arXiv:2604.03708v1 Announce Type: cross Abstract: Constrained multiobjective optimisation requires fast feasibility attainment together with stable convergence
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
An Improved Last-Iterate Convergence Rate for Anchored Gradient Descent Ascent
arXiv:2604.03782v1 Announce Type: cross Abstract: We analyze the last-iterate convergence of the Anchored Gradient Descent Ascent algorithm for smooth convex-co
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Supervised Dimensionality Reduction Revisited: Why LDA on Frozen CNN Features Deserves a Second Look
arXiv:2604.03928v1 Announce Type: cross Abstract: Effective ride-hailing dispatch requires anticipating demand patterns that vary substantially across time-of-d
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Parent Selection Mechanisms in Elitist Crossover-Based Algorithms
arXiv:2604.04083v1 Announce Type: cross Abstract: Parent selection methods are widely used in evolutionary computation to accelerate the optimization process, y
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization
arXiv:2604.04090v1 Announce Type: cross Abstract: Stochastic bilevel optimization (SBO) has been integrated into many machine learning paradigms recently, inclu
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks
arXiv:2604.04133v1 Announce Type: cross Abstract: There is substantial interest in developing artificial intelligence systems to support radiologists across tas
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Incomplete Multi-View Multi-Label Classification via Shared Codebook and Fused-Teacher Self-Distillation
arXiv:2604.04170v1 Announce Type: cross Abstract: Although multi-view multi-label learning has been extensively studied, research on the dual-missing scenario,
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Good Rankings, Wrong Probabilities: A Calibration Audit of Multimodal Cancer Survival Models
arXiv:2604.04239v1 Announce Type: cross Abstract: Multimodal deep learning models that fuse whole-slide histopathology images with genomic data have achieved st
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
A Persistent Homology Design Space for 3D Point Cloud Deep Learning
arXiv:2604.04299v1 Announce Type: cross Abstract: Persistent Homology (PH) offers stable, multi-scale descriptors of intrinsic shape structure by capturing conn
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Boosted Distributional Reinforcement Learning: Analysis and Healthcare Applications
arXiv:2604.04334v1 Announce Type: cross Abstract: Researchers and practitioners are increasingly considering reinforcement learning to optimize decisions in com
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Integer-Only Operations on Extreme Learning Machine Test Time Classification
arXiv:2604.04363v1 Announce Type: cross Abstract: We present a theoretical analysis and empirical evaluations of a novel set of techniques for computational cos
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Context is All You Need
arXiv:2604.04364v1 Announce Type: cross Abstract: Artificial Neural Networks (ANNs) are increasingly deployed across diverse real-world settings, where they mus
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or Group-Distributional non-robustness and how to fix them
arXiv:2604.04518v1 Announce Type: cross Abstract: Deep Neural Networks (DNNs) are increasingly utilized in high-stakes domains like medical diagnostics and auto
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Cardinality Estimation for High Dimensional Similarity Queries with Adaptive Bucket Probing
arXiv:2604.04603v1 Announce Type: cross Abstract: In this work, we address the problem of cardinality estimation for similarity search in high-dimensional space
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs
arXiv:2604.04614v1 Announce Type: cross Abstract: Deep learning-based modeling of multimodal Electronic Health Records (EHRs) has become an important approach f
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Training-Free Refinement of Flow Matching with Divergence-based Sampling
arXiv:2604.04646v1 Announce Type: cross Abstract: Flow-based models learn a target distribution by modeling a marginal velocity field, defined as the average of
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Grokking as Dimensional Phase Transition in Neural Networks
arXiv:2604.04655v1 Announce Type: cross Abstract: Neural network grokking -- the abrupt memorization-to-generalization transition -- challenges our understandin
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
The Infinite-Dimensional Nature of Spectroscopy and Why Models Succeed, Fail, and Mislead
arXiv:2604.04717v1 Announce Type: cross Abstract: Machine learning (ML) models have achieved strikingly high accuracies in spectroscopic classification tasks, o
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Sampling Parallelism for Fast and Efficient Bayesian Learning
arXiv:2604.04736v1 Announce Type: cross Abstract: Machine learning models, and deep neural networks in particular, are increasingly deployed in risk-sensitive d
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Selecting Decision-Relevant Concepts in Reinforcement Learning
arXiv:2604.04808v1 Announce Type: cross Abstract: Training interpretable concept-based policies requires practitioners to manually select which human-understand
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Muon Dynamics as a Spectral Wasserstein Flow
arXiv:2604.04891v1 Announce Type: cross Abstract: Gradient normalization is central in deep-learning optimization because it stabilizes training and reduces sen
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Post-detection inference for sequential changepoint localization
arXiv:2502.06096v5 Announce Type: replace-cross Abstract: This paper addresses a fundamental but largely unexplored challenge in sequential changepoint analysis
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Causality-Based Scores Alignment in Explainable Data Management
arXiv:2503.14469v5 Announce Type: replace-cross Abstract: Different attribution scores have been proposed to quantify the relevance of database tuples for query
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Bayesian Hierarchical Invariant Prediction
arXiv:2505.11211v3 Announce Type: replace-cross Abstract: We propose Bayesian Hierarchical Invariant Prediction (BHIP) reframing Invariant Causal Prediction (IC
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Understanding Task Representations in Neural Networks via Bayesian Ablation
arXiv:2505.13742v2 Announce Type: replace-cross Abstract: Neural networks are powerful tools for cognitive modeling due to their flexibility and emergent proper
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
PRISM: Lightweight Multivariate Time-Series Classification through Symmetric Multi-Resolution Convolutional Layers
arXiv:2508.04503v3 Announce Type: replace-cross Abstract: Multivariate time series classification supports applications from wearable sensing to biomedical moni
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Challenges in Deep Learning-Based Small Organ Segmentation: A Benchmarking Perspective for Medical Research with Limited Datasets
arXiv:2509.05892v2 Announce Type: replace-cross Abstract: Accurate segmentation of carotid artery structures in histopathological images is vital for cardiovasc
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
A fine-grained look at causal effects in causal spaces
arXiv:2512.11919v3 Announce Type: replace-cross Abstract: The notion of causal effect is fundamental across many scientific disciplines. Traditionally, quantita
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Teaching Machine Learning Fundamentals with LEGO Robotics
arXiv:2601.19376v2 Announce Type: replace-cross Abstract: This paper presents the web-based platform Machine Learning with Bricks and an accompanying two-day co
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
ST-BiBench: Benchmarking Multi-Stream Multimodal Coordination in Bimanual Embodied Tasks for MLLMs
arXiv:2602.08392v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have significantly advanced the landscape of embodied AI, yet
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
The Malignant Tail: Spectral Segregation of Label Noise in Over-Parameterized Networks
arXiv:2603.02293v2 Announce Type: replace-cross Abstract: While implicit regularization facilitates benign overfitting in low-noise regimes, recent theoretical
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Hindsight-Anchored Policy Optimization: Turning Failure into Feedback in Sparse Reward Settings
arXiv:2603.11321v2 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising paradigm for post-tra
InfoQ AI/ML 📐 ML Fundamentals ⚡ AI Lesson 1mo ago
Pinterest Reduces Spark OOM Failures by 96% Through Auto Memory Retries
Pinterest Engineering cut Apache Spark out-of-memory failures by 96% using improved observability, configuration tuning, and automatic memory retries. Staged ro
InfoQ AI/ML 📐 ML Fundamentals ⚡ AI Lesson 1mo ago
Article: A Better Alternative to Reducing CI Regression Test Suite Sizes
How can you focus in a sea of results from a large regression test suite? This article describes a stochastic approach that relies on some degree of redundancy
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
TRACE: Traceroute-based Internet Route change Analysis with Ensemble Learning
arXiv:2604.02361v1 Announce Type: cross Abstract: Detecting Internet routing instability is a critical yet challenging task, particularly when relying solely on
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Generative models on phase space
arXiv:2604.02415v1 Announce Type: cross Abstract: Deep generative models such as diffusion and flow matching are powerful machine learning tools capable of lear
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Self-Directed Task Identification
arXiv:2604.02430v1 Announce Type: cross Abstract: In this work, we present a novel machine learning framework called Self-Directed Task Identification (SDTI), w
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
From Elevation Maps To Contour Lines: SVM and Decision Trees to Detect Violin Width Reduction
arXiv:2604.02446v1 Announce Type: cross Abstract: We explore the automatic detection of violin width reduction using 3D photogrammetric meshes. We compare SVM a
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Managing Diabetic Retinopathy with Deep Learning: A Data Centric Overview
arXiv:2604.02448v1 Announce Type: cross Abstract: Diabetic Retinopathy (DR) is a serious microvascular complication of diabetes, and one of the leading causes o
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Sparse Bayesian Learning Algorithms Revisited: From Learning Majorizers to Structured Algorithmic Learning using Neural Networks
arXiv:2604.02513v1 Announce Type: cross Abstract: Sparse Bayesian Learning is one of the most popular sparse signal recovery methods, and various algorithms exi
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training
arXiv:2604.02651v1 Announce Type: cross Abstract: Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world sc
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Rethinking Forward Processes for Score-Based Data Assimilation in High Dimensions
arXiv:2604.02889v1 Announce Type: cross Abstract: Data assimilation is the process of estimating the time-evolving state of a dynamical system by integrating mo
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Learning from Synthetic Data via Provenance-Based Input Gradient Guidance
arXiv:2604.02946v1 Announce Type: cross Abstract: Learning methods using synthetic data have attracted attention as an effective approach for increasing the div
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
User-Aware Conditional Generative Total Correlation Learning for Multi-Modal Recommendation
arXiv:2604.03014v1 Announce Type: cross Abstract: Multi-modal recommendation (MMR) enriches item representations by introducing item content, e.g., visual and t
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
f-INE: A Hypothesis Testing Framework for Estimating Influence under Training Randomness
arXiv:2510.10510v2 Announce Type: replace-cross Abstract: Influence estimation methods promise to explain and debug machine learning by estimating the impact of
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Integrated representational signatures strengthen specificity in brains and models
arXiv:2510.20847v2 Announce Type: replace-cross Abstract: The extent to which different neural or artificial neural networks (models) rely on equivalent represe