Universal Approximation Theorem cover illustrationA clean cover illustration with a shallow neural network on the left and a target curve with a close approximation on the right.Universal ApproximationSimple unitscompose intorich curvesshallow networksumtarget and approximationtargetapproximation

Universal Approximation Theorem Explained: Why Neural Networks Can Approximate Any Continuous Function

The Universal Approximation Theorem (UAT) gets quoted constantly, but it is usually described in a fuzzier way than it deserves. It does not say neural networks are magically good at every task. It does not say a shallow network is the most practical architecture. It does not say gradient descent will easily find the right weights. What it does say is still important: With a suitable nonlinear activation and enough hidden units, a feedforward network can approximate any continuous function on a bounded domain as closely as we want. ...

April 3, 2026 · 8 min · Nitin

Multi-Layer Perceptron Explained: Dense Networks from First Principles

A multi-layer perceptron (MLP) is one of the simplest and most important neural network architectures. It is not flashy. It is not state of the art for language or vision by itself. But if you do not understand MLPs, a lot of modern deep learning stays blurry. MLPs teach the core structure of neural networks: inputs become vectors layers apply learned linear transforms activations add nonlinearity deeper layers build more useful internal representations They also still matter in practice. Even transformers contain MLP blocks. Recommendation systems, tabular models, and many small classifiers still use dense networks directly. ...

April 3, 2026 · 7 min · Nitin
Forward pass cover illustrationA visual summary of the forward pass from inputs through weighted sum and activation to output, then scaling to matrix form.Forward PassFrom One Neuron to Matrix FormStart with one weighted sum, add a bias, apply an activation,then scale the same idea into a full dense layer.x1x2x3sum+ bReLUa = f(z)yx1w1x2w2x3w3Matrix formz = Wx + ba = f(z)many neuronsThe same computation repeats: inputs -> weighted sum -> activation -> output

Forward Pass Explained: From a Single Neuron to Matrix Form

The forward pass is the part of a neural network that actually produces a prediction. You feed inputs into the model, the model applies a sequence of mathematical operations, and an output comes out the other side. That sounds trivial, but it is one of the most important ideas in deep learning because everything else depends on it: the loss compares the forward-pass output to the target backpropagation differentiates through the forward pass training is just repeating the forward pass and improving it The easiest way to understand the forward pass is to start with a single neuron and then scale it up into a full layer written in matrix form. ...

April 3, 2026 · 7 min · Nitin
Backpropagation cover illustrationA cover image showing the forward pass flowing left to right and gradients flowing backward from loss to parameters.BackpropagationHow Neural Networks LearnForward pass computes values. Backward pass computes how mucheach earlier choice contributed to the final error.xwbzweighted sumReLUa = f(z)Lforward passbackward gradientsBackpropagation is the chain rule applied efficiently over the computation graph.

Backpropagation Explained Visually: How Neural Networks Actually Learn

Backpropagation is the core algorithm that makes neural networks trainable. The forward pass tells the model what prediction it currently makes. Backpropagation tells the model how each weight contributed to the error so the optimizer can update those weights in the right direction. People often hear that backpropagation is “just the chain rule,” which is true but not especially helpful. The useful mental model is this: the forward pass computes values the backward pass computes sensitivities each node only needs its own local derivative the full gradient is built by multiplying those local derivatives along the path If that sounds abstract, it becomes much clearer once you look at one neuron first and then scale up. ...

April 3, 2026 · 8 min · Nitin

Learning from Introduction to Deep Learning

Introduction Into to deep learning Intelligence: The ability to process information and use it for future decision-making. Artificial Intelligence (AI): Empowering computers with the ability to process information and make decisions. Machine Learning (ML): A subset of AI focused on teaching computers to learn from data. Deep Learning (DL): A subset of ML utilizing neural networks to process raw data and inform decisions. Why Deep Learning Now? The recent surge in deep learning’s capabilities can be attributed to three key factors: ...

May 4, 2024 · 7 min · Nitin