02 Polynomial Interpolation

The Core Problem: What Happens Between the Measurements?

Let’s begin with a simple, practical scenario. Imagine you have a set of precise measurements. These could be the altitude of a drone at specific times, the pressure inside an engine cylinder at different crank angles, or the value of a stock at the close of each day. You have a set of points:

(x_{0}, y_{0}), (x_{1}, y_{1}), \dots, (x_{n}, y_{n})

You know the value at these points. But the crucial question is: what is the value between these points? If you plot these points on a graph, your brain almost automatically connects them, imagining a smooth curve passing through them. The task of interpolation is to make this imagined curve mathematically concrete.

The Interpolation Task: A Formal Statement

Given a set of $n + 1$ data points $(x_{i}, y_{i})$ for $i = 0, \dots, n$ , where all the $x_{i}$ are distinct, the goal is to find a function $f (x)$ that satisfies the interpolation conditions:
$f (x_{i}) = y_{i} for all i = 0, \dots, n$
We are assuming our measurements are exact for now. The case of noisy data, where the curve shouldn’t necessarily pass through every point, is a problem of regression or approximation, which we’ll discuss later.

The First Idea: Just Connect the Dots

The most straightforward way to create a curve is to draw straight lines between consecutive points. This is called piecewise linear interpolation. It’s simple and intuitive.

However, as was pointed out in the lecture, this approach has a significant drawback: the resulting function has sharp corners at each data point. It is continuous, but it is not differentiable at the points $x_{i}$ . For many physical systems, like the trajectory of a drone, we expect a smooth path without instantaneous changes in direction. We need a smoother solution.

The Framework: Choosing Our Reality from Function Spaces

The problem is that there are infinitely many possible smooth curves that can pass through a given set of points. How do we choose just one? We need a systematic approach.

This is where the idea of function spaces comes in. We assume that our discrete measurements are just samples from some “true” but unknown underlying function, let’s call it $\tilde{f} (x)$ . This function lives in some vast, infinite-dimensional space of functions, $V$ . For example, $V$ could be the space of all continuous functions, $C ([a, b])$ .

Since a computer cannot work with an infinite-dimensional object, we must simplify. We choose a much smaller, finite-dimensional subspace, let’s call it $V_{n}$ , and search for our solution only within that subspace. This is a modeling choice. We are saying, “I don’t know what the true function is, but I’m going to assume it can be well-approximated by a function from my chosen simple family, $V_{n}$ .”

Basis Functions: The Building Blocks of Our Model

How do we define this simpler space $V_{n}$ ? We define it with a basis. Just as the vectors $i, j, k$ form a basis for 3D space, we can choose a set of basis functions ${b_{0} (x), b_{1} (x), \dots, b_{n} (x)}$ to span our function space $V_{n}$ .

Any function $f_{n} (x)$ in our chosen space can then be written as a unique linear combination of these basis functions:

f_{n} (x) = α_{0} b_{0} (x) + α_{1} b_{1} (x) + \dots + α_{n} b_{n} (x) = j = 0 \sum n α_{j} b_{j} (x)

The interpolation problem is now transformed from a vague search for a “function” into a concrete, algebraic problem: find the specific coefficients $α_{j}$ that make our model $f_{n} (x)$ satisfy the interpolation conditions.

The choice of basis is the most critical decision we will make. It defines the character of our solution and the difficulty of finding it.

Polynomials: A natural choice for smooth functions.
Trigonometric Functions (Sines and Cosines): The ideal choice for periodic phenomena. This leads to Fourier analysis and the Fast Fourier Transform (FFT), one of the most important algorithms ever developed.
Splines: Piecewise polynomials that are smoothly connected. These are used everywhere in computer graphics and engineering design.

For now, we will focus on the simplest and most fundamental choice: polynomials.

Polynomial Interpolation: The Naive Approach (and Why It Fails)

Let’s try to find a single polynomial that passes through all $n + 1$ of our data points. To satisfy $n + 1$ conditions, we need $n + 1$ degrees of freedom. This leads us to choose a polynomial of degree at most $n$ .

The Monomial Basis

The most obvious basis for the space of polynomials of degree $n$ is the monomial basis: ${1, x, x^{2}, \dots, x^{n}}$ . Our interpolating polynomial is then:

p_{n} (x) = α_{0} + α_{1} x + α_{2} x^{2} + \dots + α_{n} x^{n}

To find the coefficients $α_{j}$ , we enforce the interpolation conditions $p_{n} (x_{i}) = y_{i}$ for each point. This gives us a system of $n + 1$ linear equations.

This system, $V α = y$ , involves the Vandermonde matrix. In theory, this matrix is invertible as long as the points $x_{i}$ are distinct, which means a unique solution for the coefficients $α_{j}$ exists.

What is a Vandermonde Matrix?

A Vandermonde matrix is a very structured matrix that shows up whenever you express a polynomial in the monomial basis and enforce interpolation conditions. For interpolation points $x_{0}, x_{1}, \dots, x_{n}$ , it looks like this:
$V = 11 ⋮ 1 x_{0} x_{1} ⋮ x_{n} x_{0}^{2} x_{1}^{2} ⋮ x_{n}^{2} \dots \dots ⋱ \dots x_{0}^{n} x_{1}^{n} ⋮ x_{n}^{n}$
Each row corresponds to plugging one $x_{i}$ into the polynomial, and each column corresponds to one power of $x$ .

Why does it show up here? Because when we write
$p_{n} (x) = α_{0} + α_{1} x + α_{2} x^{2} + \dots + α_{n} x^{n}$
and demand $p_{n} (x_{i}) = y_{i}$ , we are literally saying:
$α_{0} + α_{1} x_{i} + α_{2} x_{i}^{2} + \dots + α_{n} x_{i}^{n} = y_{i}$
for each $i$ . If you stack these equations for all $i$ , you get the matrix-vector system $V α = y$ .

Why are Vandermonde matrices “bad”? Two main reasons:

Ill-conditioning: The determinant of $V$ involves products of differences $(x_{j} - x_{i})$ . If the $x_{i}$ are close to each other or $n$ is large, the determinant becomes very small, meaning the matrix is nearly singular. That makes numerical inversion unstable: small errors in input create huge errors in output.

Growth of powers: In each row, you have powers like $x_{i}^{n}$ . For moderately large $n$ , these numbers can explode or vanish depending on the size of $x_{i}$ . This amplifies rounding errors when working in floating-point arithmetic.

In short: Vandermonde matrices are the unavoidable result of using the monomial basis in interpolation, but they are numerically treacherous.

Problem solved? Not at all. This approach is a numerical disaster.

Why the Monomial Basis is a Trap

Computational Cost: Solving the dense Vandermonde system with Gaussian elimination costs $O (n^{3})$ operations. For many points, this is far too slow.

Numerical Instability: The Vandermonde matrix is famously ill-conditioned. This means it is “almost singular.” Tiny rounding errors in your input data get magnified into enormous errors in the computed coefficients. As the lecturer stated, for even a moderate number of points, the results are “brutal schlecht” (brutally bad) and “völlig sinnlos” (completely senseless). Standard numerical software will even warn you not to trust the solution.

Even if we could find the coefficients exactly, evaluating the polynomial in its standard form is also prone to numerical issues. A much more stable and efficient method for evaluation is Horner’s Schema, which uses a nested form:

p_{n} (x) = α_{0} + x (α_{1} + x (α_{2} + \dots + x (α_{n - 1} + x α_{n}) \dots))

This reduces the number of multiplications and is less susceptible to overflow and rounding errors.

A Better Way: The Newton Basis

The failure of the monomial basis does not mean polynomial interpolation is a bad idea. It means we chose the wrong building blocks. We need a smarter basis.

The Newton form of the interpolating polynomial is built on a beautifully simple, incremental idea. Imagine you are receiving your data points one by one.

First point $(x_{0}, y_{0})$ : The simplest polynomial that passes through this point is a constant function, a polynomial of degree 0: $p_{0} (x) = y_{0}$
Second point $(x_{1}, y_{1})$ : We want to find $p_{1} (x)$ that passes through both points. We can write it as a correction to our previous polynomial: $p_{1} (x) = p_{0} (x) + correction term$ The correction term must be zero at $x_{0}$ so we don’t mess up our first condition. The simplest way to ensure this is to include a factor of $(x - x_{0})$ . So, we try: $p_{1} (x) = p_{0} (x) + β_{1} (x - x_{0})$ We find the constant $β_{1}$ by enforcing $p_{1} (x_{1}) = y_{1}$ .
Third point $(x_{2}, y_{2})$ : We update again. The correction term must now be zero at both $x_{0}$ and $x_{1}$ . The simplest way is to include factors of $(x - x_{0})$ and $(x - x_{1})$ : $p_{2} (x) = p_{1} (x) + β_{2} (x - x_{0}) (x - x_{1})$ We find $β_{2}$ by enforcing $p_{2} (x_{2}) = y_{2}$ .

This incremental process defines the Newton basis functions:

N_{0} (x) N_{1} (x) N_{2} (x) N_{k} (x) = 1 = (x - x_{0}) = (x - x_{0}) (x - x_{1}) ⋮ = i = 0 \prod k - 1 (x - x_{i})

The interpolating polynomial is then written in the Newton form:

p_{n} (x) = β_{0} N_{0} (x) + β_{1} N_{1} (x) + \dots + β_{n} N_{n} (x)

The coefficients $β_{j}$ are called divided differences.

The divided differences are defined recursively and represent the leading coefficient of the polynomial interpolating a subset of the points.

0-th order: $y [x_{i}] = y_{i}$
k-th order: $y [x_{i}, \dots, x_{i + k}] = \frac{y [ x _{i + 1} , \dots , x _{i + k} ] - y [ x _{i} , \dots , x _{i + k - 1} ]}{x _{i + k} - x _{i}}$

The coefficients in the Newton form are the divided differences along the top diagonal of the computation table: $β_{k} = y [x_{0}, x_{1}, \dots, x_{k}]$ .

Good video to grasp the concept…

Why the Newton Basis is Superior

This approach elegantly solves the problems of the monomial basis.

Efficient and Stable Coefficient Calculation: If we write out the interpolation conditions using the Newton basis, the resulting system matrix is lower triangular. A triangular system is trivial to solve. We can find the coefficients using forward substitution in only $O (n^{2})$ operations, which is much faster than $O (n^{3})$ and, more importantly, is numerically stable.
Adaptive: If a new data point $(x_{n + 1}, y_{n + 1})$ arrives, we don’t have to start from scratch. We simply keep our existing polynomial and add one new term, $β_{n + 1} N_{n + 1} (x)$ . We only need to compute one new row in the divided difference table. This is impossible with the monomial basis, where every single coefficient would change.

Summary: Monomial vs. Newton Basis

Feature Monomial Basis ( $x^{j}$ ) Newton Basis ( $\prod (x - x_{i})$ )
System Matrix Vandermonde (Dense, Ill-conditioned) Lower Triangular (Well-conditioned)
Cost to find coeffs. $O (n^{3})$ $O (n^{2})$
Adding a new point Requires complete re-computation Easy and efficient update
Practical Use Avoid! For theoretical use only. Recommended. Stable and efficient.

Feature	Monomial Basis ( $x^{j}$ )	Newton Basis ( $\prod (x - x_{i})$ )
System Matrix	Vandermonde (Dense, Ill-conditioned)	Lower Triangular (Well-conditioned)
Cost to find coeffs.	$O (n^{3})$	$O (n^{2})$
Adding a new point	Requires complete re-computation	Easy and efficient update
Practical Use	Avoid! For theoretical use only.	Recommended. Stable and efficient.

The Newton form of the interpolating polynomial provides a practical, stable, and efficient method for polynomial interpolation, addressing the severe shortcomings of the naive monomial approach.

Continue here: 03 Polynomial Interpolation using Lagrange Form, Barycentric Lagrange Form, Runge’s Phenomenon, Chebyshev Nodes

CS Notes

Explorer

02 Polynomial Interpolation

The Core Problem: What Happens Between the Measurements?

The First Idea: Just Connect the Dots

The Framework: Choosing Our Reality from Function Spaces

Basis Functions: The Building Blocks of Our Model

Polynomial Interpolation: The Naive Approach (and Why It Fails)

The Monomial Basis

A Better Way: The Newton Basis

Why the Newton Basis is Superior

Table of Contents

Graph View

Backlinks