12 Undecidability and Reductions

Lecture from: 24.10.2025 | Video: Videos ETHZ

We pick up where we left off: Nondeterministic Turing Machines (NTMs). It is crucial to dispel a common myth immediately: Nondeterminism is not randomness.

In a randomized algorithm, you flip a coin, and we analyze the probability of getting the right answer. In Nondeterminism, the machine has a “magical” ability to guess the correct path among many options. If there exists any sequence of choices that leads to the accept state, the NTM finds it.

The Computation Tree

For a Deterministic TM (DTM), the computation is a straight line: Configuration $C_{0} \to C_{1} \to C_{2} \dots$ . For an NTM, because the transition function $δ$ maps to a set of possible moves, the computation forms a tree.

Root: Start configuration.
Branches: Different choices made at each step.
Acceptance: An input $w$ is accepted if at least one node in the entire tree is an accepting configuration.
Rejection: An input is rejected only if every branch ends in a rejecting state.

Example: The Traveling Salesperson Problem (TSP)

To see the power of this model, consider TSP: Given a graph, is there a tour that visits every node exactly once?

Deterministic approach: We must iterate through all $n!$ permutations of nodes and check them. This is incredibly slow ( $O (n!)$ ).
Nondeterministic approach:
1. Guess: The NTM nondeterministically writes a permutation of numbers $1 \dots n$ on the tape. Because of the branching nature, effectively every permutation is generated on some branch of the computation tree.
2. Verify: The machine deterministically checks if the sequence on the tape is a valid tour in the graph. This verification takes polynomial time.

If a Hamiltonian cycle exists, one branch will guess it correctly and accept. Thus, the NTM solves TSP efficiently. This leads to the complexity class NP (Nondeterministic Polynomial time), which we will discuss later.

Equivalence of NTM and DTM

Since NTMs seem so powerful (solving TSP instantly!), do they allow us to solve problems that are fundamentally impossible for DTMs? The answer is no.

Theorem: For every Nondeterministic Turing Machine $N$ , there exists an equivalent Deterministic Turing Machine $D$ . $L (N) = L (D)$

The Simulation Problem

How do we simulate the NTM $N$ using a DTM $D$ ? If we simply use Depth-First Search (DFS) on the computation tree of $N$ , we might fail. Why? Because a specific branch of $N$ might run forever (infinite loop). If $D$ follows that path, it will never return to check the other branches, potentially missing an accepting state elsewhere.

The Solution: Breadth-First Search (BFS)

We must simulate the tree layer by layer.

Check all computations of 1 step.
Check all computations of 2 steps.
…and so on.

Construction of $D$ : We use a 3-tape DTM (which we know is equivalent to a 1-tape DTM):

Input Tape: Stores the original input $w$ .
Simulation Tape: Used to simulate one specific branch of $N$ .
Address Tape: Stores a sequence of numbers (e.g., 1-3-2) telling $D$ which choices to make in the tree.

The Algorithm:

Generate the next “address” string in canonical order (lexicographically: $ϵ, 0, 1, 00, 01, \dots$ ).
Clear the simulation tape and copy $w$ onto it.
Run $N$ on the simulation tape. Whenever $N$ needs to make a nondeterministic choice, look at the next digit on the Address Tape to decide which move to make.
If the simulation reaches $q_{a cc}$ , Accept.
If the simulation rejects or runs out of address digits (meaning this branch is valid but hasn’t accepted yet), abort this branch and loop back to Step 1.

If $N$ accepts $w$ , the accepting configuration exists at some finite depth. Our BFS enumeration will eventually generate the address string that leads to that node, and $D$ will accept. Thus, nondeterminism does not increase computability, only efficiency.

Countability: The Size of Infinity

To prove that there are problems computers cannot solve, we must compare the number of possible programs vs. the number of possible problems.

Countable Sets

A set is countable if we can list its elements: $a_{1}, a_{2}, a_{3}, \dots$ .

$N$ (Natural numbers) is countable.
$Σ^{*}$ (All binary strings) is countable (Canonical order: $λ, 0, 1, 00, 01, \dots$ ).
$N \times N$ (Pairs of numbers) is countable. (Cantor’s “zigzag” or anti-diagonal argument proves we can map 2D coordinates to a 1D list).

The Set of Turing Machines is Countable

Every Turing Machine can be encoded as a finite binary string.

We assume a standardized encoding (e.g., encoding states, transitions, and alphabets into binary).
Therefore, the set of all Turing Machines is a subset of ${0, 1}^{*}$ .
Since ${0, 1}^{*}$ is countable, the set of all Turing Machines is countable. We can enumerate them: $M_{1}, M_{2}, M_{3}, \dots$ .

The Set of Problems is Uncountable

A “problem” is a language (a set of strings). The set of all possible languages over ${0, 1}$ is the power set $P ({0, 1}^{*})$ .

Using Cantor’s Diagonalization argument (similar to proving the real numbers are uncountable), we can show that the set of all languages is uncountable.

Conclusion: There are countably many algorithms ( $ℵ_{0}$ ), but uncountably many problems ( $2^{ℵ_{0}}$ ). $∣ Algorithms ∣ ≪ ∣ Problems ∣$ Therefore, there are infinitely many problems that cannot be solved by any algorithm.

The Diagonal Language ( $L_{d ia g}$ )

We know unsolvable problems exist. Now, we will construct one explicitly using Diagonalization.

Let’s define an infinite boolean matrix $A$ where:

Rows ( $i$ ) represent the enumeration of all Turing Machines: $M_{1}, M_{2}, M_{3} \dots$
Columns ( $j$ ) represent the enumeration of all input strings: $w_{1}, w_{2}, w_{3} \dots$

The entry $A_{ij} = 1$ if machine $M_{i}$ accepts word $w_{j}$ . It is $0$ otherwise (rejects or loops).

We define the Diagonal Language $L_{d ia g}$ to be the set of strings that “flip” the behavior on the diagonal:

L_{d ia g} = {w_{i} \in {0, 1}^{*} ∣ M_{i} does NOT accept w_{i}}

If $M_{i}$ accepts $w_{i}$ , then $w_{i} \in / L_{d ia g}$ . If $M_{i}$ does not accept $w_{i}$ , then $w_{i} \in L_{d ia g}$ .

Theorem

$L_{d ia g}$ is not Recursively Enumerable (RE). Meaning: No Turing Machine exists that recognizes this language.

Proof by Contradiction

Assume $L_{d ia g}$ is RE. Then there exists some TM $M$ such that $L (M) = L_{d ia g}$ .
Since we enumerated all TMs, this machine $M$ must appear in our list at some index $k$ . So, $M = M_{k}$ .
Consider the input string $w_{k}$ (the $k$ -th word). Is $w_{k} \in L_{d ia g}$ ?
Case A: Assume $w_{k} \in L_{d ia g}$ .
- By definition of $L_{d ia g}$ , this means $M_{k}$ does not accept $w_{k}$ .
- But since $L (M_{k}) = L_{d ia g}$ , if $w_{k}$ is in the language, $M_{k}$ must accept it.
- Contradiction.
Case B: Assume $w_{k} \in / L_{d ia g}$ .
- By definition of $L_{d ia g}$ , this means $M_{k}$ accepts $w_{k}$ .
- But since $L (M_{k}) = L_{d ia g}$ , if $M_{k}$ accepts it, it must be in the language.
- Contradiction.

Both cases lead to a contradiction. Therefore, the machine $M_{k}$ cannot exist. $L_{d ia g}$ is undecidable and not even RE.

The Method of Reduction

We have successfully constructed one specific language, $L_{d ia g}$ , that is provably not recursively enumerable. This is a massive breakthrough, we’ve found a crack in the foundation of computability. But constructing diagonalization arguments from scratch for every new problem is tedious and difficult.

To explore the landscape of undecidable problems further, we need a scalable tool. That tool is Reduction.

The Core Intuition

Reduction is a method of converting one problem into another. It’s a concept you use constantly in programming: “I don’t want to write a sorting algorithm from scratch; I’ll just transform my data into a list and call the standard library’s .sort() function.”

In theoretical computer science, we use this logic in reverse to prove hardness:

“If I had a machine that could solve Problem B, I could easily use it to solve Problem A. But I already know Problem A is impossible to solve. Therefore, the machine for Problem B cannot exist.”

This implies that Problem B is at least as hard as Problem A.

Recursive Reduction ( $\leq_{R}$ )

The most general form of this idea is Recursive Reduction.

Imagine we have two languages, $L_{1}$ and $L_{2}$ . We want to know if we can solve $L_{1}$ . We don’t know how, but let’s pretend we have a magic “Black Box” (or Oracle) for $L_{2}$ . We don’t know how the Black Box works, but we know that if we feed it a string $y$ , it instantly and correctly tells us whether $y \in L_{2}$ .

Definition: We say $L_{1}$ is recursively reducible to $L_{2}$ (written $L_{1} \leq_{R} L_{2}$ ) if we can construct a Turing Machine $M$ that decides $L_{1}$ , provided $M$ is allowed to query the Oracle for $L_{2}$ .

The Logic Chain:

We want to prove $L_{n e w}$ is undecidable.
We take a known undecidable problem, like $L_{d ia g}$ .
We show $L_{d ia g} \leq_{R} L_{n e w}$ . (i.e., “If I could solve $L_{n e w}$ , I could solve $L_{d ia g}$ ”).
Since solving $L_{d ia g}$ is impossible, solving $L_{n e w}$ must also be impossible.

Many-One Reduction ( $\leq_{m}$ )

While recursive reduction is powerful, we often use a stricter, more structured version called Many-One Reduction (or Input-to-Input Reduction). This is the standard tool for proving undecidability.

Instead of a general algorithm that can call the Oracle multiple times, we demand a simple translation. We want a function that translates instances of Problem A directly into instances of Problem B.

Definition: Language $L_{1}$ is Many-One reducible to $L_{2}$ (denoted $L_{1} \leq_{m} L_{2}$ ) if there exists a computable function $f : Σ^{*} \to Σ^{*}$ such that for every input string $x$ :

x \in L_{1} ⟺ f (x) \in L_{2}

What this means

computable function: There is a Turing Machine that, given input $x$ , halts and leaves just $f (x)$ on its tape. The translation process itself must be doable!
$⟺$ condition: The translation must preserve the answer.
- If $x$ is a “Yes” instance of $L_{1}$ , then $f (x)$ must be a “Yes” instance of $L_{2}$ .
- If $x$ is a “No” instance of $L_{1}$ , then $f (x)$ must be a “No” instance of $L_{2}$ .

The Reduction Machine

If $L_{1} \leq_{m} L_{2}$ , and we had a hypothetical decider $M_{2}$ for $L_{2}$ , we could build a decider $M_{1}$ for $L_{1}$ like this:

Input: $x$ .
Run the translator: Compute $y = f (x)$ .
Run the oracle: Run $M_{2}$ on input $y$ .
Output: Return whatever $M_{2}$ returns.

Because $x \in L_{1} ⟺ y \in L_{2}$ , this machine $M_{1}$ correctly decides $L_{1}$ .

The Contradiction Strategy

To prove $L_{n e w}$ is undecidable:

Start with $L_{d ia g}$ (which we know is undecidable).
Construct a computable function $f$ that transforms any string $w$ into a string $f (w)$ .
Prove that $w \in L_{d ia g} ⟺ f (w) \in L_{n e w}$ .
This establishes $L_{d ia g} \leq_{m} L_{n e w}$ .
If $L_{n e w}$ were decidable, the machine described above would solve $L_{d ia g}$ .
This is impossible. Therefore, $L_{n e w}$ is undecidable.

Lemma 5.3 (Connection)

If $L_{1} \leq_{m} L_{2}$ , then $L_{1} \leq_{R} L_{2}$ .

Many-One reduction is a special case of recursive reduction where we call the oracle exactly once at the very end and return its answer directly. For proving basic undecidability, this “simple” translation is almost always what we use.

This framework is the sledgehammer of theoretical computer science. Once we have $L_{d ia g}$ , we never have to use diagonalization again. We just keep reducing: $L_{d ia g} \leq_{m} H \leq_{m} L_{u ni v ers a l} \leq_{m} \dots$ Every problem in this chain inherits the “unsolvability” of the ones before it.

CS Notes

Explorer

12 Undecidability and Reductions

The Computation Tree

Example: The Traveling Salesperson Problem (TSP)

Equivalence of NTM and DTM

The Simulation Problem

The Solution: Breadth-First Search (BFS)

Countability: The Size of Infinity

Countable Sets

The Set of Turing Machines is Countable

The Set of Problems is Uncountable

The Diagonal Language ( $L_{d ia g}$ )

Theorem

Proof by Contradiction

The Method of Reduction

The Core Intuition

Recursive Reduction ( $\leq_{R}$ )

Many-One Reduction ( $\leq_{m}$ )

What this means

The Reduction Machine

The Contradiction Strategy

Lemma 5.3 (Connection)

Table of Contents

Graph View

CS Notes

Explorer

12 Undecidability and Reductions

The Computation Tree

Example: The Traveling Salesperson Problem (TSP)

Equivalence of NTM and DTM

The Simulation Problem

The Solution: Breadth-First Search (BFS)

Countability: The Size of Infinity

Countable Sets

The Set of Turing Machines is Countable

The Set of Problems is Uncountable

The Diagonal Language (Ldiag​)

Theorem

Proof by Contradiction

The Method of Reduction

The Core Intuition

Recursive Reduction (≤R​)

Many-One Reduction (≤m​)

What this means

The Reduction Machine

The Contradiction Strategy

Lemma 5.3 (Connection)

Table of Contents

Graph View

The Diagonal Language ( $L_{d ia g}$ )

Recursive Reduction ( $\leq_{R}$ )

Many-One Reduction ( $\leq_{m}$ )