Chapter 8 - Randomization

The concept of chance has been a subject of philosophical inquiry and scientific investigation for centuries. While classical physics often favored a deterministic view, the 20th century, particularly with quantum mechanics and evolutionary biology, embraced randomness as an inherent aspect of nature. In computer science, too, randomness has emerged not as a sign of incomplete knowledge, but as a powerful tool for designing algorithms that are often simpler, more efficient, and more robust than their deterministic counterparts. Unlike deterministic algorithms that follow a fixed path, randomized algorithms can “explore” possibilities, often finding solutions much faster or handling adversarial inputs more effectively.

In previous chapters, we’ve seen the limitations of deterministic algorithms, especially when facing NP-hard problems or adversarial inputs. Randomized algorithms introduce an element of chance into their logic, allowing them to make choices based on random bits. This unpredictability can help algorithms escape worst-case scenarios, achieve significant speedups, or solve problems that are otherwise intractable.

This chapter explores the fundamental ideas behind randomized algorithms. We will start with a brief review of elementary probability theory, then demonstrate the power of randomization through practical examples: an efficient communication protocol for comparing large databases, a randomized primality test, and a method for checking polynomial equivalence. These examples will illustrate key paradigms like the method of frequent witnesses and the method of fingerprints, showcasing how randomness can lead to surprisingly effective solutions.

8.1 Aims

By the end of this chapter, you will be able to:

Understand Elementary Probability: Grasp basic concepts of probability spaces, distributions, and independent events.
Model Randomized Algorithms: Understand how randomized algorithms are formally defined and analyzed.
Appreciate Efficiency Gains: See how randomization can lead to algorithms that are exponentially more efficient than their deterministic counterparts for certain tasks.
Apply the Method of Frequent Witnesses: Learn this paradigm for designing randomized algorithms, exemplified by the Solovay-Strassen primality test.
Apply the Method of Fingerprints: Understand this technique for solving equivalence problems, demonstrated through polynomial equivalence testing.
Analyze Error Probability: Learn how to analyze and reduce the error probability of randomized algorithms through repetition.

8.2 Elementary Probability Theory

To understand randomized algorithms, a basic grasp of probability is essential.

Definition 8.1 (Probability Space)

A probability space is a pair $(S, Prob)$ , where:

$S$ is a finite set of elementary events (all possible outcomes of an experiment). (Think of these as the most basic, indivisible results, like “rolling a 3” on a die.)
$Prob : P (S) \to [0, 1]$ is a probability distribution (or measure) that assigns a probability to each event (subset of $S$ ), satisfying:
- $Prob ({x}) \geq 0$ for every elementary event $x \in S$ .
- $Prob (S) = 1$ . (This means that one of the possible outcomes must occur.)
- $Prob (X \cup Y) = Prob (X) + Prob (Y)$ for any two disjoint events $X, Y \subseteq S$ .

The probability of an event $X \subseteq S$ is the sum of the probabilities of its elementary events: $Prob (X) = \sum_{x \in X} Prob ({x})$ . A uniform probability distribution assigns equal probability to all elementary events: $Prob ({x}) = 1/∣ S ∣$ for all $x \in S$ .

Exercise 8.1 (Properties of Probability)

Prove the following properties for any probability space $(S, Prob)$ :

$Prob (\emptyset) = 0$ .

$Prob (S ∖ X) = 1 - Prob (X)$ for any $X \subseteq S$ .

If $X \subseteq Y$ , then $Prob (X) \leq Prob (Y)$ .

$Prob (X \cup Y) = Prob (X) + Prob (Y) - Prob (X \cap Y)$ .

Modeling Randomized Algorithms

A randomized algorithm can be viewed in two ways: (These perspectives are equivalent and offer different insights into how randomness is incorporated.)

As a nondeterministic algorithm where each nondeterministic choice is made with a certain probability. The probability of a computation path is the product of the probabilities of its individual random decisions.
As a probability distribution over a set of deterministic algorithms. This is often implemented by providing a deterministic algorithm with a sequence of random bits as an additional input. Each sequence of random bits effectively selects a specific deterministic execution path.

The error probability of a randomized algorithm is typically defined as the maximum probability of producing an incorrect output over all possible inputs of a given size. (This is a crucial metric, as it quantifies the reliability of the algorithm’s probabilistic choices.)

8.3 A Randomized Communication Protocol

Consider the task of comparing the contents of two large databases, $x$ and $y$ , stored on two different computers, RI and RII. Both databases contain $n$ bits. A deterministic protocol would require exchanging $Ω (n)$ bits in the worst case to guarantee correctness. For $n = 1 0^{16}$ bits, this is infeasible.

Here’s a randomized protocol that achieves exponential efficiency.

Stochastic Communication Protocol R

Initial situation: RI has an $n$ -bit number $x$ , RII has an $n$ -bit number $y$ .

RI chooses a prime number $p$ uniformly at random from the set of primes $\leq n^{2}$ .

RI calculates $s = Number (x) (mod p)$ and sends $(s, p)$ to RII.

RII calculates $q = Number (y) (mod p)$ .

If $q \neq = s$ , RII outputs “unequal”.

If $q = s$ , RII outputs “equal”.

Analysis of Protocol R

Communication Complexity: RI sends $s$ and $p$ . Since $s \leq p < n^{2}$ , the length of the message is $O (lo g (n^{2})) = O (lo g n)$ bits. For $n = 1 0^{16}$ , this is about $4 \cdot lo g_{2} (1 0^{16}) \approx 256$ bits, a tiny message. (This logarithmic communication is a massive improvement over the $O (n)$ bits required by deterministic protocols, especially for very large $n$ .)
Error Probability:
- If $x = y$ : Then $Number (x) = Number (y)$ , so $s = q$ for any $p$ . RII always outputs “equal”. Error probability is 0.
- If $x \neq = y$ : RII outputs “equal” (a wrong answer) only if $s = q$ , which means $Number (x) (mod p) = Number (y) (mod p)$ . This implies $p$ divides $∣ Number (x) - Number (y) ∣$ . (This is the only way for a mismatch to go undetected: if the difference between the numbers happens to be a multiple of the chosen prime $p$ .) Let $D = ∣ Number (x) - Number (y) ∣$ . Since $x \neq = y$ , $D \neq = 0$ . Also, $D < 2^{n}$ . A number $D$ has at most $lo g_{2} D$ distinct prime factors. So $D$ has at most $O (n)$ prime factors. The probability that a randomly chosen prime $p \leq n^{2}$ divides $D$ is at most $\frac{number of prime factors of D}{number of primes \leq n ^{2}} \approx \frac{n}{π ( n ^{2} )} \approx \frac{n}{n ^{2} / l n ( n ^{2} )} = \frac{l n ( n ^{2} )}{n} = \frac{2 l n n}{n}$ . For $n = 1 0^{16}$ , this error is extremely small, around $0.369 \cdot 1 0^{- 14}$ .

Error Reduction by Repetition

The error probability can be further reduced by repeating the protocol multiple times with independent random choices. If we repeat the protocol $k$ times, and all $k$ choices of $p_{i}$ lead to $s_{i} = q_{i}$ , the error probability drops to $(\frac{2 l n n}{n})^{k}$ . For $k = 10$ , the error for $n = 1 0^{16}$ becomes less than $1 0^{- 140}$ .

This demonstrates a key advantage of randomized algorithms: they can achieve extremely low error probabilities with minimal communication or computation, often exponentially better than deterministic approaches.

8.4 The Method of Frequent Witnesses and the Randomized Primality Test

The efficiency of Protocol R stems from the method of frequent witnesses.

A witness for a property is additional information that allows efficient deterministic verification of that property. (Think of it as a piece of evidence that quickly confirms a claim.)
For a randomized algorithm, we need a set of witness candidates such that, for any input, a sufficiently large fraction of these candidates are actual witnesses. (This “frequent witnesses” property is what makes random sampling effective: you’re likely to pick a witness by chance.)

The Primality Test

Determining the primality of a large number $n$ is a fundamental problem in cryptography. The naive deterministic algorithm (trial division up to $n$ ) is too slow.

Fermat’s Little Theorem provides a basis for a randomized test:

Theorem 8.1 (Fermat's Little Theorem)

For every prime number $p$ and every integer $a$ such that $gcd (a, p) = 1$ , it holds that $a^{p - 1} \equiv 1 (mod p)$ .

If $n$ is composite, it is possible that $a^{n - 1} \equiv 1 (mod n)$ for some $a$ . Such $n$ are called Carmichael numbers. (These numbers are “Fermat pseudoprimes” and can fool the basic Fermat test, making it unreliable for proving primality.) A stronger test is needed.

For an odd number $n$ with odd $(n - 1) /2$ (i.e., $n \equiv 3 (mod 4)$ ):

Theorem 8.2

If $n$ is prime, then $a^{(n - 1) /2} \equiv \pm 1 (mod n)$ for all $a \in {1, \dots, n - 1}$ .

If $n$ is composite, then $a^{(n - 1) /2} \neq \equiv \pm 1 (mod n)$ for at least half of the numbers $a \in {1, \dots, n - 1}$ .

This theorem defines a “witness” for compositeness: an $a$ such that $a^{(n - 1) /2} \neq \equiv \pm 1 (mod n)$ .

Solovay-Strassen Primality Test

Input: An odd number $n$ with odd $(n - 1) /2$ .

Choose $a$ uniformly at random from ${1, \dots, n - 1}$ .

Calculate $x := a^{(n - 1) /2} (mod n)$ . (This is done efficiently using repeated squaring in $O (lo g^{2} n)$ time).

If $x \in {1, n - 1}$ , output “prime number”.

Else, output “not a prime number”.

Error Analysis of Solovay-Strassen

If $n$ is prime: The algorithm always outputs “prime number”. Error probability is 0.
If $n$ is composite: The probability that a randomly chosen $a$ is not a witness (i.e., $a^{(n - 1) /2} \equiv \pm 1 (mod n)$ ) is at most $1/2$ . So the error probability is at most $1/2$ .

By repeating the test $k$ times with independent choices of $a$ , the error probability for composite $n$ drops to $(1/2)^{k}$ . For $k = 20$ , the error is less than $1 0^{- 6}$ . This makes the test highly reliable for practical purposes.

8.5 The Method of Fingerprints and the Equivalence of Two Polynomials

The method of fingerprints is a special application of the frequent witnesses paradigm, particularly useful for solving equivalence problems between large objects. (Think of a fingerprint as a much smaller, unique-enough identifier for a large object, allowing for quick comparisons without needing to compare the entire objects.)

Scheme of the Method of Fingerprints

Task: Decide if two objects $O_{1}$ and $O_{2}$ are equivalent.

Choose a random mapping $h$ from a suitable set of mappings $M$ .

Calculate $h (O_{1})$ and $h (O_{2})$ (the “fingerprints”).

If $h (O_{1}) = h (O_{2})$ , output “equivalent”.

Else, output “not equivalent”.

The key idea is that fingerprints $h (O_{i})$ are significantly shorter representations of $O_{i}$ , making their comparison much faster. The risk of error arises if $O_{1} \neq = O_{2}$ but $h (O_{1}) = h (O_{2})$ (a “collision”). The set $M$ must be chosen such that collisions are rare for non-equivalent objects. (This rarity is crucial; if collisions were common, the fingerprint wouldn’t be a reliable indicator of equivalence.)

Equivalence of Two Polynomials

Consider two polynomials $P_{1} (x_{1}, \dots, x_{n})$ and $P_{2} (x_{1}, \dots, x_{n})$ over a finite field $Z_{p}$ . We want to check if $P_{1} \equiv P_{2} (mod p)$ , meaning they evaluate to the same value for all inputs $(α_{1}, \dots, α_{n}) \in (Z_{p})^{n}$ . Comparing coefficients directly requires converting to normal form, which can be exponentially slow.

Algorithm AQP (Algorithm for Polynomial Equivalence)

Input: Prime $p$ , polynomials $P_{1}, P_{2}$ (degree at most $d$ ) over $n$ variables.

Choose $α = (α_{1}, \dots, α_{n})$ uniformly at random from $(Z_{p})^{n}$ .

Calculate fingerprints: $h_{α} (P_{1}) = P_{1} (α_{1}, \dots, α_{n}) (mod p)$ and $h_{α} (P_{2}) = P_{2} (α_{1}, \dots, α_{n}) (mod p)$ .

If $h_{α} (P_{1}) = h_{α} (P_{2})$ , output " $P_{1} \equiv P_{2}$ ".

Else, output " $P_{1} \neq \equiv P_{2}$ ".

Error Analysis of AQP

If $P_{1} \equiv P_{2}$ : The algorithm always outputs “equivalent”. Error probability is 0.
If $P_{1} \neq \equiv P_{2}$ : Let $Q (x_{1}, \dots, x_{n}) = P_{1} (x_{1}, \dots, x_{n}) - P_{2} (x_{1}, \dots, x_{n})$ . Since $P_{1} \neq \equiv P_{2}$ , $Q$ is not identically zero. The algorithm makes an error if $Q (α_{1}, \dots, α_{n}) \equiv 0 (mod p)$ for the randomly chosen $α$ . A key result (Theorem 8.4) states that for a non-zero polynomial $Q$ of $n$ variables and degree at most $d$ , the number of roots in $(Z_{p})^{n}$ is at most $n \cdot d \cdot p^{n - 1}$ . Thus, the probability of choosing a root $α$ (and making an error) is at most $\frac{n \cdot d \cdot p ^{n - 1}}{p ^{n}} = \frac{n \cdot d}{p}$ . If we choose $p > 2 n d$ , this error probability is less than $1/2$ . Again, repeating the test $k$ times reduces the error to $(n d / p)^{k}$ .

This method provides an efficient randomized solution for a problem for which no polynomial-time deterministic algorithm is known.

Summary

Randomized algorithms leverage chance to achieve efficiency, simplicity, and robustness, often outperforming deterministic algorithms.
They are modeled either as nondeterministic algorithms with probabilistic choices or as deterministic algorithms with random bit sequences as input.
The Stochastic Communication Protocol R for database comparison demonstrates exponential efficiency gains over deterministic protocols, achieving extremely low error probabilities with minimal communication.
The Method of Frequent Witnesses is a paradigm where an algorithm relies on finding a “witness” that efficiently verifies a property. If witnesses are frequent among candidates, random sampling is effective.
The Solovay-Strassen Primality Test is a randomized algorithm based on frequent witnesses, efficiently determining if a number is prime with a controllable error probability.
The Method of Fingerprints is a specialized technique for equivalence problems, where large objects are mapped to small “fingerprints” via random functions. Comparing fingerprints is efficient, with a small, controllable risk of collision.
The Algorithm AQP for polynomial equivalence testing uses fingerprints to efficiently determine if two polynomials are equivalent, a problem for which deterministic polynomial-time algorithms are not known.
Error amplification through repetition is a common technique to reduce the error probability of randomized algorithms to arbitrarily low levels.

Randomization is a powerful tool in algorithm design, offering practical solutions to problems that are otherwise intractable.

Previous Chapter: Chapter 7 - Algorithmics for Hard Problems Next Up: Chapter 9 - Communication and Cryptography

Exercises

Exercise 8.1 (Coin Tossing)

Model the experiment of five-fold coin tossing. What is the probability that the number of heads and the number of tails differ by at most 1?

Solution

The total number of outcomes is $2^{5} = 32$ . The number of heads and tails differ by at most 1 means:

3 Heads, 2 Tails (e.g., HHHTT): $(3 5) = 10$ ways.

2 Heads, 3 Tails (e.g., HHTTT): $(2 5) = 10$ ways. The total number of favorable outcomes is $10 + 10 = 20$ . The probability is $20/32 = 5/8$ .

Exercise 8.2 (Database Comparison)

Two computers have each stored a word of 18 bits. They use Protocol R to check for equality.

From how many prime numbers is one chosen randomly?

How many bits are communicated?

What is the error probability if the words are different?

Solution

$n = 18$ . Primes are chosen from ${2, \dots, n^{2} = 1 8^{2} = 324}$ . The number of primes $π (324)$ is 66.

Communication bits: $O (lo g n) = O (lo g 18)$ . More precisely, the message contains $p < n^{2} = 324$ and $s < p$ . So, we need $⌈ lo g_{2} 324 ⌉$ bits for $p$ and $⌈ lo g_{2} 324 ⌉$ bits for $s$ . $⌈ lo g_{2} 324 ⌉ = 9$ . Total bits: $9 + 9 = 18$ .

Error probability: $\frac{2 l n n}{n} = \frac{2 l n 18}{18} \approx \frac{2 \cdot 2.89}{18} \approx 0.32$ . This is quite high for small $n$ .

Exercise 8.3 (Solovay-Strassen Error Reduction)

How far can one reduce the error probability of the Solovay-Strassen algorithm if, instead of one attempt, $k$ independent attempts are made?

Solution

If $n$ is composite, each independent attempt has an error probability of at most $1/2$ . If $k$ independent attempts are made, and all of them fail to find a witness (i.e., all of them output “prime number” for a composite $n$ ), the probability of this happening is at most $(1/2)^{k}$ . So, the error probability is reduced to at most $2^{- k}$ .

Exercise 8.4 (Polynomial Equivalence)

Are $P_{1} (x, y) = (x + y)^{2}$ and $P_{2} (x, y) = x^{2} + 2 x y + y^{2}$ equivalent over $Z_{p}$ ? Describe how Algorithm AQP would test this.

Solution

Yes, they are equivalent. Algorithm AQP would:

Choose a prime $p$ (e.g., $p = 7$ ).

Choose random values $α_{1}, α_{2} \in Z_{p}$ (e.g., $α_{1} = 3, α_{2} = 5$ ).

Calculate $P_{1} (3, 5) = (3 + 5)^{2} = 8^{2} = 64 \equiv 1 (mod 7)$ .

Calculate $P_{2} (3, 5) = 3^{2} + 2 (3) (5) + 5^{2} = 9 + 30 + 25 = 64 \equiv 1 (mod 7)$ .

Since $P_{1} (3, 5) \equiv P_{2} (3, 5) (mod 7)$ , the algorithm would output " $P_{1} \equiv P_{2}$ ".

Since $P_{1}$ and $P_{2}$ are indeed equivalent, the algorithm will always output “equivalent” (error probability 0). If they were not equivalent, the probability of error would be at most $\frac{n d}{p}$ , where $n = 2$ (variables), $d = 2$ (degree), so $\frac{4}{p}$ . For $p = 7$ , this is $4/7$ .

CS Notes

Explorer

Chapter 8 - Randomization

8.1 Aims

8.2 Elementary Probability Theory

Definition 8.1 (Probability Space)

Modeling Randomized Algorithms

8.3 A Randomized Communication Protocol

Analysis of Protocol R

Error Reduction by Repetition

8.4 The Method of Frequent Witnesses and the Randomized Primality Test

The Primality Test

Error Analysis of Solovay-Strassen

8.5 The Method of Fingerprints and the Equivalence of Two Polynomials

Equivalence of Two Polynomials

Error Analysis of AQP

Summary

Exercises

Table of Contents

Graph View

Backlinks