06 Formalizing Finite Automata

A Formal Model of Computation

We’ve spent some time building an intuition for finite automata through state diagrams. Now, it’s time to put this on a solid mathematical foundation. Why? Because if we want to prove things about the limits of what these machines can do, we need a precise, unambiguous definition.

Historically, it’s interesting to note that the more powerful Turing Machine was formalized before the simpler finite automaton. However, starting with automata is a fantastic way to learn the vocabulary and mechanics of formal models. We can zoom in and see exactly what it means to “compute” in a very constrained world.

Definition: Deterministic Finite Automaton (DFA)

A Deterministic Finite Automaton (DFA) is a mathematical object that we can define as a 5-tuple (a quintuple). We’ll call our automaton $A$ :

A = (Q, Σ, δ, q_{0}, F)

Let’s break down these five components.

$Q$ : The Set of States This is a finite, non-empty set of states. In our diagrams, these are the circles. The finiteness of $Q$ is the most crucial, and limiting, feature of this model. Unlike a Turing machine with its infinite tape, a DFA can only store a finite amount of information, determined by which of its states it is currently in.
$Q = {q_{0}, q_{1}, q_{2}, \dots, q_{n}}$
$Σ$ : The Input Alphabet This is a finite set of symbols that the automaton can read. The input words are strings from $Σ^{*}$ .
$q_{0}$ : The Start State This is one special state from $Q$ where every computation begins. In diagrams, we mark it with an incoming arrow.
$q_{0} \in Q$
$F$ : The Set of Accepting States This is a subset of the states. If the automaton finishes reading the input word and is in one of these states, the word is accepted. In diagrams, we mark them with a double circle. Note that $F$ can be empty, though such an automaton wouldn’t be very interesting.
$F \subseteq Q$
$δ$ : The Transition Function This is the “program” of the automaton. It’s a function that takes the current state and the next input symbol and returns the next state.
$δ : Q \times Σ \to Q$
For example, $δ (q_{2}, 1) = q_{3}$ means “if in state $q_{2}$ and you read a 1, transition to state $q_{3}$ .” The fact that $δ$ is a function is what makes the automaton deterministic: for any given state and input symbol, there is exactly one, unambiguous next state. This implies two rules for our diagrams:
- For every state, there must be exactly one outgoing arrow for every symbol in the alphabet $Σ$ .
- You cannot have two arrows leaving the same state with the same label.

How an Automaton Computes

Now that we have the static definition, let’s formalize the dynamic process of computation.

Configurations

A configuration is a snapshot of the automaton’s state at any point during a computation. It captures everything we need to know to continue:

The current state the automaton is in.
The remaining part of the input word that still needs to be read.

A configuration is a pair $(q, w)$ where $q \in Q$ and $w \in Σ^{*}$ .

Start Configuration: For an input word $w$ , the computation always begins in the configuration $(q_{0}, w)$ .
End Configuration: Any configuration where the remaining word is empty is an end configuration. It looks like $(q, λ)$ for some state $q$ .

Steps and Computations

A step is the process of moving from one configuration to the next. We use the notation $⊢$ to represent a single step.

(q, a w) ⊢ (p, w)

This means we transition from configuration $(q, a w)$ to $(p, w)$ . This step is valid if and only if:

$a$ is a single symbol from $Σ$ .
$w$ is the rest of the input string.
The transition function dictates this move: $δ (q, a) = p$ .

A computation is a sequence of steps starting from the start configuration. We use $⊢^{*}$ for the reflexive, transitive closure of the step relation, meaning “can reach in zero or more steps.”

The Language of an Automaton

We can now formally define the language accepted by a DFA.

The language accepted by an automaton $A$ , denoted $L (A)$ , is the set of all input words $w$ for which the computation of $A$ on $w$ starts in the initial configuration and ends in a configuration with an accepting state.
$L (A) = {w \in Σ^{*} ∣ (q_{0}, w) ⊢^{*} (q, λ) for some q \in F}$

The Extended Transition Function $\hat{δ}$

It’s often cumbersome to talk about sequences of steps. We can extend the transition function $δ$ to a new function, $\hat{δ}$ , that works on entire words instead of single symbols.

$\hat{δ} : Q \times Σ^{*} \to Q$ tells us which state we land in after starting at state $q$ and reading the entire word $w$ . It’s defined inductively:

Base Case: Reading the empty word doesn’t change the state. $\hat{δ} (q, λ) = q$
Inductive Step: To read a word $w a$ (a word $w$ followed by a symbol $a$ ), we first see where $w$ takes us, and then apply the original $δ$ for the final symbol $a$ . $\hat{δ} (q, w a) = δ (\hat{δ} (q, w), a)$

Using this, we can write a much cleaner definition for the language of an automaton:

L (A) = {w \in Σ^{*} ∣ \hat{δ} (q_{0}, w) \in F}

The Class of Regular Languages

We can now define a fundamental class of languages.

The class of regular languages, denoted by the calligraphic $L$ , is the set of all languages for which a Deterministic Finite Automaton exists.
$L_{REG} = {L ∣ \exists a DFA A such that L (A) = L}$

The name “regular” is no coincidence. It turns out that this class of languages is precisely the same class that can be described by regular expressions. One can prove that for any regular expression, you can build a DFA that accepts the same language, and for any DFA, you can derive an equivalent regular expression.

Proving an Automaton Correct: The Method of State Classes

How can we be certain that an automaton we’ve designed accepts the correct language? We need a formal proof technique. The most powerful method is to define the class of each state.

The class of a state $q$ , denoted $C (q)$ , is the set of all words that drive the automaton from the start state $q_{0}$ to state $q$ .

C (q) = {w \in Σ^{*} ∣ \hat{δ} (q_{0}, w) = q}

These classes have two crucial properties that stem from the deterministic nature of the automaton:

They cover all strings: Every word $w \in Σ^{*}$ must end up in exactly one state, so the union of all state classes is $Σ^{*}$ . $q \in Q ⋃ C (q) = Σ^{*}$
They are disjoint: A single word cannot end in two different states. $\forall p \neq = q, C (p) \cap C (q) = \emptyset$

Together, these mean that the state classes form a partition of $Σ^{*}$ .

The language of the automaton is then simply the union of the classes of its accepting states:

L (A) = q \in F ⋃ C (q)

Example: Proving the Parity Checker Correct

Let’s formally prove that our parity automaton accepts all strings of even length.

Claim: The classes of the states are:

$C (q_{0}) = {w \in {0, 1}^{*} ∣ ∣ w ∣_{0} is even and ∣ w ∣_{1} is even}$
$C (q_{1}) = {w \in {0, 1}^{*} ∣ ∣ w ∣_{0} is even and ∣ w ∣_{1} is odd}$
$C (q_{2}) = {w \in {0, 1}^{*} ∣ ∣ w ∣_{0} is odd and ∣ w ∣_{1} is even}$
$C (q_{3}) = {w \in {0, 1}^{*} ∣ ∣ w ∣_{0} is odd and ∣ w ∣_{1} is odd}$

Proof by Induction on Word Length $∣ w ∣$

Base Case ( $∣ w ∣ = 0$ ): The only word is $w = λ$ . It has zero 0s and zero 1s (both even). The automaton starts and ends in $q_{0}$ . So $λ \in C (q_{0})$ . The claim holds.
Inductive Step: Assume the claim holds for all words of length $i$ . Let $w$ be a word of length $i + 1$ . We can write $w = z a$ , where $∣ z ∣ = i$ and $a$ is a single symbol. By the inductive hypothesis, we know $z$ is in the correct class based on its parity counts. We must check all 8 cases (4 states for $z$ , 2 choices for $a$ ). Let’s do one:
- Suppose $z \in C (q_{0})$ (even 0s, even 1s) and $a = 0$ . The word $w = z 0$ has an odd number of 0s and an even number of 1s. Our claim says $w$ should be in $C (q_{2})$ . Does the automaton agree? From state $q_{0}$ , reading a 0 transitions to $q_{2}$ . Yes, it matches.

By checking all 8 cases, we can confirm the claim is correct.

Conclusion: The accepting states are $F = {q_{0}, q_{3}}$ . The language is $L (A) = C (q_{0}) \cup C (q_{3})$ .

Words in $C (q_{0})$ have (even 0s + even 1s) $⟹$ even total length.
Words in $C (q_{3})$ have (odd 0s + odd 1s) $⟹$ even total length. Therefore, $L (A)$ is the language of all binary strings of even length.

Building an Automaton: A Worked Example

Let’s design a DFA for the following language:

L = {w \in {0, 1}^{*} ∣ (∣ w ∣_{0} = 3) \land (∣ w ∣_{1} \geq 2 \lor ∣ w ∣_{1} = 0)}

The language consists of words with exactly three 0s AND either at least two 1s OR exactly zero 1s.

The key to tackling such a problem is to realize that the automaton must track the counts of 0s and 1s simultaneously. We can imagine a grid of states, where each state represents a pair of counts seen so far.

The number of 0s can be 0, 1, 2, 3, or " $> 3$ " (more than 3). This gives us 5 conditions to track vertically.
The number of 1s can be 0, 1, or " $\geq 2$ ". This gives us 3 conditions to track horizontally.

This conceptual grid forms the structure of our automaton. Each state $q_{i, j}$ will represent having seen $i$ zeros and $j$ ones.

Let’s analyze the structure of this state diagram:

Rows: The rows count the number of 1s.
- Top row: We have seen zero 1s.
- Middle row: We have seen exactly one 1.
- Bottom row: We have seen two or more 1s.
Columns: The columns count the number of 0s.
- First column: Zero 0s.
- Second column: One 0.
- Third column: Two 0s.
- Fourth column: Exactly three 0s.
- Fifth column: More than three 0s (a “trap” column).

Transitions:

Reading a 0 moves us one step to the right within the same row (increasing the 0 count).
Reading a 1 moves us one step down within the same column (increasing the 1 count).

Special States:

Start State: The top-left state, $q_{0, 0}$ , represents having seen zero 0s and zero 1s.
Trap States: The rightmost column represents having seen more than three 0s. Once we enter this column, we can never leave, because the condition $∣ w ∣_{0} = 3$ can no longer be met.
Loops: The bottom row represents having seen two or more 1s. Since the condition is $∣ w ∣_{1} \geq 2$ , reading more 1s doesn’t change the satisfaction of this condition, so we loop on 1s in this row. Similarly, we loop on 0s in the rightmost column.

Accepting States: To be accepted, a word must end in a state that satisfies both conditions.

The 0s condition: $∣ w ∣_{0} = 3$ . This means we must be in the fourth column.
The 1s condition: $∣ w ∣_{1} = 0$ OR $∣ w ∣_{1} \geq 2$ . This means we must be in the top row OR the bottom row.

The states that satisfy both are the intersection of these requirements:

$q_{3, 0}$ (top row, fourth column)
$q_{3, \geq 2}$ (bottom row, fourth column)

These are marked with double circles in the diagram.

State Minimization (A Quick Look)

One might notice that the three states in the rightmost “trap” column are functionally identical: once entered, they are non-accepting and can never be left. We could simplify the automaton by merging these three states into a single “trap” or “garbage” state ( $q_{A b f a ll}$ ).

(Note: This is a conceptual simplification; the original diagram is also a valid and correct DFA for the language.)

Defining the State Classes

To formally prove this automaton is correct, we would define the class for each state. Let’s name the states $q_{i, j}$ where $i$ is the count of 0s and $j$ is the count of 1s.

For $i \in {0, 1, 2, 3}$ and $j \in {0, 1}$ : $C (q_{i, j}) = {w ∣ ∣ w ∣_{0} = i and ∣ w ∣_{1} = j}$
For the “at least” states: $C (q_{i, \geq 2}) = {w ∣ ∣ w ∣_{0} = i and ∣ w ∣_{1} \geq 2}$
For the trap states: $C (q_{> 3, j}) = {w ∣ ∣ w ∣_{0} > 3 and ∣ w ∣_{1} = j}$ (and similarly for $j \geq 2$ )

The language of the automaton is the union of the classes of the accepting states:

L (A) = C (q_{3, 0}) \cup C (q_{3, \geq 2})

This precisely matches the formal definition of our target language.

Defining State Classes for the Infix Automaton

A typical exam question would be not only to design such an automaton but also to formally define the class of each state, that is, to describe precisely which words land you in which state. Let’s do this for the automaton we just built.

The key is to remember what each state represents: the longest suffix of the word seen so far that is also a prefix of our target pattern, AABA.

The Accepting State: $C (q_{AA B A})$ This one is straightforward. The only way to land in the accepting state is to have successfully found the infix AABA. Once there, we never leave. Therefore, the class of this state is simply the language $L$ itself.
$C (q_{AA B A}) = L = {w \in {A, B}^{*} ∣ w contains the infix AABA}$
The “Almost There” State: $C (q_{AA B})$ This is the class of all words that have put us on the brink of success.
$C (q_{AA B}) = {w \in {A, B}^{*} ∣ w has suffix AAB, but w \in / L}$
We must explicitly exclude the words that are already in $L$ . For example, the word AABAAB ends in AAB, but it also contains AABA, so it would be in $C (q_{AA B A})$ , not here.
The “Two As” State: $C (q_{AA})$ These are words where the longest relevant suffix is AA.
$C (q_{AA}) = {w \in {A, B}^{*} ∣ w has suffix AA, but w \in / L \cup C (q_{AA B})}$
We must exclude words that have an even longer relevant suffix, like AAB.
The “Single A” State: $C (q_{A})$ These are words where the longest relevant suffix is just a single A.
$C (q_{A}) = {w \in {A, B}^{*} ∣ w has suffix A, but w \in / L \cup C (q_{AA B}) \cup C (q_{AA})}$
The Start State: $C (q_{λ})$ This class contains everything else. These are the words that do not contain the AABA infix and do not end in A, AA, or AAB.
$C (q_{λ}) = {w \in {A, B}^{*} ∣ w \in / (L \cup C (q_{AA B}) \cup C (q_{AA}) \cup C (q_{A}))}$
This includes words that end in B, like BB or ABAB, and the empty word $λ$ .

This completes the formal analysis of our automaton. We have not only designed a machine to solve the problem but have also rigorously defined what each component of the machine is “thinking” at every step of the computation.

Continue here: 07 Combining Automata and Proving Non-Regularity

CS Notes

Explorer

06 Formalizing Finite Automata

A Formal Model of Computation

Definition: Deterministic Finite Automaton (DFA)