Chapter 10 - Grammars and Chomsky Hierarchy

The theory of formal languages is a cornerstone of theoretical computer science, with profound connections to linguistics and even evolutionary biology. While our previous computational models, such as finite automata and Turing machines, were primarily conceived as acceptors of languages (recognizing whether a given word belongs to a language), grammars offer an alternative perspective: they are mechanisms for generating (producing) words.

Grammars provide a finite way to describe potentially infinite languages. For linguists, they are crucial for formally describing the syntax of natural languages. In computer science, grammars, especially context-free grammars, are central to defining the syntax of programming languages and are indispensable in compiler construction. (They allow us to precisely define the valid structure of code, enabling compilers to parse and understand programs.) From a computability standpoint, the most general type of grammars (Type-0) are as powerful as Turing machines, further reinforcing Church’s thesis.

This chapter will introduce the fundamental concept of grammars, explain their generative mechanism, and classify them according to the Chomsky Hierarchy. We will explore each type of grammar, relating its generative power to the corresponding automaton model we’ve studied.

10.1 Aims

By the end of this chapter, you will be able to:

Understand the Concept of Grammars: Grasp how grammars axiomatically define and generate languages.
Formally Define Grammars: Understand the components of a grammar (nonterminals, terminals, production rules, start symbol) and the process of derivation.
Classify Grammars by the Chomsky Hierarchy: Differentiate between Type-0 (unrestricted), Type-1 (context-sensitive), Type-2 (context-free), and Type-3 (regular) grammars based on their rule restrictions.
Relate Grammars to Automata: Understand the equivalence between regular grammars and finite automata, and context-free grammars and pushdown automata.
Apply Normal Forms: Understand Chomsky and Greibach normal forms for context-free grammars.
Prove Non-Context-Freeness: Utilize the Pumping Lemma for Context-Free Languages and Ogden’s Lemma to demonstrate that certain languages are not context-free.

10.2 The Concept of Grammars

Grammars provide an axiomatic method for defining classes of objects. Just as Boolean formulas can be defined recursively, languages can be defined by a set of rules that generate their words.

Generation Procedure for $L = {a^{i} b^{i} ∣ i \in N}$

Consider the following rules for generating a language $L$ over {a, b}:

$λ \in L$ .

If $x \in L$ , then $a x b \in L$ .

No other words belong to $L$ .

Starting with $λ$ , applying rule (2) once yields $ab$ , twice yields $a^{2} b^{2}$ , and so on. This procedure generates exactly the language $L = {a^{i} b^{i} ∣ i \in N}$ .

This idea is formalized using terminal symbols (the alphabet of the language, which are the final characters in a generated word), nonterminal symbols (variables that are replaced during generation, acting as placeholders for structures), production rules (the rewrite rules), and a start symbol.

Definition 10.1 (Grammar)

A grammar $G$ is a 4-tuple $G = (Σ_{N}, Σ_{T}, P, S)$ , where: a. $Σ_{N}$ is the nonterminal alphabet (set of nonterminals). b. $Σ_{T}$ is the terminal alphabet (set of terminal symbols), with $Σ_{N} \cap Σ_{T} = \emptyset$ . c. $S \in Σ_{N}$ is the start symbol. d. $P$ is a finite set of production rules (or productions), where each rule is of the form $α \to β$ , with $α \in (Σ_{N} \cup Σ_{T})^{*} Σ_{N} (Σ_{N} \cup Σ_{T})^{*}$ and $β \in (Σ_{N} \cup Σ_{T})^{*}$ .
Let $γ, δ \in (Σ_{N} \cup Σ_{T})^{*}$ . We say $δ$ is derivable from $γ$ in one step in $G$ , denoted $γ \Rightarrow_{G} δ$ , if $γ = ω_{1} α ω_{2}$ and $δ = ω_{1} β ω_{2}$ for some rule $α \to β \in P$ and $ω_{1}, ω_{2} \in (Σ_{N} \cup Σ_{T})^{*}$ .
We say $δ$ is derivable from $γ$ in $G$ , denoted $γ \Rightarrow_{G}^{*} δ$ , if $γ = δ$ or there is a sequence of one-step derivations $γ = ω_{0} \Rightarrow_{G} ω_{1} \Rightarrow_{G} \dots \Rightarrow_{G} ω_{n} = δ$ .
The language generated by $G$ is $L (G) = {w \in Σ_{T}^{*} ∣ S \Rightarrow_{G}^{*} w}$ .

Grammar for $L_{g e} = {w \in {a, b}^{*} ∣ ∣ w ∣_{a} = ∣ w ∣_{b}}$

Let $G = ({S}, {a, b}, P, S)$ with $P = {S \to λ, S \to SS, S \to a S b, S \to b S a}$ . A derivation for baabaabb: $S \Rightarrow SS \Rightarrow S a S b \Rightarrow b S aa S b \Rightarrow baa SS b \Rightarrow baab S a S b \Rightarrow baaba S b \Rightarrow baabaa S bb \Rightarrow baabaabb$ . This grammar generates all words with an equal number of ‘a’s and ‘b’s.

Grammars are inherently nondeterministic: multiple rules might apply, or multiple nonterminals could be chosen for replacement. (This flexibility allows a single grammar to describe a wide range of valid strings, reflecting the ambiguity often present in natural and programming languages.)

10.3 Regular Grammars and Finite Automata

The simplest type of grammars in the Chomsky Hierarchy are regular grammars.

Definition 10.2 (Chomsky Hierarchy - Part 1)

Let $G = (Σ_{N}, Σ_{T}, P, S)$ be a grammar.

$G$ is a regular grammar (Type-3 grammar) if all rules in $P$ are of the form $X \to u$ or $X \to u Y$ , where $X, Y \in Σ_{N}$ and $u \in Σ_{T}^{*}$ . (These simple rules essentially allow generating a terminal symbol and then optionally transitioning to another nonterminal, mimicking the state transitions of a finite automaton.) (Typically, $u$ is restricted to a single terminal symbol or the empty string, i.e., $u \in Σ_{T} \cup {λ}$ ).

A language is regular if it is generated by a regular grammar. The class of regular languages is denoted $L_{3}$ .

Closure Properties of Regular Languages

The class $L_{3}$ is closed under many operations, mirroring the closure properties of regular languages defined by finite automata.

Lemma 10.2 ( $L_{3}$ is closed under union)

If $L_{1}, L_{2} \in L_{3}$ , then $L_{1} \cup L_{2} \in L_{3}$ .

Proof of Closure under Union

Let $G_{1} = (Σ_{N 1}, Σ_{T}, P_{1}, S_{1})$ and $G_{2} = (Σ_{N 2}, Σ_{T}, P_{2}, S_{2})$ be regular grammars for $L_{1}$ and $L_{2}$ , respectively. Assume $Σ_{N 1} \cap Σ_{N 2} = \emptyset$ . Construct $G = (Σ_{N}, Σ_{T}, P, S)$ where $S$ is a new start symbol, $Σ_{N} = Σ_{N 1} \cup Σ_{N 2} \cup {S}$ , and $P = P_{1} \cup P_{2} \cup {S \to S_{1}, S \to S_{2}}$ . $G$ is regular and $L (G) = L_{1} \cup L_{2}$ .

Lemma 10.3 ( $L_{3}$ is closed under concatenation)

If $L_{1}, L_{2} \in L_{3}$ , then $L_{1} \cdot L_{2} \in L_{3}$ .

Proof of Closure under Concatenation

Let $G_{1} = (Σ_{N 1}, Σ_{T}, P_{1}, S_{1})$ and $G_{2} = (Σ_{N 2}, Σ_{T}, P_{2}, S_{2})$ be regular grammars for $L_{1}$ and $L_{2}$ . Assume $Σ_{N 1} \cap Σ_{N 2} = \emptyset$ . Construct $G = (Σ_{N}, Σ_{T}, P, S_{1})$ where $Σ_{N} = Σ_{N 1} \cup Σ_{N 2}$ . $P$ contains all rules from $P_{2}$ . For each rule $A \to w \in P_{1}$ (where $w \in Σ_{T}^{*}$ ), add $A \to w S_{2}$ to $P$ . For each rule $A \to wB \in P_{1}$ , add it to $P$ . $G$ is regular and $L (G) = L_{1} \cdot L_{2}$ .

Equivalence with Finite Automata

A cornerstone result is that regular grammars generate precisely the class of languages accepted by finite automata.

Theorem 10.1 (FA to Regular Grammar)

To each finite automaton $A$ , there exists a regular grammar $G$ such that $L (A) = L (G)$ .

Proof of FA to Regular Grammar Equivalence

Let $A = (Q, Σ, δ, q_{0}, F)$ be an FA. Construct $G = (Q, Σ, P, q_{0})$ where:

For each transition $δ (p, a) = q$ , add rule $p \to a q$ to $P$ .

For each accepting state $f \in F$ , add rule $f \to λ$ to $P$ . The nonterminals of $G$ are the states of $A$ . A derivation in $G$ simulates a computation in $A$ .

To prove the reverse, we first need to normalize regular grammars.

Definition 10.3 (Normalized Regular Grammar)

A regular grammar $G = (Σ_{N}, Σ_{T}, P, S)$ is normalized if all rules are of one of the following forms:

$S \to λ$ (if $λ \in L (G)$ )
$A \to a$ for $A \in Σ_{N}, a \in Σ_{T}$
$A \to a B$ for $A, B \in Σ_{N}, a \in Σ_{T}$

Theorem 10.2 (Normalization of Regular Grammars)

To every regular grammar, there exists an equivalent normalized regular grammar.

Proof of Normalization

The proof involves a series of transformations:

Eliminate chain rules ( $A \to B$ ): For each $A$ , find all $D$ such that $A \Rightarrow^{*} D$ using only chain rules. For every rule $D \to α$ (where $α$ is not a nonterminal), add $A \to α$ .

Eliminate $λ$ -rules ( $A \to λ$ for $A \neq = S$ ): For every rule $B \to ω A γ$ , add $B \to ωγ$ .

Break long terminal strings: Replace $A \to a_{1} a_{2} \dots a_{k} B$ with $A \to a_{1} X_{1}, X_{1} \to a_{2} X_{2}, \dots, X_{k - 1} \to a_{k} B$ using new nonterminals. Similarly for $A \to a_{1} \dots a_{k}$ .

Theorem 10.3 (Regular Grammar to NFA)

To each regular grammar $G$ , there exists a nondeterministic finite automaton (NFA) $M$ such that $L (G) = L (M)$ .

Proof of Regular Grammar to NFA Equivalence

Let $G = (Σ_{N}, Σ_{T}, P, S)$ be a normalized regular grammar. Construct $M = (Σ_{N} \cup {q_{F}}, Σ_{T}, δ, S, F)$ where $q_{F}$ is a new final state.

$F = {q_{F}}$ (and $S$ if $S \to λ \in P$ ).

For each rule $A \to a B \in P$ , add transition $δ (A, a) = B$ .

For each rule $A \to a \in P$ , add transition $δ (A, a) = q_{F}$ . The nonterminals of $G$ become states in $M$ . A derivation in $G$ corresponds to an accepting path in $M$ .

These theorems establish that regular grammars and finite automata have equivalent expressive power. (This equivalence is fundamental: it means we can either describe simple patterns using a generative grammar or recognize them using a finite-state machine, offering two complementary perspectives on the same class of languages.)

10.4 Context-Free Grammars and Pushdown Automata

Context-free grammars (CFGs) are more powerful than regular grammars and are fundamental to describing the syntax of programming languages.

Definition 10.2 (Chomsky Hierarchy - Part 2)

$G$ is a context-free grammar (Type-2 grammar) if all rules in $P$ are of the form $X \to β$ , where $X \in Σ_{N}$ and $β \in (Σ_{N} \cup Σ_{T})^{*}$ .

A language is context-free if it is generated by a context-free grammar. The class of context-free languages is denoted $L_{2}$ . The key difference from regular grammars is that $β$ can contain multiple nonterminals, allowing for recursive structures. (This added flexibility allows CFGs to describe languages with nested dependencies, like correctly matched parentheses or arithmetic expressions, which regular grammars cannot handle.)

Context-Free Language $L = {0^{n} 1^{n} ∣ n \in N}$

The grammar $G = ({S}, {0, 1}, {S \to λ, S \to 0 S 1}, S)$ generates $L$ . This language is not regular.

Syntax Trees

Derivations in CFGs can be represented by syntax trees (or parse trees), which visually depict the hierarchical structure of the generated word. (These trees are invaluable for understanding the structure of programming language constructs and are a core concept in compiler design.)

Definition 10.4 (Syntax Tree)

A syntax tree for a CFG $G$ is an ordered tree where:

The root is labeled with the start symbol.
Internal nodes are labeled with nonterminals.
Leaves are labeled with terminals or $λ$ .
If an internal node labeled $A$ has children labeled $α_{1}, \dots, α_{k}$ (from left to right), then $A \to α_{1} \dots α_{k}$ must be a production rule in $P$ . The concatenation of the labels of the leaves (read from left to right) forms the generated word.

Syntax Tree for Arithmetic Expression

For the grammar $S \to S + S ∣ S * S ∣ (S) ∣ id$ , the expression (id * id) + id has a syntax tree showing its structure.

Normal Forms for Context-Free Grammars

For theoretical analysis and parsing algorithms, CFGs are often converted into simpler forms.

Definition 10.5 (Chomsky Normal Form and Greibach Normal Form)

Let $G = (Σ_{N}, Σ_{T}, P, S)$ be a context-free grammar (where the empty string $λ$ is not in $L (G)$ ).

$G$ is in Chomsky Normal Form (CNF) if all rules are of the form $A \to BC$ (where $A, B, C \in Σ_{N}$ ) or $A \to a$ (where $A \in Σ_{N}, a \in Σ_{T}$ ). (CNF ensures that every production either generates a single terminal or combines exactly two nonterminals, simplifying parsing algorithms.)
$G$ is in Greibach Normal Form (GNF) if all rules are of the form $A \to a α$ (where $A \in Σ_{N}, a \in Σ_{T}, α \in Σ_{N}^{*}$ ). (GNF ensures that every production starts with a terminal, which is useful for top-down parsing.)

Theorem 10.4 (Existence of CNF and GNF)

For every context-free grammar $G$ with $λ \in / L (G)$ , there exist equivalent grammars in Chomsky Normal Form and in Greibach Normal Form.

Proof of Existence of CNF and GNF

The conversion to CNF typically involves:

Eliminating useless symbols (nonterminals that cannot be reached or cannot derive a terminal string).

Eliminating $λ$ -productions (rules $A \to λ$ , except $S \to λ$ ).

Eliminating chain rules ( $A \to B$ ).

Replacing rules with mixed terminals/nonterminals or long right-hand sides with new nonterminals and binary rules.

Pumping Lemma for Context-Free Languages

Similar to regular languages, there’s a pumping lemma for CFGs to prove that a language is not context-free.

Lemma 10.6 (Pumping Lemma for Context-Free Languages)

If $L$ is a context-free language, then there exists a constant $n_{L}$ (depending only on $L$ ) such that for all words $z \in L$ with $∣ z ∣ \geq n_{L}$ , there exists a decomposition $z = uv w x y$ such that:

$∣ vx ∣ \geq 1$ (at least one of $v$ or $x$ is non-empty).

$∣ v w x ∣ \leq n_{L}$ .

$u v^{i} w x^{i} y \in L$ for all $i \in N$ .

(Important Note: The Pumping Lemma can only be used to prove that a language is not context-free. It cannot be used to prove that a language is context-free.)

Proof of Pumping Lemma for CFLs

The proof relies on the fact that in any sufficiently long derivation tree, some nonterminal must repeat on a path from the root to a leaf. This repeating nonterminal allows for “pumping” (repeating or removing) the corresponding segments of the word.

Proving $L = {a^{n} b^{n} c^{n} ∣ n \in N}$ is not context-free

Assume $L$ is context-free. Let $n_{L}$ be the pumping length. Choose $z = a^{n_{L}} b^{n_{L}} c^{n_{L}} \in L$ . By the pumping lemma, $z = uv w x y$ with $∣ vx ∣ \geq 1$ and $∣ v w x ∣ \leq n_{L}$ . Since $∣ v w x ∣ \leq n_{L}$ , the segment $v w x$ cannot contain all three types of symbols ( $a, b, c$ ). It can contain at most two types. If $v w x$ contains only $a$ ‘s and $b$ ‘s (or only $b$ ‘s and $c$ ‘s, or only $a$ ‘s), then pumping $v$ and $x$ (i.e., forming $u v^{2} w x^{2} y$ ) will increase the count of only one or two types of symbols, while the third remains unchanged. This breaks the $a^{n} b^{n} c^{n}$ pattern, so $u v^{2} w x^{2} y \in / L$ . This contradicts condition (3) of the pumping lemma, so $L$ is not context-free.

Pushdown Automata

Nondeterministic Pushdown Automata (NPdAs) are the machine model equivalent to context-free grammars. They extend finite automata with a stack (a LIFO data structure) for memory. (This stack memory is crucial for handling nested structures, like matching parentheses, which finite automata cannot do.)

Definition 10.6 (Nondeterministic Pushdown Automaton)

An NPdA is a 6-tuple $M = (Q, Σ, Γ, δ, q_{0}, Z_{0})$ , where:

$Q$ is a finite set of states.
$Σ$ is the input alphabet.
$Γ$ is the stack alphabet.
$δ$ is the transition function: $Q \times (Σ \cup {λ}) \times Γ \to P (Q \times Γ^{*})$ .
$q_{0} \in Q$ is the initial state.
$Z_{0} \in Γ$ is the initial stack symbol. An NPdA accepts a word $w$ if, starting in $q_{0}$ with $Z_{0}$ on the stack, it can read $w$ and empty its stack.

NPdA for $L = {0^{n} 1^{n} ∣ n \in N}$

An NPdA for $L$ would push a ‘0’ onto the stack for each ‘0’ read from the input. When a ‘1’ is read, it pops a ‘0’ from the stack. If the stack is empty when all ‘1’s are read, the word is accepted.

Equivalence of CFGs and NPdAs

A language $L$ is context-free if and only if there exists a nondeterministic pushdown automaton $M$ such that $L = L (M)$ .

Proof of Equivalence of CFGs and NPdAs

CFG to NPdA: Given a CFG in Greibach Normal Form, an NPdA can be constructed with a single state. For a rule $A \to a α$ , the NPdA reads $a$ , pops $A$ from the stack, and pushes $α^{R}$ (reversed $α$ ).

NPdA to CFG: This is more complex. It involves constructing a CFG whose nonterminals represent the state changes and stack contents of the NPdA.

10.5 Context-Sensitive and Unrestricted Grammars

Beyond context-free grammars lie more powerful types, completing the Chomsky Hierarchy.

Definition 10.2 (Chomsky Hierarchy - Part 3)

$G$ is a context-sensitive grammar (Type-1 grammar) if all rules $α \to β \in P$ satisfy $∣ α ∣ \leq ∣ β ∣$ , and $α$ contains at least one nonterminal. (This means a nonterminal can only be replaced if it’s in a specific “context” of surrounding symbols, and the replacement cannot make the string shorter.) (The restriction $∣ α ∣ \leq ∣ β ∣$ means rules cannot shorten strings).
$G$ is an unrestricted grammar (Type-0 grammar) if it has no restrictions on its rules other than $α$ containing at least one nonterminal.

A language is context-sensitive (resp. recursively enumerable) if it is generated by a context-sensitive (resp. unrestricted) grammar. The classes are denoted $L_{1}$ and $L_{0}$ .

Generative Power and Automata Equivalence

Context-Sensitive Grammars ( $L_{1}$ ): These grammars can model languages where the replacement of a nonterminal depends on its surrounding context. They are equivalent in power to Linear Bounded Automata (LBAs), which are Turing machines whose tape head cannot move beyond the portion of the tape initially occupied by the input. Context-sensitive languages are decidable.
Unrestricted Grammars ( $L_{0}$ ): These are the most powerful grammars, equivalent to Turing Machines. They can generate any recursively enumerable language. Languages generated by unrestricted grammars are not necessarily decidable.

The Chomsky Hierarchy forms a strict inclusion: $L_{3} \subset L_{2} \subset L_{1} \subset L_{0}$ . (This means each level of the hierarchy is strictly more powerful than the one below it; there are languages that can be generated by a Type-X grammar but not by a Type-(X+1) grammar.)

Summary

Grammars are generative models for languages, defined by nonterminals, terminals, production rules, and a start symbol.
The Chomsky Hierarchy classifies grammars into four types based on restrictions on their production rules, each corresponding to a different class of languages and computational power.
- Type-3 (Regular Grammars): Generate regular languages, equivalent to finite automata. Rules are $X \to u$ or $X \to u Y$ .
- Type-2 (Context-Free Grammars): Generate context-free languages, equivalent to pushdown automata. Rules are $X \to β$ . Crucial for programming language syntax.
- Type-1 (Context-Sensitive Grammars): Generate context-sensitive languages, equivalent to linear bounded automata. Rules $α \to β$ with $∣ α ∣ \leq ∣ β ∣$ .
- Type-0 (Unrestricted Grammars): Generate recursively enumerable languages, equivalent to Turing machines. No restrictions on rules.
Syntax trees provide a hierarchical representation of derivations in context-free grammars.
Normal forms (Chomsky Normal Form, Greibach Normal Form) simplify CFGs for analysis and parsing.
Pumping lemmas (for regular and context-free languages) are powerful tools for proving that a language does not belong to a particular class.
The hierarchy demonstrates a fundamental trade-off between the expressiveness of a language model and the complexity of the automaton required to process it.

Previous Chapter: Chapter 9 - Communication and Cryptography

Exercises

Exercise 10.1 (Regular Grammar Construction)

Construct a regular grammar for the language $L = {w \in {0, 1}^{*} ∣ ∣ w ∣_{0} is even}$ .

Solution

Let $G = ({S_{0}, S_{1}}, {0, 1}, P, S_{0})$ , where $S_{0}$ means an even number of 0s seen, and $S_{1}$ means an odd number of 0s seen. $P = {S_{0} \to 1 S_{0}, S_{0} \to 0 S_{1}, S_{0} \to λ, S_{1} \to 1 S_{1}, S_{1} \to 0 S_{0}}$ .

Exercise 10.2 (Context-Free Grammar for Palindromes)

Construct a context-free grammar for the language of palindromes over {a, b}, i.e., $L = {w w^{R} ∣ w \in {a, b}^{*}}$ .

Solution

$G = ({S}, {a, b}, P, S)$ with $P = {S \to λ, S \to a S a, S \to b S b, S \to a, S \to b}$ . (Note: $S \to a$ and $S \to b$ are for odd-length palindromes).

Exercise 10.3 (Pumping Lemma for Context-Free Languages)

Prove that the language $L = {a^{n} b^{n} c^{n} d^{n} ∣ n \in N}$ is not context-free using the Pumping Lemma for Context-Free Languages.

Solution

Assume $L$ is context-free. Let $n_{L}$ be the pumping length. Choose $z = a^{n_{L}} b^{n_{L}} c^{n_{L}} d^{n_{L}} \in L$ . By the pumping lemma, $z = uv w x y$ with $∣ vx ∣ \geq 1$ and $∣ v w x ∣ \leq n_{L}$ . Since $∣ v w x ∣ \leq n_{L}$ , the segment $v w x$ can span at most three of the four blocks of identical symbols.

Case 1: $v w x$ contains symbols from only one block (e.g., only $a$ ‘s). Pumping $v$ and $x$ (i.e., $u v^{2} w x^{2} y$ ) will increase the count of only $a$ ‘s, breaking the $a^{n} b^{n} c^{n} d^{n}$ pattern. Case 2: $v w x$ contains symbols from two adjacent blocks (e.g., $a$ ‘s and $b$ ‘s). Pumping will increase the count of $a$ ‘s and $b$ ‘s, but not $c$ ‘s or $d$ ‘s, breaking the pattern. Case 3: $v w x$ contains symbols from two non-adjacent blocks (e.g., $a$ ‘s and $c$ ‘s). This is impossible since $v w x$ is a contiguous substring and $∣ v w x ∣ \leq n_{L}$ .

In all valid cases, pumping $v$ and $x$ leads to a string not in $L$ , contradicting the pumping lemma. Therefore, $L$ is not context-free.

Exercise 10.4 (Pushdown Automaton Design)

Design a nondeterministic pushdown automaton (NPdA) for the language $L = {w \in {a, b}^{*} ∣ w has an equal number of a ’s and b ’s}$ .

Solution

This is a classic example where an NPdA can be designed. The NPdA would use its stack to keep track of the imbalance between ‘a’s and ‘b’s. Let $M = ({q_{0}}, {a, b}, {A, B, Z_{0}}, δ, q_{0}, Z_{0})$ .

If an ‘a’ is read: If ‘B’ is on top of stack, pop ‘B’. Else, push ‘A’.

If a ‘b’ is read: If ‘A’ is on top of stack, pop ‘A’. Else, push ‘B’.

Accept if stack is empty (only $Z_{0}$ remains) after reading the entire input.

This requires a bit more detail in the transition function $δ$ to handle $Z_{0}$ and ensure correct pushing/popping. For example:

$δ (q_{0}, a, Z_{0}) = {(q_{0}, A Z_{0})}$

$δ (q_{0}, b, Z_{0}) = {(q_{0}, B Z_{0})}$

$δ (q_{0}, a, A) = {(q_{0}, AA)}$

$δ (q_{0}, b, B) = {(q_{0}, BB)}$

$δ (q_{0}, a, B) = {(q_{0}, λ)}$

$δ (q_{0}, b, A) = {(q_{0}, λ)}$

$δ (q_{0}, λ, Z_{0}) = {(q_{0}, λ)}$ (for acceptance if stack is empty)

This NPdA accepts the language.

CS Notes

Explorer

Chapter 10 - Grammars and Chomsky Hierarchy

10.1 Aims

10.2 The Concept of Grammars

Definition 10.1 (Grammar)

10.3 Regular Grammars and Finite Automata

Definition 10.2 (Chomsky Hierarchy - Part 1)

Closure Properties of Regular Languages

Equivalence with Finite Automata

Definition 10.3 (Normalized Regular Grammar)

10.4 Context-Free Grammars and Pushdown Automata

Definition 10.2 (Chomsky Hierarchy - Part 2)

Syntax Trees

Definition 10.4 (Syntax Tree)

Normal Forms for Context-Free Grammars

Definition 10.5 (Chomsky Normal Form and Greibach Normal Form)

Pumping Lemma for Context-Free Languages

Pushdown Automata

Definition 10.6 (Nondeterministic Pushdown Automaton)

10.5 Context-Sensitive and Unrestricted Grammars

Definition 10.2 (Chomsky Hierarchy - Part 3)

Generative Power and Automata Equivalence

Summary

Exercises

Table of Contents

Graph View

Backlinks