Real Computer Science begins where we almost stop reading ...: TOC Chapter 4

Saturday 26 January 2013

TOC Chapter 4

Chapter 4. Finite State Automata: Characterization, Properties, and Decidability

In this chapter, the equivalence between right linear grammars and finite state automata (FSA) is proved. The pumping lemma for regular sets is considered, and is used to give a method for showing a language not to be regular. We also consider some basic closure properties and decidability results.

Regular Grammars

Theorem 4.1

If a language L is accepted by a finite non-deterministic automaton, then L can be accepted by a right linear grammar and conversely.

Proof Let L be a language accepted by a finite non-deterministic automaton M = (K, Σ, δ, q₀, F) where K = {q₀, . . ., q_n}. If w ∊ L, then w is obtained by the concatenation of symbols corresponding to different transitions starting from q₀ and ending at a finite state. Hence, for each transition by M while reading a symbol of w, there must be a correspondence to a production of a right linear grammar G. The construction is as shown below:

G = ({S₀, S₁, . . ., S_n}, Σ, P, S₀)

where productions in P are

S_i → aS_j if δ(q_i, a) contains q_j for q_j ∉ F
S_i → aS_j and S_i → a if δ(q_i, a) contains q_j, q_j ∊ F.

To prove L(G) = L = L(M).

From the construction of P, one is able to see that S_i ⇒ aS_j, if and only if δ(q_i, a) contains q_j and S_i ⇒ a, if and only if δ(q_i, a) ∊ F. Hence, if S₀ ⇒ a₁S₁ ⇒ a₁a₂S₂ ⇒ . . . ⇒ a₁ . . . a_n if and only if δ(q₀, a₁) contains q₁, δ(q₁, a₂) contains q₂, . . ., δ(q_n-1, a_n) contains q_n where q_n ∊ F.

Hence, w ∊ L(G) if and only if w ∊ L(M).

Let G = (N, T, P, S) be a right linear grammar. An equivalent non-deterministic finite state automaton (NFSA) with ε-moves is constructed as below:

Let M = (K, T, δ, [S], [ε]), where

K = {[α]|α is S or suffix of some right-hand side of a production in P, the suffix need not be proper}.

The transition function δ is defined as follows:

δ([A], ε) = {[α]|A → α ∊ P}
If a ∊ T or α ∊ T*N, then δ([aα], a) = {[α]}. Clearly, [α] ∊ δ([S], w) if and only if where A → yα ∊ P and xy = w. If w ∊ L(M), then α = ε. M accepts w if and only if . Hence the converse follows.

4.2. Pumping Lemma for Regular Sets

A language is said to be regular, if it is accepted either by a finite automaton or it has a regular grammar generating it. In order to prove that a language is not regular the most commonly used technique is “pumping lemma.” The lemma gives a pumping property that a sufficiently long word has a subword (non-empty) that can be pumped. But the fact that a language satisfies pumping lemma does not mean that it is regular.

4.3. Closure Properties

Theorem 4.3

The family of regular languages is closed under the following operations: (1) union (2) intersection (3) complementation (4) catenation (5) star, and (6) reversal.

Proof The six closure properties will be proved below either through finite automaton or regular grammars and it has been shown that they are equivalent in Theorem 4.1.1.

Union: Let L₁ and L₂ be two regular languages generated by two right linear grammars G₁ = (N₁, T₁, P₁, S₁) and G₂ = (N₂, T₂, P₂, S₂) (say). Without loss of generality let N₁ ∩ N₂ = φ. L₁ ∪ L₂ is generated by the right linear grammar. G′ = (N₁ ∪ N₂∪{S}, T₁ ∪ T₂, P₁∪P₂∪{S → S₁, S → S₂}, S). L(G′) = L(G₁)∪L(G₂) because, the new start symbol of G′ is S from which we reach S₁ or S₂ using the rules S → S₁, S → S₂. After this step one can use only rules from P₁ or P₂, hence deriving words in L₁ or L₂ or in both.
Intersection: Let L₁, L₂ be any two regular languages accepted by two DFSA’s M₁ = (K₁, Σ₁, δ₁, q₁, F₁) and M₂ = (K₂, Σ₂, δ₂, q₂, F₂). Then, the DFSA M constructed as below accepts L₁ ∩ L₂. Let M = (K, Σ, δ, q₀, F) where K = K₁ × K₂, q₀ = (q₁, q₂), F = F₁ × F₂, δ: K × Σ → K is defined by δ((p₁, p₂), a) = (δ₁(p₁, a), δ₂(p₂, a)).

One can see that for each input word w, M runs M₁ and M₂ parallely, starting from q₁, q₂, respectively. Having finished reading the input, M accepts only if both M₁, M₂ accept. Hence, L(M) = L(M₁) ∩ L(M₂).
Complementation: Let L₁ be a regular language accepted by DFSA M = (K, Σ, δ, q₀, F). Then, clearly the complement of L is accepted by the DFSA M^c = (K, Σ, δ, q₀, K − F).
Concatenation: We prove this property using the concept of regular grammar. Let L₁ and L₂ and G₁ and G₂ be defined as in proof of union of this theorem. Then, the type 3 grammar G constructed as below satisfies the requirement that L(G) = L(G₁). L(G₂). G = (N₁ ∪ N₂, T₁ ∪ T₂, S₁, P₂ ∪ P) where P = {A → aB/A → aB ∊ P₁} ∪ {A → aS₂|A → a ∊ P₁}. Clearly, L(G) = L(G₁). L(G₂) because any derivation starting from S₁ derives a word w ∊ L₁ and for G, . Hence, if by G₂, then by G.
Catenation closure: Here also we prove the closure using regular grammar. Let L₁ be a regular grammar generated by G₁ = (N₁, T₁, P₁, S₁). Then, the type 3 grammar G = (N₁∪{S₀}, T₁, S₀, {S₀ → ε, S₀ → S₁} ∪ {A → aS₁|A → a ∊ P₁} ∪ P₁). Clearly, G generates .
Reversal: The proof is given using the NFSA model. Let L be a language accepted by a NFSA with ε-transitions which has exactly one final state.

(Exercise: For any NFSA, there exists an equivalent NFSA with ε-transitions with exactly one final state). Let it be M = (K, Σ, δ, q₀, {q_f}). Then, the reversal automaton M′ = (K, Σ, δ′, q_f, {q₀}) where δ′ is defined as δ′(q, a) contains p, if δ(p, a) contains q for any p, q ∊ K, a ∊ Σ ∪ {ε}. One can see that if w ∊ L(M) then w^R ∊ L(M′) as in the modified automaton M′, each transition takes a backward movement on w.

We prove that regular languages are also closed under homomorphism and right quotient.

4.4. Decidability Theorems

In this section, we address the basic decidability issues for regular languages. They are membership problem, emptiness problem, and equivalence problems.

Theorem 4.7

Given a regular language L over T and w ∊ T*, there exists an algorithm for determining whether or not w is in L.

Proof Let L be accepted by a DFSA M (say). Then, for input w one can see whether w is accepted by M or not. The complexity of this algorithm is O(n) where |w| = n. Hence, membership problem for regular sets can be solved in linear time.

Problems and Solutions

Let Σ be an alphabet. Define I_Σ to be the collection of all infinite languages over Σ. Note that I_Σ does not include any finite language over Σ. Prove or give counter examples to the following:

I_Σ is closed under union
I_Σ is closed under intersection

Solution.

I_Σ is closed under union. Let L₁ and L₂ be in I_Σ. L₁ and L₂ are infinite sets L₁ ∪ L₂ = {x|x ∊ L₁ or x ∊ L₂}. L₁ ∪ L₂ includes L₁ and also L₂. Hence L₁ ∪ L₂ is infinite.
I_Σ is not closed under intersection. Consider Σ = {a}.

L₁ = {a²ⁿ|n ≥ 1} is an infinite set.

L₂ = {a^p|p is a prime} is an infinite set.

L₁, L₂ ∊ I_Σ

L₁ ∩ L₂ = {a²} which is a finite set and hence it is not in I_Σ.

Construct regular grammar equivalent to the following NFSA (Figure 4.2).

Figure 4.2. State diagram for Problem 2

Solution.

Let G = (N, T, P, S) where N = {S₀, S₁, S₂}, T = {0, 1}. P consists of the following rules:

S₀ → 0S₁, S₁ → 0S₂\|0S₀\|0, S₂ → 1S₁\|1S₂\|1.

Construct an equivalent NFSA for the following grammar S → abS|a.

Solution.

Figure 4.3. Solution to Problem 3

Exercises

1.	Let Σ be an alphabet. Define I_Σ to be the collection of all infinite languages over Σ. Note that I_Σ does not include any finite language over Σ. Prove or give counter examples to the following: I_Σ is closed under complementation I_Σ is closed under concatenation I_Σ is closed under Kleene closure
2.	If a collection of languages is closed under intersection, does it mean that it is closed under union. Prove or give counter example.
3.	If L is accepted by a NFSA, is it necessarily true that all subsets of L are accepted by a NFSA? Prove or give counter examples.
4.	Let N_Σ denote the collection of languages such that no L ∊ N_Σ is accepted by a NFSA. Prove or give counter examples to the following: N_Σ is closed under union N_Σ is closed under catenation N_Σ is closed under Kleene closure
5.	We have shown that the union of two regular languages is regular. Is the union of a collection of regular languages always regular? Justify your answer.
6.	Let M be a DFSA accepting L₁ and G be a regular grammar generating L₂. Using only M and G show that L₁ ∩ L₂ is regular.
7.	Let P = {x\| \|x\| is prime} and let I(L) be defined by I(L) = L ∩ P. Let D_Σ denote the collection of all languages recognized by a DFSA Show that D_Σ is not closed under I Prove or disprove N_Σ is closed under I
8.	Given any alphabet Σ and a DFSA M, show that it is decidable whether M accepts even length strings.
9.	Given any alphabet Σ and regular expressions r₁ and r₂ over Σ, show that it is decidable whether r₁ and r₂ describe any common strings.
10.	Given any alphabet Σ and a regular expression r₁ over Σ, show that it is decidable whether there is a DFSA with less than 31 states that accepts the language described by r₁.
11.	Give a regular grammar for: (a + b)c(d + (ab)) (a + b)a(a + b)
12.	Construct a regular grammar equivalent to each of the following NFSA (Figure 4.4). Figure 4.4. State diagrams for Exercise 12
13.	Construct an equivalent NFSA for each of the following grammars: S → abS₁ S₁ → abS₁\|S₂ S₂ → a S → abA A → baB B → aA\|bb

Real Computer Science begins where we almost stop reading ...

Saturday 26 January 2013

TOC Chapter 4

Chapter 4. Finite State Automata: Characterization, Properties, and Decidability

Regular Grammars

Theorem 4.1

4.2. Pumping Lemma for Regular Sets

4.3. Closure Properties

Theorem 4.3

4.4. Decidability Theorems

Theorem 4.7

Problems and Solutions

Figure 4.2. State diagram for Problem 2

Figure 4.3. Solution to Problem 3

Exercises

Figure 4.4. State diagrams for Exercise 12

No comments:

Post a Comment