RIPAL: Responsive and Intuitive Parsing for the Analysis of Language

Overview

An ε-nondeterministic finite automaton (ε-NFA for short) is another tool that can be used to recognize strings as part of a regular language.

Example of an ε-NFA

The following is an example of an ε-NFA:

Example ε-NFA. This ε-NFA containers 6 states - s0, s1, s2, s3, s4 and s5. s0 transitions to s1 on input string a. s1 transitions to s2 and s4 on input symbol ε s2 transitions to s3 on input symbol b. s4 transitions to s5 on input symbol b.

It contains 6 states and 5 state transitions.

State s₀ is the initial state, while s₃ and s₅ are accepting states.

State s₁ illustrates a construction that is not allowed in an NFA but is allowed in an ε-NFA: transitions with ε as input. These ε transitions do not consume an input symbol.

This ε-NFA accepts the language represented by the following regular expression:

a(b ∪ c)

Anatomy

In general, an ε-NFA consists of:

A set of states, S, typically labelled s₀ through s_{n - 1}
1. One state is the initial state, s_i - typically s₀ by convention
2. Any number of states can belong to the set of final states, S_f
  1. S_f ⊆ S
A set of transitions, T, between states, consisting of a start state s_s ∈ S, an end state s_f ∈ S and a transition symbol t ∈ Z ∪ {ε}
1. This differs from NFAs which don't allow ε as an input symbol for transitions

In an ε-NFA, we have essentially no inherent structural restrictions, with the following key notes about its construction:

T = T[s_i, t] = {s_f₁, s_f₂, ..., s_{f_n}}, a set of states
t ∈ Z ∪ {ε}, a single symbol in our alphabet or the empty symbol

The ε-NFA parsing algorithm

To determine whether s = {σ₁σ₂...σ_n} belongs to the language recognized by ε-NFA with states S, transitions T[s_i, t], initial state s_i:

Set S_current = {s_i} While some state s_{ε_next} is reachable through ε transition from some state s_{ε_current} in S_current such that s_{ε_next} ∉ S_current Add s_{ε_next} to S_current For i = 1 to n symbol = σ_i S_next = {} For each s_current in S_current s_next = T[s_current, symbol] If s_next exists Add s_next to S_next While some state s_{ε_next} is reachable through ε transition from some state s_{ε_current} in S_next such that s_{ε_next} ∉ S_current Add s_{ε_next} to S_next If S_next is empty Reject S_current = S_next End for If S_current contains an accepting state Accept Otherwise Reject

Informally, we perform the NFA parsing algorithm but in addition, whenever a state is added to a set we are considering, we also add all states reachable directly or indirectly through ε transitions from it.

Note that this parsing algorithm is becoming somewhat complicated and is not often used in practice. We will later show how to reduce ε-NFAs into NFAs and then DFAs to simplify the parsing algorithm.

Parsing examples

Example

For s = a

States	Input symbol	Action
{s₀}	a	Transition to {s₁, s₂, s₄} (since s₂ and s₄ are reachable via ε transition)
{s₁, s₂, s₄}	$ (end of string)	Reject (since S_current does not contain an accepting state)