RIPAL: Responsive and Intuitive Parsing for the Analysis of Language

Background

Finite languages are easy to understand, specify and recognize, but they're not very expressive.

In practice, many useful languages are not limited to a fixed number of strings. As a result, we need some way to begin to represent infinite languages.

Language set operators

To discuss regular languages, we need to first define the following set operators:

If R₁ = {s_1,1, s_1,2, ...} and R₂ = {s_2,1, s_2,2, ...} then R = R₁R₂ = {s_1,1s_2,1, s_1,2s_2,1, s_1,1s_2,2, ...}
1. Informally, the concatenation of two sets involves squishing together the a combination of one element from each set in the provided order of operands
If R₁ = {s_1,1, s_1,2, ...} and R₂ = {s_2,1, s_2,2, ...} then R = R₁ ∪ R₂ = {s_1,1, s_2,1, s_1,2, s_2,2, ...}
1. Informally, the union of two sets involves including any element in either set
If R₁ = {s₁, s₂, ...} then R = R₁^* = {ε, s₁, s₂, s₁s₁, s₁s₂, s₂s₁, s₂s₂, s₁s₁s₁, ...}
1. Informally, the Kleene closure of a set involves starting with ε and concatenating any element of that set any number of times

Definition

Definition

For any alphabet Σ:

The empty language, L = {}, is regular
The language containing the empty string, L = {ε}, is regular
For any σ ∈ Σ, the language containing a fixed single-symbol string, L = {σ}, is regular
For any two regular languages R₁ and R₂, the concatenation of those languages, L = R₁R₂, is regular
For any two regular languages R₁ and R₂, the union of those languages, L = R₁ ∪ R₂, is regular
For any regular language R₁, the Kleene closure of that language, L = R₁^*, is regular

Regular language representations

There are two straightfoward ways of specifying a regular language:

A regular expression
A finite automaton

These constructions will be explained in detail on subsequent pages.

RIPAL: Responsive and Intuitive Parsing for the Analysis of Language

Pages

What is a regular language?

Background

Language set operators

Definition

Regular language representations