RIPAL: Responsive and Intuitive Parsing for the Analysis of Language

Background

We've seen the motivations for constructing an LR(0) parse table from a context-free grammar. In this section, we will provide motivations for the underlying construct of the production closure.

Recall

In our LR(0) parsing procedure:

When performing a shift action, we add one terminal and one state to the parse stack
When performing a reduce action, we remove a number of terminals and states from the parse stack equal to the number of symbols on the right-hand side of the relevant production

Motivating example

Example

Observe the following grammar:

S → A
A → B
B → C
C → c

This grammar has the following augmented grammar:

S' → S $
S → A
A → B
B → C
C → c

It has the following LR(0) parse table:

	c	$	S	A	B	C	S'
state₁	shift₂		goto₆	goto₅	goto₄	goto₃
state₂	reduce₅	reduce₅
state₃	reduce₄	reduce₄
state₄	reduce₃	reduce₃
state₅	reduce₂	reduce₂
state₆		accept

Parsing input string c:

Input queue	Parse stack	Action
c	1	Apply action of shift₂ which corresponds to state₁ and c in our parse table
	1 c 2	Apply action of reduce₅ which corresponds to state₂ and $ in our parse table
	1 C	Apply action of goto₃ which corresponds to state₁ and C in our parse table
	1 C 3	Apply action of reduce₄ which corresponds to state₃ and $ in our parse table
	1 B	Apply action of goto₄ which corresponds to state₁ and B in our parse table
	1 B 4	Apply action of reduce₃ which corresponds to state₄ and $ in our parse table
	1 A	Apply action of goto₅ which corresponds to state₁ and A in our parse table
	1 A 5	Apply action of reduce₂ which corresponds to state₅ and $ in our parse table
	1 S	Apply action of goto₆ which corresponds to state₁ and S in our parse table
	1 S 6	Accept, since this action corresponds to state₆ and $ in our parse table

In the above example, observe how frequently the parser is in state₁.

Since we are chaining productions that don't require the processing of additional terminals for each step, such as:

S → A
A → B
B → C

we are able to apply a goto action followed by a reduce action multiple times in sequence. This parsing process is centered around state₁.

Motivating example

Example

Observe the following grammar:

S → A
A → B
B → C D
C → c
D → d

This grammar has the following augmented grammar:

S' → S
S → A
A → B
B → C D
C → c
D → d

It has the following LR(0) parse table:

	c	d	$	S	A	B	C	D	S'
state₁	shift₂			goto₈	goto₇	goto₆	goto₃
state₂	reduce₅	reduce₅	reduce₅
state₃		shift₄						goto₅
state₄	reduce₆	reduce₆	reduce₆
state₅	reduce₄	reduce₄	reduce₄
state₆	reduce₃	reduce₃	reduce₃
state₇	reduce₂	reduce₂	reduce₂
state₈			accept

Parsing input string cd:

Input queue	Parse stack	Action
c d	1	Apply action of shift₂ which corresponds to state₁ and c in our parse table
d	1 c 2	Apply action of reduce₅ which corresponds to state₂ and d in our parse table
d	1 C	Apply action of goto₃ which corresponds to state₁ and C in our parse table
d	1 C 3	Apply action of shift₄ which corresponds to state₃ and d in our parse table
	1 C 3 d 4	Apply action of reduce₆ which corresponds to state₄ and $ in our parse table
	1 C 3 D	Apply action of goto₅ which corresponds to state₃ and D in our parse table
	1 C 3 D 5	Apply action of reduce₄ which corresponds to state₅ and $ in our parse table
	1 B	Apply action of goto₆ which corresponds to state₁ and B in our parse table
	1 B 6	Apply action of reduce₃ which corresponds to state₆ and $ in our parse table
	1 A	Apply action of goto₇ which correspoinds to state₁ and A in our parse table
	1 A 7	Apply action of reduce₂ which corresponds to state₇ and $ in our parse table
	1 S	Apply action of goto₈ which corresponds to state₁ and S in our parse table
	1 S 8	Accept, since this action corresponds to state₈ and $ in our parse table

Observe the apperance of state₁ in the above example.

When processing terminal input symbols a and b, the parser leaves state₁ until CD is reduced to B.

When applying the reduction of B to A and A to S, the parser is able to use the repeated goto and reduce actions to stay centered around state₁ as observed in the previous example.

The closure concept

In considering the set of productions to begin processing within a particular LR(0) parse state, we need a way to expand a given set as was shown in the above examples.

In particular, we can append productions to the set to be considered if they are reachable from the existing set without processing additional terminals. We will call this extended set of production rules the closure of the original set.

Conclusion

Now, we've seen motivation for the closure concept when constructing an LR(0) parse table. Next, we will introduce a special symbol to represent where we are in production processing so that we can handle closures appropriately.

RIPAL: Responsive and Intuitive Parsing for the Analysis of Language

Pages

LR(0) closure motivations

Background

Recall

Motivating example

Motivating example

The closure concept

Conclusion