0% found this document useful (0 votes)

29 views

Construction of Syntax Trees

This document discusses syntax trees and directed acyclic graphs (DAGs) used to represent the structure of expressions in a programming language. It describes how syntax trees and DAGs are constructed bottom-up using syntax-directed definitions. Nodes in the trees and graphs represent operators and operands, with interior nodes for operators pointing to child nodes for operands. The document provides examples of constructing the syntax tree and DAG for an expression like "a - 4 + c".

Uploaded by

Shammer Sha

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Construction of Syntax Trees

Uploaded by

Shammer Sha

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Previous Page Home Next Page

Dependency Graphs Bottom up evaluation of S-

attributed
definitions

CONSTRUCTION OF SYNTAX TREES

**************************************

Introduction

A syntax tree is a condensed form of parse tree useful for representing language constructs.They are created with the
help of syntax-directed definitions. The use of syntax trees as an intermediate form helps to dissociate translation from
parsing.Translation routines that are invoked during parsing must operate under two kinds of restrictions.First,a grammar that
is suited for parsing may not reflect the natural hierarchical structure of the constructs in the language.Second,the parsing
method constrains the order in which nodes in a parse tree are considered.This order may not match the order in which
information about a construct becomes available.

Syntax Trees

In a syntax tree,operators and keywords do not appear as leaves,but rather are associated with the interior node that
would be the parent of those leaves in the parse tree.Another simplification found in syntax trees is that chains of single
productions may be collapsed(see Figure 1 and Figure 2).Syntax-directed translation can be based on syntax trees as well
as on parse trees.The approach is the same in each case;we attach attributes to the nodes as in a parse tree.

Figure 1
Figure 2

Constructing Syntax Trees for Expressions

The construction of a syntax tree for an expression is similar to the translation of the expression into postfix form.We
construct subtrees for the subexpressions by creating a node for each operator and operand.The children of an operator
node are the roots of the nodes representing the subexpresions constituting the operands of that operator.

Each node in a syntax tree can be implemented as a record with several fields.In the node for an operator,one field
identifies the operator and the remaining fields contain pointers to the nodes for the operands.The operator is often called the
label of the node.When used for translation,the nodes in a syntax tree may have additional fields to hold the values of
attributes attached to the node.Usually there are a number of functions defined to create the nodes of syntax trees.Each
function returns a pointer to a newly created node.

Consider for example the expression a - 4 + c.Here,we make use of the following functions to create the nodes of syntax
trees for expressions with binary operators.

a) mknode(op , left , right) creates an operator node with label op and two fields
containing pointers to left and right.

b) mkleaf(id , entry) creates a identifier node with label id and a field containing
entry,a pointer to the symbol-table entry for the identifier.

c) mkleaf(num , val) creates a number node with label num and a field containing val,the value of the number.

The following sequence of function calls creates the syntax tree for the expression a - 4 + c.In this sequence, p1 , p2 , p3
, p4 , p5 are pointers to nodes, and entrya and entyrc are pointers to the symbol-table entries for identifiers a and
c,respectively.

i) p1 := mkleaf(id , entrya);
ii) p2 := mkleaf(num , 4);
iii) p3 := mknode(' - ', p1 , p2);
iv) p4 := mkleaf(id , entryc);
v) p5 := mknode('+' , p3 , p4);

The tree is constructed bottom up.The function calls mkleaf(id,entrya) and mkleaf(num , 4) construct the leaves for a and
4;the pointers to these nodes are saved using p1 and p2.The call mknode (' - ' , p1 , p2 ) then constructs the interior node
with the leaves for a and 4 as children.After two mor steps, p5 is left pointing to the root.

Figure 3

A Syntax-Directed Definition for Constructing Syntax Trees

Figure 4 contains an S-attributed definition for constructing a syntax tree for an expression containing the operators + and
-.It uses the underlying productions of the grammar to schedule the calls of the functions mknode and mkleaf to construct
the tree.The synthesized attribute nptr for E and T keeps track of the pointers returned by the function calls.

Figure 4
An annotated parse tree depicting the construction of a syntax tree for the expression a - 4 + c is shown in Figure
5.The parse tree is shown dotted.The parse-tree nodes labeled by the nonterminals E and T use the synthesized attribute
nptr to hold a pointer to the syntax-tree node for the expression represented by the nonterminal.

Figure 5

The semantic rules associated with the productions T ---> id and T ---> num define attribute T.nptr to bea pointer to a new
leaf for an identifier and a number,respectively.Attributes id.entry and num.val are the lexical values assumed to be returned
by the analyzer with the tokens id and num.

In Fig 2,when an expression E is a single term,corresponding to a use of the production E ---> T,the attribute E.nptr gets
the value of T.nptr.When the semantic rule E.nptr := mknode(' - ', E1.nptr , T.nptr) associated with the production E --->
E1 - T is invoked,previous rules have set E1.nptr and T.nptr to be pointers to the leaves for a and 4,respectively.

Directed Acyclic Graphs for Expressions

A directed acyclic graph (dag) for an expression identifies the common subexpressions in the expression.Like a syntax
tree,a dag has a node for every subexpression of the expression;an interior node represents an operator and its children
represent its operands.The difference is that a node in a dag representing a common subexpression has more than one
"parent;" in a syntax tree,comon subexpression would be represented as a duplicated subtree.

Figure 6 shows a dag for the expression a + a * ( b - c ) + ( b - c ) * d.

Figure 6

The leaf for a has two parents because a is common to the two subexpressions a and a * ( b - c ).Likewise,both
occurrences of the common subexpression b - c are represented by the same node,which also has two parents.

The syntax-directed definition of Figure 4 will construct a dag instead of a syntax tree if we modify the operations for
constructing nodes.A dag is obtained if the function constructing a node first checks to see whether an identical node already
exists.For example,before constructing a new node with label op and fields with pointers to left and right ,mknode( op , left
, right ) can check whether such a node has already been constructed.If so,mknode( op , left , right ) can return a pointer
to the previously constructed node.The leaf-constructing functions mkleaf can behave similarly.

The sequence of instructions for constructing the dag in Figure 6 is listed as below.The functions defined constructs the
dag provided mknode and mkleaf create new nodes only when necessary,returning pointers to existing nodes with the
correct label and children whenever possible.

(1) p1 := mkleaf(id,a);
(2) p2 := mkleaf(id,a);
(3) p3 := mkleaf(id,b);
(4) p4 := mkleaf(id,c);
(5) p5 := mknode(' - ' ,p3,p4);
(6) p6 := mknode(' * ' ,p2,p5);
(7) p7 := mknode(' + ' ,p1,p6);
(8) p8 := mkleaf(id,b);
(9) p9 := mkleaf(id,c);
(10) p10 := mknode(' - ' ,p8,p9);
(11) p11 := mkleaf(id,d);
(12) p12 := mknode(' * ' ,p10,p11);
(13) p13 := mknode(' + ' ,p7,p12);

When the call mkleaf(id,a) is repeated on line 2,the node constructed by the previous call mkleaf(id,a) is returned,so
p1=p2.Similarly,the nodes returned on lines 8 and 9 are the same as those returned on lines 3 and 4,respectively.Hence,the
node returned on line 10 must be the same one constructed by the call of mknode on line 5.

In many applications,nodes are implemented as records stored in an array,as in Figure 7.In the figure,each record
has a label field that determines the nature of the node.We can refer to a node by its index in the array.The integer index of a
node is often called value number.For example, using value numbers,we can say node 3 has label +,its left child is node
1,and its right child is node 2.The following algorithm can be used to create nodes for a dag representation of an expression.

Figure 7

Algorithm: Value-number method for constructing a node in a dag.

Suppose that nodes are stored in an array and that each node is referred to by its value number.Let the signature of an
operator node be a triple<op , l , r> consisting of its label op,left child l,and right child r.

Input . Label op , node l ,and node r.

Output . A node with signature <op , l , r>.

Method . Serach the array for a node m with label op , left child l ,and right child r. If there is such a node,return m ;
otherwise,create a a new node n with label op , left child l , right child r,and return n.

An obivious way to detrmine if node m is already in the array is to keep all previously created nodes on a list and to
check each node on the list to see if it has the desired signature.The search for m can be made more efficient by using k
lists,called buckets,and using a hashing function h to determine which bucket to search.

The hash function h computes the number of a bucket from the value of op , l ,and r.It will always return the same bucket
number,given the same arguments.If m is not in the bucket h( op , l , r), then a new node n is created and added to this
bucket,so subsequent searches will find it there.Several signatures may hash into the same bucket number,but in practice we
expect each bucket to contain a small number of nodes.
Each bucket can be implemented as a link as shown in Figure 8.Each cell in a linked list represents a node.The bucket
headers,consisting of pointers to the first cell in a list,are stored in an array.The bucket number returned by h( op , l , r ) is an
index into this array of bucket headers.

Figure 8

This algorithm can be adapted to apply to nodes that are not allocated sequentially from an array.In many compilers
,nodes are allocated as they are needed,to avoid preallocating an array that may hold too many nodes most of the time and
not enough nodes some of the time.In this case,we cannot assume that nodes are in sequential storage,so we have to use
pointers to refer to nodes.If the hash function can be made to compute the bucket number from label and pointers to
children,then we can number the nodes in any way and use this number as the value number of the node.

Previous Page Home Next Page

Dependency Graphs Bottom up evaluation of S-
attributed
definitions

IP-Lab Manual
100% (1)
IP-Lab Manual
19 pages
Construction of Syntax Trees
67% (3)
Construction of Syntax Trees
7 pages
Syntax Directed Translation
No ratings yet
Syntax Directed Translation
8 pages
Construction of Syntax Trees
No ratings yet
Construction of Syntax Trees
15 pages
Poc Unit 3
No ratings yet
Poc Unit 3
22 pages
CD_UNIT III
No ratings yet
CD_UNIT III
69 pages
BKS Unit II-Construction of Syntax Trees and DAG
No ratings yet
BKS Unit II-Construction of Syntax Trees and DAG
20 pages
CD Presentation
No ratings yet
CD Presentation
7 pages
Compiler Design
No ratings yet
Compiler Design
3 pages
CC 6
No ratings yet
CC 6
30 pages
CD Unit 3 PDF
No ratings yet
CD Unit 3 PDF
17 pages
18 UNIT-4(1)
No ratings yet
18 UNIT-4(1)
16 pages
CD UNIT 4
No ratings yet
CD UNIT 4
102 pages
Unit 4 and 5
No ratings yet
Unit 4 and 5
31 pages
unit3(trees)
No ratings yet
unit3(trees)
25 pages
Unit 3 TAC Intermidiate Code Generator
No ratings yet
Unit 3 TAC Intermidiate Code Generator
27 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 3
No ratings yet
CS602PC - Compiler - Design - Lecture Notes - Unit - 3
27 pages
Topic: Syntax Directed Translations: Unit Iv
No ratings yet
Topic: Syntax Directed Translations: Unit Iv
52 pages
Unit-3 F&CD
No ratings yet
Unit-3 F&CD
18 pages
Syntax Tree
0% (1)
Syntax Tree
3 pages
Chapter 6 Intermediate Code Generation
No ratings yet
Chapter 6 Intermediate Code Generation
47 pages
UNIT IV CD Mam Notes
No ratings yet
UNIT IV CD Mam Notes
36 pages
Unit 3 Compiler
No ratings yet
Unit 3 Compiler
27 pages
15Cs314J - Compiler Design: Unit Iii
No ratings yet
15Cs314J - Compiler Design: Unit Iii
69 pages
Module 3 - Semantic Analysis
No ratings yet
Module 3 - Semantic Analysis
26 pages
CD Unit4 - PPT
No ratings yet
CD Unit4 - PPT
28 pages
MUUnit 4
No ratings yet
MUUnit 4
63 pages
Module 3 - Semantic Analysis
No ratings yet
Module 3 - Semantic Analysis
31 pages
a159254722029
No ratings yet
a159254722029
108 pages
Unit 4.1
No ratings yet
Unit 4.1
47 pages
Unit 3 Compiler
No ratings yet
Unit 3 Compiler
27 pages
Chap-4, 5,6,7
No ratings yet
Chap-4, 5,6,7
19 pages
CD Unit 3
No ratings yet
CD Unit 3
10 pages
24-Module 4_ Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4_ Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
No ratings yet
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
27 pages
CD Unit 3
No ratings yet
CD Unit 3
23 pages
CD_UNIT-4
No ratings yet
CD_UNIT-4
28 pages
Compiler Design Unit 3
No ratings yet
Compiler Design Unit 3
14 pages
CD Chapter 4
No ratings yet
CD Chapter 4
30 pages
Check Semantics - Error Reporting - Disambiguate - Type Coercion - Static Checking
No ratings yet
Check Semantics - Error Reporting - Disambiguate - Type Coercion - Static Checking
108 pages
Chapter 5 Intermediate Code Generaration-1
No ratings yet
Chapter 5 Intermediate Code Generaration-1
31 pages
Compiler Unit3
No ratings yet
Compiler Unit3
46 pages
Compiler Design
No ratings yet
Compiler Design
38 pages
cd_3rd unit _15
No ratings yet
cd_3rd unit _15
58 pages
Semanticanalysis
No ratings yet
Semanticanalysis
14 pages
Attribute Grammars
No ratings yet
Attribute Grammars
22 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
24 pages
UNIT 4 - Intermediate - Code - Generation NEW
No ratings yet
UNIT 4 - Intermediate - Code - Generation NEW
205 pages
Imp - Unit 3 2 24
No ratings yet
Imp - Unit 3 2 24
23 pages
Compiler Design Chapter-4
100% (2)
Compiler Design Chapter-4
77 pages
3unit cd IntermediateCode_Part1
No ratings yet
3unit cd IntermediateCode_Part1
38 pages
Chapter 4
No ratings yet
Chapter 4
35 pages
11-Chapter 6-Intermediate Code Generation
No ratings yet
11-Chapter 6-Intermediate Code Generation
9 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
29 pages
Language Description: Syntactic Structure
No ratings yet
Language Description: Syntactic Structure
35 pages
UNIT-III Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
No ratings yet
UNIT-III Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
24 pages
Chapter 4
No ratings yet
Chapter 4
15 pages
CD Unit-3 Part-2
No ratings yet
CD Unit-3 Part-2
21 pages
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
No ratings yet
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
35 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
SOON
No ratings yet
SOON
90 pages
Real Time Transport Protocol
No ratings yet
Real Time Transport Protocol
3 pages
Modem Standards
No ratings yet
Modem Standards
28 pages
Compiler Construction 2018 April (2015 Ad)
No ratings yet
Compiler Construction 2018 April (2015 Ad)
1 page
!! Aim Is To Make Error Zero !!: Controller Actuator Plant
No ratings yet
!! Aim Is To Make Error Zero !!: Controller Actuator Plant
11 pages
The Kalman Filter: State-Space Derivation For Mass-Spring-Damper System
No ratings yet
The Kalman Filter: State-Space Derivation For Mass-Spring-Damper System
10 pages
Survival Analysis Theory 2024-4
No ratings yet
Survival Analysis Theory 2024-4
49 pages
Meshref 5 DR - Hossam PredictingLoanApproval CMM2020 Dec2020
No ratings yet
Meshref 5 DR - Hossam PredictingLoanApproval CMM2020 Dec2020
10 pages
BE368 Lecture 4
No ratings yet
BE368 Lecture 4
28 pages
Unit 42 - Statistics For Management
No ratings yet
Unit 42 - Statistics For Management
6 pages
Uncertainty Management With Fuzzy and Rough Sets - Recent Advances and Applications
No ratings yet
Uncertainty Management With Fuzzy and Rough Sets - Recent Advances and Applications
424 pages
4782syllabus2018 9
No ratings yet
4782syllabus2018 9
7 pages
Unit1 DBMS
No ratings yet
Unit1 DBMS
112 pages
DBSCAN
No ratings yet
DBSCAN
23 pages
Research Proposal
No ratings yet
Research Proposal
2 pages
Block Diagram: CPE501 Chemical Process Control
No ratings yet
Block Diagram: CPE501 Chemical Process Control
3 pages
Tutorial 1 c14
No ratings yet
Tutorial 1 c14
7 pages
Energies 16 02248
No ratings yet
Energies 16 02248
21 pages
Thesis - Anomaly Detection
No ratings yet
Thesis - Anomaly Detection
57 pages
Neural Network Notes Unit 1
100% (1)
Neural Network Notes Unit 1
91 pages
AIS Syllabus
No ratings yet
AIS Syllabus
2 pages
Gender Classification Based On Fingerprint Analysis: G. Jayakala, and Dr. L.R. Sudha
No ratings yet
Gender Classification Based On Fingerprint Analysis: G. Jayakala, and Dr. L.R. Sudha
8 pages
Tugas Analisis Kuantitatif Untuk Bisnis
No ratings yet
Tugas Analisis Kuantitatif Untuk Bisnis
6 pages
Module 3
No ratings yet
Module 3
44 pages
IEM-Module 4
No ratings yet
IEM-Module 4
10 pages
Ex-1- R31,R32,R33,R34
No ratings yet
Ex-1- R31,R32,R33,R34
15 pages
Queue
No ratings yet
Queue
33 pages
Chapter Three Game Theory I
No ratings yet
Chapter Three Game Theory I
37 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
Greatest Pair - Kattis, ICPC Vietnam National Programming Contest 2020
No ratings yet
Greatest Pair - Kattis, ICPC Vietnam National Programming Contest 2020
1 page
Shuvajit Paul (Es-Cs301)
No ratings yet
Shuvajit Paul (Es-Cs301)
4 pages
Ee120 sp99 mt1 Sol
No ratings yet
Ee120 sp99 mt1 Sol
4 pages
4 - Foundations of Technical Analysis Computational Algorithms, Statistical Inference, and Empirical Implementation
No ratings yet
4 - Foundations of Technical Analysis Computational Algorithms, Statistical Inference, and Empirical Implementation
62 pages