0% found this document useful (0 votes)

122 views

Compiler Design Code Generation

Code Generation

Uploaded by

Nera Ajahh

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

122 views

Compiler Design Code Generation

Code Generation

Uploaded by

Nera Ajahh

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

COMPILER DESIGN - CODE GENERATION

http://www.tutorialspoint.com/compiler_design/compiler_design_code_generation.htm Copyright © tutorialspoint.com

Code generation can be considered as the final phase of compilation. Through post code
generation, optimization process can be applied on the code, but that can be seen as a part of
code generation phase itself. The code generated by the compiler is an object code of some
lower-level programming language, for example, assembly language. We have seen that the
source code written in a higher-level language is transformed into a lower-level language that
results in a lower-level object code, which should have the following minimum properties:

It should carry the exact meaning of the source code.

It should be efficient in terms of CPU usage and memory management.

We will now see how the intermediate code is transformed into target object code
assemblycode, inthiscase.

Directed Acyclic Graph

Directed Acyclic Graph DAG is a tool that depicts the structure of basic blocks, helps to see the flow
of values flowing among the basic blocks, and offers optimization too. DAG provides easy
transformation on basic blocks. DAG can be understood here:

Leaf nodes represent identifiers, names or constants.

Interior nodes represent operators.

Interior nodes also represent the results of expressions or the identifiers/name where the
values are to be stored or assigned.

Example:

t0 = a + b
t1 = t0 + c
d = t0 + t1

[t0 = a + b]

[t1 = t0 + c]

[d = t0 + t1 ]

Peephole Optimization
This optimization technique works locally on the source code to transform it into an optimized
code. By locally, we mean a small portion of the code block at hand. These methods can be
applied on intermediate codes as well as on target codes. A bunch of statements is analyzed and
are checked for the following possible optimization:

Redundant instruction elimination

At source code level, the following can be done by the user:

int add_ten(int x) int add_ten(int x) int add_ten(int x) int add_ten(int x)

{ { { {
int y, z; int y; int y = 10; return x + 10;
y = 10; y = 10; return x + y; }
z = x + y; y = x + y; }
return z; return y;
} }

At compilation level, the compiler searches for instructions redundant in nature. Multiple loading
and storing of instructions may carry the same meaning even if some of them are removed. For
example:

MOV x, R0
MOV R0, R1

We can delete the first instruction and re-write the sentence as:

MOV x, R1

Unreachable code
Unreachable code is a part of the program code that is never accessed because of programming
constructs. Programmers may have accidently written a piece of code that can never be reached.

Example:

void add_ten(int x)
{
return x + 10;
printf(“value of x is %d”, x);
}

In this code segment, the printf statement will never be executed as the program control returns
back before it can execute, hence printf can be removed.

Flow of control optimization

There are instances in a code where the program control jumps back and forth without performing
any significant task. These jumps can be removed. Consider the following chunk of code:

...
MOV R1, R2
GOTO L1
...
L1 : GOTO L2
L2 : INC R1

In this code,label L1 can be removed as it passes the control to L2. So instead of jumping to L1 and
then to L2, the control can directly reach L2, as shown below:

...
MOV R1, R2
GOTO L2
...
L2 : INC R1

Algebraic expression simplification

There are occasions where algebraic expressions can be made simple. For example, the
expression a = a + 0 can be replaced by a itself and the expression a = a + 1 can simply be
replaced by INC a.

Strength reduction
There are operations that consume more time and space. Their ‘strength’ can be reduced by
replacing them with other operations that consume less time and space, but produce the same
result.

For example, x * 2 can be replaced by x << 1, which involves only one left shift. Though the
output of a * a and a 2 is same, a 2 is much more efficient to implement.

Accessing machine instructions

The target machine can deploy more sophisticated instructions, which can have the capability to
perform specific operations much efficiently. If the target code can accommodate those
instructions directly, that will not only improve the quality of code, but also yield more efficient
results.

Code Generator
A code generator is expected to have an understanding of the target machine’s runtime
environment and its instruction set. The code generator should take the following things into
consideration to generate the code:

Target language : The code generator has to be aware of the nature of the target language
for which the code is to be transformed. That language may facilitate some machine-specific
instructions to help the compiler generate the code in a more convenient way. The target
machine can have either CISC or RISC processor architecture.

IR Type : Intermediate representation has various forms. It can be in Abstract Syntax Tree
AST structure, Reverse Polish Notation, or 3-address code.

Selection of instruction : The code generator takes Intermediate Representation as input

and converts maps it into target machine’s instruction set. One representation can have many
ways instructions to convert it, so it becomes the responsibility of the code generator to choose
the appropriate instructions wisely.

Register allocation : A program has a number of values to be maintained during the

execution. The target machine’s architecture may not allow all of the values to be kept in the
CPU memory or registers. Code generator decides what values to keep in the registers. Also,
it decides the registers to be used to keep these values.

Ordering of instructions : At last, the code generator decides the order in which the
instruction will be executed. It creates schedules for instructions to execute them.

Descriptors
The code generator has to track both the registers foravailability and addresses locationofvalues while
generating the code. For both of them, the following two descriptors are used:

Register descriptor : Register descriptor is used to inform the code generator about the
availability of registers. Register descriptor keeps track of values stored in each register.
Whenever a new register is required during code generation, this descriptor is consulted for
register availability.

Address descriptor : Values of the names identifiers used in the program might be stored at
different locations while in execution. Address descriptors are used to keep track of memory
locations where the values of identifiers are stored. These locations may include CPU
registers, heaps, stacks, memory or a combination of the mentioned locations.

Code generator keeps both the descriptor updated in real-time. For a load statement, LD R1, x, the
code generator:
updates the Register Descriptor R1 that has value of x and
updates the Address Descriptor x to show that one instance of x is in R1.

Code Generation
Basic blocks comprise of a sequence of three-address instructions. Code generator takes these
sequence of instructions as input.

Note : If the value of a name is found at more than one place register, cache, ormemory, the register’s
value will be preferred over the cache and main memory. Likewise cache’s value will be preferred
over the main memory. Main memory is barely given any preference.

getReg : Code generator uses getReg function to determine the status of available registers and
the location of name values. getReg works as follows:

If variable Y is already in register R, it uses that register.

Else if some register R is available, it uses that register.

Else if both the above options are not possible, it chooses a register that requires minimal
number of load and store instructions.

For an instruction x = y OP z, the code generator may perform the following actions. Let us assume
that L is the location preferablyregister where the output of y OP z is to be saved:

Call function getReg, to decide the location of L.

Determine the present location registerormemory of y by consulting the Address Descriptor of y.

If y is not presently in register L, then generate the following instruction to copy the value of
y to L:

MOV y’, L

where y’ represents the copied value of y.

Determine the present location of z using the same method used in step 2 for y and
generate the following instruction:

OP z’, L

where z’ represents the copied value of z.

Now L contains the value of y OP z, that is intended to be assigned to x. So, if L is a register,

update its descriptor to indicate that it contains the value of x. Update the descriptor of x to
indicate that it is stored at location L.

If y and z has no further use, they can be given back to the system.

Other code constructs like loops and conditional statements are transformed into assembly
language in general assembly way.
Loading [MathJax]/jax/output/HTML-CSS/jax.js

Chapter 5 Syntax-Directed Translation
No ratings yet
Chapter 5 Syntax-Directed Translation
25 pages
Pak
No ratings yet
Pak
33 pages
Re To DFA
No ratings yet
Re To DFA
6 pages
Compiler Unit 1
No ratings yet
Compiler Unit 1
110 pages
Process of Execution of A Program:: Compiler Design
No ratings yet
Process of Execution of A Program:: Compiler Design
26 pages
Different Approach of Software Design
No ratings yet
Different Approach of Software Design
8 pages
Unit 2: Role of Lexical Analyzer
No ratings yet
Unit 2: Role of Lexical Analyzer
11 pages
Error Detection - Recovery
No ratings yet
Error Detection - Recovery
35 pages
Unit 4: Symbol Table
No ratings yet
Unit 4: Symbol Table
38 pages
Java Lab Record
No ratings yet
Java Lab Record
110 pages
Lab Manual C AIDS - 2
No ratings yet
Lab Manual C AIDS - 2
50 pages
Compiler Design Unit 1 Notes
No ratings yet
Compiler Design Unit 1 Notes
21 pages
Unit 3 PPL
No ratings yet
Unit 3 PPL
16 pages
Unit-4 Context Free Grammar
No ratings yet
Unit-4 Context Free Grammar
106 pages
GNS221 E-Exam Question1000
No ratings yet
GNS221 E-Exam Question1000
49 pages
System Software Lab
100% (2)
System Software Lab
49 pages
Compiler Design
No ratings yet
Compiler Design
45 pages
Module-2 Lexical Analyzer
No ratings yet
Module-2 Lexical Analyzer
36 pages
Machine Structure SP
No ratings yet
Machine Structure SP
15 pages
1.Q and A Compiler Design
No ratings yet
1.Q and A Compiler Design
20 pages
2.1basic Assemblers Functions
100% (2)
2.1basic Assemblers Functions
15 pages
NSK OS I 13 Solution 1
No ratings yet
NSK OS I 13 Solution 1
7 pages
Chap 1 Dhamdhere
75% (4)
Chap 1 Dhamdhere
84 pages
Principles of Compiler Design
No ratings yet
Principles of Compiler Design
36 pages
Compiler Design Gate Practice Paper
0% (1)
Compiler Design Gate Practice Paper
5 pages
Rayalaseema University, Kurnool: Paper-IV: Programming in C and C++ Programming in C: Unit - I
No ratings yet
Rayalaseema University, Kurnool: Paper-IV: Programming in C and C++ Programming in C: Unit - I
76 pages
Lab Manual
No ratings yet
Lab Manual
20 pages
Atcd-Unit-5 (1) - 2
No ratings yet
Atcd-Unit-5 (1) - 2
32 pages
DAA Practical File Questions
No ratings yet
DAA Practical File Questions
6 pages
III Year-V Semester: B.Tech. Computer Science and Engineering 5CS4-02: Compiler Design UNIT-1
100% (1)
III Year-V Semester: B.Tech. Computer Science and Engineering 5CS4-02: Compiler Design UNIT-1
11 pages
Unit1 Introduction Algorithm
No ratings yet
Unit1 Introduction Algorithm
161 pages
Allslides Handout
No ratings yet
Allslides Handout
269 pages
Advance Algorithms PDF
0% (2)
Advance Algorithms PDF
2 pages
The Design and Analysis of Algorithms: by Anany Levitin
100% (1)
The Design and Analysis of Algorithms: by Anany Levitin
14 pages
CD - 2 Marks Questions With Answers
No ratings yet
CD - 2 Marks Questions With Answers
21 pages
Presentations PPT Unit-1 27042019073920AM
100% (1)
Presentations PPT Unit-1 27042019073920AM
42 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
26 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
13 pages
Purpose of Language Processors
100% (2)
Purpose of Language Processors
7 pages
MCSE 204: Adina Institute of Science & Technology
No ratings yet
MCSE 204: Adina Institute of Science & Technology
16 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
22 pages
Graphs Assignment
No ratings yet
Graphs Assignment
5 pages
Q-1 Describe Data Structures For Symbol Table. Ans
No ratings yet
Q-1 Describe Data Structures For Symbol Table. Ans
14 pages
Three Address Code
100% (1)
Three Address Code
19 pages
Cs1352 Principles of Compiler Design
No ratings yet
Cs1352 Principles of Compiler Design
33 pages
Files in Python
No ratings yet
Files in Python
12 pages
Divide and Conquer Strategy
No ratings yet
Divide and Conquer Strategy
33 pages
Department of Information Technolo
No ratings yet
Department of Information Technolo
116 pages
Compiler Design Unit 2
No ratings yet
Compiler Design Unit 2
44 pages
Object Oriented Analysis and Design Using UML
100% (1)
Object Oriented Analysis and Design Using UML
111 pages
Unit-I - Introduction
100% (1)
Unit-I - Introduction
75 pages
5 Pca
No ratings yet
5 Pca
14 pages
CS8602 CD
No ratings yet
CS8602 CD
2 pages
2 Syntax Directed Transiation
No ratings yet
2 Syntax Directed Transiation
9 pages
Lecture 1
No ratings yet
Lecture 1
26 pages
Last Year Questions
No ratings yet
Last Year Questions
1 page
CD Unit - 4
No ratings yet
CD Unit - 4
39 pages
Advanced Unix Programming
From Everand
Advanced Unix Programming
Prof. N. B Venkateswarlu
No ratings yet
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
C Programming: Core Concepts and Techniques
From Everand
C Programming: Core Concepts and Techniques
William Smith
No ratings yet
Java Reflection Complete Self-Assessment Guide
From Everand
Java Reflection Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Spring MVC Form Validation With Annotations Tutorial - CodeTutr
No ratings yet
Spring MVC Form Validation With Annotations Tutorial - CodeTutr
11 pages
SDLC
No ratings yet
SDLC
55 pages
05 Java AWT Programming
No ratings yet
05 Java AWT Programming
17 pages
C++ Practical File: Session Topic
No ratings yet
C++ Practical File: Session Topic
3 pages
CacheCoherencyWhitepaper 6june2011 PDF
No ratings yet
CacheCoherencyWhitepaper 6june2011 PDF
15 pages
(XXXX) Syllabus - Exam Oracle Database 11g Program With PLSQL #1Z0-144 by Arief 050614 Edited
No ratings yet
(XXXX) Syllabus - Exam Oracle Database 11g Program With PLSQL #1Z0-144 by Arief 050614 Edited
1 page
T REC Q.773 Tcap
No ratings yet
T REC Q.773 Tcap
39 pages
REST-assured Karate References / Comments
No ratings yet
REST-assured Karate References / Comments
3 pages
RFC-1846 & Reply Code-521
100% (1)
RFC-1846 & Reply Code-521
2 pages
Star UML
No ratings yet
Star UML
25 pages
FRND App Privacy Policy
No ratings yet
FRND App Privacy Policy
12 pages
Webthereum
100% (16)
Webthereum
30 pages
Implementation Guide
No ratings yet
Implementation Guide
54 pages
Algo Lec#3 PDF
No ratings yet
Algo Lec#3 PDF
40 pages
Links in CATIA, Part 3: Context Link: Tips and Techniques
No ratings yet
Links in CATIA, Part 3: Context Link: Tips and Techniques
7 pages
Ccitt: Security Architecture For Open Systems Interconnection For Ccitt Applications
No ratings yet
Ccitt: Security Architecture For Open Systems Interconnection For Ccitt Applications
48 pages
MNP CAll Flow
100% (2)
MNP CAll Flow
13 pages
Basic My Profile App
No ratings yet
Basic My Profile App
3 pages
Salesforce Lightning Interview Questions
100% (2)
Salesforce Lightning Interview Questions
11 pages
K-Means Clustering Using Weka Interface
No ratings yet
K-Means Clustering Using Weka Interface
6 pages
Nicolas Grégoire Agarri - FR On Twitter. Bio Is Online
No ratings yet
Nicolas Grégoire Agarri - FR On Twitter. Bio Is Online
73 pages
Asynchronous IO With Boost - Asio - Michael Caisse - CppCon 2016 PDF
No ratings yet
Asynchronous IO With Boost - Asio - Michael Caisse - CppCon 2016 PDF
104 pages
Step-By-Step Guide For LSMW Using ALE-IDOC Method - 1
No ratings yet
Step-By-Step Guide For LSMW Using ALE-IDOC Method - 1
7 pages
Lect 5 PDF
No ratings yet
Lect 5 PDF
10 pages
Intro To 3D Modeling Lesson 1
No ratings yet
Intro To 3D Modeling Lesson 1
24 pages
Excel 2007VBA
No ratings yet
Excel 2007VBA
108 pages
Oracle RAC and Docker
50% (2)
Oracle RAC and Docker
43 pages
Fem Finite Element Analysis
No ratings yet
Fem Finite Element Analysis
11 pages
Aa
No ratings yet
Aa
7 pages