Cpcs302 1

Course Goals • To provide students with an understanding of the major phases of a compiler. • To introduce students to the theory behind the various phases, including regular expressions, context-free grammars, and finite state automata. • To provide students with an understanding of the design and implementation of a compiler. • To have the students build a compiler, through type checking and intermediate code generation, for a small language. • To provide students with an opportunity to work in a group on a large project.

Course Outcomes • Students will have experience using current compiler generation tools. • Students will be familiar with the different phases of compilation. • Students will have experience defining and specifying the semantic rules of a programming language

Prerequisites • In-depth knowledge of at least one structured programming language. • Strong background in algorithms, data structures, and abstract data types, including stacks, binary trees, graphs. • Understanding of grammar theories. • Understanding of data types and control structures, their design and implementation. • Understanding of the design and implementation of subprograms, parameter passing mechanisms, scope.

Major Topics Covered in the Course ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Textbook Compilers: Principles, Techniques, and Tools” by Aho, Lam, Sethi, and Ullman, 2 nd edition.

GRADING Assignements & project: 40 Midterm Exam: 20 Final Exam: 40

Compilers and Interpreters ,[object Object],[object Object],Compiler Error messages Source Program Target Program Input Output

Compilers and Interpreters (cont’d) ,[object Object],[object Object],Interpreter Source Program Input Output Error messages

The Analysis-Synthesis Model of Compilation ,[object Object],[object Object],[object Object]

Preprocessors, Compilers, Assemblers, and Linkers Preprocessor Compiler Assembler Linker Skeletal Source Program Source Program Target Assembly Program Relocatable Object Code Absolute Machine Code Libraries and Relocatable Object Files Try for example: gcc -v myprog.c

The Phases of a Compiler Phase Output Sample Programmer (source code producer) Source string A=B+C; Scanner (performs lexical analysis ) Token string ‘ A’ , ‘=’ , ‘B’ , ‘+’ , ‘C’ , ‘;’ And symbol table with names Parser (performs syntax analysis based on the grammar of the programming language) Parse tree or abstract syntax tree ; | = / A + / B C Semantic analyzer (type checking, etc) Annotated parse tree or abstract syntax tree Intermediate code generator Three-address code, quads, or RTL int2fp B t1 + t1 C t2 := t2 A Optimizer Three-address code, quads, or RTL int2fp B t1 + t1 #2.3 A Code generator Assembly code MOVF #2.3,r1 ADDF2 r1,r2 MOVF r2,A Peephole optimizer Assembly code ADDF2 #2.3,r2 MOVF r2,A

The Grouping of Phases ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Compiler-Construction Tools ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

What qualities do you want in a that compiler you buy ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

High-level View of a Compiler ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Source code Machine code Compiler Errors

Traditional Two-pass Compiler ,[object Object],[object Object],[object Object],[object Object],Source code Front End Errors Machine code Back End IR

The Front End ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Source code Scanner IR Parser Errors tokens

The Front End ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Source code Scanner IR Parser Errors tokens

The Front End ,[object Object],[object Object],[object Object],The AST summarizes grammatical structure, without including detail about the derivation + - < id, x > < number, 2 > < id, y >

The Back End Responsibilities Translate IR into target machine code Choose instructions to implement each IR operation Decide which value to keep in registers Ensure conformance with system interfaces Automation has been much less successful in the back end Errors IR Instruction Scheduling Instruction Selection Machine code Register Allocation IR IR

The Back End ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Errors IR Instruction Scheduling Instruction Selection Machine code Register Allocation IR IR

The Back End ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Errors IR Instruction Scheduling Instruction Selection Machine code Register Allocation IR IR

Traditional Three-pass Compiler ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Errors Source Code Middle End Front End Machine code Back End IR IR

The Optimizer (or Middle End) Modern optimizers are structured as a series of passes Typical Transformations Discover & propagate some constant value Move a computation to a less frequently executed place Discover a redundant computation & remove it Remove useless or unreachable co de Errors O p t 1 O p t 3 O p t 2 O p t n ... IR IR IR IR IR

The Big Picture ,[object Object],[object Object],[object Object],[object Object],[object Object],Scanner Scanner Generator specifications source code parts of speech tables or code

Lexical Analysis ,[object Object],[object Object],[object Object]

Example ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Specifying Lexical Patterns (micro-syntax) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Specifying Lexical Patterns (micro-syntax)

Regular Expressions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Precedence is closure , then concatenation , then alternation Ever type “ rm *.o a.out” ?

Set Operations (refresher) You need to know these definitions

Examples of Regular Expressions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Regular Expressions (the point) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Example S 0 S 2 S 1 r ( 0 | 1 | 2 | … 9 ) accepting state ( 0 | 1 | 2 | … 9 ) Recognizer for Register

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Example (continued) S 0 S 2 S 1 r ( 0 | 1 | 2 | … 9 ) accepting state ( 0 | 1 | 2 | … 9 ) Recognizer for Register

Example (continued) char  next character; state  s 0 ; call action(state,char); while (char  eof ) state   (state,char); call action(state,char); char  next character; if  (state) = final then report acceptance; else report failure; action(state,char) switch(  (state) ) case start : word  char; break; case normal : word  word + char; break; case final : word  char; break; case error : report error; break; end; ,[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],What if we need a tighter specification?

Tighter register specification (continued) ,[object Object],[object Object],[object Object],[object Object],S 0 S 5 S 1 r S 4 S 3 S 6 S 2 0 , 1 , 2 3 0 , 1 4 , 5 , 6 , 7 , 8 , 9 ( 0 | 1 | 2 | … 9 )

Tighter register specification (continued) ,[object Object],[object Object],[object Object],[object Object],[object Object]

Cpcs302 1

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Cpcs302 1

Similar to Cpcs302 1 (20)

Recently uploaded

Recently uploaded (20)

Cpcs302 1