1. What is a compiler?
A compiler is a program that reads a program written in one language –the source language and translates it into an equivalent program in another language-the target language. The compiler reports to its user the presence of errors in the source program.
2. What are the two parts of a compilation? Explain briefly.
Analysis and Synthesis are the two parts of compilation.
The analysis part breaks up the source program into constituent pieces and creates an intermediate representation of the source program.
The synthesis part constructs the desired target program from the intermediate representation.
3. List the subparts or phases of analysis part.
Analysis consists of three phases:
- Linear Analysis.
- Hierarchical Analysis.
- Semantic Analysis.
4. Depict diagrammatically how a language is processed.
Skeletal source program
↓
Preprocessor
↓
Source program
↓
Compiler
↓
Target assembly program
↓
Assembler
↓
Relocatable machine code
↓
Loader/ link editor ←library, relocatable object files
↓
Absolute machine code
5. What is linear analysis?
Linear analysis is one in which the stream of characters making up the source program is read from left to right and grouped into tokens that are sequences of characters having a collective meaning.
Also called lexical analysis or scanning.
6. List the various phases of a compiler.
The following are the various phases of a compiler:
- Lexical Analyzer
- Syntax Analyzer
- Semantic Analyzer
- Intermediate code generator
- Code optimizer
- Code generator
7. What are the classifications of a compiler?
Compilers are classified as:
- · Single- pass
- · Multi-pass
- · Load-and-go
- · Debugging or optimizing
8. What is a symbol table?
A symbol table is a data structure containing a record for each identifier, with fields for the attributes of the identifier. The data structure allows us to find the record for each identifier quickly and to store or retrieve data from that record quickly.
Whenever an identifier is detected by a lexical analyzer, it is entered into the symbol table. The attributes of an identifier cannot be determined by the lexical analyzer.
9. Mention some of the cousins of a compiler.
Cousins of the compiler are:
- · Preprocessors
- · Assemblers
- · Loaders and Link-Editors
10. List the phases that constitute the front end of a compiler.
The front end consists of those phases or parts of phases that depend primarily on the source language and are largely independent of the target machine. These include
- · Lexical and Syntactic analysis
- · The creation of symbol table
- · Semantic analysis
- · Generation of intermediate code
A certain amount of code optimization can be done by the front end as well. Also includes error handling that goes along with each of these phases.
11. Mention the back-end phases of a compiler.
The back end of compiler includes those portions that depend on the target machine and generally those portions do not depend on the source language, just the intermediate language. These include
- · Code optimization
- · Code generation, along with error handling and symbol- table operations.
12. Define compiler-compiler.
Systems to help with the compiler-writing process are often been referred to as compiler-compilers, compiler-generators or translator-writing systems.
Largely they are oriented around a particular model of languages , and they are suitable for generating compilers of languages similar model.
13. List the various compiler construction tools.
The following is a list of some compiler construction tools:
- · Parser generators
- · Scanner generators
- · Syntax-directed translation engines
- · Automatic code generators
- · Data-flow engines
14. Differentiate tokens, patterns, lexeme.
- · Tokens- Sequence of characters that have a collective meaning.
- · Patterns- There is a set of strings in the input for which the same token is produced as output. This set of strings is described by a rule called a pattern associated with the token
- · Lexeme- A sequence of characters in the source program that is matched by the pattern for a token.
15. List the operations on languages.
- · Union – L U M ={s | s is in L or s is in M}
- · Concatenation – LM ={st | s is in L and t is in M}
- · Kleene Closure – L* (zero or more concatenations of L)
- · Positive Closure – L+ ( one or more concatenations of L)
16. Write a regular expression for an identifier.
An identifier is defined as a letter followed by zero or more letters or digits.
The regular expression for an identifier is given as
letter (letter | digit)*
17. Mention the various notational shorthands for representing regular expressions.
- · One or more instances (+)
- · Zero or one instance (?)
- · Character classes ([abc] where a,b,c are alphabet symbols denotes the regular expressions a | b | c.)
- · Non regular sets
18. What is the function of a hierarchical analysis?
Hierarchical analysis is one in which the tokens are grouped hierarchically into nested collections with collective meaning.
Also termed as Parsing.
19. What does a semantic analysis do?
Semantic analysis is one in which certain checks are performed to ensure that components of a program fit together meaningfully.
Mainly performs type checking.
20. List the various error recovery strategies for a lexical analysis.
Possible error recovery actions are:
- · Panic mode recovery
- · Deleting an extraneous character
- · Inserting a missing character
- · Replacing an incorrect character by a correct character
- · Transposing two adjacent characters
No comments:
Post a Comment