Specification of Token

•

0 likes•75 views

A. S. M. Shafi

Compiler Design

Engineering

SPECIFICATION OF TOKENS
There are 3 specifications of tokens:
1. Strings
2. Language
3. Regular expression
An alphabet is any finite set of symbols. Typical examples of symbols are letters, digits, and punctuation.
The set {0,1) is the binary alphabet.
A string over an alphabet is a finite sequence of symbols drawn from that alphabet. In language theory,
the terms "sentence" and "word" are often used as synonyms for "string."
The length of a string s, usually written |s|, is the number of occurrences of symbols in s. For example,
banana is a string of length six. The empty string, denoted ∊, is the string of length zero.
A language is any countable set of strings over some fixed alphabet. This definition is very broad. Abstract
languages like ∅, the empty set, or {∊}, the set containing only the empty string, are languages under this
definition.
Operations on strings
The following string-related terms are commonly used:
 A prefix of string s is any string obtained by removing zero or more symbols from the end of string
s. For example, ban is a prefix of banana.
 A suffix of string s is any string obtained by removing zero or more symbols from the beginning of
s. For example, nana is a suffix of banana.
 A substring of s is obtained by deleting any prefix and any suffix from s. For example, nan is a
substring of banana.
 The proper prefixes, suffixes, and substrings of a string s are those prefixes, suffixes, and
substrings, respectively of s that are not ε or not equal to s itself.
 A subsequence of s is any string formed by deleting zero or more not necessarily consecutive
positions of s. For example, baan is a subsequence of banana.
Operations on languages:
The following are the operations that can be applied to languages:
1. Union
2. Concatenation
3. Kleene closure
4. Positive closure
which are defined formally in Fig. 1.

Figure 1: Definitions of operations on languages
Union is the familiar operation on sets. The concatenation of languages is all strings formed by taking a
string from the first language and a string from the second language, in all possible ways, and concatenating
them.
The (Kleene) closure of a language L, denoted L*, is the set of strings we get by concatenating L zero or
more times. Note that Lo, the "concatenation of L zero times," is defined to be {∊), and inductively, Li is Li-1.
Finally, the positive closure, denoted L+, is the same as the Kleene closure, but without the term Lo. That
is, ∊ will not be in L+ unless it is in L itself.
The following example shows the operations on strings: Let L={0,1} and S={a,b,c}

What's hot

Regular expressionsShiraz316

1.5 & 1.6 regular languages & regular expressionSampath Kumar S

Lecture 5shah zeb

Lecture 6shah zeb

Regular ExpressionA. S. M. Shafi

Theory of AutomataFarooq Mian

Lecture 7shah zeb

Final formal languagesMegha Khanna

LanguageMobeen Mustafa

01 alphabets strings and languagesJohnDevaPrasanna1

FLAT Notesdilip kumar

WordsSarah Abdussalam

4.7. chomskian hierarchy of languagesSampath Kumar S

Generalized transition graphsArham Khan G

Ch3 4 regular expression and grammarmeresie tesfay

Lecture 3,4shah zeb

Formal languageRajendran

AssigmentAmna Awan

Lecture 8shah zeb

What's hot (19)

Regular expressions

1.5 & 1.6 regular languages & regular expression

Lecture 5

Lecture 6

Regular Expression

Theory of Automata

Lecture 7

Final formal languages

Language

01 alphabets strings and languages

FLAT Notes

Words

4.7. chomskian hierarchy of languages

Generalized transition graphs

Ch3 4 regular expression and grammar

Lecture 3,4

Formal language

Assigment

Lecture 8

Similar to Specification of Token

Lexical analyzerPrincess Doll

Ch2 automata.pptxTemesgenAzezew

Automata definitionsSajid Marwat

Flat unit 1VenkataRaoS1

Chapter2CDpdf__2021_11_26_09_19_08.pdfDrIsikoIsaac

Regular expressionlahirubanuka1

Lecture: Regular Expressions and Regular LanguagesMarina Santini

theory of computation lecture 028threspecter

Lecture Notes-Are Natural Languages Regular.pdfDeptii Chaudhari

01-Introduction&Languages.pdfTariqSaeed80

Theory of Automata Lesson 02hamzamughal39

, Phonological systems are rule-governed; that is, they operat.docxdurantheseldine

A Statistical Model for Morphology Inspired by the Amis Languagedannyijwest

A statistical model for morphology inspired by the Amis languageIJwest

Module 1 TOC.pptxMohitJain21BCE1523

Lesson 02maamir farooq

Lesson 02University of Haripur

Free Ebooks Download ! EdholeEdhole.com

Mba ebooks ! EdholeEdhole.com

A language can be seen as a system suitable for expression of certai.pdfanudamobileshopee

Similar to Specification of Token (20)

Lexical analyzer

Ch2 automata.pptx

Automata definitions

Flat unit 1

Chapter2CDpdf__2021_11_26_09_19_08.pdf

Regular expression

Lecture: Regular Expressions and Regular Languages

theory of computation lecture 02

Lecture Notes-Are Natural Languages Regular.pdf

01-Introduction&Languages.pdf

Theory of Automata Lesson 02

, Phonological systems are rule-governed; that is, they operat.docx

A Statistical Model for Morphology Inspired by the Amis Language

A statistical model for morphology inspired by the Amis language

Module 1 TOC.pptx

Lesson 02

Free Ebooks Download ! Edhole

Mba ebooks ! Edhole

A language can be seen as a system suitable for expression of certai.pdf

Recently uploaded

Introduction to Multiple Access Protocol.pptxupamatechverse

Extrusion Processes and Their Limitations120cr0395

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile

Introduction and different types of Ethernet.pptxupamatechverse

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

AKTU Computer Networks notes --- Unit 3.pdfankushspencer015

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

Recently uploaded (20)

Introduction to Multiple Access Protocol.pptx

Extrusion Processes and Their Limitations

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

Water Industry Process Automation & Control Monthly - April 2024

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...

Introduction and different types of Ethernet.pptx

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...

Introduction to IEEE STANDARDS and its different types.pptx

AKTU Computer Networks notes --- Unit 3.pdf

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

Specification of Token

1. SPECIFICATION OF TOKENS There are 3 specifications of tokens: 1. Strings 2. Language 3. Regular expression An alphabet is any finite set of symbols. Typical examples of symbols are letters, digits, and punctuation. The set {0,1) is the binary alphabet. A string over an alphabet is a finite sequence of symbols drawn from that alphabet. In language theory, the terms "sentence" and "word" are often used as synonyms for "string." The length of a string s, usually written |s|, is the number of occurrences of symbols in s. For example, banana is a string of length six. The empty string, denoted ∊, is the string of length zero. A language is any countable set of strings over some fixed alphabet. This definition is very broad. Abstract languages like ∅, the empty set, or {∊}, the set containing only the empty string, are languages under this definition. Operations on strings The following string-related terms are commonly used:  A prefix of string s is any string obtained by removing zero or more symbols from the end of string s. For example, ban is a prefix of banana.  A suffix of string s is any string obtained by removing zero or more symbols from the beginning of s. For example, nana is a suffix of banana.  A substring of s is obtained by deleting any prefix and any suffix from s. For example, nan is a substring of banana.  The proper prefixes, suffixes, and substrings of a string s are those prefixes, suffixes, and substrings, respectively of s that are not ε or not equal to s itself.  A subsequence of s is any string formed by deleting zero or more not necessarily consecutive positions of s. For example, baan is a subsequence of banana. Operations on languages: The following are the operations that can be applied to languages: 1. Union 2. Concatenation 3. Kleene closure 4. Positive closure which are defined formally in Fig. 1.

2. Figure 1: Definitions of operations on languages Union is the familiar operation on sets. The concatenation of languages is all strings formed by taking a string from the first language and a string from the second language, in all possible ways, and concatenating them. The (Kleene) closure of a language L, denoted L*, is the set of strings we get by concatenating L zero or more times. Note that Lo, the "concatenation of L zero times," is defined to be {∊), and inductively, Li is Li-1. Finally, the positive closure, denoted L+, is the same as the Kleene closure, but without the term Lo. That is, ∊ will not be in L+ unless it is in L itself. The following example shows the operations on strings: Let L={0,1} and S={a,b,c}

Specification of Token

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Specification of Token

Similar to Specification of Token (20)

More from A. S. M. Shafi

More from A. S. M. Shafi (20)

Recently uploaded

Recently uploaded (20)

Specification of Token