SlideShare a Scribd company logo
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 2, Ver. III (Mar – Apr. 2015), PP 27-29
www.iosrjournals.org
DOI: 10.9790/0661-17232729 www.iosrjournals.org 27 | Page
Class Diagram Extraction from Textual Requirements Using NLP
Techniques
Vrushali Adhav1
, Darshana Ahire2
, Aarti Jadhav3
, Dipali Lokhande4
1,2,3,4,
(Computer Engineering, MET’s BKC IOE/ Savitribai Phule Pune University, India)
Abstract: A new method for translating software requirements to object-oriented model is proposed. Software
requirements specified in a natural language using scenario like format will be transformed into class diagrams
of Unified Modeling Language using specification transformation rule. The transformation rule is based on a
grammatical analysis, more specifically syntactical analysis, of requirements specification. This paper proposes
a new method for translating software requirements specified using natural language to formal specification (in
this context is executable and translatable Unified Modeling Language Class Diagram). Requirements
specification written in a scenario like format will be transformed into class diagram’s components the
automation of class generation from natural language requirements is highly challenging. This paper proposes
a method and a tool to facilitate requirements analysis process and class diagram extraction from textual
requirements supporting natural language processing techniques. Requirements engineers analyze requirements
manually to come out with analysis artifacts such as class diagram.
Keywords: NL (Natural Language), NLP (Natural Language processing), POS (Part Of Speech), RACE
(Requirement Analysis and Class Diagram Extraction), UML (Unified Modeling Language)
I. Introduction
Requirements are often poorly understood and articulated, even by domain experts. It is common to
find requirements specifications that are incomplete, inconsistent, ambiguous, and unstable. This problem
emerges since software requirements are inherently complex and involving various stakeholders. The fact that
specifying requirements should involve various stakeholders, who normally have no technical knowledge on
software analysis and design, imposes developer (software analyst and designer) to use specification media
which can be easily and intuitively understood by novice. In most practices, therefore, practitioners choose to
specify software requirements using natural languages. The reason why natural language is preferred as the
specification medium is because most stakeholders are more familiar with natural language compared to using
other types of media (such as formal notation or modeling language). Requirements specified in a natural
language are, however, normally written in a free form style. They still therefore need to be further analyzed and
transformed into more formal specification especially for development and documentation purposes. In a
number of software development methodologies, such as translate approach, formal specification is important
since software (source) code is obtained by translating it.
II. Related Work
a. Need of system [4]
One of the most challenging and costly functions performed by IT staff today is the deployment of
various software‟s to new or existing client computers. Currently, organizations spend a great deal of time and
expense planning, designing, and rolling out the latest version of the operating system throughout the
organization. Often this process is done manually, requiring a help desk professional to physically visit each
computer. This paper will assist analyst by providing an efficient and fast way to produce the class diagram for
their natural language requirement.
b. Existing Tools [3]
There have been several efforts for the analysis of natural language requirements. However, few are
focused on class diagram extraction from natural language requirements. Thus, few tools exist to assist analysts in
the extraction of class diagram. In this section we survey the works that use NLP or domain ontology techniques
to analyze NL requirements, and the works that aim to extract class diagram based on NLP or domain ontology
techniques. Following are some tools:
i. CIRCE
Ambriola and Gervasi present a Web-based environment called Circe. Circe helps in the elicitation,
selection, and validation of the software requirements. It can build semi-formal models, extract information from
the NL requirements, and measure the consistency of these models. Circe gives the user a complete environment
Class Diagram Extraction from Textual Requirements Using NLP Techniques
DOI: 10.9790/0661-17232729 www.iosrjournals.org 28 | Page
that integrates a number of tools. Cico is the main tool that is considered as a front-end for the other components;
it recognizes the NL sentences and extracts some facts from them. These facts are handed to the remaining tools
for graphical representation and analysis. Natural language sentence of requirements is parsed converted into a
forest of parse trees. These parse trees closely correspond to the original requirements but many surface features
of requirement document are not included in abstract UML model.
ii. Zhou and Zhou
It proposes a methodology that uses NLP and domain ontology. It is based on that the core classes are
always semantically connected to each other‟s by one to one, one to many, or many to many relationships in the
domain. This methodology Class Diagram from textual requirements using NLP techniques finds candidate
classes using NLP through a part of speech (POS) tagger, a link grammar ,parser, linguistic patterns and parallel
structure, and then the domain ontology is used to refine the result.
iii. LOLITA
Mich L. proposes a NLP system, LOLITA to generate an object model automatically from natural
language. This approach considers nouns as objects and use links to find relationships amongst objects. LOLITA
system is built on a large scale Semantic Network (SN) that does not distinguish between classes, attributes, and
objects. This approach is limited to extract objects and cannot identify classes. Hence for to overcome this
problem we are trying developing our system that mean we think how to solve this problem and we find solution
on this problem and we can apply on this our paper.
c. RACE System [2]
RACE system is decomposed into internal and external components and sub-systems. Figure 1
illustrates the architecture model of RACE:
i. OPEN NLP Parser
OpenNLP is an open-source and re-usable algorithm. It provides our system with lexical and syntactic
parsers. OpenNLP POS tagger (lexical) takes the English text as input and outputs the corresponding POS tags
for each word; On the other hand, OpenNLP Chunker (syntactic) chunks the sentence into phrases (Noun
phrase, verb phrase, etc.) according to English language grammar. The high accuracy and speed in OpenNLP
encouraged us to choose it rather than other existing parsers. OpenNLP uses lexical and syntactic annotations to
denote to the part of speech of the terms; for example, NN denotes to Proper Noun, VB denotes to Verb, and NP
denotes to Noun Phrase. OpenNLP parser supports our system with an efficient way to find the terms‟ part of
speech (POS) which we need in order to accomplish the noun and verb analysis.
ii. RACE Stemming Algorithm
Stemming is a technique that abbreviates word by removing affixes and suffixes. In RACE system, it is
very important to return words back to its base form; this will reduce the redundancy and increase the efficiency
of the system. To perform the stemming, we implemented a new stemming algorithm. Based on the stemming
result, we find that our stemming algorithm is efficient and sufficient to be used in the morphological analysis of
requirements in RACE system. Our stemming algorithm is simple and re-usable.
iii. WORDNET
WordNet is used to validate the semantic correctness of the sentences generated at the syntactic analysis.
It also enables users to display all hypernyms for a selected noun. We used this feature to verify Generalization
relationship where a noun phrase is supposed to be „a kind of‟ another noun phrase. WordNet can used to find
semantically similar terms, and for the acquisition of synonyms. We used synonyms to extract words which are
semantically related to each other. We calculated the words frequency to keep the synonyms with high
frequency in the document.
iv. Concept Extraction Engine
The aim of this module is to extract concepts according to the requirements document. This module uses
OpenNLP parser in, RACE stemming algorithm, and WordNet into extract concepts related to the given
requirements.
i. Class Extraction Engine
This module uses the output of concept extraction engine module and applies different heuristic rules to
extract the class diagram; However, We use domain ontology in this module to refine the extracted class
diagram. We can summarize the heuristic rules.
Class Diagram Extraction from Textual Requirements Using NLP Techniques
DOI: 10.9790/0661-17232729 www.iosrjournals.org 29 | Page
i. Domain Ontology
As mentioned early in this paper, domain ontology is used to improve the performance of concepts
identification. In RACE system we use a Library system Ontology as sample ontology. We gathered our
ontology based on a survey we conducted on some library systems, and then we organize the ontology in such
way to be easily maintained and re-used. The information in the ontology includes concepts related to classes
for Library system, Attributes, and relationships. We used the XML to build the ontology.
III. Figures And Tables
Fig 1: race system
IV. Conclusion
In this paper, we propose an enhanced approach that is based on NLP techniques to support the
extraction of class diagram from NL requirements. We validate our approach by implementing a system called
RACE referred to as “Requirements Analysis and class diagram Extraction”. RACE system efficiently
demonstrates the using of NLP techniques in the extraction of class diagram from informal requirements.
Acknowledgements
We have taken efforts in this paper. However, it would not have been possible without the kind support
and help of many individual. Our thanks and appreciation also goes to our beloved parents for their blessings, all
the staff members, friends and colleagues for providing help and support in our work.
References
Journal Papers:
[1] Booch, G. Object-Oriented Analysis and Design with Applications, 2nd Ed., Benjamin Cummings.
[2] Ambriola, V. and Gervasi, V. “Processing natural language requirements”, Proc. 12th IEEE Intl. Conf. on Automated Software
Engineering, pp.
[3] Farid Meziane, Nikos Athanasakis, Sophia Ananiadou, 2007,Generating Natural Language specifications from UML class diagrams,
Springer-Verlag London Limited .
[4] F Elizabeth D. Liddy & Jennifer H. Liddy, “An NLP Approach for Improving Access to Statistical Information for the Masses”.

More Related Content

What's hot

Oop
OopOop
Integrating natural language processing and software engineering
Integrating natural language processing and software engineeringIntegrating natural language processing and software engineering
Integrating natural language processing and software engineering
Nakul Sharma
 
Benchmarking nlp toolkits for enterprise application
Benchmarking nlp toolkits for enterprise applicationBenchmarking nlp toolkits for enterprise application
Benchmarking nlp toolkits for enterprise application
Conference Papers
 
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational  database for tran...Hindi language as a graphical user interface to relational  database for tran...
Hindi language as a graphical user interface to relational database for tran...
IRJET Journal
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
ijnlc
 
Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)
IT Industry
 
Oops
OopsOops
Cohesive Software Design
Cohesive Software DesignCohesive Software Design
Cohesive Software Design
ijtsrd
 
POP vs OOP Introduction
POP vs OOP IntroductionPOP vs OOP Introduction
POP vs OOP Introduction
Hashni T
 
Icsme16.ppt
Icsme16.pptIcsme16.ppt
Unit 1 OOSE
Unit 1 OOSE Unit 1 OOSE
Unit 1 OOSE
ChhayaShelake
 
Principles of object oriented programming
Principles of object oriented programmingPrinciples of object oriented programming
Principles of object oriented programming
Amogh Kalyanshetti
 
NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)
IT Industry
 
IRJET- An Efficient Way to Querying XML Database using Natural Language
IRJET-  	  An Efficient Way to Querying XML Database using Natural LanguageIRJET-  	  An Efficient Way to Querying XML Database using Natural Language
IRJET- An Efficient Way to Querying XML Database using Natural Language
IRJET Journal
 

What's hot (15)

Oop
OopOop
Oop
 
Integrating natural language processing and software engineering
Integrating natural language processing and software engineeringIntegrating natural language processing and software engineering
Integrating natural language processing and software engineering
 
Benchmarking nlp toolkits for enterprise application
Benchmarking nlp toolkits for enterprise applicationBenchmarking nlp toolkits for enterprise application
Benchmarking nlp toolkits for enterprise application
 
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational  database for tran...Hindi language as a graphical user interface to relational  database for tran...
Hindi language as a graphical user interface to relational database for tran...
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
 
Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)
 
Oops
OopsOops
Oops
 
Cohesive Software Design
Cohesive Software DesignCohesive Software Design
Cohesive Software Design
 
POP vs OOP Introduction
POP vs OOP IntroductionPOP vs OOP Introduction
POP vs OOP Introduction
 
Icsme16.ppt
Icsme16.pptIcsme16.ppt
Icsme16.ppt
 
Unit 1 OOSE
Unit 1 OOSE Unit 1 OOSE
Unit 1 OOSE
 
Principles of object oriented programming
Principles of object oriented programmingPrinciples of object oriented programming
Principles of object oriented programming
 
NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)
 
IRJET- An Efficient Way to Querying XML Database using Natural Language
IRJET-  	  An Efficient Way to Querying XML Database using Natural LanguageIRJET-  	  An Efficient Way to Querying XML Database using Natural Language
IRJET- An Efficient Way to Querying XML Database using Natural Language
 
Java chapter 3
Java   chapter 3Java   chapter 3
Java chapter 3
 

Viewers also liked

B018110915
B018110915B018110915
B018110915
IOSR Journals
 
F012463754
F012463754F012463754
F012463754
IOSR Journals
 
F1303053542
F1303053542F1303053542
F1303053542
IOSR Journals
 
M010527678
M010527678M010527678
M010527678
IOSR Journals
 
Design analysis and Commissioning Of High Mast Lighting Poles
Design analysis and Commissioning Of High Mast Lighting PolesDesign analysis and Commissioning Of High Mast Lighting Poles
Design analysis and Commissioning Of High Mast Lighting Poles
IOSR Journals
 
F0433338
F0433338F0433338
F0433338
IOSR Journals
 
A Novel Edge Detection Technique for Image Classification and Analysis
A Novel Edge Detection Technique for Image Classification and AnalysisA Novel Edge Detection Technique for Image Classification and Analysis
A Novel Edge Detection Technique for Image Classification and Analysis
IOSR Journals
 
I01245562
I01245562I01245562
I01245562
IOSR Journals
 
Y01226152157
Y01226152157Y01226152157
Y01226152157
IOSR Journals
 
D012311522
D012311522D012311522
D012311522
IOSR Journals
 
F1102013841
F1102013841F1102013841
F1102013841
IOSR Journals
 
J017316670
J017316670J017316670
J017316670
IOSR Journals
 
J012647278
J012647278J012647278
J012647278
IOSR Journals
 
F012552934
F012552934F012552934
F012552934
IOSR Journals
 
G013144149
G013144149G013144149
G013144149
IOSR Journals
 
Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...
Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...
Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...
IOSR Journals
 
G013135664
G013135664G013135664
G013135664
IOSR Journals
 
C013112225
C013112225C013112225
C013112225
IOSR Journals
 
C012512230
C012512230C012512230
C012512230
IOSR Journals
 

Viewers also liked (20)

B018110915
B018110915B018110915
B018110915
 
F012463754
F012463754F012463754
F012463754
 
F1303053542
F1303053542F1303053542
F1303053542
 
M010527678
M010527678M010527678
M010527678
 
Design analysis and Commissioning Of High Mast Lighting Poles
Design analysis and Commissioning Of High Mast Lighting PolesDesign analysis and Commissioning Of High Mast Lighting Poles
Design analysis and Commissioning Of High Mast Lighting Poles
 
F0433338
F0433338F0433338
F0433338
 
C0951827
C0951827C0951827
C0951827
 
A Novel Edge Detection Technique for Image Classification and Analysis
A Novel Edge Detection Technique for Image Classification and AnalysisA Novel Edge Detection Technique for Image Classification and Analysis
A Novel Edge Detection Technique for Image Classification and Analysis
 
I01245562
I01245562I01245562
I01245562
 
Y01226152157
Y01226152157Y01226152157
Y01226152157
 
D012311522
D012311522D012311522
D012311522
 
F1102013841
F1102013841F1102013841
F1102013841
 
J017316670
J017316670J017316670
J017316670
 
J012647278
J012647278J012647278
J012647278
 
F012552934
F012552934F012552934
F012552934
 
G013144149
G013144149G013144149
G013144149
 
Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...
Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...
Development and Characterization of Adsorbent from Rice Husk Ash to Bleach Ve...
 
G013135664
G013135664G013135664
G013135664
 
C013112225
C013112225C013112225
C013112225
 
C012512230
C012512230C012512230
C012512230
 

Similar to D017232729

Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
IJTET Journal
 
IRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software TechnologiesIRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET Journal
 
team10.ppt.pptx
team10.ppt.pptxteam10.ppt.pptx
team10.ppt.pptx
REMEGIUSPRAVEENSAHAY
 
Finding Bad Code Smells with Neural Network Models
Finding Bad Code Smells with Neural Network Models Finding Bad Code Smells with Neural Network Models
Finding Bad Code Smells with Neural Network Models
IJECEIAES
 
IRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation SystemIRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation System
IRJET Journal
 
Novel Database-Centric Framework for Incremental Information Extraction
Novel Database-Centric Framework for Incremental Information ExtractionNovel Database-Centric Framework for Incremental Information Extraction
Novel Database-Centric Framework for Incremental Information Extraction
ijsrd.com
 
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET Journal
 
Requirment
RequirmentRequirment
Requirmentstat
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Journal For Research
 
Re2018 Semios for Requirements
Re2018 Semios for RequirementsRe2018 Semios for Requirements
Re2018 Semios for Requirements
Clément Portet
 
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
IRJET Journal
 
Software_Engineering_Presentation (1).pptx
Software_Engineering_Presentation (1).pptxSoftware_Engineering_Presentation (1).pptx
Software_Engineering_Presentation (1).pptx
ArifaMehreen1
 
An efficient tool for reusable software
An efficient tool for reusable softwareAn efficient tool for reusable software
An efficient tool for reusable softwareprjpublications
 
Automatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language ProcessingAutomatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language Processing
IRJET Journal
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
kevig
 
Overlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammarsOverlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammarsIAEME Publication
 
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
SBGC
 
Implementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain OntologyImplementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain Ontology
IOSR Journals
 

Similar to D017232729 (20)

Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
 
IRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software TechnologiesIRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software Technologies
 
team10.ppt.pptx
team10.ppt.pptxteam10.ppt.pptx
team10.ppt.pptx
 
Finding Bad Code Smells with Neural Network Models
Finding Bad Code Smells with Neural Network Models Finding Bad Code Smells with Neural Network Models
Finding Bad Code Smells with Neural Network Models
 
IRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation SystemIRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation System
 
Novel Database-Centric Framework for Incremental Information Extraction
Novel Database-Centric Framework for Incremental Information ExtractionNovel Database-Centric Framework for Incremental Information Extraction
Novel Database-Centric Framework for Incremental Information Extraction
 
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
 
Requirment
RequirmentRequirment
Requirment
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
Re2018 Semios for Requirements
Re2018 Semios for RequirementsRe2018 Semios for Requirements
Re2018 Semios for Requirements
 
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
 
Software_Engineering_Presentation (1).pptx
Software_Engineering_Presentation (1).pptxSoftware_Engineering_Presentation (1).pptx
Software_Engineering_Presentation (1).pptx
 
An efficient tool for reusable software
An efficient tool for reusable softwareAn efficient tool for reusable software
An efficient tool for reusable software
 
Automatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language ProcessingAutomatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language Processing
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
 
Je2516241630
Je2516241630Je2516241630
Je2516241630
 
Je2516241630
Je2516241630Je2516241630
Je2516241630
 
Overlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammarsOverlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammars
 
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
 
Implementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain OntologyImplementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain Ontology
 

More from IOSR Journals

A011140104
A011140104A011140104
A011140104
IOSR Journals
 
M0111397100
M0111397100M0111397100
M0111397100
IOSR Journals
 
L011138596
L011138596L011138596
L011138596
IOSR Journals
 
K011138084
K011138084K011138084
K011138084
IOSR Journals
 
J011137479
J011137479J011137479
J011137479
IOSR Journals
 
I011136673
I011136673I011136673
I011136673
IOSR Journals
 
G011134454
G011134454G011134454
G011134454
IOSR Journals
 
H011135565
H011135565H011135565
H011135565
IOSR Journals
 
F011134043
F011134043F011134043
F011134043
IOSR Journals
 
E011133639
E011133639E011133639
E011133639
IOSR Journals
 
D011132635
D011132635D011132635
D011132635
IOSR Journals
 
C011131925
C011131925C011131925
C011131925
IOSR Journals
 
B011130918
B011130918B011130918
B011130918
IOSR Journals
 
A011130108
A011130108A011130108
A011130108
IOSR Journals
 
I011125160
I011125160I011125160
I011125160
IOSR Journals
 
H011124050
H011124050H011124050
H011124050
IOSR Journals
 
G011123539
G011123539G011123539
G011123539
IOSR Journals
 
F011123134
F011123134F011123134
F011123134
IOSR Journals
 
E011122530
E011122530E011122530
E011122530
IOSR Journals
 
D011121524
D011121524D011121524
D011121524
IOSR Journals
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 

Recently uploaded (20)

Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 

D017232729

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 2, Ver. III (Mar – Apr. 2015), PP 27-29 www.iosrjournals.org DOI: 10.9790/0661-17232729 www.iosrjournals.org 27 | Page Class Diagram Extraction from Textual Requirements Using NLP Techniques Vrushali Adhav1 , Darshana Ahire2 , Aarti Jadhav3 , Dipali Lokhande4 1,2,3,4, (Computer Engineering, MET’s BKC IOE/ Savitribai Phule Pune University, India) Abstract: A new method for translating software requirements to object-oriented model is proposed. Software requirements specified in a natural language using scenario like format will be transformed into class diagrams of Unified Modeling Language using specification transformation rule. The transformation rule is based on a grammatical analysis, more specifically syntactical analysis, of requirements specification. This paper proposes a new method for translating software requirements specified using natural language to formal specification (in this context is executable and translatable Unified Modeling Language Class Diagram). Requirements specification written in a scenario like format will be transformed into class diagram’s components the automation of class generation from natural language requirements is highly challenging. This paper proposes a method and a tool to facilitate requirements analysis process and class diagram extraction from textual requirements supporting natural language processing techniques. Requirements engineers analyze requirements manually to come out with analysis artifacts such as class diagram. Keywords: NL (Natural Language), NLP (Natural Language processing), POS (Part Of Speech), RACE (Requirement Analysis and Class Diagram Extraction), UML (Unified Modeling Language) I. Introduction Requirements are often poorly understood and articulated, even by domain experts. It is common to find requirements specifications that are incomplete, inconsistent, ambiguous, and unstable. This problem emerges since software requirements are inherently complex and involving various stakeholders. The fact that specifying requirements should involve various stakeholders, who normally have no technical knowledge on software analysis and design, imposes developer (software analyst and designer) to use specification media which can be easily and intuitively understood by novice. In most practices, therefore, practitioners choose to specify software requirements using natural languages. The reason why natural language is preferred as the specification medium is because most stakeholders are more familiar with natural language compared to using other types of media (such as formal notation or modeling language). Requirements specified in a natural language are, however, normally written in a free form style. They still therefore need to be further analyzed and transformed into more formal specification especially for development and documentation purposes. In a number of software development methodologies, such as translate approach, formal specification is important since software (source) code is obtained by translating it. II. Related Work a. Need of system [4] One of the most challenging and costly functions performed by IT staff today is the deployment of various software‟s to new or existing client computers. Currently, organizations spend a great deal of time and expense planning, designing, and rolling out the latest version of the operating system throughout the organization. Often this process is done manually, requiring a help desk professional to physically visit each computer. This paper will assist analyst by providing an efficient and fast way to produce the class diagram for their natural language requirement. b. Existing Tools [3] There have been several efforts for the analysis of natural language requirements. However, few are focused on class diagram extraction from natural language requirements. Thus, few tools exist to assist analysts in the extraction of class diagram. In this section we survey the works that use NLP or domain ontology techniques to analyze NL requirements, and the works that aim to extract class diagram based on NLP or domain ontology techniques. Following are some tools: i. CIRCE Ambriola and Gervasi present a Web-based environment called Circe. Circe helps in the elicitation, selection, and validation of the software requirements. It can build semi-formal models, extract information from the NL requirements, and measure the consistency of these models. Circe gives the user a complete environment
  • 2. Class Diagram Extraction from Textual Requirements Using NLP Techniques DOI: 10.9790/0661-17232729 www.iosrjournals.org 28 | Page that integrates a number of tools. Cico is the main tool that is considered as a front-end for the other components; it recognizes the NL sentences and extracts some facts from them. These facts are handed to the remaining tools for graphical representation and analysis. Natural language sentence of requirements is parsed converted into a forest of parse trees. These parse trees closely correspond to the original requirements but many surface features of requirement document are not included in abstract UML model. ii. Zhou and Zhou It proposes a methodology that uses NLP and domain ontology. It is based on that the core classes are always semantically connected to each other‟s by one to one, one to many, or many to many relationships in the domain. This methodology Class Diagram from textual requirements using NLP techniques finds candidate classes using NLP through a part of speech (POS) tagger, a link grammar ,parser, linguistic patterns and parallel structure, and then the domain ontology is used to refine the result. iii. LOLITA Mich L. proposes a NLP system, LOLITA to generate an object model automatically from natural language. This approach considers nouns as objects and use links to find relationships amongst objects. LOLITA system is built on a large scale Semantic Network (SN) that does not distinguish between classes, attributes, and objects. This approach is limited to extract objects and cannot identify classes. Hence for to overcome this problem we are trying developing our system that mean we think how to solve this problem and we find solution on this problem and we can apply on this our paper. c. RACE System [2] RACE system is decomposed into internal and external components and sub-systems. Figure 1 illustrates the architecture model of RACE: i. OPEN NLP Parser OpenNLP is an open-source and re-usable algorithm. It provides our system with lexical and syntactic parsers. OpenNLP POS tagger (lexical) takes the English text as input and outputs the corresponding POS tags for each word; On the other hand, OpenNLP Chunker (syntactic) chunks the sentence into phrases (Noun phrase, verb phrase, etc.) according to English language grammar. The high accuracy and speed in OpenNLP encouraged us to choose it rather than other existing parsers. OpenNLP uses lexical and syntactic annotations to denote to the part of speech of the terms; for example, NN denotes to Proper Noun, VB denotes to Verb, and NP denotes to Noun Phrase. OpenNLP parser supports our system with an efficient way to find the terms‟ part of speech (POS) which we need in order to accomplish the noun and verb analysis. ii. RACE Stemming Algorithm Stemming is a technique that abbreviates word by removing affixes and suffixes. In RACE system, it is very important to return words back to its base form; this will reduce the redundancy and increase the efficiency of the system. To perform the stemming, we implemented a new stemming algorithm. Based on the stemming result, we find that our stemming algorithm is efficient and sufficient to be used in the morphological analysis of requirements in RACE system. Our stemming algorithm is simple and re-usable. iii. WORDNET WordNet is used to validate the semantic correctness of the sentences generated at the syntactic analysis. It also enables users to display all hypernyms for a selected noun. We used this feature to verify Generalization relationship where a noun phrase is supposed to be „a kind of‟ another noun phrase. WordNet can used to find semantically similar terms, and for the acquisition of synonyms. We used synonyms to extract words which are semantically related to each other. We calculated the words frequency to keep the synonyms with high frequency in the document. iv. Concept Extraction Engine The aim of this module is to extract concepts according to the requirements document. This module uses OpenNLP parser in, RACE stemming algorithm, and WordNet into extract concepts related to the given requirements. i. Class Extraction Engine This module uses the output of concept extraction engine module and applies different heuristic rules to extract the class diagram; However, We use domain ontology in this module to refine the extracted class diagram. We can summarize the heuristic rules.
  • 3. Class Diagram Extraction from Textual Requirements Using NLP Techniques DOI: 10.9790/0661-17232729 www.iosrjournals.org 29 | Page i. Domain Ontology As mentioned early in this paper, domain ontology is used to improve the performance of concepts identification. In RACE system we use a Library system Ontology as sample ontology. We gathered our ontology based on a survey we conducted on some library systems, and then we organize the ontology in such way to be easily maintained and re-used. The information in the ontology includes concepts related to classes for Library system, Attributes, and relationships. We used the XML to build the ontology. III. Figures And Tables Fig 1: race system IV. Conclusion In this paper, we propose an enhanced approach that is based on NLP techniques to support the extraction of class diagram from NL requirements. We validate our approach by implementing a system called RACE referred to as “Requirements Analysis and class diagram Extraction”. RACE system efficiently demonstrates the using of NLP techniques in the extraction of class diagram from informal requirements. Acknowledgements We have taken efforts in this paper. However, it would not have been possible without the kind support and help of many individual. Our thanks and appreciation also goes to our beloved parents for their blessings, all the staff members, friends and colleagues for providing help and support in our work. References Journal Papers: [1] Booch, G. Object-Oriented Analysis and Design with Applications, 2nd Ed., Benjamin Cummings. [2] Ambriola, V. and Gervasi, V. “Processing natural language requirements”, Proc. 12th IEEE Intl. Conf. on Automated Software Engineering, pp. [3] Farid Meziane, Nikos Athanasakis, Sophia Ananiadou, 2007,Generating Natural Language specifications from UML class diagrams, Springer-Verlag London Limited . [4] F Elizabeth D. Liddy & Jennifer H. Liddy, “An NLP Approach for Improving Access to Statistical Information for the Masses”.