SlideShare a Scribd company logo
1 of 4
Download to read offline
Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 01 Issue: 02 December 2012 Page No.36-39
ISSN: 2278-2419
36
Improvement of Software Maintenance and
Reliability using Data Mining Techniques
Yethiraj N G
Assistant Professor, Department of Computer Science
Maharani’s Science College for Women, Bangalore, India
.
Abstract-Software is ubiquitous in our daily life. It brings us
great convenience and a big headache about software reliability
as well: Software is never bug-free, and software bugs keep
incurring monetary loss of even catastrophes. In the pursuit of
better reliability, software engineering researchers found that
huge amount of data in various forms can be collected from
software systems, and these data, when properly analyzed, can
help improve software reliability. Unfortunately, the huge
volume of complex data renders the analysis of simple
techniques incompetent; consequently, studies have been
resorting to data mining for more effective analysis. In the past
few years, we have witnessed many studies on mining for
software reliability reported in data mining as well as software
engineering forums. These studies either develop new or apply
existing data mining techniques to tackle reliability problems
from different angles. In order to keep data mining researchers
abreast of the latest development in this growing research area,
we propose this paper on data mining for software reliability.
In this paper, we will present a comprehensive overview of this
area, examine representative studies, and lay out challenges to
data mining researchers.
Key words- Software, Software Reliability, Data Mining,
Frequent Item Set, Extracting Rules.
I. INTRODUCTION
The economies of all developed nations are dependent on
software. More and More systems are software controlled.
Software Engineering is concerned with theories, methods and
tools for professional software development. Software
Engineering is an engineering discipline which is concerned
with all aspects of software production. Software Engineers
should adopt a systematic and organized approach to their
work and use appropriate tools and techniques depending on
the problem to be solved, the development constraints and the
resources available. Software reliability, unlike many other
quality factors, can be measured directed and estimated using
historical and developmental data [1]. Software reliability is
defined in statistical terms as “the probability of failure-free
operation of a computer program in a specified environment
for a specific time”. Measures of reliability- if we consider a
computer-based system, a simple measure of reliability is
mean-time-between-failure (MTBF),where MTBF = MTTF +
MTTR, the acronym MTTF and MTTR are mean-time-to-
failure and mean-time-to-repair respectively [2].Software
reliability specification- Reliability is a complex concept that
should always be considered at the system rather than the
individual component level. Because the components in a
system are interdependent, a failure in one component can be
propagated through the system and affect the operation of other
components. In a computer-based system, we have to consider
three dimensions when specifying the overall system
reliability:
Hardware reliability- What is the probability of a hardware
component failing and how long would it take to repair that
component? (ii) Software reliability- How likely is it that a
software component will produce an incorrect output?
Software failures are different from hardware failures in that
software does not wear out: It can continue operating correctly
after producing an incorrect result. (iii) Operator reliability –
How likely is it that the operator of a system will make an
error? [1].
Following are the basic terminologies that are frequently used
for reliability-Table-1
System
Failure
When the system does not perform as per the
user expectations, then system failure occurs.
System
Error
When the system gives the result in an
unexpected manner then the system error occurs.
System
Fault
It is probability of the system that the failure can
lead to system error.
Human
Error
It is human activity that makes the system fault to
occur.
Mining Software Engineering Data –The main goal is to
transform static record – keeping Software Engineering data to
active data so that the hidden patterns and trends could be
explored.Normally, a Software is “full of bugs”, In Windows
2000, containing35 million lines of code, there were 63,000
known bugs at the time of release, 2 per 1000 lines. Software
failure costs are becoming very high. A study by the National
Institute of Standards and Technology found that software
errors cost the U.S. economy about $59.5 billion annually. So
testing and debugging are laborious and expensive. “50% of
my company employees are testers, and the rest spends 50% of
their time testing!” —Bill Gates, in 1995. In general Software
is complex for e.g., MySQL has 1.2 millions of LOC and its
runtime data is larger and more complex. In fact, finding bugs
is challenging which requires
specifications/properties, which often don’t exist and also
substantial human efforts in analysing data are required [3].
Software Reliability Methods are:
ď‚· Static Bug Detection - Without running the code, detect
bugs in code,
Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 01 Issue: 02 December 2012 Page No.36-39
ISSN: 2278-2419
37
ď‚· Dynamic Bug Detection (aka. Testing) - Run the code with
some test inputs and detect failures/bugs and
ď‚· Debugging - Given known test failures (symptoms),
pinpoint the bug locations in the code.
Mining for Soft Reliability is absolutely needed because,
i. Finding bugs is challenging.It requires
specifications/properties, which often don’t exist and
also require substantial human efforts in analyzing
data.
ii. We can mine common patterns as likely
specifications/properties Detect violations of patterns
as likely bugs.
iii. We can mine huge data for patterns or locations to
narrow down the scope of human inspection
II. TECHNIQUES
The Software engineering tasks helped by data mining are – (i)
programming,(ii)defect detection,(iii)testing,(iv)debugging
and(v)maintenance.Data mining techniques
are(i)Classification, (ii) Association, (iii) Patterns Detection,
(iv) Clustering [4].
Software engineering data
Considered are- (i) Code bases, (ii)
change history, (iii) program states,(iv)
structural entities and (v) bug reports [5].
III. ANALYSIS
Data Mining for Software Bug Detection needs frequent
pattern mining then automated Debugging in Software
Programs is carried out from frequent patterns to software bugs
and statistical debugging. Further, automated Debugging in
computer systems is carried out from (i) Automated diagnosis
of system misconfigurations and (ii) performance debugging
[6].
A. Software Bug Detection
Common approach: mining rules/patterns from source
code/revision histories and detecting bugs as rule/pattern
violations.
B. Mining rules from source code
i. Bugs as deviant behaviour [Engler et al., SOSP’01]
ii. Mining programming rules with PR-Miner [Li et al.,
FSE’05]
iii. Mining function precedence protocols [Ramanathan et
al., ICSE’07]
iv. Revealing neglected conditions [Chang et al.,
ISSTA’07]
C. Mining rules from revision histories
i. DynaMine [Livshits& Zimmermann, FSE’05]
D. Mining copy-paste patterns from source code
ii. CP-Miner [Li et al., OSDI’04] to find copy-paste bugs
[7].
Bugs as Deviant Behaviour
Static verification tools need rules to check against program
code.To find errors without knowing the truth
ď‚· Contradiction in belief. To find lies: cross-examine
one witness or many witness. Any contradiction is an
error (internal consistency)
ď‚· Deviation from common behaviour. To infer correct
behaviour: if 1 person does X, might be right or a
coincidence. If 1000s do X and 1 does Y, probably an
error (statistical analysis)
IV. A BRIEF METHODOLOGY: SOFTWARE BUG
DETECTION
Based on the discussion presented in the previous section, the
following steps for software bug detection are presented.
Step 1:Mining rules from source code [8]
 Bugs as deviant 37ehaviour [Engler et al., SOSP’01]
ď‚· Mining techniques: Statistical analysis
ď‚· Mining programming rules with PR-Miner [Li et al.,
FSE’05]
ď‚· Mining function precedence protocols [Ramanathan et
al., ICSE’07]
ď‚· Revealing neglected conditions [Chang et al.,
ISSTA’07]
Step 2:Mining copy-paste patterns from source code
 CP-Miner [Li et al., OSDI’04] to find copy-paste bugs
An Overview of Extracting Rules –
Observation: elements are usually used
together.Idea: finding association among elements that
arefrequently used togetherin source code Implies frequent
item set mining [9].Examples:spin_lock_irqsave and
spin_unlock_irqrestore
appear together within the same function more than 3600
times.
Step 3: Mining Programming Patterns and Generation of Rules
–
Parsing Source Code – Purpose: building an item set database.
Element: function call, variable, data type, etc. are mapped to a
number. The Source code is mapped to an item set database.A
frequent sub-item set corresponds to a programming pattern
and application of frequent item set mining algorithm on the
item set database.
E.g., {39, 68, 36, 92}:27 corresponds to pattern
{Scsi_Host, host_alloc, add_host, scan_host}
ď‚· Tradeoff: consider order or not
Step 4: Generating Programming Rules Programming patterns
- programming rules
E.g., Patterns: {a, b, d} : 3,
{a} : 4
Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 01 Issue: 02 December 2012 Page No.36-39
ISSN: 2278-2419
38
Source files
Parsing & hashing
Pre-Processing
Itemsets
Mining
Programming patterns
Post-Processing
Generating rules
Programming rules
Fig.1 Flowchart of Extracting Rules
Rules:
{a} => {b,d} with confidence = Âľ =75%
{b} => {a,d} with confidence = 100%
{d} => {a,b} with confidence = 100%
{a,b} => {d} with confidence = 100%
{a,d} => {b} with confidence = 100%
{b,d} => {a} with confidence = 100%
Rule Explosion Problem
ď‚· Exponential number of rules
ď‚· Solution: closed mining
Example:
{a,b,d}:3, {a}:4
{a,b}:3, {a,d}:3, {b,d}:3 are not closed
ď‚· Close rules
{a,b,d}:3 | {a}:4
Detection of Violations
For violations of a programming rule
(i) The rule holds for most cases
ď‚· Confidence > threshold
(ii) The rule is violated for a few cases
ď‚· Confidence < 100%
Example: Detecting Violations
Step 5:Programming patterns:
{Scsi_Host, host_alloc, add_host, scan_host}: 27
{Scsi_Host, host_alloc, add_host}: 29
Programming rule:
{Scsi_Host, host_alloc, add_host}=>
{scan_host}
with confidence 27/29 = 93%
Missing
Table 2: Some Results of Bug Detection
Software #C files LOC #functions
Linux 3,538 3,037,403 73,607
Postgre SQL 409 381,192 6,964
Apache 160 84,724 1,912
Table-3
Software Inspected (top 60)
Bugs Anomalies False Positives
Linux 16 20 24
Postgre SQL 6 9 45
Apache 1 0 6
V. LIMITATIONS OF PR-MINER
Rules across multiple functions
ď‚· Not using inter-procedural analysis
False negatives of violations in control paths
ď‚· Not using sophisticated analysis techniques
ď‚· Inter-procedural, path-sensitive inference of function
precedence protocols to address the limitations
[Ramanathan et al., ICSE’07] [10].
We shall now discuss Mining Function Precedence Protocols
fp = fopen(…);
fclose(…);
a) Definition:-Precedence protocol:
A call tofcloseis always preceded by a call tofopen
b) Definition:-Successor protocol :
A call tofopenis always succeeded by a call tofclose
c) Violation of Precedence Protocols
fp = fopen(…);
if(fp == NULL)
exit(-1);
fclose(…);
d) Tool Implementation/Evaluation
CHRONICLER – tool implemented in C has the following
features:
Tested on open source C programs
Apache, linux, openssh, gimp,postgresql
Lines of code varies from 66K to 2M
Number of call-sites varies from 10K to 110K
e) Some Results of Precedence-Related Bug Detection
Case Study: Linux
Hardware Bug
ď‚· Difficult to detect using traditional testing techniques
ď‚· Platform dependent error
ď‚· Transparently identified using CHRONICLER
Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 01 Issue: 02 December 2012 Page No.36-39
ISSN: 2278-2419
39
Performance Bug
ď‚· Cache lookup operation was absent
ď‚· Not easily specified as a bug for testing
ď‚· Deviation delays data write flushes [11].
f) Limitation of Precedence-Related Bug Detection
ď‚· Does not take data flow or data dependency into
account
ď‚· A new approach to discovering neglected conditions
[Chang et al., ISSTA’07] addresses the issue
ď‚· Based on dependence analysis, frequent item set, and
frequent sub graph mining
g) Crucial Observation
Things that are frequently changed together often form a
pattern...also known as co-changeCo-changed items = patterns
h) Finding Patterns
Find “frequent itemsets” (with Apriori)
o.enterAlignment()
o.exitAlignment()
o.redoAlignment()
iter.hasNext()
iter.next()
{enterAlignment (), exitAlignment(),
redoAlignment()}
i) Ranking Patterns
Support count = #occurrences of a pattern
Confidence count= Strength of a pattern, P (A|B)
j) Pattern classification
Post-process
v validations, e violations
Usage error unlikely
patterns patterns patterns
e<v/10 v/10<=e<=2v otherwise
Fig. 2
Results of Mining Patterns
Usage pattern – 15
Error Pattern- 8
Unlikely Pattern – 11
Not Hit – 24
Total – 56 Patterns
Mining into Computer SystemsHuge volume of data from
computer systems
Persistent state interactions, event logs, network logs, CPU
usage …
Mining system data for …
Reliability
Performance
Manageability …
VI. CONCLUSION
Challenges in data mining-Statistical modelling of computer
systemsOnline, scalability, interpretability …Data Mining for
Software Bug DetectionFrequent pattern mining.Automated
Debugging in Software Programs-From frequent patterns to
software bugs.Statistical debugging-Automated Debugging in
Computer Systems.Automated diagnosis of system
misconfigurations.Limitations of Bugs as Deviant Behaviour
Fixed rule templates.Need specific knowledge about the
software.2 elements.PR-Miner [Li et al., FSE’05] (mining
implicit programming rules) developed to address the
limitations.General method (No prior knowledge; No
templates).General rules (Different types: function, variable,
data type, etc.;Multiple elements)Ubiquitous computing
demands reliable software- Mining for software
reliability.Mining program source code/version histories to
find bugs.Mining program runtime data to locate why an
execution fails.Mining system snapshots to diagnose
misconfigurations and performance problems.An active and
rewarding research area.International Workshop on Mining
Software Repositories since 2004.SIGCOMM Workshop on
Mining Network Data since 2005.Systems and Machine
Learning Workshop since 2006.Workshop on Statistical
Learning Techniques for Solving Systems.Problems, co-
located with NIPS
REFERENCES
[1] Ian Sommerville, Software Engineering 8th
edition, Pearson Education
Publications, 2007.
[2] Roger S. Pressman, Software Engineering: A Practitioner’s Approach, 6th
edition McGraw-Hill International edition Publications, 2005.
[3] James S. Peters &WitoldPedrycz, Software Engineering an Engineering
Approach, Wiley Publications, 2000.
[4] Jiawei Han &MichelineKamber, Data Mining: Concepts and Techniques,
2nd
edition,, Elsevier Publications, March 2006.
[5] Chai Liu, Long Fei, Xifang Yan, Jiawei Han and Samuel Midkiff,
Statistical Debugging: A Hypothesis Testing-based approach, IEEETSE
2006.
[6] Dawson Engler, David Yu Chen, Seth Hallem, Andy Chou and Benjamin
Chelf, Bugs as Deviant Behaviour: A General approach to inferring
errors in systems code, SOSP 2001.
[7] Zhenmin Li, Shan Lu, SuvdaMyagmar and Yuanyan Zhou, CP-Miner: A
tool for finding copy-paste and related bugs in operating system code,
OSPI 2004.
[8] Prof. S. Chitra&Dr. M. Rajaram, A Software Reliability Estimation tool
using Artificial Immune Recognition System: Proceedings of the
International Multiconference of Engineers and computer scientists 2008
vol 1, IMECS 2008, pp. 19-21 March 2008, Hong Kong.
[9] Leon Wu, BoyiXie, Gail Kaiser & Rebecca Passonneau, Department of
Computer Science, Columbia University, Newyork NY 10027 USA,
BUGMINER: Software Reliability Analysis via Data Mining of Bug
Reports2007.
[10] Swapna S. Gokhale, Member, IEEE, A Simulation Approach to
structured-based software reliability analysis, IEEE transactions on
Software Engineering, vol 31, No. 8, August 2005.
[11] Simon P. Wilson and Francisco J. Samaniego,Nonparametric Analysis of
the order-statistic model in software reliability, IEEE transactions on
software engineering, vol 33, No. 3, March 2007.

More Related Content

What's hot

A method for detecting abnormal program behavior on embedded devices
A method for detecting abnormal program behavior on embedded devicesA method for detecting abnormal program behavior on embedded devices
A method for detecting abnormal program behavior on embedded devicesRaja Ram
 
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...IOSR Journals
 
A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...
A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...
A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...IJNSA Journal
 
Implementation of Secured Network Based Intrusion Detection System Using SVM ...
Implementation of Secured Network Based Intrusion Detection System Using SVM ...Implementation of Secured Network Based Intrusion Detection System Using SVM ...
Implementation of Secured Network Based Intrusion Detection System Using SVM ...IRJET Journal
 
A self adaptive learning approach for optimum path evaluation of process for ...
A self adaptive learning approach for optimum path evaluation of process for ...A self adaptive learning approach for optimum path evaluation of process for ...
A self adaptive learning approach for optimum path evaluation of process for ...Alexander Decker
 
A STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODS
A STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODSA STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODS
A STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODSijaia
 
COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...
COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...
COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...IJNSA Journal
 
J034057065
J034057065J034057065
J034057065ijceronline
 
Critical analysis of genetic algorithm based IDS and an approach for detecti...
Critical analysis of genetic algorithm based IDS and an approach  for detecti...Critical analysis of genetic algorithm based IDS and an approach  for detecti...
Critical analysis of genetic algorithm based IDS and an approach for detecti...IOSR Journals
 
Dynamic Taint Analysis Tools: A Review
Dynamic Taint Analysis Tools: A ReviewDynamic Taint Analysis Tools: A Review
Dynamic Taint Analysis Tools: A ReviewCSCJournals
 
IRJET- Two Factor Authentication using User Behavioural Analytics
IRJET- Two Factor Authentication using User Behavioural AnalyticsIRJET- Two Factor Authentication using User Behavioural Analytics
IRJET- Two Factor Authentication using User Behavioural AnalyticsIRJET Journal
 
A Study of Intrusion Detection System Methods in Computer Networks
A Study of Intrusion Detection System Methods in Computer NetworksA Study of Intrusion Detection System Methods in Computer Networks
A Study of Intrusion Detection System Methods in Computer NetworksEditor IJCATR
 
Zero day malware detection
Zero day malware detectionZero day malware detection
Zero day malware detectionsujeeshkumarj
 
IRJET- Proximity Detection Warning System using Ray Casting
IRJET- Proximity Detection Warning System using Ray CastingIRJET- Proximity Detection Warning System using Ray Casting
IRJET- Proximity Detection Warning System using Ray CastingIRJET Journal
 
A new approach for formal behavioral
A new approach for formal behavioralA new approach for formal behavioral
A new approach for formal behavioralijfcstjournal
 
IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...
IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...
IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...IRJET Journal
 
To use the concept of Data Mining and machine learning concept for Cyber secu...
To use the concept of Data Mining and machine learning concept for Cyber secu...To use the concept of Data Mining and machine learning concept for Cyber secu...
To use the concept of Data Mining and machine learning concept for Cyber secu...Nishant Mehta
 
Comparative Performance Analysis of Machine Learning Techniques for Software ...
Comparative Performance Analysis of Machine Learning Techniques for Software ...Comparative Performance Analysis of Machine Learning Techniques for Software ...
Comparative Performance Analysis of Machine Learning Techniques for Software ...csandit
 

What's hot (20)

A method for detecting abnormal program behavior on embedded devices
A method for detecting abnormal program behavior on embedded devicesA method for detecting abnormal program behavior on embedded devices
A method for detecting abnormal program behavior on embedded devices
 
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
 
A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...
A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...
A PROPOSED MODEL FOR DIMENSIONALITY REDUCTION TO IMPROVE THE CLASSIFICATION C...
 
Implementation of Secured Network Based Intrusion Detection System Using SVM ...
Implementation of Secured Network Based Intrusion Detection System Using SVM ...Implementation of Secured Network Based Intrusion Detection System Using SVM ...
Implementation of Secured Network Based Intrusion Detection System Using SVM ...
 
A self adaptive learning approach for optimum path evaluation of process for ...
A self adaptive learning approach for optimum path evaluation of process for ...A self adaptive learning approach for optimum path evaluation of process for ...
A self adaptive learning approach for optimum path evaluation of process for ...
 
A STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODS
A STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODSA STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODS
A STATIC MALWARE DETECTION SYSTEM USING DATA MINING METHODS
 
COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...
COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...
COMPARISON OF MALWARE CLASSIFICATION METHODS USING CONVOLUTIONAL NEURAL NETWO...
 
J034057065
J034057065J034057065
J034057065
 
Critical analysis of genetic algorithm based IDS and an approach for detecti...
Critical analysis of genetic algorithm based IDS and an approach  for detecti...Critical analysis of genetic algorithm based IDS and an approach  for detecti...
Critical analysis of genetic algorithm based IDS and an approach for detecti...
 
Dynamic Taint Analysis Tools: A Review
Dynamic Taint Analysis Tools: A ReviewDynamic Taint Analysis Tools: A Review
Dynamic Taint Analysis Tools: A Review
 
IRJET- Two Factor Authentication using User Behavioural Analytics
IRJET- Two Factor Authentication using User Behavioural AnalyticsIRJET- Two Factor Authentication using User Behavioural Analytics
IRJET- Two Factor Authentication using User Behavioural Analytics
 
A Study of Intrusion Detection System Methods in Computer Networks
A Study of Intrusion Detection System Methods in Computer NetworksA Study of Intrusion Detection System Methods in Computer Networks
A Study of Intrusion Detection System Methods in Computer Networks
 
Zero day malware detection
Zero day malware detectionZero day malware detection
Zero day malware detection
 
IRJET- Proximity Detection Warning System using Ray Casting
IRJET- Proximity Detection Warning System using Ray CastingIRJET- Proximity Detection Warning System using Ray Casting
IRJET- Proximity Detection Warning System using Ray Casting
 
A new approach for formal behavioral
A new approach for formal behavioralA new approach for formal behavioral
A new approach for formal behavioral
 
IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...
IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...
IRJET- An Intrusion Detection Framework based on Binary Classifiers Optimized...
 
To use the concept of Data Mining and machine learning concept for Cyber secu...
To use the concept of Data Mining and machine learning concept for Cyber secu...To use the concept of Data Mining and machine learning concept for Cyber secu...
To use the concept of Data Mining and machine learning concept for Cyber secu...
 
Msc dare journal 1
Msc dare journal 1Msc dare journal 1
Msc dare journal 1
 
A035401010
A035401010A035401010
A035401010
 
Comparative Performance Analysis of Machine Learning Techniques for Software ...
Comparative Performance Analysis of Machine Learning Techniques for Software ...Comparative Performance Analysis of Machine Learning Techniques for Software ...
Comparative Performance Analysis of Machine Learning Techniques for Software ...
 

Similar to Improve Software Reliability Data Mining

A Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and MethodologiesA Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and MethodologiesIJERA Editor
 
1. Emergence of Software EngineeringIn the software industry, we.docx
1. Emergence of Software EngineeringIn the software industry, we.docx1. Emergence of Software EngineeringIn the software industry, we.docx
1. Emergence of Software EngineeringIn the software industry, we.docxjackiewalcutt
 
Software engineering study materials
Software engineering study materialsSoftware engineering study materials
Software engineering study materialssmruti sarangi
 
Privacy Preserving Mining in Code Profiling Data
Privacy Preserving Mining in Code Profiling DataPrivacy Preserving Mining in Code Profiling Data
Privacy Preserving Mining in Code Profiling DataDr. Amarjeet Singh
 
CRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECASTCRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECASTIRJET Journal
 
Bt0081 software engineering
Bt0081 software engineeringBt0081 software engineering
Bt0081 software engineeringTechglyphs
 
INTERNAL Assign no 207( JAIPUR NATIONAL UNI)
INTERNAL Assign no   207( JAIPUR NATIONAL UNI)INTERNAL Assign no   207( JAIPUR NATIONAL UNI)
INTERNAL Assign no 207( JAIPUR NATIONAL UNI)Partha_bappa
 
lake city institute of technology
lake city institute of technology lake city institute of technology
lake city institute of technology RaviKalola786
 
Sofware Engineering Important Past Paper 2019
Sofware Engineering Important Past Paper 2019Sofware Engineering Important Past Paper 2019
Sofware Engineering Important Past Paper 2019MuhammadTalha436
 
David vernon software_engineering_notes
David vernon software_engineering_notesDavid vernon software_engineering_notes
David vernon software_engineering_notesmitthudwivedi
 
Clone of an organization
Clone of an organizationClone of an organization
Clone of an organizationIRJET Journal
 
Application of Data Mining Techniques for Improving Continuous Integration
Application of Data Mining Techniques for Improving Continuous IntegrationApplication of Data Mining Techniques for Improving Continuous Integration
Application of Data Mining Techniques for Improving Continuous IntegrationDr. Amarjeet Singh
 
Introduction to Software Reverse Engineering
Introduction to Software Reverse EngineeringIntroduction to Software Reverse Engineering
Introduction to Software Reverse EngineeringTeodoro Cipresso
 
Software Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining TechniquesSoftware Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining TechniquesAM Publications
 
Comparative performance analysis
Comparative performance analysisComparative performance analysis
Comparative performance analysiscsandit
 
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE MethodParameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE MethodIRJET Journal
 
Different Methodologies For Testing Web Application Testing
Different Methodologies For Testing Web Application TestingDifferent Methodologies For Testing Web Application Testing
Different Methodologies For Testing Web Application TestingRachel Davis
 

Similar to Improve Software Reliability Data Mining (20)

A Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and MethodologiesA Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and Methodologies
 
E018132735
E018132735E018132735
E018132735
 
1. Emergence of Software EngineeringIn the software industry, we.docx
1. Emergence of Software EngineeringIn the software industry, we.docx1. Emergence of Software EngineeringIn the software industry, we.docx
1. Emergence of Software EngineeringIn the software industry, we.docx
 
Software engineering study materials
Software engineering study materialsSoftware engineering study materials
Software engineering study materials
 
Privacy Preserving Mining in Code Profiling Data
Privacy Preserving Mining in Code Profiling DataPrivacy Preserving Mining in Code Profiling Data
Privacy Preserving Mining in Code Profiling Data
 
CRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECASTCRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECAST
 
Bt0081 software engineering
Bt0081 software engineeringBt0081 software engineering
Bt0081 software engineering
 
INTERNAL Assign no 207( JAIPUR NATIONAL UNI)
INTERNAL Assign no   207( JAIPUR NATIONAL UNI)INTERNAL Assign no   207( JAIPUR NATIONAL UNI)
INTERNAL Assign no 207( JAIPUR NATIONAL UNI)
 
lake city institute of technology
lake city institute of technology lake city institute of technology
lake city institute of technology
 
Sofware Engineering Important Past Paper 2019
Sofware Engineering Important Past Paper 2019Sofware Engineering Important Past Paper 2019
Sofware Engineering Important Past Paper 2019
 
Sw engg two mark question
Sw engg two mark questionSw engg two mark question
Sw engg two mark question
 
David vernon software_engineering_notes
David vernon software_engineering_notesDavid vernon software_engineering_notes
David vernon software_engineering_notes
 
Clone of an organization
Clone of an organizationClone of an organization
Clone of an organization
 
Application of Data Mining Techniques for Improving Continuous Integration
Application of Data Mining Techniques for Improving Continuous IntegrationApplication of Data Mining Techniques for Improving Continuous Integration
Application of Data Mining Techniques for Improving Continuous Integration
 
Introduction to Software Reverse Engineering
Introduction to Software Reverse EngineeringIntroduction to Software Reverse Engineering
Introduction to Software Reverse Engineering
 
Software Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining TechniquesSoftware Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining Techniques
 
Comparative performance analysis
Comparative performance analysisComparative performance analysis
Comparative performance analysis
 
H1803044651
H1803044651H1803044651
H1803044651
 
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE MethodParameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
 
Different Methodologies For Testing Web Application Testing
Different Methodologies For Testing Web Application TestingDifferent Methodologies For Testing Web Application Testing
Different Methodologies For Testing Web Application Testing
 

More from ijdmtaiir

A review on data mining techniques for Digital Mammographic Analysis
A review on data mining techniques for Digital Mammographic AnalysisA review on data mining techniques for Digital Mammographic Analysis
A review on data mining techniques for Digital Mammographic Analysisijdmtaiir
 
Comparison on PCA ICA and LDA in Face Recognition
Comparison on PCA ICA and LDA in Face RecognitionComparison on PCA ICA and LDA in Face Recognition
Comparison on PCA ICA and LDA in Face Recognitionijdmtaiir
 
A Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data MiningA Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data Miningijdmtaiir
 
Analysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data MiningAnalysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data Miningijdmtaiir
 
Performance Analysis of Selected Classifiers in User Profiling
Performance Analysis of Selected Classifiers in User ProfilingPerformance Analysis of Selected Classifiers in User Profiling
Performance Analysis of Selected Classifiers in User Profilingijdmtaiir
 
Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...
Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...
Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...ijdmtaiir
 
Analysis of Influences of memory on Cognitive load Using Neural Network Back ...
Analysis of Influences of memory on Cognitive load Using Neural Network Back ...Analysis of Influences of memory on Cognitive load Using Neural Network Back ...
Analysis of Influences of memory on Cognitive load Using Neural Network Back ...ijdmtaiir
 
An Analysis of Data Mining Applications for Fraud Detection in Securities Market
An Analysis of Data Mining Applications for Fraud Detection in Securities MarketAn Analysis of Data Mining Applications for Fraud Detection in Securities Market
An Analysis of Data Mining Applications for Fraud Detection in Securities Marketijdmtaiir
 
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...ijdmtaiir
 
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...ijdmtaiir
 
Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...
Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...
Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...ijdmtaiir
 
A Study on Youth Violence and Aggression using DEMATEL with FCM Methods
A Study on Youth Violence and Aggression using DEMATEL with FCM MethodsA Study on Youth Violence and Aggression using DEMATEL with FCM Methods
A Study on Youth Violence and Aggression using DEMATEL with FCM Methodsijdmtaiir
 
Certain Investigation on Dynamic Clustering in Dynamic Datamining
Certain Investigation on Dynamic Clustering in Dynamic DataminingCertain Investigation on Dynamic Clustering in Dynamic Datamining
Certain Investigation on Dynamic Clustering in Dynamic Dataminingijdmtaiir
 
Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...
Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...
Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...ijdmtaiir
 
An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...
An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...
An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...ijdmtaiir
 
An Approach for the Detection of Vascular Abnormalities in Diabetic Retinopathy
An Approach for the Detection of Vascular Abnormalities in Diabetic RetinopathyAn Approach for the Detection of Vascular Abnormalities in Diabetic Retinopathy
An Approach for the Detection of Vascular Abnormalities in Diabetic Retinopathyijdmtaiir
 
Improve the Performance of Clustering Using Combination of Multiple Clusterin...
Improve the Performance of Clustering Using Combination of Multiple Clusterin...Improve the Performance of Clustering Using Combination of Multiple Clusterin...
Improve the Performance of Clustering Using Combination of Multiple Clusterin...ijdmtaiir
 
The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...
The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...
The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...ijdmtaiir
 
A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)
A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)
A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)ijdmtaiir
 
Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)
Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)
Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)ijdmtaiir
 

More from ijdmtaiir (20)

A review on data mining techniques for Digital Mammographic Analysis
A review on data mining techniques for Digital Mammographic AnalysisA review on data mining techniques for Digital Mammographic Analysis
A review on data mining techniques for Digital Mammographic Analysis
 
Comparison on PCA ICA and LDA in Face Recognition
Comparison on PCA ICA and LDA in Face RecognitionComparison on PCA ICA and LDA in Face Recognition
Comparison on PCA ICA and LDA in Face Recognition
 
A Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data MiningA Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data Mining
 
Analysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data MiningAnalysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data Mining
 
Performance Analysis of Selected Classifiers in User Profiling
Performance Analysis of Selected Classifiers in User ProfilingPerformance Analysis of Selected Classifiers in User Profiling
Performance Analysis of Selected Classifiers in User Profiling
 
Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...
Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...
Analysis of Sales and Distribution of an IT Industry Using Data Mining Techni...
 
Analysis of Influences of memory on Cognitive load Using Neural Network Back ...
Analysis of Influences of memory on Cognitive load Using Neural Network Back ...Analysis of Influences of memory on Cognitive load Using Neural Network Back ...
Analysis of Influences of memory on Cognitive load Using Neural Network Back ...
 
An Analysis of Data Mining Applications for Fraud Detection in Securities Market
An Analysis of Data Mining Applications for Fraud Detection in Securities MarketAn Analysis of Data Mining Applications for Fraud Detection in Securities Market
An Analysis of Data Mining Applications for Fraud Detection in Securities Market
 
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
 
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
 
Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...
Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...
Music Promotes Gross National Happiness Using Neutrosophic fuzzyCognitive Map...
 
A Study on Youth Violence and Aggression using DEMATEL with FCM Methods
A Study on Youth Violence and Aggression using DEMATEL with FCM MethodsA Study on Youth Violence and Aggression using DEMATEL with FCM Methods
A Study on Youth Violence and Aggression using DEMATEL with FCM Methods
 
Certain Investigation on Dynamic Clustering in Dynamic Datamining
Certain Investigation on Dynamic Clustering in Dynamic DataminingCertain Investigation on Dynamic Clustering in Dynamic Datamining
Certain Investigation on Dynamic Clustering in Dynamic Datamining
 
Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...
Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...
Analyzing the Role of a Family in Constructing Gender Roles Using Combined Ov...
 
An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...
An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...
An Interval Based Fuzzy Multiple Expert System to Analyze the Impacts of Clim...
 
An Approach for the Detection of Vascular Abnormalities in Diabetic Retinopathy
An Approach for the Detection of Vascular Abnormalities in Diabetic RetinopathyAn Approach for the Detection of Vascular Abnormalities in Diabetic Retinopathy
An Approach for the Detection of Vascular Abnormalities in Diabetic Retinopathy
 
Improve the Performance of Clustering Using Combination of Multiple Clusterin...
Improve the Performance of Clustering Using Combination of Multiple Clusterin...Improve the Performance of Clustering Using Combination of Multiple Clusterin...
Improve the Performance of Clustering Using Combination of Multiple Clusterin...
 
The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...
The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...
The Study of Symptoms of Tuberculosis Using Induced Fuzzy Coginitive Maps (IF...
 
A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)
A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)
A Study on Finding the Key Motive of Happiness Using Fuzzy Cognitive Maps (FCMs)
 
Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)
Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)
Study of sustainable development using Fuzzy Cognitive Relational Maps (FCM)
 

Recently uploaded

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
pipeline in computer architecture design
pipeline in computer architecture  designpipeline in computer architecture  design
pipeline in computer architecture designssuser87fa0c1
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 

Recently uploaded (20)

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
pipeline in computer architecture design
pipeline in computer architecture  designpipeline in computer architecture  design
pipeline in computer architecture design
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 

Improve Software Reliability Data Mining

  • 1. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications Volume: 01 Issue: 02 December 2012 Page No.36-39 ISSN: 2278-2419 36 Improvement of Software Maintenance and Reliability using Data Mining Techniques Yethiraj N G Assistant Professor, Department of Computer Science Maharani’s Science College for Women, Bangalore, India . Abstract-Software is ubiquitous in our daily life. It brings us great convenience and a big headache about software reliability as well: Software is never bug-free, and software bugs keep incurring monetary loss of even catastrophes. In the pursuit of better reliability, software engineering researchers found that huge amount of data in various forms can be collected from software systems, and these data, when properly analyzed, can help improve software reliability. Unfortunately, the huge volume of complex data renders the analysis of simple techniques incompetent; consequently, studies have been resorting to data mining for more effective analysis. In the past few years, we have witnessed many studies on mining for software reliability reported in data mining as well as software engineering forums. These studies either develop new or apply existing data mining techniques to tackle reliability problems from different angles. In order to keep data mining researchers abreast of the latest development in this growing research area, we propose this paper on data mining for software reliability. In this paper, we will present a comprehensive overview of this area, examine representative studies, and lay out challenges to data mining researchers. Key words- Software, Software Reliability, Data Mining, Frequent Item Set, Extracting Rules. I. INTRODUCTION The economies of all developed nations are dependent on software. More and More systems are software controlled. Software Engineering is concerned with theories, methods and tools for professional software development. Software Engineering is an engineering discipline which is concerned with all aspects of software production. Software Engineers should adopt a systematic and organized approach to their work and use appropriate tools and techniques depending on the problem to be solved, the development constraints and the resources available. Software reliability, unlike many other quality factors, can be measured directed and estimated using historical and developmental data [1]. Software reliability is defined in statistical terms as “the probability of failure-free operation of a computer program in a specified environment for a specific time”. Measures of reliability- if we consider a computer-based system, a simple measure of reliability is mean-time-between-failure (MTBF),where MTBF = MTTF + MTTR, the acronym MTTF and MTTR are mean-time-to- failure and mean-time-to-repair respectively [2].Software reliability specification- Reliability is a complex concept that should always be considered at the system rather than the individual component level. Because the components in a system are interdependent, a failure in one component can be propagated through the system and affect the operation of other components. In a computer-based system, we have to consider three dimensions when specifying the overall system reliability: Hardware reliability- What is the probability of a hardware component failing and how long would it take to repair that component? (ii) Software reliability- How likely is it that a software component will produce an incorrect output? Software failures are different from hardware failures in that software does not wear out: It can continue operating correctly after producing an incorrect result. (iii) Operator reliability – How likely is it that the operator of a system will make an error? [1]. Following are the basic terminologies that are frequently used for reliability-Table-1 System Failure When the system does not perform as per the user expectations, then system failure occurs. System Error When the system gives the result in an unexpected manner then the system error occurs. System Fault It is probability of the system that the failure can lead to system error. Human Error It is human activity that makes the system fault to occur. Mining Software Engineering Data –The main goal is to transform static record – keeping Software Engineering data to active data so that the hidden patterns and trends could be explored.Normally, a Software is “full of bugs”, In Windows 2000, containing35 million lines of code, there were 63,000 known bugs at the time of release, 2 per 1000 lines. Software failure costs are becoming very high. A study by the National Institute of Standards and Technology found that software errors cost the U.S. economy about $59.5 billion annually. So testing and debugging are laborious and expensive. “50% of my company employees are testers, and the rest spends 50% of their time testing!” —Bill Gates, in 1995. In general Software is complex for e.g., MySQL has 1.2 millions of LOC and its runtime data is larger and more complex. In fact, finding bugs is challenging which requires specifications/properties, which often don’t exist and also substantial human efforts in analysing data are required [3]. Software Reliability Methods are: ď‚· Static Bug Detection - Without running the code, detect bugs in code,
  • 2. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications Volume: 01 Issue: 02 December 2012 Page No.36-39 ISSN: 2278-2419 37 ď‚· Dynamic Bug Detection (aka. Testing) - Run the code with some test inputs and detect failures/bugs and ď‚· Debugging - Given known test failures (symptoms), pinpoint the bug locations in the code. Mining for Soft Reliability is absolutely needed because, i. Finding bugs is challenging.It requires specifications/properties, which often don’t exist and also require substantial human efforts in analyzing data. ii. We can mine common patterns as likely specifications/properties Detect violations of patterns as likely bugs. iii. We can mine huge data for patterns or locations to narrow down the scope of human inspection II. TECHNIQUES The Software engineering tasks helped by data mining are – (i) programming,(ii)defect detection,(iii)testing,(iv)debugging and(v)maintenance.Data mining techniques are(i)Classification, (ii) Association, (iii) Patterns Detection, (iv) Clustering [4]. Software engineering data Considered are- (i) Code bases, (ii) change history, (iii) program states,(iv) structural entities and (v) bug reports [5]. III. ANALYSIS Data Mining for Software Bug Detection needs frequent pattern mining then automated Debugging in Software Programs is carried out from frequent patterns to software bugs and statistical debugging. Further, automated Debugging in computer systems is carried out from (i) Automated diagnosis of system misconfigurations and (ii) performance debugging [6]. A. Software Bug Detection Common approach: mining rules/patterns from source code/revision histories and detecting bugs as rule/pattern violations. B. Mining rules from source code i. Bugs as deviant behaviour [Engler et al., SOSP’01] ii. Mining programming rules with PR-Miner [Li et al., FSE’05] iii. Mining function precedence protocols [Ramanathan et al., ICSE’07] iv. Revealing neglected conditions [Chang et al., ISSTA’07] C. Mining rules from revision histories i. DynaMine [Livshits& Zimmermann, FSE’05] D. Mining copy-paste patterns from source code ii. CP-Miner [Li et al., OSDI’04] to find copy-paste bugs [7]. Bugs as Deviant Behaviour Static verification tools need rules to check against program code.To find errors without knowing the truth ď‚· Contradiction in belief. To find lies: cross-examine one witness or many witness. Any contradiction is an error (internal consistency) ď‚· Deviation from common behaviour. To infer correct behaviour: if 1 person does X, might be right or a coincidence. If 1000s do X and 1 does Y, probably an error (statistical analysis) IV. A BRIEF METHODOLOGY: SOFTWARE BUG DETECTION Based on the discussion presented in the previous section, the following steps for software bug detection are presented. Step 1:Mining rules from source code [8] ď‚· Bugs as deviant 37ehaviour [Engler et al., SOSP’01] ď‚· Mining techniques: Statistical analysis ď‚· Mining programming rules with PR-Miner [Li et al., FSE’05] ď‚· Mining function precedence protocols [Ramanathan et al., ICSE’07] ď‚· Revealing neglected conditions [Chang et al., ISSTA’07] Step 2:Mining copy-paste patterns from source code ď‚· CP-Miner [Li et al., OSDI’04] to find copy-paste bugs An Overview of Extracting Rules – Observation: elements are usually used together.Idea: finding association among elements that arefrequently used togetherin source code Implies frequent item set mining [9].Examples:spin_lock_irqsave and spin_unlock_irqrestore appear together within the same function more than 3600 times. Step 3: Mining Programming Patterns and Generation of Rules – Parsing Source Code – Purpose: building an item set database. Element: function call, variable, data type, etc. are mapped to a number. The Source code is mapped to an item set database.A frequent sub-item set corresponds to a programming pattern and application of frequent item set mining algorithm on the item set database. E.g., {39, 68, 36, 92}:27 corresponds to pattern {Scsi_Host, host_alloc, add_host, scan_host} ď‚· Tradeoff: consider order or not Step 4: Generating Programming Rules Programming patterns - programming rules E.g., Patterns: {a, b, d} : 3, {a} : 4
  • 3. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications Volume: 01 Issue: 02 December 2012 Page No.36-39 ISSN: 2278-2419 38 Source files Parsing & hashing Pre-Processing Itemsets Mining Programming patterns Post-Processing Generating rules Programming rules Fig.1 Flowchart of Extracting Rules Rules: {a} => {b,d} with confidence = Âľ =75% {b} => {a,d} with confidence = 100% {d} => {a,b} with confidence = 100% {a,b} => {d} with confidence = 100% {a,d} => {b} with confidence = 100% {b,d} => {a} with confidence = 100% Rule Explosion Problem ď‚· Exponential number of rules ď‚· Solution: closed mining Example: {a,b,d}:3, {a}:4 {a,b}:3, {a,d}:3, {b,d}:3 are not closed ď‚· Close rules {a,b,d}:3 | {a}:4 Detection of Violations For violations of a programming rule (i) The rule holds for most cases ď‚· Confidence > threshold (ii) The rule is violated for a few cases ď‚· Confidence < 100% Example: Detecting Violations Step 5:Programming patterns: {Scsi_Host, host_alloc, add_host, scan_host}: 27 {Scsi_Host, host_alloc, add_host}: 29 Programming rule: {Scsi_Host, host_alloc, add_host}=> {scan_host} with confidence 27/29 = 93% Missing Table 2: Some Results of Bug Detection Software #C files LOC #functions Linux 3,538 3,037,403 73,607 Postgre SQL 409 381,192 6,964 Apache 160 84,724 1,912 Table-3 Software Inspected (top 60) Bugs Anomalies False Positives Linux 16 20 24 Postgre SQL 6 9 45 Apache 1 0 6 V. LIMITATIONS OF PR-MINER Rules across multiple functions ď‚· Not using inter-procedural analysis False negatives of violations in control paths ď‚· Not using sophisticated analysis techniques ď‚· Inter-procedural, path-sensitive inference of function precedence protocols to address the limitations [Ramanathan et al., ICSE’07] [10]. We shall now discuss Mining Function Precedence Protocols fp = fopen(…); fclose(…); a) Definition:-Precedence protocol: A call tofcloseis always preceded by a call tofopen b) Definition:-Successor protocol : A call tofopenis always succeeded by a call tofclose c) Violation of Precedence Protocols fp = fopen(…); if(fp == NULL) exit(-1); fclose(…); d) Tool Implementation/Evaluation CHRONICLER – tool implemented in C has the following features: Tested on open source C programs Apache, linux, openssh, gimp,postgresql Lines of code varies from 66K to 2M Number of call-sites varies from 10K to 110K e) Some Results of Precedence-Related Bug Detection Case Study: Linux Hardware Bug ď‚· Difficult to detect using traditional testing techniques ď‚· Platform dependent error ď‚· Transparently identified using CHRONICLER
  • 4. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications Volume: 01 Issue: 02 December 2012 Page No.36-39 ISSN: 2278-2419 39 Performance Bug ď‚· Cache lookup operation was absent ď‚· Not easily specified as a bug for testing ď‚· Deviation delays data write flushes [11]. f) Limitation of Precedence-Related Bug Detection ď‚· Does not take data flow or data dependency into account ď‚· A new approach to discovering neglected conditions [Chang et al., ISSTA’07] addresses the issue ď‚· Based on dependence analysis, frequent item set, and frequent sub graph mining g) Crucial Observation Things that are frequently changed together often form a pattern...also known as co-changeCo-changed items = patterns h) Finding Patterns Find “frequent itemsets” (with Apriori) o.enterAlignment() o.exitAlignment() o.redoAlignment() iter.hasNext() iter.next() {enterAlignment (), exitAlignment(), redoAlignment()} i) Ranking Patterns Support count = #occurrences of a pattern Confidence count= Strength of a pattern, P (A|B) j) Pattern classification Post-process v validations, e violations Usage error unlikely patterns patterns patterns e<v/10 v/10<=e<=2v otherwise Fig. 2 Results of Mining Patterns Usage pattern – 15 Error Pattern- 8 Unlikely Pattern – 11 Not Hit – 24 Total – 56 Patterns Mining into Computer SystemsHuge volume of data from computer systems Persistent state interactions, event logs, network logs, CPU usage … Mining system data for … Reliability Performance Manageability … VI. CONCLUSION Challenges in data mining-Statistical modelling of computer systemsOnline, scalability, interpretability …Data Mining for Software Bug DetectionFrequent pattern mining.Automated Debugging in Software Programs-From frequent patterns to software bugs.Statistical debugging-Automated Debugging in Computer Systems.Automated diagnosis of system misconfigurations.Limitations of Bugs as Deviant Behaviour Fixed rule templates.Need specific knowledge about the software.2 elements.PR-Miner [Li et al., FSE’05] (mining implicit programming rules) developed to address the limitations.General method (No prior knowledge; No templates).General rules (Different types: function, variable, data type, etc.;Multiple elements)Ubiquitous computing demands reliable software- Mining for software reliability.Mining program source code/version histories to find bugs.Mining program runtime data to locate why an execution fails.Mining system snapshots to diagnose misconfigurations and performance problems.An active and rewarding research area.International Workshop on Mining Software Repositories since 2004.SIGCOMM Workshop on Mining Network Data since 2005.Systems and Machine Learning Workshop since 2006.Workshop on Statistical Learning Techniques for Solving Systems.Problems, co- located with NIPS REFERENCES [1] Ian Sommerville, Software Engineering 8th edition, Pearson Education Publications, 2007. [2] Roger S. Pressman, Software Engineering: A Practitioner’s Approach, 6th edition McGraw-Hill International edition Publications, 2005. [3] James S. Peters &WitoldPedrycz, Software Engineering an Engineering Approach, Wiley Publications, 2000. [4] Jiawei Han &MichelineKamber, Data Mining: Concepts and Techniques, 2nd edition,, Elsevier Publications, March 2006. [5] Chai Liu, Long Fei, Xifang Yan, Jiawei Han and Samuel Midkiff, Statistical Debugging: A Hypothesis Testing-based approach, IEEETSE 2006. [6] Dawson Engler, David Yu Chen, Seth Hallem, Andy Chou and Benjamin Chelf, Bugs as Deviant Behaviour: A General approach to inferring errors in systems code, SOSP 2001. [7] Zhenmin Li, Shan Lu, SuvdaMyagmar and Yuanyan Zhou, CP-Miner: A tool for finding copy-paste and related bugs in operating system code, OSPI 2004. [8] Prof. S. Chitra&Dr. M. Rajaram, A Software Reliability Estimation tool using Artificial Immune Recognition System: Proceedings of the International Multiconference of Engineers and computer scientists 2008 vol 1, IMECS 2008, pp. 19-21 March 2008, Hong Kong. [9] Leon Wu, BoyiXie, Gail Kaiser & Rebecca Passonneau, Department of Computer Science, Columbia University, Newyork NY 10027 USA, BUGMINER: Software Reliability Analysis via Data Mining of Bug Reports2007. [10] Swapna S. Gokhale, Member, IEEE, A Simulation Approach to structured-based software reliability analysis, IEEE transactions on Software Engineering, vol 31, No. 8, August 2005. [11] Simon P. Wilson and Francisco J. Samaniego,Nonparametric Analysis of the order-statistic model in software reliability, IEEE transactions on software engineering, vol 33, No. 3, March 2007.