SlideShare a Scribd company logo
1 of 12
Download to read offline
David Bracewell, et al. (Eds): AIAA 2011,CS & IT 03, pp. 157–168 , 2011.
© CS & IT-CSCP 2011 DOI : 10.5121/csit.2011.1314
A STUDY ON SIMILARITY MEASURE
FUNCTIONS ON ENGINEERING
MATERIALS SELECTION
Doreswamy1
, Manohar M G2
, Hemanth.K.S3
1,2,3
Department of Computer Science
Mangalore University
Mangalagangotri-574 199
doreswamyh@yahoo.com, manoharg.phd@gmail.com,
reachhemanthmca@gmail.com
ABSTRACT
While designing a new type of engineering material one has to search for some existing
materials which suits design requirement and then he can try to produce new kind of
engineering material. This selection process itself is tedious as he has to select few numbers of
materials out of a set of lakhs of materials. Therefore in this paper a model is proposed to select
a particular material which suits the user requirement, by using some similarity/distance
measuring functionalities. Here thirteen different types of similarity/distance measuring
functionalities are examined. Performance Index Measure(PIM) is calculated to verify the
relative performance of the selected material with the target material. Then all the results are
normalised for the purpose of analysing the results. Hence the proposed model reduces the
wastage of time in selection and also avoids the haphazardly selection of the materials in
materials design and manufacturing industries.
KEYWORDS
Similarity measure, Performance index, Materials database, Materials selection, Knowledge
Discovery
1. INTRODUCTION
Processed information is playing a major role in the success of the any industry. Only who can
apply the suitable newest information for his product development, is able to survive in the global
competition[11]. The latest developments in storage and network technologies have enabled the
fast access to information resources and repositories, but there is a lack of facilities to effectively
and efficiently search for the required information. To make use of the huge data repositories and
information, the field of knowledge extraction from these data repositories will be of major
importance in the next decade. Therefore, research on Data Mining and Knowledge discovery has
become one of the most important domains of computer science.
Usually the modern Database Management Systems (DBMS) provide fundamental utilities such
as searching, retrieving and indexing mechanisms for the management of such relational data
[12], which are well-understood and widely applied in many commercial and industrials
applications. The traditional DBMS can help the user If the structure of the information to be
searched is sufficiently simple and manageable with available data manipulation mechanism of
158 Computer Science & Information Technology (CS & IT)
the DBMS. Requirements of traditional operational data management such as accounting and
billing are perfectly met by a commercial DBMS. Therefore, the information infrastructure of
most enterprises is based on the latest Relational Database Management Systems (RDBMS).
Recently, an increase in the number of applications, processing large amounts of complex and
application specific data objects have been observed. Therefore it is the necessity of the day as
the latest trends in the industry require to analyse and use the existing huge complex data.
In application domains such as multimedia, medical imaging, Computer Aided Design
(AMPTIAC Material EASE 2 CAMS Report), Material Informatics [13], Nanoinformatics [14],
Bioinformatics [15], marketing and purchasing assistance, etc., a high efficiency of query
processing is crucial due to the immense and even increasing size of current databases. The
search in such databases, called non-standard databases, is rarely based on an exact match of
objects. Instead, the search is often based on some notion of similarity which is specific to the
application. For applications which do not only support transaction oriented search operations,
but also provide high-level strategic information or decision making. It is necessary not only to
search for objects which are similar to a given query object but rather to analyze the data set as a
whole for knowledge discovery. Information which is interesting in the decision making process
include common patterns, classifications of data, knowledge about collections of similar objects
and exceptional data.
Similar search and Data Mining have become widespread problems of modern database
applications. Similar pattern search in databases is a problem of searching extract patterns that
matches the target object/Patterns[2]. A general approach is to translate complex object into
single or multi dimensional vector by feature transformation function and then to employ search
mechanism to retrieve similar object to a given query object. Analysis of similar patterns involves
data mining techniques such as association analysis, correlation analysis, classification, cluster
analysis, outlier analysis and distance or similarity functions.
This kind of information is commonly referred to as knowledge and the process of deriving such
knowledge or higher-level information from a vast amount of transactional data is called data
mining or knowledge discovery in databases (KDD)[6]. Because such applications on top of
modern databases are also depend on similarity search [16]. The difference to traditional
similarity search applications is, however, that these applications do not only raise few, single
similarity queries but rather a high number of such queries.
This paper is organized as fallows, A brief introduction to similarity and distance measuring
functions are discussed, and a list of 13 such functions are tabled in section 2. Section 3 describes
briefly about the engineering material database. Further the performance index measure(PIM)
for material selection, Normalized measures and algorithm for similar material selection are
elaborated in the section 4. Finally experimental results and conclusion are discuses in section 5
and section 6 respectively
2. SIMILARITY AND DISTANCE FUNCTIONS
From the scientific and mathematical point of view, similarity/distance is defined as a quantitative
degree that enumerates the logical separation of two objects represented by a set of measurable
attributes/characteristics[4][5]. Measuring similarity or distance between two data points is a core
requirement for several data mining and knowledge discovery tasks that involve distance
computation. Examples include clustering (k-means), distance-based outlier detection,
classification (KNN, SVM), and several other data mining tasks. These algorithms typically treat
the similarity computation as an orthogonal step and can make use of any measure. For
continuous data sets, the Minkowski Distance is a general method used to compute distance
between two multivariate points. In particular, the Minkowski Distance of order 1 (Manhattan)
and order 2 (Euclidean) are the two most widely used distance measures for continuous data. The
key observation about the above measures is that they are independent of the underlying data set
Computer Science & Information Technology (CS & IT) 159
to which the two points belong. Several data driven measures have also been explored for
continuous data. The notion of similarity or distance for categorical data is not as straightforward
as for continuous data. The key characteristic of categorical data is that the different values that a
categorical attribute takes are not inherently ordered[7]. Thus, it is not possible to directly
compare two different categorical values. The simplest way to address similarity between two
categorical attributes is to assign a similarity of 1 if the values are identical and a similarity of 0 if
the values are not identical. For two multivariate categorical data points, the similarity between
them will be directly proportional to the number of attributes in which they match. Various
similarity measure functions are enumerated in the literature [16][17] and whose applications are
widespread in retrieving information or data from databases.
Properties of similarity / distance measure function are as follows:
0),( ≥YXD Distance is a non-negative number.
0),( =XXD The distance of an attribute to itself is zero.
),(),( XYDYXD = the distance is symmetric function
),(),(),( YXDYZDZXD ≥+ It does obey triangular inequality.
where { }niiiiiii xxxxxxX ,5,4,3,2,1, ..........,,,,,= and { }niiiiiii yyyyyyY ,5,4,3,2,1, ..........,,,,,=
are two n dimensional data sets. Any function is said to be a distance metric if it satisfies the
properties from 1 to 4, however the similarity function may not satisfy the 4th property.
Some of the distance or similarity function frequently used for information retrieval from the
databases are listed below and employed for material selection from the engineering materials
database.
Table 1 . Various Similarity/ Distance functions
1. Euclidean
Distance ( ) 





−= ∑=
n
i
iiEud YXYXD
1
2
),(
, where i = 1 . . n
(1)
2. Squared Euclidean
Distance ∑=
−=
n
i
iiEd YXYXD
1
2
),(
where i = 1 . . . n,
(2)
3. City Block
Distance ∑=
−=
n
i
iiCBd YXYXD
1
),(
, where i = 1. . . n
(3)
4. Minkowski
Distance,
pn
i
p
iiMkd YXYXD
1
1
),( 





−= ∑=
, where i = 1. .. n
(4)
5. Chebyshev
Distance ii
i
Chebd YXYXD −= max),(
, where i = 1. . . n (5)
6. Exponential
Similarity Measure ∑=
−−
+
−
=
n
i
YX
ii
iiESM
ii
e
YX
YXD
1 1
),(
, where i = 1 . .. n
(6)
160 Computer Science & Information Technology (CS & IT)
7. P-Inverse Similarity 3
4.1
1
2
),( 





−= ∑=
n
i
iiMkd YXYXD
, where i = 1 . . . n
(7)
8. Camberra
Similarity ∑= +
−
=
n
i ii
ii
yx
yx
yxC
1
),(
, where i = 1 . . . n
(8)
9. Max-Min Similarity
Measure
∑
∑
=
=
= n
i
ii
n
i
ii
MMM
YX
YX
YXD
1
1
),max(
),min(
),(
, where i = 1 . . . n
(9)
10. Geometric Average
Minimum
Similarity [ ]∑
∑
=
=
= n
i
ii
n
i
ii
Min
YX
YXMin
YXGA
1
2/1
1
.
),(
),(
, where i = 1 .. . n
(10)
11. Geometric Average
Maximum
Similarity [ ]∑
∑
=
=
= n
i
ii
n
i
ii
Max
YX
YXMax
YXGA
1
2/1
1
.
),(
),(
, where i = 1 . . n
(11)
12. Cosine Amplitude
Similarity
∑∑
∑
==
=
=
m
k
jk
m
k
ik
m
k
jkik
ji
xx
xx
r
1
2
1
2
1
,
))((
, where i, j = 1. . . n (12)
13. Correlation
Coefficient
Similarity
2/1
1
2
,
2/1
1
2
,
,
1
,
)()(
),(






−





−
−−
=
∑∑
∑
=
−
=
−
−
=
−
p
k
jkj
p
k
jki
jkj
p
k
iki
ji
yyxx
yyxx
YXγ
where ,
1
,
1
1
,
1
, ∑∑ =
−
=
−
==
p
k
kjj
p
k
kii y
p
yandx
p
x
(13)
3. ENGINEERING MATERIALS DATABASE(EMD)
It is a group of related records of engineering materials data set. Each record or an object is a
group of related attributes of materials. The attributes of materials are the characteristics or
behaviour of synthesized materials and these are measured by design engineer and manufacturing
industries following the industrial standards. Data related to engineering materials are gathered
from different information sources that include materials handbook [18] scientific literature[19]
and Internet WebPages www.matweb.com. Information structure for organizing these gathered
data are organized with scheme of Object Oriented Data Model .
4. PERFORMANCE INDEX MEASURE(PIM) FOR MATERIAL SELECTION
Performance index measure is proposed for the selection of the best among the similar materials.
Performance Index Measure is the logical difference between the absolute aggregated values of
attribute/characteristics of target material and its matching material. The best material is the one
whose performance index measure value is the minimum among the most similar materials.
Let { }niiiiiii xxxxxxX ,5,4,3,2,1, ..........,,,,,= and { }niiiiiii yyyyyyY ,5,4,3,2,1, ..........,,,,,= be a
target material and matching material respectively. PIM is mathematically expressed as
∑∑ −=
−=
n
i
i
n
i
i yxYXPIM
11
),( (14)
The best materials, whose Minimum Performance Index Measure(MPIM) value matches the
target materials is defined by
Computer Science & Information Technology (CS & IT) 161
MPIM(X,Y) = 





− ∑∑ ==
=
n
i
i
n
i
i
m
j
yx
11
1
min (15)
4.1 Normalized Measures
Normalization is the mechanism for transforming measure values to another range normally -1 to
+1. Normalization of PIM and SM values of similar materials are required to make relative
comparison in decision making on material selection. Normalized Performance Index Measure is
defined to transform PIM values to unique range.
NPIM=
)(
1
PIMXYMax
PIMXYj
− (16)
where, PIMXY is the performance index value of X and Y. and Max(PIMXY) is the maximum
Performance Index value among the similar materials.
NSM=
)(
1
YMax
Yj
− (17)
where Yj is the distance/similarity measure values, Max(Y) is the maximum similarity value
among the materials similarity measure values.
4.2 Algorithm For Similar Material Selection
Input : Target Material (TM): { }niiiiii xxxxxX ,5,4,2,1, ..........,,,,= ,Materials Database(MD)
Output: A List of materials Material matches the target materials
Method:
1. Start
2. PMX = Call PERFORMANCE_ MEASURE_ FUNCTION (X) //Calculates ∑=
n
i
ix
1
3. Y [] = Call SIMILARITY-FUNCTION( SM[],TM, MD) // Function gives top ten similarity
materials
4.
5. for j=1 to m do
6. Begin
7. PMY[j] = Call PERFORMANCE_ MEASURE_ FUNCTION ( Y [j]) // Calculates
∑−
n
j
jy
1
8. // PMY Performance Measure of Y
9. PIMX Y[j]) = PMX - PMY[j] // PIM = Performance Index Measure
10. End
11.
12. For j=1 to m do // Normalization of Measured Values
13.
14. Begin
15. NPIMXY[j] =
)(
1
PIMXYMax
PIMXYj
− // NPIMXY = Normalized Performance Index of
X,Y
162 Computer Science & Information Technology (CS & IT)
16. NY[j] =
)(
1
YMax
Yj
− // NY = Normalized Measured Value
17. End
18.
19. //Selection of a Material
20.
21. MNPIMXY = 0
22. for j=1 to m do
23. if (NPIMXY[j] > MNPIMXY) then // MNPIMXY = Max of Normalized Performance
Index MNPIMXY = PIMXY[j]
24. End
25. Stop
5. EXPERIMENTAL RESULTS
In this experiment, Materials Database(MD) consisting of 5670 data sets of metal, ceramic and
polymer type is considered for the selection of materials that very similar to target material
features set, say X. A target material has several attributes that include maximum of 25 various
kinds of properties as shown in table 2.
Table 2 . Target Material Properties with values
TS YS MP CS IS HRDS TM CS MUT DNSTY ELGN MACHN CE
122 230 921 80 5 765 100 3 52 0.90 81 3 5
TCE FS WA EI CR CORR SM CH EXTRN MOLD CAST MANFT
3 3 1 1 2 2 5 4 5 3 5 2
Each similarity function is applied for measuring the logical separation between the target
and a material data set in the database. Top ten materials that are very closer to the target
materials are considered and the relative Performance Index Measure(PIM) with all the materials
data set is computed for material selection. A material whose behaviour similarity is closure to
the behaviour of the target material is considered as the selected material.
The procedure is experimented with each similarity/distance function listed in the table 1.
Experimental results corresponding to each function are depicted in figure from 1 to 13. In each
graphical representation, similar group of materials are plotted along the X-axis and the
normalized similarity measure values and the normalized performance index measure values are
plotted along the Y-axis. The least normalized logical distance(The highest similarity) and the
highest normalized performance index measure are considered as materials selection measures.
Several similarity measure functions are deployed for material selection. Materials selected by
different methods, their normalized similarity measure values and normalized performance index
measure are tabulated in the table 3. Graphical representation of mined data are depicted in the
figure 14. From the figure14, it is depicted that material M-7 is the ideal material for the target
material since this has been commonly selected by six distance /similarity functions.
Computer Science & Information Technology (CS & IT) 163
Figure 1. Performance measures of Similar
Group of materials retrieved throug
Euclidian distance function
Figure 2. Performance measures of Similar
Group of materials retrieved throug Sqauired
Eucledian distance function
Figure 3 .Performance measures of Similar
Group of materials retrieved throug City
Block Ditance function
Figure 4 .Performance measures of Similar
Group of materials retrieved throug
Mickowski Distance Function
Figure 5 .Performance measures of Similar
Group of materials retrieved throug
Chebyshev Distance function
Figure 6 .Performance measures of Similar
Group of materials retrieved through
exponetial simialrity Measure function
164 Computer Science & Information Technology (CS & IT)
Figure 7 .Performance measures of Similar
Group of materials retrieved throug
Mickowski Distance Function.
Figure 8. Performance measures of Similar
Group of materials retrieved throug
camberrral Function.
Figure 9.Performance measures of Similar
Group of materials retrieved through Max-
Min Similarity Function.
Figure 10 . Performance measures of Similar
Group of materials retrieved through
Geometric Average Minimum Function.
Figure 11. Performance measures of Similar
Group of materials retrieved through
Geometric Average Maximum Function.
Figure 12 . Performance measures of Similar
Group of materials retrieved through Cosine
Appritute Function.
Computer Science & Information Technology (CS & IT) 165
Figure 13 . Performance measures of Similar Group of materials retrieved through Correlation
Coeefficent Method
Table 3 . Material selection through similarity measure functions and on normalized Performance
Index Measure
Methods Similarity Measures
Selected
Material
Normalized
Measured
Values
Normalized
Performance
Index
1 Euclidean Distance M-7 0.072133127 0.993781934
2 Squared Euclidean Distance M-7 0.138720149 0.993781934
3 City Block Distance M-7 0.072336620 0.994697549
4 Minkowski Distance, M-7 0.02788523 0.993781934
5 Chebyshev Distance M-262 0.00221239 0.67139835
6 Exponential Similarity Measure M-7 0.072559289 0.987868806
7 P-Inverse Similarity M-7 0.068632226 0.993781934
8 Camberra Similarity M-26 0.24896733 0.949039872
9 Max-Min Similarity Measure M-10 0.67304755 0.988623333
10 Geometric Average Minimum Similarity M-10 0.346172027 0.985798568
11
Geometric Average Maximum
Similarity
M-10 0.006477780 0.988623333
12 Cosine Amplitude Similarity M-290 0.92800612 0.815339607
13 Correlation Coefficient Similarity M-290 0.91467073 0.81524154
166 Computer Science & Information Technology (CS & IT)
Figure 14. Comparison of all the similarity measure functions with normalized similarity and
Performance Index Measure(PIM) values
6. CONCLUSION AND FUTURE SCOPE
In this paper similarity measures functionalities are used to select a required material, Before
applying the similarity measure functions the dataset is properly classified into metals, polymers
and ceramics using some techniques of classifications in data mining. Hence proposed a model
for retrieving selective information from the vast amount of data using similarity measure
functions. Systematic study of various similarity measure functions on engineering materials
database is done. The material which gives a high performance index value and less distance
between selected and target material is determined as an ideal material. The proposed approach is
suitable for the selection of engineering materials that suit for the required materials design
requirement specifications. Thus can overcome the haphazardly selecting the materials in
materials design and manufacturing industries.
Further it proposed to find a model which would be the exact ideal distance or similarity
function for the selection of materials that suit for complex applications in real empirical world.
and also the computational complexity such as time complexity and space complexity can be
examined for the model.
ACKNOWLEDGEMENTS
This work has been supported by the University Grant Commission(UGC), India under Major
Research Project entitled “Scientific Knowledge Discovery Systems (SKDS) For Advanced
Engineering Materials Design Applications” vide reference F.No. 34-992008 (SR), 30th
December 2008. The authors gratefully acknowledge the support and thank the authors.
REFERENCES
[1]. Doreswamy, Sharma, S.C.: "An expert decision support system for engineering materials selections
and their performance classifications on design parameters". International Journal of Computing and
Applications (ICJA)vol.1, pp.17-34,2006.
[2]. Doreswamy.: "Machine learning classification and selection model for engineering materials design
applications". Published in the proceedings of International Conference on Systemics: Cybernetics
and Informatics, ICSCI pp.no. , INDIA 2008.
Computer Science & Information Technology (CS & IT) 167
[3]. Eli Shechtman & Michal Irani .:" Matching Local Self-Similarities across Images and Videos"
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE ,pp.1 - 8 .17-22 June 2007 .
[4]. Hariri B B, Abolhassani H, and Khodaei A.: "A new Structural Similarity Measure for Ontology
Alignment", in Proc. SWWS, 2006, pp.36-42.
[5]. Hongmei Wang, Sanghyuk Lee, and Jaehyung Kim.: " Quantitative Comparison of Similarity
Measure and Entropy for Fuzzy Sets" ADMA '09 Proceedings of the 5th International Conference on
Advanced Data Mining and Applications, pp. 688–695, 2009.
[6]. Jiawei Han, Micheeline Kamber: Data Mining Concepts & Techniques, Morgan Kaufamann Pubisher
2009.
[7]. Jouni Sampo and Pasi Luukka .: "Similarity Classifier with Generalized Mean; Ideal " Fuzzy Systems
And Knowledge Discovery Lecture Notes in Computer Science, 2006, Volume 4223/2006, 1140-
1147.
[8]. Kutz, Myer , Handbook of Materials Selection.. John Wiley & Sons,.2002.
[9]. Michel F Ashby.: Materials Selection in Mechanical Design, Elsevier 3rd edt 2005.
[10]. Sung-Hyuk Cha, "Comprehensive Survey on Distance/Similarity Measures between Probability
Density Function", International Journal of Mathematical Models and Methods in Applied Sciences,
Issues 4, Volume 1, 2007.
[11]. Schiele O. H.: Forschung und Entwicklung im Maschinenbauauf dem Weg in die
nformationsgesellschaft (in German,translation by the author), Bundesministerium für Bildung,
Wissenschaft, Forschung und Technologie, 1995.
[12]. Bohm C., Berchtold S., Kriegel H.-P., Michel U.: Multidimensional Index Structures in Relational
Databases, Journal of Intelligent Information Systems (JIIS) 15 (1), 2000.
[13]. Hrubiak.R., George,L., Saxena,S.K., Rajan, K., "A Materials Database for Exploring Material
Properties", Journal of Materials, Vol. 61, pp. 59-62, 2009.
[14]. Jarolan Mackerle, Nanomaterials, Nanomacanics and Finite elements: a bibliography (1995-2004),
Modeling Simulation Materials Science Engineering,13, pp. 123-158,2005
[15]. Burcu Yılmaz and Mehmet Gokturk, “Interactive Data Mining For Molecular Graphs”, Journal of
Automated Methods and Management in Chemistry, Vol. 2009, pp.1-12,2009.
[16]. Sung-Hyuk Cha, "Comprehensive Survey on Distance/Similarity Measures between Probability
Density Function", International Journal of Mathematical Models and Methods in Applied Sciences,
Issues 4, Volume 1, 2007.
[17]. Randall Wilson D, Tony R. Martinez, Improved Heterogeneous Distance Functions Journal of
Artificial Intelligence Research ,Vol. 6, pp. 1-34, 1997.
[18]. Kutz, Myer , Handbook of Materials Selection.. John Wiley & Sons,.2002.
[19]. Westbrook, J W, MATERIALS DATA ON THE INTERNET, Data Science Journal, Volume 2, 25
November, 2003.
168 Computer Science & Information Technology (CS & IT)
Authors
Doreswamy received B.Sc degree in Computer Science and M.Sc Degree in
Computer Science from University of Mysore in 1993 and 1995 respectively.
After completion of his Post-Graduation Degree, he subsequently joined and served
as Lecturer in Computer Science at St.Joseph’s College, Bangalore from 1996-1999
and at Yuvaraja’s College, a constituent college of University of Mysore from
1999-2002. Then he has elevated to the position Reader in Computer Science at
Mangalore University in year 2003. He was the Chairman of the Department of
Post-Graduate Studies and Research in Computer Science from 2003-2005 & from
2008-2009 and served at varies capacities in Mangalore University and at present
he is the Chairman of Board of Studies in Computer Science of Mangalore
University. His areas of research interests include Data Mining and Knowledge Discovery, Artificial
Intelligence and Expert Systems, Bioinformatics, Molecular Modeling and Simulation, Computational
Intelligence, Nanotechnology, Image Processing and Pattern Recognition. He has been granted a Major
Research project entitled “Scientific Knowledge Discovery Systems(SKDS) for advanced Engineering
Materials Design Applications” from the funding Agency University Grant Commission, New Delhi,
INDIA. He has published about 30 contributed peer reviewed papers at National/International Journals
and Conferences. He received SHIKSHA RATTAN PURSKAR for his outstanding achievements in the
year 2009 & RASTRIYA VIDYA SARAWATHI AWARD for outstanding achievement in chosen field
of activity in the year 2010.
Manohar M G received B.Sc and M.C.A Degree from Karnataka University
Dharwad in 1999 and 2002 respectively. After completion of his Post-Graduation
Degree, he subsequently joined Honeywell Software technology Solutions Pvt. Ltd.
Bangalore India and served for an around a year as software engineer. Then in the
year 2003 he joined Mangalore University and currently working as System
Programmer/Analyst. Meanwhile he received the MPhil Degree in Computer
Science in the year 2011 and continuing the research work for his Ph.D degree in
Computer Science under the guidance of Dr. Doreswamy in the Department of
Post-Graduate Studies and Research in Computer Science, Mangalore University
Hemanth K S received B.Sc degree and MCA degree in the years 2006 and 2009
respectively. Currently working as Project Fellow of UGC Major Research Project
and is working towards his Ph.D degree in Computer Science under the guidance of
Dr. Doreswamy in the Department of Post-Graduate Studies and Research in
Computer Science, Mangalore University.

More Related Content

What's hot

Ijatcse71852019
Ijatcse71852019Ijatcse71852019
Ijatcse71852019loki536577
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for properIJDKP
 
A unified approach for spatial data query
A unified approach for spatial data queryA unified approach for spatial data query
A unified approach for spatial data queryIJDKP
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceIJDKP
 
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...IJDKP
 
Data mining techniques a survey paper
Data mining techniques a survey paperData mining techniques a survey paper
Data mining techniques a survey papereSAT Publishing House
 
New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...IJDKP
 
GCUBE INDEXING
GCUBE INDEXINGGCUBE INDEXING
GCUBE INDEXINGIJDKP
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection methodIJSRD
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Miningertekg
 
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...ertekg
 
Enhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging areaEnhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging areaIJDKP
 
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...IJAEMSJORNAL
 
Enhance The Technique For Searching Dimension Incomplete Databases
Enhance The Technique For Searching Dimension Incomplete DatabasesEnhance The Technique For Searching Dimension Incomplete Databases
Enhance The Technique For Searching Dimension Incomplete Databasespaperpublications3
 
Combined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex dataCombined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex datacsandit
 
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATACOMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATAcscpconf
 
Variance rover system
Variance rover systemVariance rover system
Variance rover systemeSAT Journals
 
Variance rover system web analytics tool using data
Variance rover system web analytics tool using dataVariance rover system web analytics tool using data
Variance rover system web analytics tool using dataeSAT Publishing House
 

What's hot (20)

Ijatcse71852019
Ijatcse71852019Ijatcse71852019
Ijatcse71852019
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for proper
 
A unified approach for spatial data query
A unified approach for spatial data queryA unified approach for spatial data query
A unified approach for spatial data query
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduce
 
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
 
Data mining techniques a survey paper
Data mining techniques a survey paperData mining techniques a survey paper
Data mining techniques a survey paper
 
New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...
 
GCUBE INDEXING
GCUBE INDEXINGGCUBE INDEXING
GCUBE INDEXING
 
F04463437
F04463437F04463437
F04463437
 
H04564550
H04564550H04564550
H04564550
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
 
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
 
Enhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging areaEnhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging area
 
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
 
Enhance The Technique For Searching Dimension Incomplete Databases
Enhance The Technique For Searching Dimension Incomplete DatabasesEnhance The Technique For Searching Dimension Incomplete Databases
Enhance The Technique For Searching Dimension Incomplete Databases
 
Combined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex dataCombined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex data
 
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATACOMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
 
Variance rover system
Variance rover systemVariance rover system
Variance rover system
 
Variance rover system web analytics tool using data
Variance rover system web analytics tool using dataVariance rover system web analytics tool using data
Variance rover system web analytics tool using data
 

Similar to A Study on Similarity Measures for Materials Selection

Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsIRJET Journal
 
Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...
Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...
Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...csandit
 
Feature selection a novel
Feature selection a novelFeature selection a novel
Feature selection a novelcsandit
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaIJERA Editor
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaIJERA Editor
 
IRJET- Diverse Approaches for Document Clustering in Product Development Anal...
IRJET- Diverse Approaches for Document Clustering in Product Development Anal...IRJET- Diverse Approaches for Document Clustering in Product Development Anal...
IRJET- Diverse Approaches for Document Clustering in Product Development Anal...IRJET Journal
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelDr. Abdul Ahad Abro
 
Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...IJECEIAES
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...IOSR Journals
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...acijjournal
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data FusionIRJET Journal
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerIJERA Editor
 
Pak eko 4412ijdms01
Pak eko 4412ijdms01Pak eko 4412ijdms01
Pak eko 4412ijdms01hyuviridvic
 
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILESAN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILEScscpconf
 
An efficient feature selection in
An efficient feature selection inAn efficient feature selection in
An efficient feature selection incsandit
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...Editor IJCATR
 
A Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document ClusteringA Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document ClusteringIJMER
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningGurdal Ertek
 

Similar to A Study on Similarity Measures for Materials Selection (20)

Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
 
Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...
Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...
Feature Selection : A Novel Approach for the Prediction of Learning Disabilit...
 
Feature selection a novel
Feature selection a novelFeature selection a novel
Feature selection a novel
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
 
IRJET- Diverse Approaches for Document Clustering in Product Development Anal...
IRJET- Diverse Approaches for Document Clustering in Product Development Anal...IRJET- Diverse Approaches for Document Clustering in Product Development Anal...
IRJET- Diverse Approaches for Document Clustering in Product Development Anal...
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms Excel
 
G1803054653
G1803054653G1803054653
G1803054653
 
Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
 
I1802055259
I1802055259I1802055259
I1802055259
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data Fusion
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
 
Pak eko 4412ijdms01
Pak eko 4412ijdms01Pak eko 4412ijdms01
Pak eko 4412ijdms01
 
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILESAN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
 
An efficient feature selection in
An efficient feature selection inAn efficient feature selection in
An efficient feature selection in
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
 
A Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document ClusteringA Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document Clustering
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
 

More from cscpconf

ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR cscpconf
 
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATIONcscpconf
 
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...cscpconf
 
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIESPROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIEScscpconf
 
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGICA SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGICcscpconf
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS cscpconf
 
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS cscpconf
 
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTICTWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTICcscpconf
 
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAINDETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAINcscpconf
 
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...cscpconf
 
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEMIMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEMcscpconf
 
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...cscpconf
 
AUTOMATED PENETRATION TESTING: AN OVERVIEW
AUTOMATED PENETRATION TESTING: AN OVERVIEWAUTOMATED PENETRATION TESTING: AN OVERVIEW
AUTOMATED PENETRATION TESTING: AN OVERVIEWcscpconf
 
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORKCLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORKcscpconf
 
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...cscpconf
 
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATAPROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATAcscpconf
 
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCHCHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCHcscpconf
 
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...cscpconf
 
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGESOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGEcscpconf
 
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXTGENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXTcscpconf
 

More from cscpconf (20)

ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
 
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
 
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
 
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIESPROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
 
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGICA SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
 
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
 
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTICTWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
 
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAINDETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
 
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
 
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEMIMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
 
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
 
AUTOMATED PENETRATION TESTING: AN OVERVIEW
AUTOMATED PENETRATION TESTING: AN OVERVIEWAUTOMATED PENETRATION TESTING: AN OVERVIEW
AUTOMATED PENETRATION TESTING: AN OVERVIEW
 
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORKCLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
 
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
 
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATAPROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
 
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCHCHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
 
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
 
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGESOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
 
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXTGENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
 

Recently uploaded

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 

Recently uploaded (20)

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 

A Study on Similarity Measures for Materials Selection

  • 1. David Bracewell, et al. (Eds): AIAA 2011,CS & IT 03, pp. 157–168 , 2011. © CS & IT-CSCP 2011 DOI : 10.5121/csit.2011.1314 A STUDY ON SIMILARITY MEASURE FUNCTIONS ON ENGINEERING MATERIALS SELECTION Doreswamy1 , Manohar M G2 , Hemanth.K.S3 1,2,3 Department of Computer Science Mangalore University Mangalagangotri-574 199 doreswamyh@yahoo.com, manoharg.phd@gmail.com, reachhemanthmca@gmail.com ABSTRACT While designing a new type of engineering material one has to search for some existing materials which suits design requirement and then he can try to produce new kind of engineering material. This selection process itself is tedious as he has to select few numbers of materials out of a set of lakhs of materials. Therefore in this paper a model is proposed to select a particular material which suits the user requirement, by using some similarity/distance measuring functionalities. Here thirteen different types of similarity/distance measuring functionalities are examined. Performance Index Measure(PIM) is calculated to verify the relative performance of the selected material with the target material. Then all the results are normalised for the purpose of analysing the results. Hence the proposed model reduces the wastage of time in selection and also avoids the haphazardly selection of the materials in materials design and manufacturing industries. KEYWORDS Similarity measure, Performance index, Materials database, Materials selection, Knowledge Discovery 1. INTRODUCTION Processed information is playing a major role in the success of the any industry. Only who can apply the suitable newest information for his product development, is able to survive in the global competition[11]. The latest developments in storage and network technologies have enabled the fast access to information resources and repositories, but there is a lack of facilities to effectively and efficiently search for the required information. To make use of the huge data repositories and information, the field of knowledge extraction from these data repositories will be of major importance in the next decade. Therefore, research on Data Mining and Knowledge discovery has become one of the most important domains of computer science. Usually the modern Database Management Systems (DBMS) provide fundamental utilities such as searching, retrieving and indexing mechanisms for the management of such relational data [12], which are well-understood and widely applied in many commercial and industrials applications. The traditional DBMS can help the user If the structure of the information to be searched is sufficiently simple and manageable with available data manipulation mechanism of
  • 2. 158 Computer Science & Information Technology (CS & IT) the DBMS. Requirements of traditional operational data management such as accounting and billing are perfectly met by a commercial DBMS. Therefore, the information infrastructure of most enterprises is based on the latest Relational Database Management Systems (RDBMS). Recently, an increase in the number of applications, processing large amounts of complex and application specific data objects have been observed. Therefore it is the necessity of the day as the latest trends in the industry require to analyse and use the existing huge complex data. In application domains such as multimedia, medical imaging, Computer Aided Design (AMPTIAC Material EASE 2 CAMS Report), Material Informatics [13], Nanoinformatics [14], Bioinformatics [15], marketing and purchasing assistance, etc., a high efficiency of query processing is crucial due to the immense and even increasing size of current databases. The search in such databases, called non-standard databases, is rarely based on an exact match of objects. Instead, the search is often based on some notion of similarity which is specific to the application. For applications which do not only support transaction oriented search operations, but also provide high-level strategic information or decision making. It is necessary not only to search for objects which are similar to a given query object but rather to analyze the data set as a whole for knowledge discovery. Information which is interesting in the decision making process include common patterns, classifications of data, knowledge about collections of similar objects and exceptional data. Similar search and Data Mining have become widespread problems of modern database applications. Similar pattern search in databases is a problem of searching extract patterns that matches the target object/Patterns[2]. A general approach is to translate complex object into single or multi dimensional vector by feature transformation function and then to employ search mechanism to retrieve similar object to a given query object. Analysis of similar patterns involves data mining techniques such as association analysis, correlation analysis, classification, cluster analysis, outlier analysis and distance or similarity functions. This kind of information is commonly referred to as knowledge and the process of deriving such knowledge or higher-level information from a vast amount of transactional data is called data mining or knowledge discovery in databases (KDD)[6]. Because such applications on top of modern databases are also depend on similarity search [16]. The difference to traditional similarity search applications is, however, that these applications do not only raise few, single similarity queries but rather a high number of such queries. This paper is organized as fallows, A brief introduction to similarity and distance measuring functions are discussed, and a list of 13 such functions are tabled in section 2. Section 3 describes briefly about the engineering material database. Further the performance index measure(PIM) for material selection, Normalized measures and algorithm for similar material selection are elaborated in the section 4. Finally experimental results and conclusion are discuses in section 5 and section 6 respectively 2. SIMILARITY AND DISTANCE FUNCTIONS From the scientific and mathematical point of view, similarity/distance is defined as a quantitative degree that enumerates the logical separation of two objects represented by a set of measurable attributes/characteristics[4][5]. Measuring similarity or distance between two data points is a core requirement for several data mining and knowledge discovery tasks that involve distance computation. Examples include clustering (k-means), distance-based outlier detection, classification (KNN, SVM), and several other data mining tasks. These algorithms typically treat the similarity computation as an orthogonal step and can make use of any measure. For continuous data sets, the Minkowski Distance is a general method used to compute distance between two multivariate points. In particular, the Minkowski Distance of order 1 (Manhattan) and order 2 (Euclidean) are the two most widely used distance measures for continuous data. The key observation about the above measures is that they are independent of the underlying data set
  • 3. Computer Science & Information Technology (CS & IT) 159 to which the two points belong. Several data driven measures have also been explored for continuous data. The notion of similarity or distance for categorical data is not as straightforward as for continuous data. The key characteristic of categorical data is that the different values that a categorical attribute takes are not inherently ordered[7]. Thus, it is not possible to directly compare two different categorical values. The simplest way to address similarity between two categorical attributes is to assign a similarity of 1 if the values are identical and a similarity of 0 if the values are not identical. For two multivariate categorical data points, the similarity between them will be directly proportional to the number of attributes in which they match. Various similarity measure functions are enumerated in the literature [16][17] and whose applications are widespread in retrieving information or data from databases. Properties of similarity / distance measure function are as follows: 0),( ≥YXD Distance is a non-negative number. 0),( =XXD The distance of an attribute to itself is zero. ),(),( XYDYXD = the distance is symmetric function ),(),(),( YXDYZDZXD ≥+ It does obey triangular inequality. where { }niiiiiii xxxxxxX ,5,4,3,2,1, ..........,,,,,= and { }niiiiiii yyyyyyY ,5,4,3,2,1, ..........,,,,,= are two n dimensional data sets. Any function is said to be a distance metric if it satisfies the properties from 1 to 4, however the similarity function may not satisfy the 4th property. Some of the distance or similarity function frequently used for information retrieval from the databases are listed below and employed for material selection from the engineering materials database. Table 1 . Various Similarity/ Distance functions 1. Euclidean Distance ( )       −= ∑= n i iiEud YXYXD 1 2 ),( , where i = 1 . . n (1) 2. Squared Euclidean Distance ∑= −= n i iiEd YXYXD 1 2 ),( where i = 1 . . . n, (2) 3. City Block Distance ∑= −= n i iiCBd YXYXD 1 ),( , where i = 1. . . n (3) 4. Minkowski Distance, pn i p iiMkd YXYXD 1 1 ),(       −= ∑= , where i = 1. .. n (4) 5. Chebyshev Distance ii i Chebd YXYXD −= max),( , where i = 1. . . n (5) 6. Exponential Similarity Measure ∑= −− + − = n i YX ii iiESM ii e YX YXD 1 1 ),( , where i = 1 . .. n (6)
  • 4. 160 Computer Science & Information Technology (CS & IT) 7. P-Inverse Similarity 3 4.1 1 2 ),(       −= ∑= n i iiMkd YXYXD , where i = 1 . . . n (7) 8. Camberra Similarity ∑= + − = n i ii ii yx yx yxC 1 ),( , where i = 1 . . . n (8) 9. Max-Min Similarity Measure ∑ ∑ = = = n i ii n i ii MMM YX YX YXD 1 1 ),max( ),min( ),( , where i = 1 . . . n (9) 10. Geometric Average Minimum Similarity [ ]∑ ∑ = = = n i ii n i ii Min YX YXMin YXGA 1 2/1 1 . ),( ),( , where i = 1 .. . n (10) 11. Geometric Average Maximum Similarity [ ]∑ ∑ = = = n i ii n i ii Max YX YXMax YXGA 1 2/1 1 . ),( ),( , where i = 1 . . n (11) 12. Cosine Amplitude Similarity ∑∑ ∑ == = = m k jk m k ik m k jkik ji xx xx r 1 2 1 2 1 , ))(( , where i, j = 1. . . n (12) 13. Correlation Coefficient Similarity 2/1 1 2 , 2/1 1 2 , , 1 , )()( ),(       −      − −− = ∑∑ ∑ = − = − − = − p k jkj p k jki jkj p k iki ji yyxx yyxx YXγ where , 1 , 1 1 , 1 , ∑∑ = − = − == p k kjj p k kii y p yandx p x (13) 3. ENGINEERING MATERIALS DATABASE(EMD) It is a group of related records of engineering materials data set. Each record or an object is a group of related attributes of materials. The attributes of materials are the characteristics or behaviour of synthesized materials and these are measured by design engineer and manufacturing industries following the industrial standards. Data related to engineering materials are gathered from different information sources that include materials handbook [18] scientific literature[19] and Internet WebPages www.matweb.com. Information structure for organizing these gathered data are organized with scheme of Object Oriented Data Model . 4. PERFORMANCE INDEX MEASURE(PIM) FOR MATERIAL SELECTION Performance index measure is proposed for the selection of the best among the similar materials. Performance Index Measure is the logical difference between the absolute aggregated values of attribute/characteristics of target material and its matching material. The best material is the one whose performance index measure value is the minimum among the most similar materials. Let { }niiiiiii xxxxxxX ,5,4,3,2,1, ..........,,,,,= and { }niiiiiii yyyyyyY ,5,4,3,2,1, ..........,,,,,= be a target material and matching material respectively. PIM is mathematically expressed as ∑∑ −= −= n i i n i i yxYXPIM 11 ),( (14) The best materials, whose Minimum Performance Index Measure(MPIM) value matches the target materials is defined by
  • 5. Computer Science & Information Technology (CS & IT) 161 MPIM(X,Y) =       − ∑∑ == = n i i n i i m j yx 11 1 min (15) 4.1 Normalized Measures Normalization is the mechanism for transforming measure values to another range normally -1 to +1. Normalization of PIM and SM values of similar materials are required to make relative comparison in decision making on material selection. Normalized Performance Index Measure is defined to transform PIM values to unique range. NPIM= )( 1 PIMXYMax PIMXYj − (16) where, PIMXY is the performance index value of X and Y. and Max(PIMXY) is the maximum Performance Index value among the similar materials. NSM= )( 1 YMax Yj − (17) where Yj is the distance/similarity measure values, Max(Y) is the maximum similarity value among the materials similarity measure values. 4.2 Algorithm For Similar Material Selection Input : Target Material (TM): { }niiiiii xxxxxX ,5,4,2,1, ..........,,,,= ,Materials Database(MD) Output: A List of materials Material matches the target materials Method: 1. Start 2. PMX = Call PERFORMANCE_ MEASURE_ FUNCTION (X) //Calculates ∑= n i ix 1 3. Y [] = Call SIMILARITY-FUNCTION( SM[],TM, MD) // Function gives top ten similarity materials 4. 5. for j=1 to m do 6. Begin 7. PMY[j] = Call PERFORMANCE_ MEASURE_ FUNCTION ( Y [j]) // Calculates ∑− n j jy 1 8. // PMY Performance Measure of Y 9. PIMX Y[j]) = PMX - PMY[j] // PIM = Performance Index Measure 10. End 11. 12. For j=1 to m do // Normalization of Measured Values 13. 14. Begin 15. NPIMXY[j] = )( 1 PIMXYMax PIMXYj − // NPIMXY = Normalized Performance Index of X,Y
  • 6. 162 Computer Science & Information Technology (CS & IT) 16. NY[j] = )( 1 YMax Yj − // NY = Normalized Measured Value 17. End 18. 19. //Selection of a Material 20. 21. MNPIMXY = 0 22. for j=1 to m do 23. if (NPIMXY[j] > MNPIMXY) then // MNPIMXY = Max of Normalized Performance Index MNPIMXY = PIMXY[j] 24. End 25. Stop 5. EXPERIMENTAL RESULTS In this experiment, Materials Database(MD) consisting of 5670 data sets of metal, ceramic and polymer type is considered for the selection of materials that very similar to target material features set, say X. A target material has several attributes that include maximum of 25 various kinds of properties as shown in table 2. Table 2 . Target Material Properties with values TS YS MP CS IS HRDS TM CS MUT DNSTY ELGN MACHN CE 122 230 921 80 5 765 100 3 52 0.90 81 3 5 TCE FS WA EI CR CORR SM CH EXTRN MOLD CAST MANFT 3 3 1 1 2 2 5 4 5 3 5 2 Each similarity function is applied for measuring the logical separation between the target and a material data set in the database. Top ten materials that are very closer to the target materials are considered and the relative Performance Index Measure(PIM) with all the materials data set is computed for material selection. A material whose behaviour similarity is closure to the behaviour of the target material is considered as the selected material. The procedure is experimented with each similarity/distance function listed in the table 1. Experimental results corresponding to each function are depicted in figure from 1 to 13. In each graphical representation, similar group of materials are plotted along the X-axis and the normalized similarity measure values and the normalized performance index measure values are plotted along the Y-axis. The least normalized logical distance(The highest similarity) and the highest normalized performance index measure are considered as materials selection measures. Several similarity measure functions are deployed for material selection. Materials selected by different methods, their normalized similarity measure values and normalized performance index measure are tabulated in the table 3. Graphical representation of mined data are depicted in the figure 14. From the figure14, it is depicted that material M-7 is the ideal material for the target material since this has been commonly selected by six distance /similarity functions.
  • 7. Computer Science & Information Technology (CS & IT) 163 Figure 1. Performance measures of Similar Group of materials retrieved throug Euclidian distance function Figure 2. Performance measures of Similar Group of materials retrieved throug Sqauired Eucledian distance function Figure 3 .Performance measures of Similar Group of materials retrieved throug City Block Ditance function Figure 4 .Performance measures of Similar Group of materials retrieved throug Mickowski Distance Function Figure 5 .Performance measures of Similar Group of materials retrieved throug Chebyshev Distance function Figure 6 .Performance measures of Similar Group of materials retrieved through exponetial simialrity Measure function
  • 8. 164 Computer Science & Information Technology (CS & IT) Figure 7 .Performance measures of Similar Group of materials retrieved throug Mickowski Distance Function. Figure 8. Performance measures of Similar Group of materials retrieved throug camberrral Function. Figure 9.Performance measures of Similar Group of materials retrieved through Max- Min Similarity Function. Figure 10 . Performance measures of Similar Group of materials retrieved through Geometric Average Minimum Function. Figure 11. Performance measures of Similar Group of materials retrieved through Geometric Average Maximum Function. Figure 12 . Performance measures of Similar Group of materials retrieved through Cosine Appritute Function.
  • 9. Computer Science & Information Technology (CS & IT) 165 Figure 13 . Performance measures of Similar Group of materials retrieved through Correlation Coeefficent Method Table 3 . Material selection through similarity measure functions and on normalized Performance Index Measure Methods Similarity Measures Selected Material Normalized Measured Values Normalized Performance Index 1 Euclidean Distance M-7 0.072133127 0.993781934 2 Squared Euclidean Distance M-7 0.138720149 0.993781934 3 City Block Distance M-7 0.072336620 0.994697549 4 Minkowski Distance, M-7 0.02788523 0.993781934 5 Chebyshev Distance M-262 0.00221239 0.67139835 6 Exponential Similarity Measure M-7 0.072559289 0.987868806 7 P-Inverse Similarity M-7 0.068632226 0.993781934 8 Camberra Similarity M-26 0.24896733 0.949039872 9 Max-Min Similarity Measure M-10 0.67304755 0.988623333 10 Geometric Average Minimum Similarity M-10 0.346172027 0.985798568 11 Geometric Average Maximum Similarity M-10 0.006477780 0.988623333 12 Cosine Amplitude Similarity M-290 0.92800612 0.815339607 13 Correlation Coefficient Similarity M-290 0.91467073 0.81524154
  • 10. 166 Computer Science & Information Technology (CS & IT) Figure 14. Comparison of all the similarity measure functions with normalized similarity and Performance Index Measure(PIM) values 6. CONCLUSION AND FUTURE SCOPE In this paper similarity measures functionalities are used to select a required material, Before applying the similarity measure functions the dataset is properly classified into metals, polymers and ceramics using some techniques of classifications in data mining. Hence proposed a model for retrieving selective information from the vast amount of data using similarity measure functions. Systematic study of various similarity measure functions on engineering materials database is done. The material which gives a high performance index value and less distance between selected and target material is determined as an ideal material. The proposed approach is suitable for the selection of engineering materials that suit for the required materials design requirement specifications. Thus can overcome the haphazardly selecting the materials in materials design and manufacturing industries. Further it proposed to find a model which would be the exact ideal distance or similarity function for the selection of materials that suit for complex applications in real empirical world. and also the computational complexity such as time complexity and space complexity can be examined for the model. ACKNOWLEDGEMENTS This work has been supported by the University Grant Commission(UGC), India under Major Research Project entitled “Scientific Knowledge Discovery Systems (SKDS) For Advanced Engineering Materials Design Applications” vide reference F.No. 34-992008 (SR), 30th December 2008. The authors gratefully acknowledge the support and thank the authors. REFERENCES [1]. Doreswamy, Sharma, S.C.: "An expert decision support system for engineering materials selections and their performance classifications on design parameters". International Journal of Computing and Applications (ICJA)vol.1, pp.17-34,2006. [2]. Doreswamy.: "Machine learning classification and selection model for engineering materials design applications". Published in the proceedings of International Conference on Systemics: Cybernetics and Informatics, ICSCI pp.no. , INDIA 2008.
  • 11. Computer Science & Information Technology (CS & IT) 167 [3]. Eli Shechtman & Michal Irani .:" Matching Local Self-Similarities across Images and Videos" Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE ,pp.1 - 8 .17-22 June 2007 . [4]. Hariri B B, Abolhassani H, and Khodaei A.: "A new Structural Similarity Measure for Ontology Alignment", in Proc. SWWS, 2006, pp.36-42. [5]. Hongmei Wang, Sanghyuk Lee, and Jaehyung Kim.: " Quantitative Comparison of Similarity Measure and Entropy for Fuzzy Sets" ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications, pp. 688–695, 2009. [6]. Jiawei Han, Micheeline Kamber: Data Mining Concepts & Techniques, Morgan Kaufamann Pubisher 2009. [7]. Jouni Sampo and Pasi Luukka .: "Similarity Classifier with Generalized Mean; Ideal " Fuzzy Systems And Knowledge Discovery Lecture Notes in Computer Science, 2006, Volume 4223/2006, 1140- 1147. [8]. Kutz, Myer , Handbook of Materials Selection.. John Wiley & Sons,.2002. [9]. Michel F Ashby.: Materials Selection in Mechanical Design, Elsevier 3rd edt 2005. [10]. Sung-Hyuk Cha, "Comprehensive Survey on Distance/Similarity Measures between Probability Density Function", International Journal of Mathematical Models and Methods in Applied Sciences, Issues 4, Volume 1, 2007. [11]. Schiele O. H.: Forschung und Entwicklung im Maschinenbauauf dem Weg in die nformationsgesellschaft (in German,translation by the author), Bundesministerium für Bildung, Wissenschaft, Forschung und Technologie, 1995. [12]. Bohm C., Berchtold S., Kriegel H.-P., Michel U.: Multidimensional Index Structures in Relational Databases, Journal of Intelligent Information Systems (JIIS) 15 (1), 2000. [13]. Hrubiak.R., George,L., Saxena,S.K., Rajan, K., "A Materials Database for Exploring Material Properties", Journal of Materials, Vol. 61, pp. 59-62, 2009. [14]. Jarolan Mackerle, Nanomaterials, Nanomacanics and Finite elements: a bibliography (1995-2004), Modeling Simulation Materials Science Engineering,13, pp. 123-158,2005 [15]. Burcu Yılmaz and Mehmet Gokturk, “Interactive Data Mining For Molecular Graphs”, Journal of Automated Methods and Management in Chemistry, Vol. 2009, pp.1-12,2009. [16]. Sung-Hyuk Cha, "Comprehensive Survey on Distance/Similarity Measures between Probability Density Function", International Journal of Mathematical Models and Methods in Applied Sciences, Issues 4, Volume 1, 2007. [17]. Randall Wilson D, Tony R. Martinez, Improved Heterogeneous Distance Functions Journal of Artificial Intelligence Research ,Vol. 6, pp. 1-34, 1997. [18]. Kutz, Myer , Handbook of Materials Selection.. John Wiley & Sons,.2002. [19]. Westbrook, J W, MATERIALS DATA ON THE INTERNET, Data Science Journal, Volume 2, 25 November, 2003.
  • 12. 168 Computer Science & Information Technology (CS & IT) Authors Doreswamy received B.Sc degree in Computer Science and M.Sc Degree in Computer Science from University of Mysore in 1993 and 1995 respectively. After completion of his Post-Graduation Degree, he subsequently joined and served as Lecturer in Computer Science at St.Joseph’s College, Bangalore from 1996-1999 and at Yuvaraja’s College, a constituent college of University of Mysore from 1999-2002. Then he has elevated to the position Reader in Computer Science at Mangalore University in year 2003. He was the Chairman of the Department of Post-Graduate Studies and Research in Computer Science from 2003-2005 & from 2008-2009 and served at varies capacities in Mangalore University and at present he is the Chairman of Board of Studies in Computer Science of Mangalore University. His areas of research interests include Data Mining and Knowledge Discovery, Artificial Intelligence and Expert Systems, Bioinformatics, Molecular Modeling and Simulation, Computational Intelligence, Nanotechnology, Image Processing and Pattern Recognition. He has been granted a Major Research project entitled “Scientific Knowledge Discovery Systems(SKDS) for advanced Engineering Materials Design Applications” from the funding Agency University Grant Commission, New Delhi, INDIA. He has published about 30 contributed peer reviewed papers at National/International Journals and Conferences. He received SHIKSHA RATTAN PURSKAR for his outstanding achievements in the year 2009 & RASTRIYA VIDYA SARAWATHI AWARD for outstanding achievement in chosen field of activity in the year 2010. Manohar M G received B.Sc and M.C.A Degree from Karnataka University Dharwad in 1999 and 2002 respectively. After completion of his Post-Graduation Degree, he subsequently joined Honeywell Software technology Solutions Pvt. Ltd. Bangalore India and served for an around a year as software engineer. Then in the year 2003 he joined Mangalore University and currently working as System Programmer/Analyst. Meanwhile he received the MPhil Degree in Computer Science in the year 2011 and continuing the research work for his Ph.D degree in Computer Science under the guidance of Dr. Doreswamy in the Department of Post-Graduate Studies and Research in Computer Science, Mangalore University Hemanth K S received B.Sc degree and MCA degree in the years 2006 and 2009 respectively. Currently working as Project Fellow of UGC Major Research Project and is working towards his Ph.D degree in Computer Science under the guidance of Dr. Doreswamy in the Department of Post-Graduate Studies and Research in Computer Science, Mangalore University.