SlideShare a Scribd company logo
International Association of Scientific Innovation and Research (IASIR)
(An Association Unifying the Sciences, Engineering, and Applied Research)
International Journal of Emerging Technologies in Computational
and Applied Sciences (IJETCAS)
www.iasir.net
IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 174
ISSN (Print): 2279-0047
ISSN (Online): 2279-0055
Improved BSP Clustering Algorithm for Social Network Analysis
1
T.B.Saranya Preetha, 2
G.T.Prabavathi
1
Research Scholar, 2
Assistant Professor in Computer science
Department of Computer Science
Gobi Arts and Science College
Gobichettipalayam, Tamil Nadu 638453, India
_________________________________________________________________________________________
Abstract: A social network is a social structure of individuals, who are linked (directly or indirectly to each
other) depending on a common relation of interest, e.g. friendship, trust, etc. Social network analysis is the
study of social networks to recognize the structure and behavior of social actor. Social network analysis has
gained importance due to its usage in different applications - from product marketing to search engines and
organizational dynamics. The conventional clustering approaches group objects based only on objects’
similarity which does not suite for social network data. Social network objects should be grouped into classes
depending on their links as well as their attributes. In this paper, a clustering algorithm based on BSP (Business
System Planning) clustering with Principal Component Analysis (PCA) technique is proposed. This algorithm
produces significant improvement in clusters, as it groups objects in a social network into different classes
based on their links and identify relation among classes.
Keywords: Social Network Analysis, BSP Clustering, PCA
__________________________________________________________________________________________
I. INTRODUCTION
Social Networks are graph structures whose nodes or vertices represent people or other entities embedded
in a social context, and whose edges represent interaction or collaboration between these entities [10]. Social
networks are highly dynamic, evolving relationships among people or other entities. This dynamic property of
social networks makes studying these graphs a challenging task. A lot of research has been done recently to
study different properties of these networks. Such complex analysis of large, heterogeneous, multi-relational
social networks has led to an interesting field of study known as Social Network Analysis (SNA).
Social network analysis, which can be applied to analyze the structure and the property of personal
relationship, web page links, and the spread of messages, is a research field in sociology. Recently social
network analysis has attracted increasing attention in the data mining [1] research community. From the
viewpoint of data mining, a social network is a heterogeneous and multi-relational dataset represented by graph
[3].Research on social network analysis in the data mining community includes following areas: clustering
analysis [1], classification [8], link prediction [7], Page Rank [9] and Hub-Authority [4] in web search engine.
This paper deals with the social network cluster analysis. In the second section, a social network clustering
based on Business System Planning (BSP) clustering algorithm is given. The algorithm can group objects in a
social network into different classes based on their links, and it can also identify the relations among classes. In
the third section, importance of Principle Component Analysis technique is dealt. Fourth section describes the
methodology of BSP clustering. Section five shows how BSP clustering is improved by applying the PCA
reduction technique.
II. BSP Clustering in Social Networks
There has been extensive research work on clustering in data mining. Traditional clustering algorithms [1]
divide objects into classes based on their similarity. Objects in a class are similar to each other and are very
dissimilar from objects in different classes. Social network clustering analysis, which is different from
traditional clustering problem, divides objects into classes based on their links as well as their attributes. The
biggest challenge of social network clustering analysis is how to divide objects into classes based on objects’
links, and to find algorithms that can meet this challenge.
The BSP clustering algorithm [11] is proposed by IBM. It is designed to define information architecture for
the firm in business system planning. This algorithm analyses business process and their data classes, cluster
business process into subsystems, and define the relationship of these sub-systems.
Basically BSP clustering algorithm uses objects (business processes) and links among objects (data classes) to
make clustering analysis. Similarly social network also includes objects and links among these objects. In view
of the same pre-condition, the BSP clustering algorithm can be used in social network clustering analysis.
According to graph theory, social network is a directed graph composed by objects and their relationship. Figure
1 shows a sample of social network, the circle in the figure represents an object; the line with arrow is an edge
T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174-
178
IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 175
of the graph, and it represents directed link between two objects, so a social network is a directed graph. In
figure 1, Let Oi be an object in social network (i = 1...m ), let Ej which means directed link between two objects,
be a directed edge of the graph ( j = 1...n ).After definition of objects and directed edges, reachable relation
between two objects is also defined.
Figure 1: A Sample of Social Network
There are two kinds of reachable relation among objects, shown as following:
1) One-step reachable relation: if there is directed link from Oi to Oj through one and only one directed edge,
then Oi to Oj is a one-step reachable relation. For instance in Figure 1 there is a directed link from O1 to O2
through the directed edge E1, O1 to O2 is one-step reachable relation.
2) Multi-steps reachable relation: if there is directed link from O i to O j through two or more directed edges,
then O i to O j is a multi-steps reachable relation. There is a directed link from O1 to O4 through directed edges
E1 and E5, then O1 to O4 is a 2-steps reachable relation. After these relations, BSP analyses a social network by
generating edge pointed matrix and calculates one-step and multi-step reachable matrix between objects.
III. PRINCIPAL COMPONENT ANALYSIS
Principal component analysis (PCA) [13] in multivariate statistics is widely adopted as an effective
unsupervised dimension reduction method and is extended in many different directions. The main justification
of dimension reduction is that PCA uses singular value decomposition (SVD) which gives the best low rank
approximation to original data. However, this essentially noise reduction perspective alone is inadequate to
explain the effectiveness of PCA. In this paper, we provide a new perspective of PCA based on its close
relationship with the BSP clustering algorithm. We show that the principal components are actually relaxed
cluster membership indicators.
The central idea of Principal Component Analysis (PCA) is to reduce the dimensionality of the data set while
retaining as much as possible the variation in the data set. Principal components (PC’s) are linear
transformations of the original set of variables. PC’s are uncorrelated and ordered so that the first few PC’s
contain most of the variations in the original data set [Jolliffe, 1986].
PCA is a classic tool for analyzing large scale multivariate data. It seeks linear combinations of the data
variables (often called factors or principal components) that capture a maximum amount of variance.
Numerically, PCA only amounts to computing a few leading eigenvectors of the data’s covariance matrix, so it
can be applied to very large scale data sets. One of the key shortcomings of PCA however is that these factors
are linear combinations of all variables; that is, all factor coefficients (or loadings) are non-zero. This means that
while PCA facilitates model interpretation and visualization by concentrating the information in a few key
factors, the factors themselves are still constructed using all observed variables. In many applications of PCA,
the coordinate axes have a direct physical interpretation; in finance or biology for example, each axis might
correspond to a specific financial asset or gene. In such cases, having only a few nonzero coefficients in the
principal components would greatly improve the relevance and interpretability of the factors. In sparse PCA,
there is a trade-off between the two goals of expressive power (explaining most of the variance or information in
the data) and interpretability (making sure that the factors involve only a few coordinate axes or variables).
When PCA is used as a clustering tool, sparse actors identify the clusters with the action of only a few variables.
The original n data points in m-dimensional space are contained in the data matrix In
general data is not centered around the origin. The centered data matrix , where and
. The covarance matrix is given by
The principal eigenvectors o are the principal directions of the data Y. The principal eigenvectors of
the Gram matrix are the principal components; entries of each are the projected values of data points on
the principal direction . and are related via: : where is the eigenvalue of the covarance
matrix . Thus after applying PCA to the data, the principal components are identified from the data to be
clustered. Thus only the important and the vital data are going to be clustered. In this proposed approach, the
social network data is analyzed based on the BSP and PCA.
O3
O1
E3
E2
E5E1 E7
O5
E6
O4
E10 O6
E9
E8
E4
O2
T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174-
178
IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 176
IV METHODOLOGY
a) Generate edge creation matrix and edge pointed matrix
First according to the objects and edges in the graph, define two matrixes Lc(m× n matrix) and Lp(m× n
matrix). Lc means the creation of edges and Lp denote the pointed relations of edges. In the matrix, Lc (i, j) =1
denotes object Oi connects with the tail of edge Ej, which means that object Oi creates the directed edge Ej . Lc
(i, j)=0 denotes object Oi has no connection with the tail of edge Ej, which means Ej isn’t created by object Oi.
In the matrix, Lp(i, j) =1 denotes object Oi connects with the head of edge Ej, which means object Oi is pointed
to by the directed edge Ej. Lp (i, j)=0 denotes Oi doesn’t connect with the head of edge Ej, which means Ej
doesn’t point to Oi. After defining the matrix reachable matrices are calculated.
b) Calculate one-step reachable matrix between objects
After the definition of Lc and Lp, one-step reachable matrix between objects is calculated through the
following equation.
V is Boolean sum and ^is Boolean product.
G(i, j)=1 means Oi to Oj is a one-step reachable relation, G(i, j) = 0 means there is no one-step reachable
relation from Oi to Oj . All one-step reachable relation between objects is calculated.
c) Calculate multi-steps reachable matrix between objects
Besides one-step reachable relation, there are multi-steps reachable relations between objects. According to
graph theory and the BSP clustering algorithm, multi-steps reachable matrix G2, G3, G4,…., Gm−1 is
calculated. Following equations show the calculation of multi-steps reachable matrix:
These matrices include 2-steps, 3-steps… m-1-steps reachable relations between objects.
Now n-steps reachable relation between two objects through G2G3G4...Gm−1 is calculated.
d) Calculate reachable matrix
The algorithm considers whether reachable relations exist between two objects, but do not care these relations
are one-step or multi-steps, so reachable matrix R based on G,G2 ,G3 ,G4 ,...,Gm−1 is calculated as
R=IVGVG2...VGm−1 where V is Boolean sum and I is unit matrix. R(i, j) = 1 means reachable relation
exists from Oi to Oj, but the reachable relations existing in matrix R is not mutual, for instance R(i, j) = 1 means
reachable relation exists from Oi to Oj, but it doesn’t mean reachable relation exists from Oj to Oi. Mutual
reachable relations between two objects are important in a social network, so mutual reachable matrix based on
R is calculated.
e) Mutual reachable matrix and cluster generation
The mutual reachable matrix can be calculated through Q=R^RT where ^means Boolean product, RT means
Reachable Transpose matrix.
In the matrix Q(i, j) = 1 indicates there are mutual reachable relation between Oi and Oj . In a social network
if two objects that have mutual reachable relation, they should belong to the same class, thus cluster based on Q
is generated.
Thus according to mutual reachable matrix Q, a social network is divided into classes based on strong sub
matrices in Q or adjusted Q. If all elements in a sub-matrix of Q are 1, then that matrix is a strong sub matrix.
f) Identify relationships among classes
After clustering social network nodes, there is a need to identify relationship among clusters. This can be
done through generated clusters and one-step reachable matrix G. If there is one-step reachable relation between
two objects in different classes, directed links exist between classes. Through G all relations among classes is
identifed.
After performing these steps, a social network is divided into classes. Social network clustering analysis
algorithm can be given as Q− > Ck means generating clusters through mutual reachable matrix Q, and (C
k ,Q )- >Relation(Ck) means identifying relationships among clusters based on clusters and one-step reachable
matrix G.
V. PROPOSED ALGORITHM
BSP clusters divide a social network into different classes according to objects in the social network and links
between objects, and it also can identify relations among clusters. Main disadvantage of this algorithm is that it
uses matrices to store edges and reachable relations. In a real social network these matrices will be very huge,
T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174-
178
IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 177
which can’t be loaded into main memory. But these matrices are normally sparse, so this feature is used in our
work to reduce the size of the matrix.
In this paper we propose a modification of existing BSP algorithm using Link list data structure. Using this
data structure we have overcome the shortcomings of BSP. Following procedure is required for converting this
sparse matrix in to link list:
A matrix is a two-dimensional data object made of m rows and n columns, therefore having m, n values.
When m=n, it is a square matrix. The most natural representation is to use two-dimensional array A[m][n] and
access the element of ith
row and jth
column as A[i][j]. If large number of elements of the matrix are zero
elements, then it is called a sparse matrix. Representing a sparse matrix by using a two-dimensional array leads
to the wastage of a substantial amount of space. Therefore, an alternative representation must be used for sparse
matrices.
One such representation is to store only non- zero elements along with their row positions and column
positions. That means representing every non-zero element by using triples(i,j,value), where i is a row position
and j is a column position, and store these triples in a linear list. It is possible to arrange these triples in the
increasing order of row indices, and for the same row index in the increasing order of column indices. Each
triple can be represented using a node having four fields as shown:
Struct snode{Int row,col,val;
Struct snode *next;}
The algorithm to reduce the dimensionality of the matrix is given below:
Input
Edge creation Lists
Edge pointed List
Begin
Swap(Lu)
For k=3 to m do
Gk-1=Gk-2*G
Edge pointed Lists
Begin
Swap(Lu)
For k=3 to m do
Gk-1=Gk-2*G
R=IVGVG2……Gm-1
Swap(R)
Qk ->C
( Ck,Q)->Relation(Ck)
End
End
1. In order to add two sparse matrices represented using the sorted linked lists, the lists are traversed until the
end of one of the lists is reached.
2. In the process of traversal, the row indices stored in the nodes of these lists are compared. If they don't
match, a new node is created and inserted into the resultant list by copying the contents of a node with a
lower value of row index. The pointer in the list containing a node with a lower value of row index is
advanced to make it point to the next node.
3. If the row indices match, column indices for the corresponding row positions are compared. If they don't
match, a new node is created and inserted into the resultant list by copying the contents of a node with a
lower value of column index. The pointer in the list containing a node with a lower value of column index
is advanced to make it point to the next node.
4. If the column indices match, a new node is created and inserted into the resultant list by copying the row
and column indices from any of the nodes and the value equal to the sum of the values in the two nodes.
5. After this, the pointers in both the lists are advanced to make them point to the next nodes in the respective
lists. This process is repeated in each iteration. After reaching the end of any one of the lists, the iterations
come to an end and the remaining nodes in the list whose end has not been reached are copied, as it is in the
resultant list.
We performed the above mentioned steps to reduce the dimensionality of the matrix representation, so that, the
input to the BSP clustering is reduced in large number and improves the efficiency of the clusters.
T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174-
178
IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 178
VI. EXPERIMENTAL RESULTS
The performance of the proposed social network using the PCA and BSP clustering is evaluated on the
Zachary's Karate Club dataset. This data set is a social network of friendships links between 34 members of a
karate club at a US university in the 1970. The values ‘1’ in the data set denotes that there is relationship
between the two members in the data set. Alternatively, the values ‘0’ denotes that there is no relationship
among the members in the dataset. The proposed PCA and BSP clustering is compared with the existing BSP
clustering algorithm. From the dataset, 1156 links were extracted. But after applying PCA technique it was
reduced to 700 links which was provided as data to the BSP clustering algorithm. The reachable matrix is
obtained from the data set. G34 reachable matrix is computed. Then, clusters are obtained. Cluster 1 and cluster
2 is obtained. If, there is relation between the two clusters, then a relation is formed between them. The
clustering accuracy of the data sets is obtained. Moreover, the time taken by the proposed approach for
clustering the data set is also calculated and is displayed.
The proposed PCA and BSP clustering is compared with the BSP clustering algorithm. The performance of
the proposed system is evaluated based on the parameters like clustering accuracy and classification time. The
accuracy in clusters between BSP and PCA with BSP is shown in Figure.1. It is observed from the graph that
the classification accuracy of the proposed approach using PCA and BSP approach is high when compared to
the BSP approach. Moreover the time taken by proposed approach using PCA and BSP approach is very less
when compared to the existing BSP approach due to the reduction in matrix size.
74
76
78
80
82
84
86
88
BSP clustering PCA and BSP
Clustering
Clustering Approaches
Accuracy(%)
Accuracy
Figure 1. Cluster Comparison
The data set taken for this experiment is small in size. Our future work is to check whether the same
efficiency and time reduction exists for a large data set and to combine other reduction techniques with BSP.
VII CONCLUSION
This paper proposes an improved social network clustering technique based on PCA and BSP clustering
algorithm. The main idea of this approach is introducing the PCA approach before applying BSP clustering. The
main advantage of the PCA technique is that the main and the principal components are identified. Thus after
the PCA technique is applied, the important components are given to the BSP clustering. The performance of
the proposed approach is evaluated using the Zachary’s karate club dataset. The parameters used for the
evaluation are clustering accuracy and the time taken for clustering. From the experimental observation, it is
clearly observed that the proposed BSP clustering with PCA shows significant performance in terms of accuracy
and classification time. Thus the proposed approach is very significant in providing better social network
analysis.
REFERENCES
[1] Gao X, Wu S, Yu B. “Management Information System. Beijing: Economy and Management Press (in Chinese)”, 2002.
[2] Han J, Kamber M. “Data Mining: Concepts and Techniques 2nd edition San Francisco: The Morgan Kaufmann Publishers, 2006.
[3] W. W. Zachary, “An information flow model for conflict and fission in small groups”, Journal of Anthropological Research 33,
452-473 (1977).
[4] Krebs V. “Mapping networks of terrorist cells”, Connections, Vol. 24, pages 43-52, 2002.
[5] Kubica J, Moore A and Schneider J. “Tractable Group Detection on Large Link Data Sets”, Proceeding 3rd IEEE international
conference on data mining, Melbourne, FL, pages 573-576, 2003.
[6] Liben-Nowell D and Kleinberg J. “The Link prediction problem for social networks”, Proceeding 2003 international conference
on information and knowledge management, New Orleans, LA, pages 556-559, 2003.
[7] Lu Q and Getoor L. “Link-based classification” Proceeding 2003 international conference on machine learning, Washington DC,
pages 496-503, 2003.
[8] Page L, Brin S, Motwani R and Winograd T. “The PageRank citation ranking: Bring order to the web. Technical report, Stanford
University”, 1998.
[9] The Link Prediction Problem for Social Networks (2003) David Liben-Nowell, Jon Kleinberg, 556 – 559.
[10] Communications of the IIMA (2007) Volume 7 Issue 4 “Social Network Analysis Based on BSP Clustering Algorithm” Gong
Yu School of Business Administration China University of Petroleum.
[11] Algorithms by thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Prentice-Hall India (1999).
[12] Yeung, K. Y. and Ruzzo, W. L., “An empirical study of Principal Component Analysis for clustering gene expression data”,
2001.
[13] Jiangtao Qiu, Zhangxi Lin, Changjie Tang and Shaojie Qiao, “Discovering Organizational Structure in Dynamic Social
Network”, Ninth IEEE International Conference on Data Mining, 2009.

More Related Content

What's hot

G1803054653
G1803054653G1803054653
G1803054653
IOSR Journals
 
Web Based Fuzzy Clustering Analysis
Web Based Fuzzy Clustering AnalysisWeb Based Fuzzy Clustering Analysis
Web Based Fuzzy Clustering Analysis
inventy
 
Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...
Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...
Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...
dannyijwest
 
A comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectionA comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detection
csandit
 
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
IOSR Journals
 
A PROCESS OF LINK MINING
A PROCESS OF LINK MININGA PROCESS OF LINK MINING
A PROCESS OF LINK MINING
csandit
 
Community detection of political blogs network based on structure-attribute g...
Community detection of political blogs network based on structure-attribute g...Community detection of political blogs network based on structure-attribute g...
Community detection of political blogs network based on structure-attribute g...
IJECEIAES
 
IRJET- Evidence Chain for Missing Data Imputation: Survey
IRJET- Evidence Chain for Missing Data Imputation: SurveyIRJET- Evidence Chain for Missing Data Imputation: Survey
IRJET- Evidence Chain for Missing Data Imputation: Survey
IRJET Journal
 
2014 USA
2014 USA2014 USA
2014 USALI HE
 
Graph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social NetworkGraph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social Network
Khan Mostafa
 
IRJET- Foster Hashtag from Image and Text
IRJET-  	  Foster Hashtag from Image and TextIRJET-  	  Foster Hashtag from Image and Text
IRJET- Foster Hashtag from Image and Text
IRJET Journal
 
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
cscpconf
 
Incentive Compatible Privacy Preserving Data Analysis
Incentive Compatible Privacy Preserving Data AnalysisIncentive Compatible Privacy Preserving Data Analysis
Incentive Compatible Privacy Preserving Data Analysis
rupasri mupparthi
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with Idioms
Waqas Tariq
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data Fusion
IRJET Journal
 
Answering approximate-queries-over-xml-data
Answering approximate-queries-over-xml-dataAnswering approximate-queries-over-xml-data
Answering approximate-queries-over-xml-data
Shakas Technologies
 
Predicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networksPredicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networksAnvardh Nanduri
 
Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257Editor IJARCET
 

What's hot (20)

G1803054653
G1803054653G1803054653
G1803054653
 
Web Based Fuzzy Clustering Analysis
Web Based Fuzzy Clustering AnalysisWeb Based Fuzzy Clustering Analysis
Web Based Fuzzy Clustering Analysis
 
Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...
Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...
Semantic Annotation Framework For Intelligent Information Retrieval Using KIM...
 
A comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectionA comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detection
 
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
 
A PROCESS OF LINK MINING
A PROCESS OF LINK MININGA PROCESS OF LINK MINING
A PROCESS OF LINK MINING
 
Community detection of political blogs network based on structure-attribute g...
Community detection of political blogs network based on structure-attribute g...Community detection of political blogs network based on structure-attribute g...
Community detection of political blogs network based on structure-attribute g...
 
IRJET- Evidence Chain for Missing Data Imputation: Survey
IRJET- Evidence Chain for Missing Data Imputation: SurveyIRJET- Evidence Chain for Missing Data Imputation: Survey
IRJET- Evidence Chain for Missing Data Imputation: Survey
 
2014 USA
2014 USA2014 USA
2014 USA
 
Graph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social NetworkGraph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social Network
 
IRJET- Foster Hashtag from Image and Text
IRJET-  	  Foster Hashtag from Image and TextIRJET-  	  Foster Hashtag from Image and Text
IRJET- Foster Hashtag from Image and Text
 
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
 
Incentive Compatible Privacy Preserving Data Analysis
Incentive Compatible Privacy Preserving Data AnalysisIncentive Compatible Privacy Preserving Data Analysis
Incentive Compatible Privacy Preserving Data Analysis
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with Idioms
 
A4 elanjceziyan
A4 elanjceziyanA4 elanjceziyan
A4 elanjceziyan
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data Fusion
 
Answering approximate-queries-over-xml-data
Answering approximate-queries-over-xml-dataAnswering approximate-queries-over-xml-data
Answering approximate-queries-over-xml-data
 
Predicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networksPredicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networks
 
Data Convergence White Paper
Data Convergence White PaperData Convergence White Paper
Data Convergence White Paper
 
Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257
 

Viewers also liked

Ijetcas14 528
Ijetcas14 528Ijetcas14 528
Ijetcas14 528
Iasir Journals
 
Ijetcas14 508
Ijetcas14 508Ijetcas14 508
Ijetcas14 508
Iasir Journals
 
Ijetcas14 545
Ijetcas14 545Ijetcas14 545
Ijetcas14 545
Iasir Journals
 
Ijetcas14 578
Ijetcas14 578Ijetcas14 578
Ijetcas14 578
Iasir Journals
 

Viewers also liked (10)

Aijrfans14 228
Aijrfans14 228Aijrfans14 228
Aijrfans14 228
 
Ijebea14 244
Ijebea14 244Ijebea14 244
Ijebea14 244
 
Ijetcas14 528
Ijetcas14 528Ijetcas14 528
Ijetcas14 528
 
Ijetcas14 508
Ijetcas14 508Ijetcas14 508
Ijetcas14 508
 
Ijetcas14 545
Ijetcas14 545Ijetcas14 545
Ijetcas14 545
 
Ijetcas14 430
Ijetcas14 430Ijetcas14 430
Ijetcas14 430
 
Ijetcas14 578
Ijetcas14 578Ijetcas14 578
Ijetcas14 578
 
Aijrfans14 257
Aijrfans14 257Aijrfans14 257
Aijrfans14 257
 
Aijrfans14 259
Aijrfans14 259Aijrfans14 259
Aijrfans14 259
 
Ijebea14 287
Ijebea14 287Ijebea14 287
Ijebea14 287
 

Similar to Ijetcas14 347

INTELLIGENT SOCIAL NETWORKS MODEL BASED ON SEMANTIC TAG RANKING
INTELLIGENT SOCIAL NETWORKS MODEL BASED  ON SEMANTIC TAG RANKINGINTELLIGENT SOCIAL NETWORKS MODEL BASED  ON SEMANTIC TAG RANKING
INTELLIGENT SOCIAL NETWORKS MODEL BASED ON SEMANTIC TAG RANKING
dannyijwest
 
A Survey On Link Prediction In Social Networks
A Survey On Link Prediction In Social NetworksA Survey On Link Prediction In Social Networks
A Survey On Link Prediction In Social Networks
April Smith
 
Spatio textual similarity join
Spatio textual similarity joinSpatio textual similarity join
Spatio textual similarity join
IJDKP
 
GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...
GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...
GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...
ijaia
 
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
gerogepatton
 
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
gerogepatton
 
Distributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache SparkDistributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache Spark
Anastasios Theodosiou
 
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
IJAEMSJORNAL
 
Study on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its ApplicationsStudy on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its Applications
IJERA Editor
 
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
IJECEIAES
 
Sub-Graph Finding Information over Nebula Networks
Sub-Graph Finding Information over Nebula NetworksSub-Graph Finding Information over Nebula Networks
Sub-Graph Finding Information over Nebula Networks
ijceronline
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
IJERA Editor
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
IJERA Editor
 
Annotating Search Results from Web Databases
Annotating Search Results from Web Databases Annotating Search Results from Web Databases
Annotating Search Results from Web Databases
Mohit Sngg
 
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksThe Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
Editor IJCATR
 
Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks
IJECEIAES
 
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
csandit
 
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
cscpconf
 
Entity Resolution in Large Graphs
Entity Resolution in Large GraphsEntity Resolution in Large Graphs
Entity Resolution in Large Graphs
Apurva Kumar
 

Similar to Ijetcas14 347 (20)

INTELLIGENT SOCIAL NETWORKS MODEL BASED ON SEMANTIC TAG RANKING
INTELLIGENT SOCIAL NETWORKS MODEL BASED  ON SEMANTIC TAG RANKINGINTELLIGENT SOCIAL NETWORKS MODEL BASED  ON SEMANTIC TAG RANKING
INTELLIGENT SOCIAL NETWORKS MODEL BASED ON SEMANTIC TAG RANKING
 
A Survey On Link Prediction In Social Networks
A Survey On Link Prediction In Social NetworksA Survey On Link Prediction In Social Networks
A Survey On Link Prediction In Social Networks
 
Spatio textual similarity join
Spatio textual similarity joinSpatio textual similarity join
Spatio textual similarity join
 
GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...
GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...
GRAPH ALGORITHM TO FIND CORE PERIPHERY STRUCTURES USING MUTUAL K-NEAREST NEIG...
 
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
 
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neig...
 
Distributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache SparkDistributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache Spark
 
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
Influence over the Dimensionality Reduction and Clustering for Air Quality Me...
 
Study on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its ApplicationsStudy on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its Applications
 
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
 
Sub-Graph Finding Information over Nebula Networks
Sub-Graph Finding Information over Nebula NetworksSub-Graph Finding Information over Nebula Networks
Sub-Graph Finding Information over Nebula Networks
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
 
Annotating Search Results from Web Databases
Annotating Search Results from Web Databases Annotating Search Results from Web Databases
Annotating Search Results from Web Databases
 
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksThe Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
 
Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks
 
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
 
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTR...
 
Entity Resolution in Large Graphs
Entity Resolution in Large GraphsEntity Resolution in Large Graphs
Entity Resolution in Large Graphs
 

More from Iasir Journals

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
Iasir Journals
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
Iasir Journals
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
Iasir Journals
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
Iasir Journals
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
Iasir Journals
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
Iasir Journals
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
Iasir Journals
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
Iasir Journals
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
Iasir Journals
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
Iasir Journals
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
Iasir Journals
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
Iasir Journals
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
Iasir Journals
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
Iasir Journals
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
Iasir Journals
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
Iasir Journals
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
Iasir Journals
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
Iasir Journals
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
Iasir Journals
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
Iasir Journals
 

More from Iasir Journals (20)

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
 

Recently uploaded

Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
PedroFerreira53928
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
Col Mukteshwar Prasad
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 

Recently uploaded (20)

Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 

Ijetcas14 347

  • 1. International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) www.iasir.net IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 174 ISSN (Print): 2279-0047 ISSN (Online): 2279-0055 Improved BSP Clustering Algorithm for Social Network Analysis 1 T.B.Saranya Preetha, 2 G.T.Prabavathi 1 Research Scholar, 2 Assistant Professor in Computer science Department of Computer Science Gobi Arts and Science College Gobichettipalayam, Tamil Nadu 638453, India _________________________________________________________________________________________ Abstract: A social network is a social structure of individuals, who are linked (directly or indirectly to each other) depending on a common relation of interest, e.g. friendship, trust, etc. Social network analysis is the study of social networks to recognize the structure and behavior of social actor. Social network analysis has gained importance due to its usage in different applications - from product marketing to search engines and organizational dynamics. The conventional clustering approaches group objects based only on objects’ similarity which does not suite for social network data. Social network objects should be grouped into classes depending on their links as well as their attributes. In this paper, a clustering algorithm based on BSP (Business System Planning) clustering with Principal Component Analysis (PCA) technique is proposed. This algorithm produces significant improvement in clusters, as it groups objects in a social network into different classes based on their links and identify relation among classes. Keywords: Social Network Analysis, BSP Clustering, PCA __________________________________________________________________________________________ I. INTRODUCTION Social Networks are graph structures whose nodes or vertices represent people or other entities embedded in a social context, and whose edges represent interaction or collaboration between these entities [10]. Social networks are highly dynamic, evolving relationships among people or other entities. This dynamic property of social networks makes studying these graphs a challenging task. A lot of research has been done recently to study different properties of these networks. Such complex analysis of large, heterogeneous, multi-relational social networks has led to an interesting field of study known as Social Network Analysis (SNA). Social network analysis, which can be applied to analyze the structure and the property of personal relationship, web page links, and the spread of messages, is a research field in sociology. Recently social network analysis has attracted increasing attention in the data mining [1] research community. From the viewpoint of data mining, a social network is a heterogeneous and multi-relational dataset represented by graph [3].Research on social network analysis in the data mining community includes following areas: clustering analysis [1], classification [8], link prediction [7], Page Rank [9] and Hub-Authority [4] in web search engine. This paper deals with the social network cluster analysis. In the second section, a social network clustering based on Business System Planning (BSP) clustering algorithm is given. The algorithm can group objects in a social network into different classes based on their links, and it can also identify the relations among classes. In the third section, importance of Principle Component Analysis technique is dealt. Fourth section describes the methodology of BSP clustering. Section five shows how BSP clustering is improved by applying the PCA reduction technique. II. BSP Clustering in Social Networks There has been extensive research work on clustering in data mining. Traditional clustering algorithms [1] divide objects into classes based on their similarity. Objects in a class are similar to each other and are very dissimilar from objects in different classes. Social network clustering analysis, which is different from traditional clustering problem, divides objects into classes based on their links as well as their attributes. The biggest challenge of social network clustering analysis is how to divide objects into classes based on objects’ links, and to find algorithms that can meet this challenge. The BSP clustering algorithm [11] is proposed by IBM. It is designed to define information architecture for the firm in business system planning. This algorithm analyses business process and their data classes, cluster business process into subsystems, and define the relationship of these sub-systems. Basically BSP clustering algorithm uses objects (business processes) and links among objects (data classes) to make clustering analysis. Similarly social network also includes objects and links among these objects. In view of the same pre-condition, the BSP clustering algorithm can be used in social network clustering analysis. According to graph theory, social network is a directed graph composed by objects and their relationship. Figure 1 shows a sample of social network, the circle in the figure represents an object; the line with arrow is an edge
  • 2. T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174- 178 IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 175 of the graph, and it represents directed link between two objects, so a social network is a directed graph. In figure 1, Let Oi be an object in social network (i = 1...m ), let Ej which means directed link between two objects, be a directed edge of the graph ( j = 1...n ).After definition of objects and directed edges, reachable relation between two objects is also defined. Figure 1: A Sample of Social Network There are two kinds of reachable relation among objects, shown as following: 1) One-step reachable relation: if there is directed link from Oi to Oj through one and only one directed edge, then Oi to Oj is a one-step reachable relation. For instance in Figure 1 there is a directed link from O1 to O2 through the directed edge E1, O1 to O2 is one-step reachable relation. 2) Multi-steps reachable relation: if there is directed link from O i to O j through two or more directed edges, then O i to O j is a multi-steps reachable relation. There is a directed link from O1 to O4 through directed edges E1 and E5, then O1 to O4 is a 2-steps reachable relation. After these relations, BSP analyses a social network by generating edge pointed matrix and calculates one-step and multi-step reachable matrix between objects. III. PRINCIPAL COMPONENT ANALYSIS Principal component analysis (PCA) [13] in multivariate statistics is widely adopted as an effective unsupervised dimension reduction method and is extended in many different directions. The main justification of dimension reduction is that PCA uses singular value decomposition (SVD) which gives the best low rank approximation to original data. However, this essentially noise reduction perspective alone is inadequate to explain the effectiveness of PCA. In this paper, we provide a new perspective of PCA based on its close relationship with the BSP clustering algorithm. We show that the principal components are actually relaxed cluster membership indicators. The central idea of Principal Component Analysis (PCA) is to reduce the dimensionality of the data set while retaining as much as possible the variation in the data set. Principal components (PC’s) are linear transformations of the original set of variables. PC’s are uncorrelated and ordered so that the first few PC’s contain most of the variations in the original data set [Jolliffe, 1986]. PCA is a classic tool for analyzing large scale multivariate data. It seeks linear combinations of the data variables (often called factors or principal components) that capture a maximum amount of variance. Numerically, PCA only amounts to computing a few leading eigenvectors of the data’s covariance matrix, so it can be applied to very large scale data sets. One of the key shortcomings of PCA however is that these factors are linear combinations of all variables; that is, all factor coefficients (or loadings) are non-zero. This means that while PCA facilitates model interpretation and visualization by concentrating the information in a few key factors, the factors themselves are still constructed using all observed variables. In many applications of PCA, the coordinate axes have a direct physical interpretation; in finance or biology for example, each axis might correspond to a specific financial asset or gene. In such cases, having only a few nonzero coefficients in the principal components would greatly improve the relevance and interpretability of the factors. In sparse PCA, there is a trade-off between the two goals of expressive power (explaining most of the variance or information in the data) and interpretability (making sure that the factors involve only a few coordinate axes or variables). When PCA is used as a clustering tool, sparse actors identify the clusters with the action of only a few variables. The original n data points in m-dimensional space are contained in the data matrix In general data is not centered around the origin. The centered data matrix , where and . The covarance matrix is given by The principal eigenvectors o are the principal directions of the data Y. The principal eigenvectors of the Gram matrix are the principal components; entries of each are the projected values of data points on the principal direction . and are related via: : where is the eigenvalue of the covarance matrix . Thus after applying PCA to the data, the principal components are identified from the data to be clustered. Thus only the important and the vital data are going to be clustered. In this proposed approach, the social network data is analyzed based on the BSP and PCA. O3 O1 E3 E2 E5E1 E7 O5 E6 O4 E10 O6 E9 E8 E4 O2
  • 3. T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174- 178 IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 176 IV METHODOLOGY a) Generate edge creation matrix and edge pointed matrix First according to the objects and edges in the graph, define two matrixes Lc(m× n matrix) and Lp(m× n matrix). Lc means the creation of edges and Lp denote the pointed relations of edges. In the matrix, Lc (i, j) =1 denotes object Oi connects with the tail of edge Ej, which means that object Oi creates the directed edge Ej . Lc (i, j)=0 denotes object Oi has no connection with the tail of edge Ej, which means Ej isn’t created by object Oi. In the matrix, Lp(i, j) =1 denotes object Oi connects with the head of edge Ej, which means object Oi is pointed to by the directed edge Ej. Lp (i, j)=0 denotes Oi doesn’t connect with the head of edge Ej, which means Ej doesn’t point to Oi. After defining the matrix reachable matrices are calculated. b) Calculate one-step reachable matrix between objects After the definition of Lc and Lp, one-step reachable matrix between objects is calculated through the following equation. V is Boolean sum and ^is Boolean product. G(i, j)=1 means Oi to Oj is a one-step reachable relation, G(i, j) = 0 means there is no one-step reachable relation from Oi to Oj . All one-step reachable relation between objects is calculated. c) Calculate multi-steps reachable matrix between objects Besides one-step reachable relation, there are multi-steps reachable relations between objects. According to graph theory and the BSP clustering algorithm, multi-steps reachable matrix G2, G3, G4,…., Gm−1 is calculated. Following equations show the calculation of multi-steps reachable matrix: These matrices include 2-steps, 3-steps… m-1-steps reachable relations between objects. Now n-steps reachable relation between two objects through G2G3G4...Gm−1 is calculated. d) Calculate reachable matrix The algorithm considers whether reachable relations exist between two objects, but do not care these relations are one-step or multi-steps, so reachable matrix R based on G,G2 ,G3 ,G4 ,...,Gm−1 is calculated as R=IVGVG2...VGm−1 where V is Boolean sum and I is unit matrix. R(i, j) = 1 means reachable relation exists from Oi to Oj, but the reachable relations existing in matrix R is not mutual, for instance R(i, j) = 1 means reachable relation exists from Oi to Oj, but it doesn’t mean reachable relation exists from Oj to Oi. Mutual reachable relations between two objects are important in a social network, so mutual reachable matrix based on R is calculated. e) Mutual reachable matrix and cluster generation The mutual reachable matrix can be calculated through Q=R^RT where ^means Boolean product, RT means Reachable Transpose matrix. In the matrix Q(i, j) = 1 indicates there are mutual reachable relation between Oi and Oj . In a social network if two objects that have mutual reachable relation, they should belong to the same class, thus cluster based on Q is generated. Thus according to mutual reachable matrix Q, a social network is divided into classes based on strong sub matrices in Q or adjusted Q. If all elements in a sub-matrix of Q are 1, then that matrix is a strong sub matrix. f) Identify relationships among classes After clustering social network nodes, there is a need to identify relationship among clusters. This can be done through generated clusters and one-step reachable matrix G. If there is one-step reachable relation between two objects in different classes, directed links exist between classes. Through G all relations among classes is identifed. After performing these steps, a social network is divided into classes. Social network clustering analysis algorithm can be given as Q− > Ck means generating clusters through mutual reachable matrix Q, and (C k ,Q )- >Relation(Ck) means identifying relationships among clusters based on clusters and one-step reachable matrix G. V. PROPOSED ALGORITHM BSP clusters divide a social network into different classes according to objects in the social network and links between objects, and it also can identify relations among clusters. Main disadvantage of this algorithm is that it uses matrices to store edges and reachable relations. In a real social network these matrices will be very huge,
  • 4. T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174- 178 IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 177 which can’t be loaded into main memory. But these matrices are normally sparse, so this feature is used in our work to reduce the size of the matrix. In this paper we propose a modification of existing BSP algorithm using Link list data structure. Using this data structure we have overcome the shortcomings of BSP. Following procedure is required for converting this sparse matrix in to link list: A matrix is a two-dimensional data object made of m rows and n columns, therefore having m, n values. When m=n, it is a square matrix. The most natural representation is to use two-dimensional array A[m][n] and access the element of ith row and jth column as A[i][j]. If large number of elements of the matrix are zero elements, then it is called a sparse matrix. Representing a sparse matrix by using a two-dimensional array leads to the wastage of a substantial amount of space. Therefore, an alternative representation must be used for sparse matrices. One such representation is to store only non- zero elements along with their row positions and column positions. That means representing every non-zero element by using triples(i,j,value), where i is a row position and j is a column position, and store these triples in a linear list. It is possible to arrange these triples in the increasing order of row indices, and for the same row index in the increasing order of column indices. Each triple can be represented using a node having four fields as shown: Struct snode{Int row,col,val; Struct snode *next;} The algorithm to reduce the dimensionality of the matrix is given below: Input Edge creation Lists Edge pointed List Begin Swap(Lu) For k=3 to m do Gk-1=Gk-2*G Edge pointed Lists Begin Swap(Lu) For k=3 to m do Gk-1=Gk-2*G R=IVGVG2……Gm-1 Swap(R) Qk ->C ( Ck,Q)->Relation(Ck) End End 1. In order to add two sparse matrices represented using the sorted linked lists, the lists are traversed until the end of one of the lists is reached. 2. In the process of traversal, the row indices stored in the nodes of these lists are compared. If they don't match, a new node is created and inserted into the resultant list by copying the contents of a node with a lower value of row index. The pointer in the list containing a node with a lower value of row index is advanced to make it point to the next node. 3. If the row indices match, column indices for the corresponding row positions are compared. If they don't match, a new node is created and inserted into the resultant list by copying the contents of a node with a lower value of column index. The pointer in the list containing a node with a lower value of column index is advanced to make it point to the next node. 4. If the column indices match, a new node is created and inserted into the resultant list by copying the row and column indices from any of the nodes and the value equal to the sum of the values in the two nodes. 5. After this, the pointers in both the lists are advanced to make them point to the next nodes in the respective lists. This process is repeated in each iteration. After reaching the end of any one of the lists, the iterations come to an end and the remaining nodes in the list whose end has not been reached are copied, as it is in the resultant list. We performed the above mentioned steps to reduce the dimensionality of the matrix representation, so that, the input to the BSP clustering is reduced in large number and improves the efficiency of the clusters.
  • 5. T. Preetha et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May, 2014, pp. 174- 178 IJETCAS 14-347; © 2014, IJETCAS All Rights Reserved Page 178 VI. EXPERIMENTAL RESULTS The performance of the proposed social network using the PCA and BSP clustering is evaluated on the Zachary's Karate Club dataset. This data set is a social network of friendships links between 34 members of a karate club at a US university in the 1970. The values ‘1’ in the data set denotes that there is relationship between the two members in the data set. Alternatively, the values ‘0’ denotes that there is no relationship among the members in the dataset. The proposed PCA and BSP clustering is compared with the existing BSP clustering algorithm. From the dataset, 1156 links were extracted. But after applying PCA technique it was reduced to 700 links which was provided as data to the BSP clustering algorithm. The reachable matrix is obtained from the data set. G34 reachable matrix is computed. Then, clusters are obtained. Cluster 1 and cluster 2 is obtained. If, there is relation between the two clusters, then a relation is formed between them. The clustering accuracy of the data sets is obtained. Moreover, the time taken by the proposed approach for clustering the data set is also calculated and is displayed. The proposed PCA and BSP clustering is compared with the BSP clustering algorithm. The performance of the proposed system is evaluated based on the parameters like clustering accuracy and classification time. The accuracy in clusters between BSP and PCA with BSP is shown in Figure.1. It is observed from the graph that the classification accuracy of the proposed approach using PCA and BSP approach is high when compared to the BSP approach. Moreover the time taken by proposed approach using PCA and BSP approach is very less when compared to the existing BSP approach due to the reduction in matrix size. 74 76 78 80 82 84 86 88 BSP clustering PCA and BSP Clustering Clustering Approaches Accuracy(%) Accuracy Figure 1. Cluster Comparison The data set taken for this experiment is small in size. Our future work is to check whether the same efficiency and time reduction exists for a large data set and to combine other reduction techniques with BSP. VII CONCLUSION This paper proposes an improved social network clustering technique based on PCA and BSP clustering algorithm. The main idea of this approach is introducing the PCA approach before applying BSP clustering. The main advantage of the PCA technique is that the main and the principal components are identified. Thus after the PCA technique is applied, the important components are given to the BSP clustering. The performance of the proposed approach is evaluated using the Zachary’s karate club dataset. The parameters used for the evaluation are clustering accuracy and the time taken for clustering. From the experimental observation, it is clearly observed that the proposed BSP clustering with PCA shows significant performance in terms of accuracy and classification time. Thus the proposed approach is very significant in providing better social network analysis. REFERENCES [1] Gao X, Wu S, Yu B. “Management Information System. Beijing: Economy and Management Press (in Chinese)”, 2002. [2] Han J, Kamber M. “Data Mining: Concepts and Techniques 2nd edition San Francisco: The Morgan Kaufmann Publishers, 2006. [3] W. W. Zachary, “An information flow model for conflict and fission in small groups”, Journal of Anthropological Research 33, 452-473 (1977). [4] Krebs V. “Mapping networks of terrorist cells”, Connections, Vol. 24, pages 43-52, 2002. [5] Kubica J, Moore A and Schneider J. “Tractable Group Detection on Large Link Data Sets”, Proceeding 3rd IEEE international conference on data mining, Melbourne, FL, pages 573-576, 2003. [6] Liben-Nowell D and Kleinberg J. “The Link prediction problem for social networks”, Proceeding 2003 international conference on information and knowledge management, New Orleans, LA, pages 556-559, 2003. [7] Lu Q and Getoor L. “Link-based classification” Proceeding 2003 international conference on machine learning, Washington DC, pages 496-503, 2003. [8] Page L, Brin S, Motwani R and Winograd T. “The PageRank citation ranking: Bring order to the web. Technical report, Stanford University”, 1998. [9] The Link Prediction Problem for Social Networks (2003) David Liben-Nowell, Jon Kleinberg, 556 – 559. [10] Communications of the IIMA (2007) Volume 7 Issue 4 “Social Network Analysis Based on BSP Clustering Algorithm” Gong Yu School of Business Administration China University of Petroleum. [11] Algorithms by thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Prentice-Hall India (1999). [12] Yeung, K. Y. and Ruzzo, W. L., “An empirical study of Principal Component Analysis for clustering gene expression data”, 2001. [13] Jiangtao Qiu, Zhangxi Lin, Changjie Tang and Shaojie Qiao, “Discovering Organizational Structure in Dynamic Social Network”, Ninth IEEE International Conference on Data Mining, 2009.