SlideShare a Scribd company logo
Network Measures
 Molecules are represented as nodes and some similarity or dissimilarity measure is used to define the
connections between them.
 Degree k: The number of connections (links) between a node and other nodes.
 Degree distribution P(k): The probability that a selected node has exactly k links.
― count the number of nodes with k = 1,2... links and divide by the total number of nodes.
― allows us to distinguish between different classes of networks.
 In random (Erdӧs-Renyi) networks the node degrees follow a Poisson distribution; most nodes have
approximately the same number of links (close to the average degree).
 The tail of the degree distribution decreases exponentially P(k) ~ exp(-k); nodes that significantly deviate
from the average are extremely rare.
 For scale-free networks, the probability that a node has k links follows a power-law P(k) ~ k-. Such
networks are often characterized by a small number of highly connected nodes (hubs). A scale-free
networks’ degree distribution is a straight line on a log–log plot.
 Clustering coefficient: CI = 2nI/k(k-1), where nI is the number of links connecting the k neighbors of
node I to each other.
― CI is the number of triangles that go through node I divided by the total number of possible triangles
that can pass through node I.
― C(k) is the average clustering coefficient of all nodes with k links. This property measures the
cliquishness of the network.
 The average path length or diameter is the shortest path connecting a pair of nodes averaged over all pairs
of nodes in the network.
 For Erdӧs-Renyi random networks, the average path length is proportional to the logarithm of the network
size: this is the small-world property.
 Scale-free networks are ultra-small; the average path length follows ℓ ~ log (log N)
 Existence of the small-world property in a network can be characterized by the following metrics:
 C(G) ≫ CE(G), where C(G) = Average Clustering coefficient of a network G and CE(G) = Average
Clustering coefficient of the Erdӧs-Renyi random network constructed from the same nodes at equivalent
edge density,
 Average Path Length L(G) < LE(G), where LE(G)= Average Path Length of the corresponding Erdӧs-Renyi
random network constructed from the same nodes at equivalent edge density,
 Average Path Length L(G) ∝ Log N (not relevant for this study).
 Degree assortativity is the correlation coefficient between the degrees
of connected nodes:
 where Aij is an element of the adjacency matrix of the network, ki and kj are the degrees of node i and j,
respectively; n and m are the order (total number of nodes) and the size (total number of edges) in the
network.
 Degree assortativity provides information about the nodes of high degree connecting nodes of high
degree and nodes of low degree connecting nodes of low degree; negative values of degree assortativity
represent nodes of high degree connecting to low degree nodes in a dissortative network.
 Modularity reflects the community structures of a network:
 Modularity ranges between -1 and 1. A network with more community structure reflects high modularity.
Disintegration of the small world property with
increasing diversity of chemical libraries
Ganesh Prabhu1, Sudeepto Bhattacharya2,3, Michael Krein4, and N. Sukumar1,2
1Department of Chemistry, Shiv Nadar University, Dadri, Gautam Buddha Nagar, UP, India
2Center for Informatics, Shiv Nadar University, Dadri, Gautam Buddha Nagar, UP, India
3Department of Mathematics, Shiv Nadar University, Dadri, Gautam Buddha Nagar, UP, India
4Rensselaer Exploratory Center for Cheminformatics Research, Rensselaer Polytechnic Institute, Troy, NY 12180
Conclusions:
Abstract:
In recent years factors such as solubility, permeability, polymorphism, cytotoxicity, mutation and drug resistance have forced chemists to broaden the spectrum
of new chemical entities for drug design. Diversity-oriented synthesis (DOS) has emerged as a popular strategy to synthesize molecular libraries possessing
structural complexity as well as skeletal and stereochemical diversity, targeting various dimensions of chemical and biological space. Topological properties of
chemical library graphs, such as the average clustering co-efficient, average path length and existence of hubs, are useful in identifying the presence or absence
of the small world property. The focus of the present study was to analyze the existence and disintegration of small world behaviour in graphs created on the
basis of similarity and dissimilarity scales. We generated similarity and dissimilarity graphs from diverse chemical libraries at various Tanimoto similarity
coefficients (tc) using FP2 and MACCS fingerprints. In comparison to Erdos-Reyni random graph, the dissimilarity graphs exhibited low average clustering co-
efficient and low average path length. The fitting of cumulative distribution function verses the degree of the dissimilarity graphs have demonstrated
exponential, log-normal and also possible power-law distribution statics validated using Kolmogorov-Smirnov’s (KS) goodness of fit. The similarity graphs at high
tc threshold show high clustering co-efficient and similar average path length as those of Erdӧs-Reyni graphs, without any sign of hubs. The dissimilarity graphs
at very low tc threshold displayed loss of small world character, as evidenced by low average clustering co-efficient and low average path length in comparison to
Erdӧs-Reyni random graphs. Graph theory and statistical metrics thus represent a simple and efficient approach to analyze the diversity in a chemical library.
Motivation
 The objective of the present work is to
determine whether different chemical
libraries exhibit different signatures in
terms of their network topologies.
 Another goal of this work is to better
understand the impact of library
diversity: as to whether or not such
small-world and scale-free properties
are common to certain classes of
chemical libraries.
 Similarity networks constructed on
large compound collections using
different sets of descriptors have
revealed some common features such
as the small-world property and scale-
free degree distributions:
Michael P. Krein, N. Sukumar, Exploration of
the topology of chemical spaces with network
measures. J. Phys. Chem. A 115(45),
2011,12905–12918.
R. W. Benz, J. Swamidass, P. Baldi, Discovery
of Power-Laws in Chemical Space. J. Chem.
Inf. Model. 2008, 48, 1138-1151.
Network properties of DOS libraries (N=118, 32, 41) and Focussed library (FL, N=41)
at various dissimilarity and similarity thresholds using MACCS fingerprints
Degree assortativity and Modularity of various threshold networks from DOS libraries (N=118, 41, 32) and
Focussed library (FL, N=41) generated using FP2 and MACCS fingerprints and their corresponding ERN(Erdӧs-Renyi
random network) constructed from the same nodes at equivalent edge density.
Datasets & Computational Methodology
• FP2 and MACCS fingerprints were
computed for DOS (Diversity-Oriented
Synthesis), NPDB (ZINC Natural
Products Database) and focused
(antimalarial) chemical libraries.
Mamidala, R., et al., Pyrrolidine and piperidine
based chiral spiro and fused scaffolds via
build/couple/pair approach. RSC Advances,
2014. 4(21): p. 10619.
Irwin, J.J., et al., ZINC: a free tool to discover
chemistry for biology. J Chem Inf Model,
2012. 52(7): p. 1757-68.
Cruz-Monteagudo, M., et al., Computational
modeling tools for the design of potent
antimalarial bisbenzamidines: overcoming the
antimalarial potential of pentamidine. Bioorg
Med Chem, 2007. 15(15): p. 5322-39.
• Edgelists based on Tanimotosimilarity
coefficients (tc) of the fingerprints
were generated and transformed to
networks in R-studio using the ‘igraph’
package.
• Similarity and dissimilarity networks
were generated at various thresholds.
 Dissimilarity networks from DOS and focussed libraries using FP2 and
MACCS fingerprints demonstrate disintegration of small world
behaviour.
 Hubs were detected in both similarity and dissimilarity networks.
 Hubs in dissimilarity networks show dissortative behaviour, whereas
hubs in similarity networks show assortative behaviour.
Assortative and dissortative behaviour in DOS libraries
Combined graph DOS118_M_tc ≤ 0.2 ≥ 0.9 generated from MACCS fingerprints showing dissortative
(red edges) and assortative hubs (blue edges)
 The diversity of a chemical library assessed by
combining similarity and dissimilarity threshold graphs.
 High average clustering coefficient, assortativity and
high modularity of the network are hallmarks of a high
similarity chemical library.
 Low average clustering coefficient, dissortativity and low
modularity of the network are the signature of a highly
diverse chemical library.
 The combined graph here exhibits C(G) >> CE(G) and
L(G) = LE(G)
Dissimilarity Network-MACCS C(G) L(G) C(G) > CE(G) C(G) ≫ CE(G) L(G) < LE(G) Avg. degree No. of edges D(G)
DOS118_M_tc < 0.2 0 1.81
N N Y
2 28 0.004
ERN(118, 0.0045) 0 2.11 0.52 31 0.0045
DOS41_M_tc < 0.2 0 1.66
N N N
1.66 5 0.0061
ERN(41, 0.0061) 0 1.4 0.3 6 0.0097
DOS32_M_tc < 0.2 0 1.86
N N N
1.66 5 0.01
ERN(32, 0.01) 0 1.25 0.19 3 0.006
FL41_M_tc < 0.3 0 1.92
N N Y
3.64 31 0.038
ERN(41, 0.038) 0 2.63 1.27 26 0.032
Similarity Networks-MACCS
DOS118_M_tc > 0.9 0.67 1.1
Y Y Y
1.6 28 0.004
ERN(118, 4-E03) 0 2.11 0.52 31 0.0045
DOS41_M_tc > 0.8 0.5 1.37
Y Y Y
1.21 14 0.02
ERN(41, 0.02) 0 3.3 0.97 20 0.024
DOS32_M_tc > 0.8 0.75 1.16
Y Y Y
1.54 10 0.02
ERN(32, 0.02) 0 1.38 0.56 9 0.018
FL41_M_tc > 0.98 1 1
Y Y Y
4.9 22 0.026
ERN(41, 0.026) 0 1.77 0.78 16 0.019
Dissimilarity Networks-FP2
Assortativity
degree
( r )
Modularity
( Q and QE )
D(G)
Dissimilarity Network-
MACCS
Assortativity
degree
( r )
Modularity
( Q and QE )
D(G)
DOS118_tc < 0.22 -0.158 0.64 0.0016 tc < 0.2 -0.53 0.47 4.05 E-03
ERN(118, 0.0016) -0.166 0.81 0.001 ERN(118, 4.5-E03) 0.2 0.93 4.5 E-03
DOS41_tc < 0.37 -0.53 0.15 0.017 tc < 0.2 -1 0 6.1-E03
ERN(41, 0.017) -0.41 0.7 0.018 ERN(41, 0.0061) -0.61 0.61 9.7-E03
DOS32_tc < 0.4 -0.58 0.18 0.11 tc < 0.2 -0.74 0.22 0.01
ERN(32, 0.11) -0.23 0.4 0.1 ERN(32, 0.01) -0.5 0.44 0.006
FL41_tc < 0.5 -0.668 0.062 0.013 tc < 0.3 -0.72 0.087 0.038
ERN(41, 0.013) -0.095 0.58 0.017 ERN(41, 0.038) -0.03 0.66 0.032
Similarity Networks-FP2 Similarity Networks-MACCS
DOS118_tc > 0.95 0.70 0.95 0.004 tc > 0.9 0.8 0.85 4.0-E03
ERN(118, 0.004) 0.2 0.88 0.003 ERN(118, 4-E03) 0.2 0.88 4.5-E03
DOS41_tc > 0.95 1 0.73 0.012 tc > 0.8 0.69 0.82 0.02
ERN(41, 0.012) 0.3 0.73 0.015 ERN(41, 0.02) 0.14 0.74 0.024
DOS32_tc > 0.8 0.59 0.77 0.06 tc > 0.8 0.35 0.72 0.02
ERN(32, 0.06) -0.09 0.46 0.07 ERN(32, 0.02) -0.53 0.64 0.018
FL41_tc > 0.995 1 0.28 0.022 tc > 0.98 1 0.2 0.026
ERN(41, 0.022) 0.144 0.74 0.04 ERN(41, 0.026) -0.19 0.73 0.019
Fitting cumulative distribution
functions of the dissimilarity
networks versus degree
Similarity and dissimilarity networks
Fruchterman Reingold layout (force field directed layout) used to visualize the undirected
threshold networks. Red and blue edges represent dissimilarity and similarity edges.
Networks with dissimilarity edges (a-b) show the absence of cliquishness, loss of small-
world character, presence of hubs (star subnetworks) with limited communities and links of
high degree nodes to low degree nodes reflecting dissortativity. Networks with similarity
edges (c-d) show the presence of cliques reflecting small-worldness with high modularity.
a
b
c
d

More Related Content

What's hot

SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
csandit
 
Dimensionality Reduction Techniques for Document Clustering- A Survey
Dimensionality Reduction Techniques for Document Clustering- A SurveyDimensionality Reduction Techniques for Document Clustering- A Survey
Dimensionality Reduction Techniques for Document Clustering- A Survey
IJTET Journal
 
Important spreaders in networks: Exact results for small graphs
Important spreaders in networks: Exact results for small graphsImportant spreaders in networks: Exact results for small graphs
Important spreaders in networks: Exact results for small graphs
Petter Holme
 
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
ijcseit
 
A paradox of importance in network epidemiology
A paradox of importance in network epidemiologyA paradox of importance in network epidemiology
A paradox of importance in network epidemiology
Petter Holme
 
2018 algorithms for the minmax regret path problem with interval data
2018   algorithms for the minmax regret path problem with interval data2018   algorithms for the minmax regret path problem with interval data
2018 algorithms for the minmax regret path problem with interval data
Francisco Pérez
 
Spreading processes on temporal networks
Spreading processes on temporal networksSpreading processes on temporal networks
Spreading processes on temporal networks
Petter Holme
 
Exploiting rhetorical relations to
Exploiting rhetorical relations toExploiting rhetorical relations to
Exploiting rhetorical relations to
IJNSA Journal
 
2224d_final
2224d_final2224d_final
2224d_final
Seth Hildick-Smith
 
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral AnalysisIdentifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
IJMER
 
A SURVEY ON SIMILARITY MEASURES IN TEXT MINING
A SURVEY ON SIMILARITY MEASURES IN TEXT MINING A SURVEY ON SIMILARITY MEASURES IN TEXT MINING
A SURVEY ON SIMILARITY MEASURES IN TEXT MINING
mlaij
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
Ganesan Narayanasamy
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)
CSCJournals
 
A Novel Framework for Short Tandem Repeats (STRs) Using Parallel String Matching
A Novel Framework for Short Tandem Repeats (STRs) Using Parallel String MatchingA Novel Framework for Short Tandem Repeats (STRs) Using Parallel String Matching
A Novel Framework for Short Tandem Repeats (STRs) Using Parallel String Matching
IJERA Editor
 
Temporal Networks of Human Interaction
Temporal Networks of Human InteractionTemporal Networks of Human Interaction
Temporal Networks of Human Interaction
Petter Holme
 
Improving the data recovery for short length LT codes
Improving the data recovery for short length LT codes Improving the data recovery for short length LT codes
Improving the data recovery for short length LT codes
IJECEIAES
 
JMM_Poster_2015
JMM_Poster_2015JMM_Poster_2015
JMM_Poster_2015
Jennifer Kiser
 

What's hot (17)

SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
 
Dimensionality Reduction Techniques for Document Clustering- A Survey
Dimensionality Reduction Techniques for Document Clustering- A SurveyDimensionality Reduction Techniques for Document Clustering- A Survey
Dimensionality Reduction Techniques for Document Clustering- A Survey
 
Important spreaders in networks: Exact results for small graphs
Important spreaders in networks: Exact results for small graphsImportant spreaders in networks: Exact results for small graphs
Important spreaders in networks: Exact results for small graphs
 
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
 
A paradox of importance in network epidemiology
A paradox of importance in network epidemiologyA paradox of importance in network epidemiology
A paradox of importance in network epidemiology
 
2018 algorithms for the minmax regret path problem with interval data
2018   algorithms for the minmax regret path problem with interval data2018   algorithms for the minmax regret path problem with interval data
2018 algorithms for the minmax regret path problem with interval data
 
Spreading processes on temporal networks
Spreading processes on temporal networksSpreading processes on temporal networks
Spreading processes on temporal networks
 
Exploiting rhetorical relations to
Exploiting rhetorical relations toExploiting rhetorical relations to
Exploiting rhetorical relations to
 
2224d_final
2224d_final2224d_final
2224d_final
 
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral AnalysisIdentifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
 
A SURVEY ON SIMILARITY MEASURES IN TEXT MINING
A SURVEY ON SIMILARITY MEASURES IN TEXT MINING A SURVEY ON SIMILARITY MEASURES IN TEXT MINING
A SURVEY ON SIMILARITY MEASURES IN TEXT MINING
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)
 
A Novel Framework for Short Tandem Repeats (STRs) Using Parallel String Matching
A Novel Framework for Short Tandem Repeats (STRs) Using Parallel String MatchingA Novel Framework for Short Tandem Repeats (STRs) Using Parallel String Matching
A Novel Framework for Short Tandem Repeats (STRs) Using Parallel String Matching
 
Temporal Networks of Human Interaction
Temporal Networks of Human InteractionTemporal Networks of Human Interaction
Temporal Networks of Human Interaction
 
Improving the data recovery for short length LT codes
Improving the data recovery for short length LT codes Improving the data recovery for short length LT codes
Improving the data recovery for short length LT codes
 
JMM_Poster_2015
JMM_Poster_2015JMM_Poster_2015
JMM_Poster_2015
 

Similar to Disintegration of the small world property with increasing diversity of chemical libraries

Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approach
Elena Sügis
 
Application of graph theory in drug design
Application of graph theory in drug designApplication of graph theory in drug design
Application of graph theory in drug design
Reihaneh Safavi
 
9517cnc01
9517cnc019517cnc01
9517cnc01
IJCNCJournal
 
Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)
Tin180 VietNam
 
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents ApproachAn Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
jake henry
 
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents ApproachAn Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
ijasuc
 
AN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACH
AN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACHAN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACH
AN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACH
ijasuc
 
Using spectral radius ratio for node degree
Using spectral radius ratio for node degreeUsing spectral radius ratio for node degree
Using spectral radius ratio for node degree
IJCNCJournal
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detection
Colleen Farrelly
 
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphsUse of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
csandit
 
Topology ppt
Topology pptTopology ppt
Topology ppt
Daksh Bapna
 
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Devansh16
 
An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...
An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...
An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...
ijitcs
 
Topology ppt
Topology pptTopology ppt
Topology ppt
karan saini
 
Topology ppt
Topology pptTopology ppt
Topology ppt
boocse11
 
TopologyPPT.ppt
TopologyPPT.pptTopologyPPT.ppt
TopologyPPT.ppt
ssuser933685
 
Medicinal Applications of Quantum Computing
Medicinal Applications of Quantum ComputingMedicinal Applications of Quantum Computing
Medicinal Applications of Quantum Computing
bGeniusLLC
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
ijcsit
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
Austin Benson
 
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
cscpconf
 

Similar to Disintegration of the small world property with increasing diversity of chemical libraries (20)

Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approach
 
Application of graph theory in drug design
Application of graph theory in drug designApplication of graph theory in drug design
Application of graph theory in drug design
 
9517cnc01
9517cnc019517cnc01
9517cnc01
 
Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)
 
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents ApproachAn Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
 
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents ApproachAn Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
An Opportunistic AODV Routing Scheme : A Cognitive Mobile Agents Approach
 
AN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACH
AN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACHAN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACH
AN OPPORTUNISTIC AODV ROUTING SCHEME: A COGNITIVE MOBILE AGENTS APPROACH
 
Using spectral radius ratio for node degree
Using spectral radius ratio for node degreeUsing spectral radius ratio for node degree
Using spectral radius ratio for node degree
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detection
 
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphsUse of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
 
Topology ppt
Topology pptTopology ppt
Topology ppt
 
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...
 
An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...
An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...
An Efficient Algorithm to Calculate The Connectivity of Hyper-Rings Distribut...
 
Topology ppt
Topology pptTopology ppt
Topology ppt
 
Topology ppt
Topology pptTopology ppt
Topology ppt
 
TopologyPPT.ppt
TopologyPPT.pptTopologyPPT.ppt
TopologyPPT.ppt
 
Medicinal Applications of Quantum Computing
Medicinal Applications of Quantum ComputingMedicinal Applications of Quantum Computing
Medicinal Applications of Quantum Computing
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
 

Recently uploaded

Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 

Recently uploaded (20)

Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 

Disintegration of the small world property with increasing diversity of chemical libraries

  • 1. Network Measures  Molecules are represented as nodes and some similarity or dissimilarity measure is used to define the connections between them.  Degree k: The number of connections (links) between a node and other nodes.  Degree distribution P(k): The probability that a selected node has exactly k links. ― count the number of nodes with k = 1,2... links and divide by the total number of nodes. ― allows us to distinguish between different classes of networks.  In random (Erdӧs-Renyi) networks the node degrees follow a Poisson distribution; most nodes have approximately the same number of links (close to the average degree).  The tail of the degree distribution decreases exponentially P(k) ~ exp(-k); nodes that significantly deviate from the average are extremely rare.  For scale-free networks, the probability that a node has k links follows a power-law P(k) ~ k-. Such networks are often characterized by a small number of highly connected nodes (hubs). A scale-free networks’ degree distribution is a straight line on a log–log plot.  Clustering coefficient: CI = 2nI/k(k-1), where nI is the number of links connecting the k neighbors of node I to each other. ― CI is the number of triangles that go through node I divided by the total number of possible triangles that can pass through node I. ― C(k) is the average clustering coefficient of all nodes with k links. This property measures the cliquishness of the network.  The average path length or diameter is the shortest path connecting a pair of nodes averaged over all pairs of nodes in the network.  For Erdӧs-Renyi random networks, the average path length is proportional to the logarithm of the network size: this is the small-world property.  Scale-free networks are ultra-small; the average path length follows ℓ ~ log (log N)  Existence of the small-world property in a network can be characterized by the following metrics:  C(G) ≫ CE(G), where C(G) = Average Clustering coefficient of a network G and CE(G) = Average Clustering coefficient of the Erdӧs-Renyi random network constructed from the same nodes at equivalent edge density,  Average Path Length L(G) < LE(G), where LE(G)= Average Path Length of the corresponding Erdӧs-Renyi random network constructed from the same nodes at equivalent edge density,  Average Path Length L(G) ∝ Log N (not relevant for this study).  Degree assortativity is the correlation coefficient between the degrees of connected nodes:  where Aij is an element of the adjacency matrix of the network, ki and kj are the degrees of node i and j, respectively; n and m are the order (total number of nodes) and the size (total number of edges) in the network.  Degree assortativity provides information about the nodes of high degree connecting nodes of high degree and nodes of low degree connecting nodes of low degree; negative values of degree assortativity represent nodes of high degree connecting to low degree nodes in a dissortative network.  Modularity reflects the community structures of a network:  Modularity ranges between -1 and 1. A network with more community structure reflects high modularity. Disintegration of the small world property with increasing diversity of chemical libraries Ganesh Prabhu1, Sudeepto Bhattacharya2,3, Michael Krein4, and N. Sukumar1,2 1Department of Chemistry, Shiv Nadar University, Dadri, Gautam Buddha Nagar, UP, India 2Center for Informatics, Shiv Nadar University, Dadri, Gautam Buddha Nagar, UP, India 3Department of Mathematics, Shiv Nadar University, Dadri, Gautam Buddha Nagar, UP, India 4Rensselaer Exploratory Center for Cheminformatics Research, Rensselaer Polytechnic Institute, Troy, NY 12180 Conclusions: Abstract: In recent years factors such as solubility, permeability, polymorphism, cytotoxicity, mutation and drug resistance have forced chemists to broaden the spectrum of new chemical entities for drug design. Diversity-oriented synthesis (DOS) has emerged as a popular strategy to synthesize molecular libraries possessing structural complexity as well as skeletal and stereochemical diversity, targeting various dimensions of chemical and biological space. Topological properties of chemical library graphs, such as the average clustering co-efficient, average path length and existence of hubs, are useful in identifying the presence or absence of the small world property. The focus of the present study was to analyze the existence and disintegration of small world behaviour in graphs created on the basis of similarity and dissimilarity scales. We generated similarity and dissimilarity graphs from diverse chemical libraries at various Tanimoto similarity coefficients (tc) using FP2 and MACCS fingerprints. In comparison to Erdos-Reyni random graph, the dissimilarity graphs exhibited low average clustering co- efficient and low average path length. The fitting of cumulative distribution function verses the degree of the dissimilarity graphs have demonstrated exponential, log-normal and also possible power-law distribution statics validated using Kolmogorov-Smirnov’s (KS) goodness of fit. The similarity graphs at high tc threshold show high clustering co-efficient and similar average path length as those of Erdӧs-Reyni graphs, without any sign of hubs. The dissimilarity graphs at very low tc threshold displayed loss of small world character, as evidenced by low average clustering co-efficient and low average path length in comparison to Erdӧs-Reyni random graphs. Graph theory and statistical metrics thus represent a simple and efficient approach to analyze the diversity in a chemical library. Motivation  The objective of the present work is to determine whether different chemical libraries exhibit different signatures in terms of their network topologies.  Another goal of this work is to better understand the impact of library diversity: as to whether or not such small-world and scale-free properties are common to certain classes of chemical libraries.  Similarity networks constructed on large compound collections using different sets of descriptors have revealed some common features such as the small-world property and scale- free degree distributions: Michael P. Krein, N. Sukumar, Exploration of the topology of chemical spaces with network measures. J. Phys. Chem. A 115(45), 2011,12905–12918. R. W. Benz, J. Swamidass, P. Baldi, Discovery of Power-Laws in Chemical Space. J. Chem. Inf. Model. 2008, 48, 1138-1151. Network properties of DOS libraries (N=118, 32, 41) and Focussed library (FL, N=41) at various dissimilarity and similarity thresholds using MACCS fingerprints Degree assortativity and Modularity of various threshold networks from DOS libraries (N=118, 41, 32) and Focussed library (FL, N=41) generated using FP2 and MACCS fingerprints and their corresponding ERN(Erdӧs-Renyi random network) constructed from the same nodes at equivalent edge density. Datasets & Computational Methodology • FP2 and MACCS fingerprints were computed for DOS (Diversity-Oriented Synthesis), NPDB (ZINC Natural Products Database) and focused (antimalarial) chemical libraries. Mamidala, R., et al., Pyrrolidine and piperidine based chiral spiro and fused scaffolds via build/couple/pair approach. RSC Advances, 2014. 4(21): p. 10619. Irwin, J.J., et al., ZINC: a free tool to discover chemistry for biology. J Chem Inf Model, 2012. 52(7): p. 1757-68. Cruz-Monteagudo, M., et al., Computational modeling tools for the design of potent antimalarial bisbenzamidines: overcoming the antimalarial potential of pentamidine. Bioorg Med Chem, 2007. 15(15): p. 5322-39. • Edgelists based on Tanimotosimilarity coefficients (tc) of the fingerprints were generated and transformed to networks in R-studio using the ‘igraph’ package. • Similarity and dissimilarity networks were generated at various thresholds.  Dissimilarity networks from DOS and focussed libraries using FP2 and MACCS fingerprints demonstrate disintegration of small world behaviour.  Hubs were detected in both similarity and dissimilarity networks.  Hubs in dissimilarity networks show dissortative behaviour, whereas hubs in similarity networks show assortative behaviour. Assortative and dissortative behaviour in DOS libraries Combined graph DOS118_M_tc ≤ 0.2 ≥ 0.9 generated from MACCS fingerprints showing dissortative (red edges) and assortative hubs (blue edges)  The diversity of a chemical library assessed by combining similarity and dissimilarity threshold graphs.  High average clustering coefficient, assortativity and high modularity of the network are hallmarks of a high similarity chemical library.  Low average clustering coefficient, dissortativity and low modularity of the network are the signature of a highly diverse chemical library.  The combined graph here exhibits C(G) >> CE(G) and L(G) = LE(G) Dissimilarity Network-MACCS C(G) L(G) C(G) > CE(G) C(G) ≫ CE(G) L(G) < LE(G) Avg. degree No. of edges D(G) DOS118_M_tc < 0.2 0 1.81 N N Y 2 28 0.004 ERN(118, 0.0045) 0 2.11 0.52 31 0.0045 DOS41_M_tc < 0.2 0 1.66 N N N 1.66 5 0.0061 ERN(41, 0.0061) 0 1.4 0.3 6 0.0097 DOS32_M_tc < 0.2 0 1.86 N N N 1.66 5 0.01 ERN(32, 0.01) 0 1.25 0.19 3 0.006 FL41_M_tc < 0.3 0 1.92 N N Y 3.64 31 0.038 ERN(41, 0.038) 0 2.63 1.27 26 0.032 Similarity Networks-MACCS DOS118_M_tc > 0.9 0.67 1.1 Y Y Y 1.6 28 0.004 ERN(118, 4-E03) 0 2.11 0.52 31 0.0045 DOS41_M_tc > 0.8 0.5 1.37 Y Y Y 1.21 14 0.02 ERN(41, 0.02) 0 3.3 0.97 20 0.024 DOS32_M_tc > 0.8 0.75 1.16 Y Y Y 1.54 10 0.02 ERN(32, 0.02) 0 1.38 0.56 9 0.018 FL41_M_tc > 0.98 1 1 Y Y Y 4.9 22 0.026 ERN(41, 0.026) 0 1.77 0.78 16 0.019 Dissimilarity Networks-FP2 Assortativity degree ( r ) Modularity ( Q and QE ) D(G) Dissimilarity Network- MACCS Assortativity degree ( r ) Modularity ( Q and QE ) D(G) DOS118_tc < 0.22 -0.158 0.64 0.0016 tc < 0.2 -0.53 0.47 4.05 E-03 ERN(118, 0.0016) -0.166 0.81 0.001 ERN(118, 4.5-E03) 0.2 0.93 4.5 E-03 DOS41_tc < 0.37 -0.53 0.15 0.017 tc < 0.2 -1 0 6.1-E03 ERN(41, 0.017) -0.41 0.7 0.018 ERN(41, 0.0061) -0.61 0.61 9.7-E03 DOS32_tc < 0.4 -0.58 0.18 0.11 tc < 0.2 -0.74 0.22 0.01 ERN(32, 0.11) -0.23 0.4 0.1 ERN(32, 0.01) -0.5 0.44 0.006 FL41_tc < 0.5 -0.668 0.062 0.013 tc < 0.3 -0.72 0.087 0.038 ERN(41, 0.013) -0.095 0.58 0.017 ERN(41, 0.038) -0.03 0.66 0.032 Similarity Networks-FP2 Similarity Networks-MACCS DOS118_tc > 0.95 0.70 0.95 0.004 tc > 0.9 0.8 0.85 4.0-E03 ERN(118, 0.004) 0.2 0.88 0.003 ERN(118, 4-E03) 0.2 0.88 4.5-E03 DOS41_tc > 0.95 1 0.73 0.012 tc > 0.8 0.69 0.82 0.02 ERN(41, 0.012) 0.3 0.73 0.015 ERN(41, 0.02) 0.14 0.74 0.024 DOS32_tc > 0.8 0.59 0.77 0.06 tc > 0.8 0.35 0.72 0.02 ERN(32, 0.06) -0.09 0.46 0.07 ERN(32, 0.02) -0.53 0.64 0.018 FL41_tc > 0.995 1 0.28 0.022 tc > 0.98 1 0.2 0.026 ERN(41, 0.022) 0.144 0.74 0.04 ERN(41, 0.026) -0.19 0.73 0.019 Fitting cumulative distribution functions of the dissimilarity networks versus degree Similarity and dissimilarity networks Fruchterman Reingold layout (force field directed layout) used to visualize the undirected threshold networks. Red and blue edges represent dissimilarity and similarity edges. Networks with dissimilarity edges (a-b) show the absence of cliquishness, loss of small- world character, presence of hubs (star subnetworks) with limited communities and links of high degree nodes to low degree nodes reflecting dissortativity. Networks with similarity edges (c-d) show the presence of cliques reflecting small-worldness with high modularity. a b c d