What will you get to know ? To stop the fire you have to create fire Why do your friends seem to be more popular than you are Are we living in a â€śSmall Worldâ€ť How do we detect epidemics early Friendship network in BITS Behavior in Online Social Networking Sites How popular is something on DC++
Complex Networks Non-trivial real-life networks Observed in most Social, Biological and Computer networks.
The Friendship Paradox â€śOn an average, your friends have more friends than you doâ€ť True for all networks (or graphs). Prominent in real life networks.
The Small World Phenomenon Any two persons in the world are connected by at most six links of acquaintances. Among Mathematicians: ErdÓ§s Number (Paul ErdÓ§s) Among Actors: Bacon Number (Kevin Bacon)
Complex Network Analysis Diameter: Then number of links in the shortest path between furthest nodes. (Small World) Average path-length Degree: Number of links on a particular node(Number of neighbors)
Network Density: The ratio of edges in the network to the max possible number of edges. Density of a social network with large number of nodes is highly unlikely to exceed 0.5
Clustering Coefficient: Likelihood that two associates of a node are associates themselves Lies between 0 and 1 Y X A
Centrality Measures (Betweenness): The number of shortest path that passes through a node. Synonymous with importance. Important in study of spreading of forest fires, rumors, information, epidemics etc. Revisit â€śFriendship Paradoxâ€ť
BITSian Friendship Network
BITSian Friendship Network Network Density: 0.37 Diameter: 4 Average Path-length: 1.99 Average Clustering Coefficient: 0.51
Twitter Growth Model With probability p, a new node(user) enters the network and links with one existing node. With probability q = 1-p, an existing user gets linked to an existing node. Preferential Selection: P(deg i -> deg i+1) proportional to (i+constant)
The Twitter growth model The rate equations are:
Formula vs Model Simulation
Model vs Twitter Data
Power Law!!! Degree distribution: n(j) = c.j-Îł Straight line in log-log plot. Scale free networks. Many networks conjectured(and many found) to follow power law. Eg.-Online Social Networks, Friendship Network, Collaboration Network (Movie-Actor, Research-Scientists), World Wide Web, Protien-Protien Interaction, Airline Networks Pareto Principle: 80-20 rule.
DC++ Search Spy A similar approach can be applied to find out number of searches vs â€śrankâ€ť of search query. query keyword
Power Law !!!
Rank of a keyword (node) = number of nodes with degree greater than its degree. The inverse function gives the frequency of a keyword ranked r: POWER LAW !!!