SlideShare a Scribd company logo
1 of 43
Download to read offline
Social Network Analysis
Inforte course on Big Social Data Analytics 2017
Dr. Jari Jussila
Twitter: @jjussila
Email: jari.j.jussila@tut.fi
GitHub: https://github.com/jjussila/BigSocialDataAnalytics
WEB
MOBILE AND SOCIAL MEDIA
ERP
CRM
Purchase &
Transaction
Records
Offers and
Quotations
Customer
Engagements
A/B Testing
Dynamic Pricing
Search Engine
Marketing and
Optimization
Target Marketing
Images and Videos
Speech to Text
Sensor Data
Application Log Data
SMS/MMS
Location Data
Social Network Analysis
From transactions to interactions
Social Media Posts
Customer
Segmenting
Network Analysis (NA) &
Social Network Analysis (SNA)
Graph and Matrix
Representation of Networks
Star
Circle
Chain
0 1 1 1 1 1 1
1 0 0 0 0 0 0
1 0 0 0 0 0 0
1 0 0 0 0 0 0
1 0 0 0 0 0 0
1 0 0 0 0 0 0
0 1 0 0 0 0 1
1 0 1 0 0 0 0
0 1 0 1 0 0 0
0 0 0 1 0 1 0
0 0 0 0 1 0 1
1 0 0 0 0 1 0
0 1 1 0 0 0 0
1 0 0 1 0 0 0
1 0 0 0 1 0 0
0 1 0 0 0 1 0
0 0 1 0 0 0 1
0 0 0 1 0 0 0
0 0 0 0 1 0 0
Matrix
Directed and Undirected
Networks
B
A
C
A B C
A 0 0 1
B 1 0 0
C 0 1 0
B
A
C
A B C
A 0 1 1
B 1 0 1
C 1 1 0
Sociomatrix
Jim Bob Alex Tom
Jim - 0 1 0
Bob 1 - 1 1
Alex 1 1 - 1
Tom 0 1 1 -
Relationship: is friend of
Source: Hoffman 2000; Moreno 1953
“the mathematical study of psychological properties of populations, the
experimental technique of and the results obtained by application of quantitative
methods” (Moreno, 1953, pp. 15-16).
Direct and Indirect Paths
(Friends/Connections/etc.)
Nodes and Edges
GephiNodeXL
Anatomy of Networks
Network Metrics:
Prominence
Centrality Prestige
Prominence
Degree
Centrality
Closeness
Centrality
Degree
Prestige
Proximity
Prestige
Betweeness
Centrality
Information
Centrality
Status or Rank
Prestige
Source: Wasserman & Faust 1994
• Degree
• How many direct links a node has to other nodes
• In the case of a directed network it is possible to
calculate both indegree (incoming connections)
and outdegree (outgoing connections)
11
Degree Centrality
Source: Wasserman & Faust 1994
• Closeness is the sum of shortest paths of a node to
other nodes in the network
• dij length of shortest path between i and j
• Closeness centrality indicates how quickly a node can
interact with other nodes
å=
=
n
ij
iji dc
Closeness Centrality
Source: Wasserman & Faust 1994
• Betweennes measures the degree to which a node is
located at the shortest paths between two nodes
• Betweennes centrality indicates the ability of node to
control information between other nodes (gatekeeper)
• A node may not be locally central, but may still have a
high betweenness centrality
13
Betweenness Centrality
Source: Wasserman & Faust 1994
Network Analysis Process in
Practice
• Network Analysis process usually consists of
the following four phases:
1. Interpreting the phenomena under
investigation as a network
2. Collecting data
3. Cleaning and refining the data
4. Network layout and fine-tuning
Source: Huhtamäki & Parviainen 2015
A process for visualization
Source: Card et al. 1999
Visualization Stages
Visual and
Cognitive
Processing
Physical Environment
Social Environment
Data gathering
Data
Preprocessing
and
transformation
Visualization
Tool
Data manipulation
Data exploration
Source: Ware 2004
OSTINATO Process Model for
Visual Network Analysis
Source: Huhtamäki 2016
Entity Recognition?
• Twitter provides natural identifiers for nodes
(however some nodes maybe fake accounts or
bots)
• In some other application areas, such as,
bibliographic data analysis entity recognition is
more problematic
• Entity Recognition can be done in network
visualization tools (e.g. Gephi Data Laboratory)
or using third-party applications (e.g. Open
Refine)
Entity Recognition in Gephi Data
Laboratory
22.5.2017 19
Source Target
Node and Edge Creation
DiGraph – Directed graphs with self loops
Each user mention creates an edge between users. For Twitter Mentions see:
https://support.twitter.com/articles/14023#
Visual Properties Configuration
Node Partition by Modularity Class
Layout Processing: Force-driven
layout
• Layout refers to the act of placing the nodes on
canvas
• Force-driven layout is a straightforward option:
– Nodes repel each other
– Connections act as springs pulling the nodes back
together
– The center of a gravitational field is placed in the
middle of the canvas
– The process is run and configured in iteration until the
visualizer is happy with the result
Source: Huhtamäki 2015
Example
Source: Huhtamäki et al. 2012
The list of startups participating
in the Tekes YIC program was
scraped from Tekes homepage.
The IEN Dataset was used to
gather data on companies,
investors, key individuals, and
acquisitions.
Moreover, the Twitter
usernames of the YIC
companies were compiled in a
spreadsheet in a semi-manual
manner, and a tailored script
was implemented to crawl
Twitter REST API to collect the
list of followers of each YIC
company with a Twitter
account.
Interactive Network Visualization
Source: Aramo-Immonen et al. 2016; Aramo-Immonen et al. 2015
http://www.tut.fi/novi/case/2015-cbh-cmadfi2014-informallearning/twomode/network/
Hashtag Co-Occurrence Matrix
http://www.tut.fi/novi/case/2015-cbh-cmadfi2014-informallearning/hashtags/matrix/
Source: Aramo-Immonen et al. 2016; Aramo-Immonen et al. 2015
Extraction of Twitter data
and Network Visualization
with Gephi
Steps
• Collect the Twitter data
– Download the following script for extracting tweets:
https://github.com/jjussila/BigSocialDataAnalytics/blob/master/sc
ripts/search_trump.py
– Create a Twitter account or borrow from friend, if you do not
already have one
– Create a Twitter App https://apps.twitter.com/
– Create keychain.json file (that includes necessary keys and
tokes for accessing the data)
• Start running Python code online
– https://www.pythonanywhere.com/
• Install the following software
– Gephi https://gephi.org/ (for network visualization)
Original Twitter-api script
Source: https://github.com/jukkahuhtamaki/pcm-demo/tree/master/twitter-api
Modified script of extracting
Twitter đata
Source: https://github.com/jjussila/BigSocialDataAnalytics
Become a Twitter Developer
Create your first Twitter App
Get the keys and tokens needed to
access Twitter data
Create keychain.json using
template file
Copy-paste from Twitter App the necessary keys and tokens
and save the file as keychain.json
Example of extracting tweet data
Modifying the script
Note:
%40 = ‘@’
%23 = ‘#’’
For more details see:
w3schools.com
ASCII Encoding Reference
Network creation with NetworkX
library
Source: NetworkX
Using PythonAnywhere
Upload the following files:
- search_twitter.py
- keychain.json
Running Python code on
PythonAnywhere
Start a new console:
Bash
Execute Python script in Bash
console
22.5.2017 39
Using PythonAnywhere
Download the following files:
- network.gexf
Open gexf (Graph Exchange XML Format)
with Gephi
Calculate the Network Metrics and
Visualize the Network
Modularity Report
(Community Detection Algorithm)
References
• Aramo-Immonen, H., Kärkkäinen, H., Jussila, J. J., Joel-Edgar, S., & Huhtamäki, J. (2016).
Visualizing informal learning behavior from conference participants' Twitter data with the Ostinato
Model. Computers in Human Behavior, 55, 584-595.
• Aramo-Immonen, H., Jussila, J., & Huhtamäki, J. (2015). Exploring co-learning behavior of
conference participants with visual network analysis of Twitter data. Computers in Human
Behavior, 51, 1154-1162.
• Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: an open source software for exploring
and manipulating networks. ICWSM, 8, 361-362.
• Card, S. K., Mackinlay, J. D., & Shneiderman, B. (1999). Readings in information visualization:
using vision to think. Morgan Kaufmann.
• Huhtamäki, J. (2016). Ostinato Process Model for Visual Network Analytics: Experiments in
Innovation Ecosystems. (Tampere University of Technology. Publication; Vol. 1425). Tampere
University of Technology.
• Huhtamäki, J., Still, K., Isomursu, M., Russell, M., & Rubens, N. (2012, September). Networks of
Growth: The Case of Young Innovative Companies in Finland. In Proceedings of the 7th European
Conference on Innovation and Entrepreneurship: ECIE (p. 307). Academic Conferences Limited.
• Huhtamäki, J., & Parviainen, O. (2013). Verkostoanalyysi sosiaalisen median tutkimuksessa.
Otteita verkosta-Verkon ja sosiaalisen median tutkimusmenetelmät. Vastapaino, Tampere.
• Jacomy, M., Venturini, T., Heymann, S., & Bastian, M. (2014). ForceAtlas2, a continuous graph
layout algorithm for handy network visualization designed for the Gephi software. PloS one, 9(6),
e98679.
• McSweeney, P. J. (2009). Gephi Network Statistics. Presentado en Google Summer of Code.
Recuperado a partir de http://gephi. org/google-soc/gephi-netalgo. pdf.
• Ware, C. (2013). Information visualization: perception for design (Third ed.): Elsevier.
• Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (Vol. 8).
Cambridge university press.

More Related Content

What's hot

Default Prediction & Analysis on Lending Club Loan Data
Default Prediction & Analysis on Lending Club Loan DataDefault Prediction & Analysis on Lending Club Loan Data
Default Prediction & Analysis on Lending Club Loan DataDeep Borkar
 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysisAkhila
 
The Basics of Social Network Analysis
The Basics of Social Network AnalysisThe Basics of Social Network Analysis
The Basics of Social Network AnalysisRory Sie
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningShahar Cohen
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisFred Stutzman
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment AnalysisJaganadh Gopinadhan
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSumit Raj
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisAnkur Tyagi
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Simplilearn
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisAditya Nag
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systemsguest77b0cd12
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis worksCJ Jenkins
 
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?Countants
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation SystemAnamta Sayyed
 
Approaches to Sentiment Analysis
Approaches to Sentiment AnalysisApproaches to Sentiment Analysis
Approaches to Sentiment AnalysisNihar Suryawanshi
 
Multi media Data mining
Multi media Data miningMulti media Data mining
Multi media Data mininghome
 
Classification
ClassificationClassification
ClassificationCloudxLab
 

What's hot (20)

Default Prediction & Analysis on Lending Club Loan Data
Default Prediction & Analysis on Lending Club Loan DataDefault Prediction & Analysis on Lending Club Loan Data
Default Prediction & Analysis on Lending Club Loan Data
 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysis
 
The Basics of Social Network Analysis
The Basics of Social Network AnalysisThe Basics of Social Network Analysis
The Basics of Social Network Analysis
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
 
Sequential Pattern Mining and GSP
Sequential Pattern Mining and GSPSequential Pattern Mining and GSP
Sequential Pattern Mining and GSP
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systems
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis works
 
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Amazon seniment
Amazon senimentAmazon seniment
Amazon seniment
 
Approaches to Sentiment Analysis
Approaches to Sentiment AnalysisApproaches to Sentiment Analysis
Approaches to Sentiment Analysis
 
Multi media Data mining
Multi media Data miningMulti media Data mining
Multi media Data mining
 
Classification
ClassificationClassification
Classification
 

Similar to Big social data analytics - social network analysis

IRJET- Monitoring Suspicious Discussions on Online Forums using Data Mining
IRJET- Monitoring Suspicious Discussions on Online Forums using Data MiningIRJET- Monitoring Suspicious Discussions on Online Forums using Data Mining
IRJET- Monitoring Suspicious Discussions on Online Forums using Data MiningIRJET Journal
 
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)paperpublications3
 
Smashing SIlos: UX is the New SEO
Smashing SIlos: UX is the New SEOSmashing SIlos: UX is the New SEO
Smashing SIlos: UX is the New SEOBrightEdge
 
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIBig Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIRuchika Sharma
 
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jAI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jIvan Zoratti
 
A Novel Frame Work System Used In Mobile with Cloud Based Environment
A Novel Frame Work System Used In Mobile with Cloud Based EnvironmentA Novel Frame Work System Used In Mobile with Cloud Based Environment
A Novel Frame Work System Used In Mobile with Cloud Based Environmentpaperpublications3
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceMahir Haque
 
Knowledge Extraction from Social Media
Knowledge Extraction from Social MediaKnowledge Extraction from Social Media
Knowledge Extraction from Social MediaSeth Grimes
 
Analysis and Prediction of Sentiments for Cricket Tweets using Hadoop
Analysis and Prediction of Sentiments for Cricket Tweets using HadoopAnalysis and Prediction of Sentiments for Cricket Tweets using Hadoop
Analysis and Prediction of Sentiments for Cricket Tweets using HadoopIRJET Journal
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataIRJET Journal
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfBalasundaramSr
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Editor IJAIEM
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsNUS-ISS
 
Terrorism Analysis through Social Media using Data Mining
Terrorism Analysis through Social Media using Data MiningTerrorism Analysis through Social Media using Data Mining
Terrorism Analysis through Social Media using Data MiningIRJET Journal
 
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET Journal
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...IRJET Journal
 
Framework for opinion as a service on review data of customer using semantics...
Framework for opinion as a service on review data of customer using semantics...Framework for opinion as a service on review data of customer using semantics...
Framework for opinion as a service on review data of customer using semantics...IJECEIAES
 
Service Rating Prediction by check-in and check-out behavior of user and POI
Service Rating Prediction by check-in and check-out behavior of user and POIService Rating Prediction by check-in and check-out behavior of user and POI
Service Rating Prediction by check-in and check-out behavior of user and POIIRJET Journal
 

Similar to Big social data analytics - social network analysis (20)

IRJET- Monitoring Suspicious Discussions on Online Forums using Data Mining
IRJET- Monitoring Suspicious Discussions on Online Forums using Data MiningIRJET- Monitoring Suspicious Discussions on Online Forums using Data Mining
IRJET- Monitoring Suspicious Discussions on Online Forums using Data Mining
 
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
 
Smashing SIlos: UX is the New SEO
Smashing SIlos: UX is the New SEOSmashing SIlos: UX is the New SEO
Smashing SIlos: UX is the New SEO
 
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIBig Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
 
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jAI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
 
A Novel Frame Work System Used In Mobile with Cloud Based Environment
A Novel Frame Work System Used In Mobile with Cloud Based EnvironmentA Novel Frame Work System Used In Mobile with Cloud Based Environment
A Novel Frame Work System Used In Mobile with Cloud Based Environment
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Knowledge Extraction from Social Media
Knowledge Extraction from Social MediaKnowledge Extraction from Social Media
Knowledge Extraction from Social Media
 
Q046049397
Q046049397Q046049397
Q046049397
 
Analysis and Prediction of Sentiments for Cricket Tweets using Hadoop
Analysis and Prediction of Sentiments for Cricket Tweets using HadoopAnalysis and Prediction of Sentiments for Cricket Tweets using Hadoop
Analysis and Prediction of Sentiments for Cricket Tweets using Hadoop
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and Foundations
 
Terrorism Analysis through Social Media using Data Mining
Terrorism Analysis through Social Media using Data MiningTerrorism Analysis through Social Media using Data Mining
Terrorism Analysis through Social Media using Data Mining
 
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
 
Framework for opinion as a service on review data of customer using semantics...
Framework for opinion as a service on review data of customer using semantics...Framework for opinion as a service on review data of customer using semantics...
Framework for opinion as a service on review data of customer using semantics...
 
[IJCT-V3I2P30] Authors: Sunny Sharma
[IJCT-V3I2P30] Authors: Sunny Sharma[IJCT-V3I2P30] Authors: Sunny Sharma
[IJCT-V3I2P30] Authors: Sunny Sharma
 
Service Rating Prediction by check-in and check-out behavior of user and POI
Service Rating Prediction by check-in and check-out behavior of user and POIService Rating Prediction by check-in and check-out behavior of user and POI
Service Rating Prediction by check-in and check-out behavior of user and POI
 

More from Jari Jussila

Open Data and Standard APIs
Open Data and Standard APIsOpen Data and Standard APIs
Open Data and Standard APIsJari Jussila
 
Tutkimuksen tekeminen näkyväksi
Tutkimuksen tekeminen näkyväksiTutkimuksen tekeminen näkyväksi
Tutkimuksen tekeminen näkyväksiJari Jussila
 
HAMK Design Factory and Product Development Project 21
HAMK Design Factory and Product Development Project 21HAMK Design Factory and Product Development Project 21
HAMK Design Factory and Product Development Project 21Jari Jussila
 
Social media and online communities
Social media and online communitiesSocial media and online communities
Social media and online communitiesJari Jussila
 
Hamk Design Factory opening
Hamk Design Factory openingHamk Design Factory opening
Hamk Design Factory openingJari Jussila
 
Cognitive computing approaches for human activity recognition
Cognitive computing approaches for human activity recognitionCognitive computing approaches for human activity recognition
Cognitive computing approaches for human activity recognitionJari Jussila
 
Social media analysis and document based research
Social media analysis and document based researchSocial media analysis and document based research
Social media analysis and document based researchJari Jussila
 
Social media impact after publishing
Social media impact after publishingSocial media impact after publishing
Social media impact after publishingJari Jussila
 
Towards ecosystem for research and development of electrodermal activity appl...
Towards ecosystem for research and development of electrodermal activity appl...Towards ecosystem for research and development of electrodermal activity appl...
Towards ecosystem for research and development of electrodermal activity appl...Jari Jussila
 
Visualising maritime vessel open data for better situational awareness in ice...
Visualising maritime vessel open data for better situational awareness in ice...Visualising maritime vessel open data for better situational awareness in ice...
Visualising maritime vessel open data for better situational awareness in ice...Jari Jussila
 
Literature review on customer emotions in social media
Literature review on customer emotions in social mediaLiterature review on customer emotions in social media
Literature review on customer emotions in social mediaJari Jussila
 
SAPSyke Hackathon Y-Foorumi
SAPSyke Hackathon Y-FoorumiSAPSyke Hackathon Y-Foorumi
SAPSyke Hackathon Y-FoorumiJari Jussila
 
Deeva project introduction
Deeva project introductionDeeva project introduction
Deeva project introductionJari Jussila
 
Digitaalinen arvonluonti - tunneympyräsovellus
Digitaalinen arvonluonti -  tunneympyräsovellusDigitaalinen arvonluonti -  tunneympyräsovellus
Digitaalinen arvonluonti - tunneympyräsovellusJari Jussila
 
Application for pre-processing and visualization of electrodermal activity we...
Application for pre-processing and visualization of electrodermal activity we...Application for pre-processing and visualization of electrodermal activity we...
Application for pre-processing and visualization of electrodermal activity we...Jari Jussila
 
A bibliometric study on authorship trends and research themes
A bibliometric study on authorship trends and research themes A bibliometric study on authorship trends and research themes
A bibliometric study on authorship trends and research themes Jari Jussila
 
Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...
Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...
Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...Jari Jussila
 
Sosiaalisen median analysointi ja dokumenttipohjainen tutkimus
Sosiaalisen median analysointi ja dokumenttipohjainen tutkimusSosiaalisen median analysointi ja dokumenttipohjainen tutkimus
Sosiaalisen median analysointi ja dokumenttipohjainen tutkimusJari Jussila
 
Computational methods for intelligent matchmaking for knowledge work
Computational methods for intelligent matchmaking for knowledge workComputational methods for intelligent matchmaking for knowledge work
Computational methods for intelligent matchmaking for knowledge workJari Jussila
 
Someanalytiikan hyödyntäminen liiketoiminnassa
Someanalytiikan hyödyntäminen liiketoiminnassaSomeanalytiikan hyödyntäminen liiketoiminnassa
Someanalytiikan hyödyntäminen liiketoiminnassaJari Jussila
 

More from Jari Jussila (20)

Open Data and Standard APIs
Open Data and Standard APIsOpen Data and Standard APIs
Open Data and Standard APIs
 
Tutkimuksen tekeminen näkyväksi
Tutkimuksen tekeminen näkyväksiTutkimuksen tekeminen näkyväksi
Tutkimuksen tekeminen näkyväksi
 
HAMK Design Factory and Product Development Project 21
HAMK Design Factory and Product Development Project 21HAMK Design Factory and Product Development Project 21
HAMK Design Factory and Product Development Project 21
 
Social media and online communities
Social media and online communitiesSocial media and online communities
Social media and online communities
 
Hamk Design Factory opening
Hamk Design Factory openingHamk Design Factory opening
Hamk Design Factory opening
 
Cognitive computing approaches for human activity recognition
Cognitive computing approaches for human activity recognitionCognitive computing approaches for human activity recognition
Cognitive computing approaches for human activity recognition
 
Social media analysis and document based research
Social media analysis and document based researchSocial media analysis and document based research
Social media analysis and document based research
 
Social media impact after publishing
Social media impact after publishingSocial media impact after publishing
Social media impact after publishing
 
Towards ecosystem for research and development of electrodermal activity appl...
Towards ecosystem for research and development of electrodermal activity appl...Towards ecosystem for research and development of electrodermal activity appl...
Towards ecosystem for research and development of electrodermal activity appl...
 
Visualising maritime vessel open data for better situational awareness in ice...
Visualising maritime vessel open data for better situational awareness in ice...Visualising maritime vessel open data for better situational awareness in ice...
Visualising maritime vessel open data for better situational awareness in ice...
 
Literature review on customer emotions in social media
Literature review on customer emotions in social mediaLiterature review on customer emotions in social media
Literature review on customer emotions in social media
 
SAPSyke Hackathon Y-Foorumi
SAPSyke Hackathon Y-FoorumiSAPSyke Hackathon Y-Foorumi
SAPSyke Hackathon Y-Foorumi
 
Deeva project introduction
Deeva project introductionDeeva project introduction
Deeva project introduction
 
Digitaalinen arvonluonti - tunneympyräsovellus
Digitaalinen arvonluonti -  tunneympyräsovellusDigitaalinen arvonluonti -  tunneympyräsovellus
Digitaalinen arvonluonti - tunneympyräsovellus
 
Application for pre-processing and visualization of electrodermal activity we...
Application for pre-processing and visualization of electrodermal activity we...Application for pre-processing and visualization of electrodermal activity we...
Application for pre-processing and visualization of electrodermal activity we...
 
A bibliometric study on authorship trends and research themes
A bibliometric study on authorship trends and research themes A bibliometric study on authorship trends and research themes
A bibliometric study on authorship trends and research themes
 
Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...
Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...
Rakennetun ympäristön sähköinen asiointipalvelu kuntien tiedolla johtamis...
 
Sosiaalisen median analysointi ja dokumenttipohjainen tutkimus
Sosiaalisen median analysointi ja dokumenttipohjainen tutkimusSosiaalisen median analysointi ja dokumenttipohjainen tutkimus
Sosiaalisen median analysointi ja dokumenttipohjainen tutkimus
 
Computational methods for intelligent matchmaking for knowledge work
Computational methods for intelligent matchmaking for knowledge workComputational methods for intelligent matchmaking for knowledge work
Computational methods for intelligent matchmaking for knowledge work
 
Someanalytiikan hyödyntäminen liiketoiminnassa
Someanalytiikan hyödyntäminen liiketoiminnassaSomeanalytiikan hyödyntäminen liiketoiminnassa
Someanalytiikan hyödyntäminen liiketoiminnassa
 

Recently uploaded

Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Serviceankitnayak356677
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageMatteo Carbone
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfpollardmorgan
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechNewman George Leech
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024christinemoorman
 
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...lizamodels9
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...lizamodels9
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...lizamodels9
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 

Recently uploaded (20)

KestrelPro Flyer Japan IT Week 2024 (English)
KestrelPro Flyer Japan IT Week 2024 (English)KestrelPro Flyer Japan IT Week 2024 (English)
KestrelPro Flyer Japan IT Week 2024 (English)
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman Leech
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024
 
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 

Big social data analytics - social network analysis

  • 1. Social Network Analysis Inforte course on Big Social Data Analytics 2017 Dr. Jari Jussila Twitter: @jjussila Email: jari.j.jussila@tut.fi GitHub: https://github.com/jjussila/BigSocialDataAnalytics
  • 2. WEB MOBILE AND SOCIAL MEDIA ERP CRM Purchase & Transaction Records Offers and Quotations Customer Engagements A/B Testing Dynamic Pricing Search Engine Marketing and Optimization Target Marketing Images and Videos Speech to Text Sensor Data Application Log Data SMS/MMS Location Data Social Network Analysis From transactions to interactions Social Media Posts Customer Segmenting
  • 3. Network Analysis (NA) & Social Network Analysis (SNA)
  • 4. Graph and Matrix Representation of Networks Star Circle Chain 0 1 1 1 1 1 1 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 1 0 0 0 0 0 1 0 1 0 0 0 0 0 0 1 0 1 0 0 0 0 0 1 0 1 1 0 0 0 0 1 0 0 1 1 0 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 1 0 0 Matrix
  • 5. Directed and Undirected Networks B A C A B C A 0 0 1 B 1 0 0 C 0 1 0 B A C A B C A 0 1 1 B 1 0 1 C 1 1 0
  • 6. Sociomatrix Jim Bob Alex Tom Jim - 0 1 0 Bob 1 - 1 1 Alex 1 1 - 1 Tom 0 1 1 - Relationship: is friend of Source: Hoffman 2000; Moreno 1953 “the mathematical study of psychological properties of populations, the experimental technique of and the results obtained by application of quantitative methods” (Moreno, 1953, pp. 15-16).
  • 7. Direct and Indirect Paths (Friends/Connections/etc.)
  • 11. • Degree • How many direct links a node has to other nodes • In the case of a directed network it is possible to calculate both indegree (incoming connections) and outdegree (outgoing connections) 11 Degree Centrality Source: Wasserman & Faust 1994
  • 12. • Closeness is the sum of shortest paths of a node to other nodes in the network • dij length of shortest path between i and j • Closeness centrality indicates how quickly a node can interact with other nodes å= = n ij iji dc Closeness Centrality Source: Wasserman & Faust 1994
  • 13. • Betweennes measures the degree to which a node is located at the shortest paths between two nodes • Betweennes centrality indicates the ability of node to control information between other nodes (gatekeeper) • A node may not be locally central, but may still have a high betweenness centrality 13 Betweenness Centrality Source: Wasserman & Faust 1994
  • 14. Network Analysis Process in Practice • Network Analysis process usually consists of the following four phases: 1. Interpreting the phenomena under investigation as a network 2. Collecting data 3. Cleaning and refining the data 4. Network layout and fine-tuning Source: Huhtamäki & Parviainen 2015
  • 15. A process for visualization Source: Card et al. 1999
  • 16. Visualization Stages Visual and Cognitive Processing Physical Environment Social Environment Data gathering Data Preprocessing and transformation Visualization Tool Data manipulation Data exploration Source: Ware 2004
  • 17. OSTINATO Process Model for Visual Network Analysis Source: Huhtamäki 2016
  • 18. Entity Recognition? • Twitter provides natural identifiers for nodes (however some nodes maybe fake accounts or bots) • In some other application areas, such as, bibliographic data analysis entity recognition is more problematic • Entity Recognition can be done in network visualization tools (e.g. Gephi Data Laboratory) or using third-party applications (e.g. Open Refine)
  • 19. Entity Recognition in Gephi Data Laboratory 22.5.2017 19 Source Target
  • 20. Node and Edge Creation DiGraph – Directed graphs with self loops Each user mention creates an edge between users. For Twitter Mentions see: https://support.twitter.com/articles/14023#
  • 21. Visual Properties Configuration Node Partition by Modularity Class
  • 22. Layout Processing: Force-driven layout • Layout refers to the act of placing the nodes on canvas • Force-driven layout is a straightforward option: – Nodes repel each other – Connections act as springs pulling the nodes back together – The center of a gravitational field is placed in the middle of the canvas – The process is run and configured in iteration until the visualizer is happy with the result Source: Huhtamäki 2015
  • 23. Example Source: Huhtamäki et al. 2012 The list of startups participating in the Tekes YIC program was scraped from Tekes homepage. The IEN Dataset was used to gather data on companies, investors, key individuals, and acquisitions. Moreover, the Twitter usernames of the YIC companies were compiled in a spreadsheet in a semi-manual manner, and a tailored script was implemented to crawl Twitter REST API to collect the list of followers of each YIC company with a Twitter account.
  • 24. Interactive Network Visualization Source: Aramo-Immonen et al. 2016; Aramo-Immonen et al. 2015 http://www.tut.fi/novi/case/2015-cbh-cmadfi2014-informallearning/twomode/network/
  • 26. Extraction of Twitter data and Network Visualization with Gephi
  • 27. Steps • Collect the Twitter data – Download the following script for extracting tweets: https://github.com/jjussila/BigSocialDataAnalytics/blob/master/sc ripts/search_trump.py – Create a Twitter account or borrow from friend, if you do not already have one – Create a Twitter App https://apps.twitter.com/ – Create keychain.json file (that includes necessary keys and tokes for accessing the data) • Start running Python code online – https://www.pythonanywhere.com/ • Install the following software – Gephi https://gephi.org/ (for network visualization)
  • 28. Original Twitter-api script Source: https://github.com/jukkahuhtamaki/pcm-demo/tree/master/twitter-api
  • 29. Modified script of extracting Twitter đata Source: https://github.com/jjussila/BigSocialDataAnalytics
  • 30. Become a Twitter Developer
  • 31. Create your first Twitter App
  • 32. Get the keys and tokens needed to access Twitter data
  • 33. Create keychain.json using template file Copy-paste from Twitter App the necessary keys and tokens and save the file as keychain.json
  • 34. Example of extracting tweet data
  • 35. Modifying the script Note: %40 = ‘@’ %23 = ‘#’’ For more details see: w3schools.com ASCII Encoding Reference
  • 36. Network creation with NetworkX library Source: NetworkX
  • 37. Using PythonAnywhere Upload the following files: - search_twitter.py - keychain.json
  • 38. Running Python code on PythonAnywhere Start a new console: Bash
  • 39. Execute Python script in Bash console 22.5.2017 39
  • 40. Using PythonAnywhere Download the following files: - network.gexf
  • 41. Open gexf (Graph Exchange XML Format) with Gephi
  • 42. Calculate the Network Metrics and Visualize the Network Modularity Report (Community Detection Algorithm)
  • 43. References • Aramo-Immonen, H., Kärkkäinen, H., Jussila, J. J., Joel-Edgar, S., & Huhtamäki, J. (2016). Visualizing informal learning behavior from conference participants' Twitter data with the Ostinato Model. Computers in Human Behavior, 55, 584-595. • Aramo-Immonen, H., Jussila, J., & Huhtamäki, J. (2015). Exploring co-learning behavior of conference participants with visual network analysis of Twitter data. Computers in Human Behavior, 51, 1154-1162. • Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: an open source software for exploring and manipulating networks. ICWSM, 8, 361-362. • Card, S. K., Mackinlay, J. D., & Shneiderman, B. (1999). Readings in information visualization: using vision to think. Morgan Kaufmann. • Huhtamäki, J. (2016). Ostinato Process Model for Visual Network Analytics: Experiments in Innovation Ecosystems. (Tampere University of Technology. Publication; Vol. 1425). Tampere University of Technology. • Huhtamäki, J., Still, K., Isomursu, M., Russell, M., & Rubens, N. (2012, September). Networks of Growth: The Case of Young Innovative Companies in Finland. In Proceedings of the 7th European Conference on Innovation and Entrepreneurship: ECIE (p. 307). Academic Conferences Limited. • Huhtamäki, J., & Parviainen, O. (2013). Verkostoanalyysi sosiaalisen median tutkimuksessa. Otteita verkosta-Verkon ja sosiaalisen median tutkimusmenetelmät. Vastapaino, Tampere. • Jacomy, M., Venturini, T., Heymann, S., & Bastian, M. (2014). ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software. PloS one, 9(6), e98679. • McSweeney, P. J. (2009). Gephi Network Statistics. Presentado en Google Summer of Code. Recuperado a partir de http://gephi. org/google-soc/gephi-netalgo. pdf. • Ware, C. (2013). Information visualization: perception for design (Third ed.): Elsevier. • Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (Vol. 8). Cambridge university press.