SlideShare a Scribd company logo
BIG	
  DATA	
  AND	
  SOCIAL	
  SCIENCE	
  THEORY	
  
Leveraging	
  Large	
  Scale	
  Data	
  to	
  Discover	
  
New	
  Pa4erns	
  in	
  Society	
  
Monday,	
  April	
  7,	
  2014	
  
CybermoCons	
  @	
  Korea	
  
Yeungnam	
  University	
  
	
  
Ma4hew	
  Weber	
  
Rutgers	
  University	
  
School	
  of	
  CommunicaCon	
  &	
  InformaCon	
  
2
Opportunity:	
  The	
  Internet	
  Archive	
  contains	
  the	
  largest	
  
single	
  record	
  of	
  the	
  history	
  of	
  the	
  World	
  Wide	
  Web	
  from	
  
1995	
  to	
  the	
  present—a	
  wealth	
  of	
  untapped	
  research	
  data.	
  	
  
Challenge:	
  There	
  is	
  a	
  significant	
  lack	
  of	
  research-­‐ready	
  
databases	
  and	
  tools	
  available	
  to	
  the	
  scholarly	
  community	
  
© Internet Archive 2013
©	
  Internet	
  Archive	
  2013	
  
5
6
7
8
9
10
Opportunity:	
  The	
  ArchiveHub	
  project	
  aims	
  to	
  support	
  the	
  
creaCon	
  and	
  disseminaCon	
  of	
  general	
  guidelines	
  &	
  tools	
  for	
  
conducCng	
  theoreCcally	
  and	
  methodologically	
  rigorous	
  
longitudinal	
  research	
  using	
  archival	
  Web	
  data	
  	
  
11
12
13
14
Dataset	
   Research	
  PotenAal	
   Dates	
   Captures	
   Unique	
  URLs	
  
Hurricane	
  Katrina	
   Online	
  networks	
  and	
  organizaConal	
  
resilience	
  (Chewning,	
  Lai	
  and	
  Doerfel,	
  
2012;	
  Perry,	
  Taylor	
  and	
  Doerfel,	
  2003)	
  in	
  
the	
  wake	
  of	
  disasters;	
  informaCon	
  
disseminaCon	
  	
  
2003	
  –	
  2012	
   1,694,236	
   663,740	
  	
  
Superstorm	
  
Sandy	
  
2003	
  –	
  2012	
   41,703,112	
   20,013,455	
  
US	
  Senate	
   Study	
  the	
  growth	
  of	
  poliCcal	
  acCvity	
  in	
  
online	
  environments	
  (Adamic	
  &	
  Glance,	
  
2005;	
  Bruns,	
  2007;	
  Chang	
  &	
  Park,	
  2012);	
  
polarizaCon	
  &	
  media	
  discourse	
  
109th	
  –	
  112th	
  
Congresses	
  
26,965,770	
   	
  8,674,397	
  	
  
US	
  House	
   51,840,777	
   12,410,014	
  
Occupy	
  Wall	
  
Street	
  
Previous	
  research	
  on	
  NGOs	
  in	
  the	
  online	
  
environment	
  (Bach	
  &	
  Stark,	
  2004;	
  
Shumate,	
  2003,	
  2012;	
  Shumate,	
  Fulk,	
  &	
  
Monge,	
  2005);	
  use	
  of	
  hyperlink	
  data	
  to	
  
study	
  the	
  formaCon	
  and	
  role	
  of	
  alliances	
  
between	
  SMOs	
  
2010	
  –	
  2012	
   247,928,272	
   11,3259,655	
  
US	
  Media	
  
Previous	
  studies	
  of	
  news	
  media	
  
organizaCons	
  (Greer	
  &	
  Mensing,	
  2006;	
  
Weber,	
  2012;	
  Weber	
  &	
  Monge,	
  In	
  
Press);	
  focus	
  on	
  evoluConary	
  pa4erns	
  
2008	
  –	
  2012	
   1,315,132,555	
   539,184,823	
  
15
http://archivehub.rutgers.edu
16
Tracing	
  the	
  Emergence	
  of	
  OrganizaConal	
  Forms	
  
17
Environment:	
  	
  
OrganizaCons	
  compete	
  for	
  scare	
  resources;	
  during	
  rapid	
  periods	
  of	
  
disrupCon,	
  new	
  entrants	
  seek	
  “protected”	
  niches	
  (Weber	
  &	
  Monge	
  2014)	

PopulaAon:	
  	
  
In	
  digital	
  spaces,	
  online	
  connecCons	
  provide	
  communicaCve	
  representaCons	
  of	
  
informaCon	
  flows	
  (Weber	
  &	
  Monge,	
  2012)	
  
	
  
FormaCon	
  of	
  Ces	
  (e.g.	
  hyperlinks)	
  can	
  posiCvely	
  impact	
  long-­‐term	
  likelihood	
  of	
  
organizaCon	
  survival	
  (Weber,	
  2012)	
  
	

OrganizaAon:	
  	
  
OrganizaCons	
  adapt	
  internally,	
  reconfiguring	
  team	
  structures	
  and	
  
developing	
  new	
  rouCnes	
  for	
  knowledge	
  sharing	
  	
  
(Ellison,	
  Gibbs	
  &	
  Weber,	
  In	
  Press;	
  Weber	
  &	
  Kim,	
  Under	
  Review)
18
Big Data… Big Theory?	
  
•  Networks	
  are	
  central	
  to	
  social	
  movements	
  in	
  that	
  links	
  between	
  
nodes	
  can	
  be	
  influenCal	
  in	
  collecCve	
  acCon	
  
•  Examples	
  of	
  nodes	
  includes	
  parCcipants,	
  organizaCons,	
  media	
  and	
  
communicaCons	
  technologies	
  	
  
•  Social	
  networks	
  and	
  social	
  movements	
  (Diani,	
  2003)	
  
	
  
•  The	
  interacCon	
  between	
  actors,	
  and	
  between	
  actors	
  and	
  hashtags,	
  
collecCvely	
  represent	
  a	
  networked	
  form	
  of	
  organizaCon	
  	
  
•  Network	
  form	
  of	
  organizaCon	
  (Powell,	
  1990)	
  
Over time, dyadic communication will become prevalent in
an emerging networked organization.H1:	
  
As a social movement develops as an emerging network
form of organization, the organizational structure will be
increasingly clustered.
H2:	
  
Data	
  
•  TriangulaCon	
  of	
  data	
  insulates	
  against	
  false	
  readings	
  from	
  large-­‐scale	
  data	
  
(see	
  Lazer,	
  Kennedy,	
  King	
  and	
  Vespignani,	
  2014)	
  
•  Internet	
  Archive:	
  
–  14	
  websites;	
  4,504	
  hyperlink	
  dyads	
  over	
  a	
  2-­‐month	
  period.	
  
•  Lexis	
  Nexis:	
  
–  Search	
  conducted	
  to	
  assess	
  U.S.	
  newspaper	
  coverage	
  of	
  OWS	
  from	
  the	
  early	
  stages	
  of	
  the	
  
movement	
  in	
  September	
  2011	
  through	
  Sept.	
  2012	
  
–  Search	
  OWS	
  keywords,	
  e.g.	
  “Occupy	
  Wall	
  Street,”	
  “Occupy	
  Oakland”	
  
•  Twi4er	
  
–  Gnip	
  PowerTrack	
  	
  
•  Search	
  by	
  keywords;	
  captures	
  a	
  larger	
  volume	
  of	
  Twi4er	
  data	
  than	
  other	
  opCons	
  	
  
–  Sample	
  includes	
  October	
  17,	
  2011,	
  through	
  January	
  5,	
  2012.	
  IniCal	
  study	
  focused	
  on	
  the	
  
criCcal	
  two-­‐month	
  period	
  from	
  November	
  1	
  through	
  December	
  31,	
  2011,	
  	
  
–  750,816	
  tweets	
  across	
  the	
  two-­‐month	
  period.	
  	
  
21
OWS News Coverage	
  
OWS	
  on	
  the	
  Web	
  
•  335	
  seed	
  organizaCons	
  based	
  on	
  records	
  from	
  #OccupyResearch	
  
•  Data	
  extracted	
  for	
  2011	
  &	
  2012,	
  based	
  on	
  “both	
  matching”	
  
24
0	
  
2	
  
4	
  
6	
  
8	
  
10	
  
12	
  
14	
  
16	
  
18	
  
Millions	
  
Captures	
  per	
  Month	
  
Maximal	
  Cores	
  (k	
  Coreness)	
  
25
Aug.	
  2011	
  
Jan.	
  2012	
  
26
	
  -­‐	
  	
  	
  	
  
	
  10,000.00	
  	
  
	
  20,000.00	
  	
  
	
  30,000.00	
  	
  
	
  40,000.00	
  	
  
	
  50,000.00	
  	
  
	
  60,000.00	
  	
  
	
  70,000.00	
  	
  
	
  80,000.00	
  	
  
Edges	
  
60	
  
80	
  
100	
  
120	
  
140	
  
160	
  
180	
  
VerAces	
  
27
0	
  
0.01	
  
0.02	
  
0.03	
  
0.04	
  
0.05	
  
0.06	
  
0.07	
  
0.08	
  
Density	
  
28
0	
  
10	
  
20	
  
30	
  
40	
  
50	
  
60	
  
70	
  
80	
  
90	
  
100	
  
Clusters	
  
29
ImplicaCons	
  
•  Big	
  Data:	
  
–  Guiding	
  data	
  collecCon	
  with	
  theoreCcally	
  grounded	
  quesCons	
  avoids	
  the	
  
“needle-­‐in-­‐the-­‐haystack”	
  problem	
  
–  Leverage	
  advances	
  in	
  compuCng	
  with	
  exisCng	
  theories	
  to	
  develop	
  robust	
  
studies	
  of	
  social	
  science	
  phenomenon	
  	
  
•  Big	
  Theory:	
  
–  Expanding	
  prior	
  theories	
  on	
  networked	
  organizaConal	
  forms	
  and	
  form	
  
emergence	
  (evoluConary)	
  
–  Building	
  toward	
  a	
  macro	
  theory	
  of	
  organizaConal	
  form	
  emergence	
  based	
  on	
  
resource	
  availability	
  and	
  networks	
  
30
•  Want	
  data?	
  
–  Email	
  me!	
  ma4hew.weber@rutgers.edu	
  
–  ArchiveHub:	
  h4p://archivehub.rutgers.edu	
  
	
  
•  Collaborators	
  
–  Kris	
  Carpenter	
  &	
  Vinay	
  Goel,	
  Internet	
  Archive	
  	
  
–  David	
  Lazer,	
  Northeastern	
  University	
  	
  
	
  
31
Research	
  supported	
  by	
  NSF	
  Award	
  #1244727	
  and	
  the	
  NetSCI	
  Lab	
  @	
  Rutgers	
  

More Related Content

What's hot

Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...
Mathieu d'Aquin
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
Michael Hausenblas
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3SMCFrance
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Big Data Spain
 
Search, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving DataSearch, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving Data
Nattiya Kanhabua
 
Noshir Contractor's view on the future of Linked Data
Noshir Contractor's view on the future of Linked DataNoshir Contractor's view on the future of Linked Data
Noshir Contractor's view on the future of Linked Data
Carlos Pedrinaci
 
Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014
Carly Strasser
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
Platforma Otwartej Nauki
 
Keystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenanceKeystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenance
Paolo Missier
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
Enrico Daga
 
Data Science and Urban Science @ UW
Data Science and Urban Science @ UWData Science and Urban Science @ UW
Data Science and Urban Science @ UW
University of Washington
 
Strategic Network Formation in a Location-Based Social Network
Strategic Network Formation in a Location-Based Social NetworkStrategic Network Formation in a Location-Based Social Network
Strategic Network Formation in a Location-Based Social Network
Gene Moo Lee
 
Political Transformations in Network Societies - the fifth estate
Political Transformations in Network Societies - the fifth estatePolitical Transformations in Network Societies - the fifth estate
Political Transformations in Network Societies - the fifth estate
Oxford Martin Centre, OII, and Computer Science at the University of Oxford
 
Science Big, Science Connected
Science Big, Science ConnectedScience Big, Science Connected
Science Big, Science Connected
Deepak Singh
 
Open Government Data on the Web - A Semantic Approach
Open Government Data on the Web - A Semantic ApproachOpen Government Data on the Web - A Semantic Approach
Open Government Data on the Web - A Semantic Approach
Peter Krantz
 
Emerging Institutional Paradigms for the Digital Commons
Emerging Institutional Paradigms for the  Digital CommonsEmerging Institutional Paradigms for the  Digital Commons
Emerging Institutional Paradigms for the Digital Commons
Bob Chao
 
Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5
Jeremie Dauphin
 
Di d dlf_handout
Di d dlf_handoutDi d dlf_handout
Di d dlf_handout
cwilliford
 
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Micah Altman
 
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
Anna De Liddo
 

What's hot (20)

Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
 
Search, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving DataSearch, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving Data
 
Noshir Contractor's view on the future of Linked Data
Noshir Contractor's view on the future of Linked DataNoshir Contractor's view on the future of Linked Data
Noshir Contractor's view on the future of Linked Data
 
Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Keystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenanceKeystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenance
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Data Science and Urban Science @ UW
Data Science and Urban Science @ UWData Science and Urban Science @ UW
Data Science and Urban Science @ UW
 
Strategic Network Formation in a Location-Based Social Network
Strategic Network Formation in a Location-Based Social NetworkStrategic Network Formation in a Location-Based Social Network
Strategic Network Formation in a Location-Based Social Network
 
Political Transformations in Network Societies - the fifth estate
Political Transformations in Network Societies - the fifth estatePolitical Transformations in Network Societies - the fifth estate
Political Transformations in Network Societies - the fifth estate
 
Science Big, Science Connected
Science Big, Science ConnectedScience Big, Science Connected
Science Big, Science Connected
 
Open Government Data on the Web - A Semantic Approach
Open Government Data on the Web - A Semantic ApproachOpen Government Data on the Web - A Semantic Approach
Open Government Data on the Web - A Semantic Approach
 
Emerging Institutional Paradigms for the Digital Commons
Emerging Institutional Paradigms for the  Digital CommonsEmerging Institutional Paradigms for the  Digital Commons
Emerging Institutional Paradigms for the Digital Commons
 
Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5
 
Di d dlf_handout
Di d dlf_handoutDi d dlf_handout
Di d dlf_handout
 
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
 
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
 

Viewers also liked

Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...
Eric Meyer
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Jisc
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
Josh Cowls
 
Search Tasks, Proactive Search & Digital Assistants
Search Tasks, Proactive Search & Digital AssistantsSearch Tasks, Proactive Search & Digital Assistants
Search Tasks, Proactive Search & Digital Assistants
Rishabh Mehrotra
 
Designing the search experience by tyler tate : twigkit
Designing the search experience by tyler tate : twigkitDesigning the search experience by tyler tate : twigkit
Designing the search experience by tyler tate : twigkitTyler Tate
 
Social impacts information technology
Social impacts information technologySocial impacts information technology
Social impacts information technologyRimple Darra
 

Viewers also liked (6)

Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Search Tasks, Proactive Search & Digital Assistants
Search Tasks, Proactive Search & Digital AssistantsSearch Tasks, Proactive Search & Digital Assistants
Search Tasks, Proactive Search & Digital Assistants
 
Designing the search experience by tyler tate : twigkit
Designing the search experience by tyler tate : twigkitDesigning the search experience by tyler tate : twigkit
Designing the search experience by tyler tate : twigkit
 
Social impacts information technology
Social impacts information technologySocial impacts information technology
Social impacts information technology
 

Similar to Internet Archives and Social Science Research - Yeungnam University

Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyeroiisdp
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data science
Han Woo PARK
 
Sci 2011 big_data(30_may13)2nd revised _ loet
Sci 2011 big_data(30_may13)2nd revised _ loetSci 2011 big_data(30_may13)2nd revised _ loet
Sci 2011 big_data(30_may13)2nd revised _ loetHan Woo PARK
 
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchDecomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
Han Woo PARK
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-Research
Eric Meyer
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving Up
Paco Nathan
 
Web Observatories and e-Research
Web Observatories and e-ResearchWeb Observatories and e-Research
Web Observatories and e-Research
David De Roure
 
Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013
Han Woo PARK
 
Data are the new black : Susan Robbins
Data are the new black : Susan RobbinsData are the new black : Susan Robbins
Data are the new black : Susan Robbins
therese nolan-brown
 
DREaM Event 2: Louise Cooke
DREaM Event 2: Louise CookeDREaM Event 2: Louise Cooke
VU University Amsterdam - The Social Web 2016 - Lecture 6
VU University Amsterdam - The Social Web 2016 - Lecture 6VU University Amsterdam - The Social Web 2016 - Lecture 6
VU University Amsterdam - The Social Web 2016 - Lecture 6
Davide Ceolin
 
Scholarship in the Digital Age
Scholarship in the Digital AgeScholarship in the Digital Age
Scholarship in the Digital Age
Eric Meyer
 
Promise of web science
Promise of web sciencePromise of web science
Promise of web science
Aastha Madaan
 
Studying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & BiasStudying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & Bias
gloriakt
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
My Dissertation Defense
My Dissertation Defense My Dissertation Defense
My Dissertation Defense
Laura Pasquini
 
The Blossoming of the Semantic Web
The Blossoming of the Semantic WebThe Blossoming of the Semantic Web
The Blossoming of the Semantic Web
American Art Collaborative
 
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
PrattSILS
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
Dongpo Deng
 
Tfsc disc 2014 si proposal (30 june2014)
Tfsc disc 2014 si proposal (30 june2014)Tfsc disc 2014 si proposal (30 june2014)
Tfsc disc 2014 si proposal (30 june2014)
Han Woo PARK
 

Similar to Internet Archives and Social Science Research - Yeungnam University (20)

Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyer
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data science
 
Sci 2011 big_data(30_may13)2nd revised _ loet
Sci 2011 big_data(30_may13)2nd revised _ loetSci 2011 big_data(30_may13)2nd revised _ loet
Sci 2011 big_data(30_may13)2nd revised _ loet
 
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchDecomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-Research
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving Up
 
Web Observatories and e-Research
Web Observatories and e-ResearchWeb Observatories and e-Research
Web Observatories and e-Research
 
Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013
 
Data are the new black : Susan Robbins
Data are the new black : Susan RobbinsData are the new black : Susan Robbins
Data are the new black : Susan Robbins
 
DREaM Event 2: Louise Cooke
DREaM Event 2: Louise CookeDREaM Event 2: Louise Cooke
DREaM Event 2: Louise Cooke
 
VU University Amsterdam - The Social Web 2016 - Lecture 6
VU University Amsterdam - The Social Web 2016 - Lecture 6VU University Amsterdam - The Social Web 2016 - Lecture 6
VU University Amsterdam - The Social Web 2016 - Lecture 6
 
Scholarship in the Digital Age
Scholarship in the Digital AgeScholarship in the Digital Age
Scholarship in the Digital Age
 
Promise of web science
Promise of web sciencePromise of web science
Promise of web science
 
Studying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & BiasStudying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & Bias
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
My Dissertation Defense
My Dissertation Defense My Dissertation Defense
My Dissertation Defense
 
The Blossoming of the Semantic Web
The Blossoming of the Semantic WebThe Blossoming of the Semantic Web
The Blossoming of the Semantic Web
 
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
Tfsc disc 2014 si proposal (30 june2014)
Tfsc disc 2014 si proposal (30 june2014)Tfsc disc 2014 si proposal (30 june2014)
Tfsc disc 2014 si proposal (30 june2014)
 

More from mwe400

050817 geomedia news networks
050817 geomedia news networks050817 geomedia news networks
050817 geomedia news networks
mwe400
 
022217 ia hackathon presentation
022217 ia  hackathon presentation022217 ia  hackathon presentation
022217 ia hackathon presentation
mwe400
 
062016 jcdl media networks upload
062016 jcdl media networks upload062016 jcdl media networks upload
062016 jcdl media networks upload
mwe400
 
Web Archives and Data Challenges - Archives Unleashed
Web Archives and Data Challenges - Archives UnleashedWeb Archives and Data Challenges - Archives Unleashed
Web Archives and Data Challenges - Archives Unleashed
mwe400
 
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
mwe400
 
Internet Archives as a Tool for Research: Decay in Large Scale Archival Records
Internet Archives as a Tool for Research: Decay in Large Scale Archival RecordsInternet Archives as a Tool for Research: Decay in Large Scale Archival Records
Internet Archives as a Tool for Research: Decay in Large Scale Archival Records
mwe400
 
Immutable Technology and the Breakdown of Organizational Change.
Immutable Technology and the Breakdown of Organizational Change.Immutable Technology and the Breakdown of Organizational Change.
Immutable Technology and the Breakdown of Organizational Change.
mwe400
 
From Big Data to Big Theory: Lessons Learned from Archival Internet Research.
From Big Data to Big Theory: Lessons Learned from Archival Internet Research.From Big Data to Big Theory: Lessons Learned from Archival Internet Research.
From Big Data to Big Theory: Lessons Learned from Archival Internet Research.
mwe400
 
032415 marketing 101 watershed upload
032415 marketing 101   watershed upload032415 marketing 101   watershed upload
032415 marketing 101 watershed upload
mwe400
 
AEJMC 2014 - Big Data and Education
AEJMC 2014 - Big Data and EducationAEJMC 2014 - Big Data and Education
AEJMC 2014 - Big Data and Education
mwe400
 
AEJMC 2014 - Online News and Linking
AEJMC 2014 - Online News and LinkingAEJMC 2014 - Online News and Linking
AEJMC 2014 - Online News and Linking
mwe400
 

More from mwe400 (11)

050817 geomedia news networks
050817 geomedia news networks050817 geomedia news networks
050817 geomedia news networks
 
022217 ia hackathon presentation
022217 ia  hackathon presentation022217 ia  hackathon presentation
022217 ia hackathon presentation
 
062016 jcdl media networks upload
062016 jcdl media networks upload062016 jcdl media networks upload
062016 jcdl media networks upload
 
Web Archives and Data Challenges - Archives Unleashed
Web Archives and Data Challenges - Archives UnleashedWeb Archives and Data Challenges - Archives Unleashed
Web Archives and Data Challenges - Archives Unleashed
 
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
 
Internet Archives as a Tool for Research: Decay in Large Scale Archival Records
Internet Archives as a Tool for Research: Decay in Large Scale Archival RecordsInternet Archives as a Tool for Research: Decay in Large Scale Archival Records
Internet Archives as a Tool for Research: Decay in Large Scale Archival Records
 
Immutable Technology and the Breakdown of Organizational Change.
Immutable Technology and the Breakdown of Organizational Change.Immutable Technology and the Breakdown of Organizational Change.
Immutable Technology and the Breakdown of Organizational Change.
 
From Big Data to Big Theory: Lessons Learned from Archival Internet Research.
From Big Data to Big Theory: Lessons Learned from Archival Internet Research.From Big Data to Big Theory: Lessons Learned from Archival Internet Research.
From Big Data to Big Theory: Lessons Learned from Archival Internet Research.
 
032415 marketing 101 watershed upload
032415 marketing 101   watershed upload032415 marketing 101   watershed upload
032415 marketing 101 watershed upload
 
AEJMC 2014 - Big Data and Education
AEJMC 2014 - Big Data and EducationAEJMC 2014 - Big Data and Education
AEJMC 2014 - Big Data and Education
 
AEJMC 2014 - Online News and Linking
AEJMC 2014 - Online News and LinkingAEJMC 2014 - Online News and Linking
AEJMC 2014 - Online News and Linking
 

Recently uploaded

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 

Recently uploaded (20)

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 

Internet Archives and Social Science Research - Yeungnam University

  • 1. BIG  DATA  AND  SOCIAL  SCIENCE  THEORY   Leveraging  Large  Scale  Data  to  Discover   New  Pa4erns  in  Society   Monday,  April  7,  2014   CybermoCons  @  Korea   Yeungnam  University     Ma4hew  Weber   Rutgers  University   School  of  CommunicaCon  &  InformaCon  
  • 2. 2 Opportunity:  The  Internet  Archive  contains  the  largest   single  record  of  the  history  of  the  World  Wide  Web  from   1995  to  the  present—a  wealth  of  untapped  research  data.     Challenge:  There  is  a  significant  lack  of  research-­‐ready   databases  and  tools  available  to  the  scholarly  community  
  • 5. 5
  • 6. 6
  • 7. 7
  • 8. 8
  • 9. 9
  • 10. 10 Opportunity:  The  ArchiveHub  project  aims  to  support  the   creaCon  and  disseminaCon  of  general  guidelines  &  tools  for   conducCng  theoreCcally  and  methodologically  rigorous   longitudinal  research  using  archival  Web  data    
  • 11. 11
  • 12. 12
  • 13. 13
  • 14. 14 Dataset   Research  PotenAal   Dates   Captures   Unique  URLs   Hurricane  Katrina   Online  networks  and  organizaConal   resilience  (Chewning,  Lai  and  Doerfel,   2012;  Perry,  Taylor  and  Doerfel,  2003)  in   the  wake  of  disasters;  informaCon   disseminaCon     2003  –  2012   1,694,236   663,740     Superstorm   Sandy   2003  –  2012   41,703,112   20,013,455   US  Senate   Study  the  growth  of  poliCcal  acCvity  in   online  environments  (Adamic  &  Glance,   2005;  Bruns,  2007;  Chang  &  Park,  2012);   polarizaCon  &  media  discourse   109th  –  112th   Congresses   26,965,770    8,674,397     US  House   51,840,777   12,410,014   Occupy  Wall   Street   Previous  research  on  NGOs  in  the  online   environment  (Bach  &  Stark,  2004;   Shumate,  2003,  2012;  Shumate,  Fulk,  &   Monge,  2005);  use  of  hyperlink  data  to   study  the  formaCon  and  role  of  alliances   between  SMOs   2010  –  2012   247,928,272   11,3259,655   US  Media   Previous  studies  of  news  media   organizaCons  (Greer  &  Mensing,  2006;   Weber,  2012;  Weber  &  Monge,  In   Press);  focus  on  evoluConary  pa4erns   2008  –  2012   1,315,132,555   539,184,823  
  • 16. 16
  • 17. Tracing  the  Emergence  of  OrganizaConal  Forms   17 Environment:     OrganizaCons  compete  for  scare  resources;  during  rapid  periods  of   disrupCon,  new  entrants  seek  “protected”  niches  (Weber  &  Monge  2014) PopulaAon:     In  digital  spaces,  online  connecCons  provide  communicaCve  representaCons  of   informaCon  flows  (Weber  &  Monge,  2012)     FormaCon  of  Ces  (e.g.  hyperlinks)  can  posiCvely  impact  long-­‐term  likelihood  of   organizaCon  survival  (Weber,  2012)   OrganizaAon:     OrganizaCons  adapt  internally,  reconfiguring  team  structures  and   developing  new  rouCnes  for  knowledge  sharing     (Ellison,  Gibbs  &  Weber,  In  Press;  Weber  &  Kim,  Under  Review)
  • 18. 18
  • 19. Big Data… Big Theory?   •  Networks  are  central  to  social  movements  in  that  links  between   nodes  can  be  influenCal  in  collecCve  acCon   •  Examples  of  nodes  includes  parCcipants,  organizaCons,  media  and   communicaCons  technologies     •  Social  networks  and  social  movements  (Diani,  2003)     •  The  interacCon  between  actors,  and  between  actors  and  hashtags,   collecCvely  represent  a  networked  form  of  organizaCon     •  Network  form  of  organizaCon  (Powell,  1990)  
  • 20. Over time, dyadic communication will become prevalent in an emerging networked organization.H1:   As a social movement develops as an emerging network form of organization, the organizational structure will be increasingly clustered. H2:  
  • 21. Data   •  TriangulaCon  of  data  insulates  against  false  readings  from  large-­‐scale  data   (see  Lazer,  Kennedy,  King  and  Vespignani,  2014)   •  Internet  Archive:   –  14  websites;  4,504  hyperlink  dyads  over  a  2-­‐month  period.   •  Lexis  Nexis:   –  Search  conducted  to  assess  U.S.  newspaper  coverage  of  OWS  from  the  early  stages  of  the   movement  in  September  2011  through  Sept.  2012   –  Search  OWS  keywords,  e.g.  “Occupy  Wall  Street,”  “Occupy  Oakland”   •  Twi4er   –  Gnip  PowerTrack     •  Search  by  keywords;  captures  a  larger  volume  of  Twi4er  data  than  other  opCons     –  Sample  includes  October  17,  2011,  through  January  5,  2012.  IniCal  study  focused  on  the   criCcal  two-­‐month  period  from  November  1  through  December  31,  2011,     –  750,816  tweets  across  the  two-­‐month  period.     21
  • 22.
  • 24. OWS  on  the  Web   •  335  seed  organizaCons  based  on  records  from  #OccupyResearch   •  Data  extracted  for  2011  &  2012,  based  on  “both  matching”   24 0   2   4   6   8   10   12   14   16   18   Millions   Captures  per  Month  
  • 25. Maximal  Cores  (k  Coreness)   25 Aug.  2011   Jan.  2012  
  • 26. 26  -­‐          10,000.00      20,000.00      30,000.00      40,000.00      50,000.00      60,000.00      70,000.00      80,000.00     Edges   60   80   100   120   140   160   180   VerAces  
  • 27. 27 0   0.01   0.02   0.03   0.04   0.05   0.06   0.07   0.08   Density  
  • 28. 28 0   10   20   30   40   50   60   70   80   90   100   Clusters  
  • 29. 29
  • 30. ImplicaCons   •  Big  Data:   –  Guiding  data  collecCon  with  theoreCcally  grounded  quesCons  avoids  the   “needle-­‐in-­‐the-­‐haystack”  problem   –  Leverage  advances  in  compuCng  with  exisCng  theories  to  develop  robust   studies  of  social  science  phenomenon     •  Big  Theory:   –  Expanding  prior  theories  on  networked  organizaConal  forms  and  form   emergence  (evoluConary)   –  Building  toward  a  macro  theory  of  organizaConal  form  emergence  based  on   resource  availability  and  networks   30
  • 31. •  Want  data?   –  Email  me!  ma4hew.weber@rutgers.edu   –  ArchiveHub:  h4p://archivehub.rutgers.edu     •  Collaborators   –  Kris  Carpenter  &  Vinay  Goel,  Internet  Archive     –  David  Lazer,  Northeastern  University       31 Research  supported  by  NSF  Award  #1244727  and  the  NetSCI  Lab  @  Rutgers