2013 05 20 field_directors
Upcoming SlideShare
Loading in...5
×
 

2013 05 20 field_directors

on

  • 564 views

 

Statistics

Views

Total Views
564
Views on SlideShare
563
Embed Views
1

Actions

Likes
0
Downloads
2
Comments
0

1 Embed 1

https://twitter.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

2013 05 20 field_directors 2013 05 20 field_directors Presentation Transcript

  • Computational Social Science:The Pros and Cons of Big Data’Cliff Lampe- School of Information, University of MichiganMay 20, 2013Monday, May 20, 13
  • Cliff LampeSchool of Information- associate professorSocial media“Socio-technical systems”Primarily a social scientistMonday, May 20, 13
  • Samples of my research in this areaEffects of participation on FacebookInformation cascades on TwitterUser collaboration on WikipediaDiscussion patterns in large-scale news sitesCoordination in massive online gamesInformation seeking on search engines vs. social mediaMonday, May 20, 13 View slide
  • Interactions in social media leave communicationtraces we can mine to understand social processes.These compete with insights from surveys.Monday, May 20, 13 View slide
  • Defining “Big Data”“Big Data” is a rough categorization, a marketingterm, and a paradigm shift.Monday, May 20, 13
  • Why “big data” has becomea big deal...More devices collecting dataMore data born digitalEasier/cheaper to storeBetter processorsNew skills / techniquesInsights have proven effectiveMonday, May 20, 13
  • Monday, May 20, 13
  • “Big Data” started in thephysical sciencesMonday, May 20, 13
  • Big Data is increasingly beingapplied to social science questionsMonday, May 20, 13
  • What counts as “big”?LHC: .001% of sensors leadto 25 petabytes annually.Wikipedia: 17 terabytesTwitter: ~ 10 GB/dayHow many observationsneeded to count as “big”?Monday, May 20, 13
  • ‘Big Data’ require multiple,interlinked skills and tools.Monday, May 20, 13
  • Monday, May 20, 13
  • Challenges in “Big Data”CaptureCurationStorageSearchSharingTransferAnalysisVisualizationMonday, May 20, 13
  • Social Media is often linked withBig Data because it is the likeliestsource for human trace data.Monday, May 20, 13
  • What is social media?Monday, May 20, 13
  • Common characteristicsUser generated contentDirect user-to-user interactionBundles of applicationsMore than Facebook and TwitterMonday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Social media cover a widevariety of sites.Monday, May 20, 13
  • Trends in social media useMonday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Social media skillNearly 1 million people join Facebook every weekPeople spend on average 16 hours a month onFacebookThere are about 250 million Tweets per dayPeople upload 3000 pictures to Flickr every minuteWikipedia has 17 million articles by 91,000 editorsYouTube has 490 million unique visitors per monthGoogle + reached 10 million users in 16 daysMonday, May 20, 13
  • The social media landscapeis constantly changing.Monday, May 20, 13
  • People are increasinglyenacting their social lives incomputer mediatedchannels.Monday, May 20, 13
  • Examples of social media /big data social insightsMonday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Monday, May 20, 13
  • Social media are being usedto capture big social data.Monday, May 20, 13
  • Obtaining social media trace data“Scraping”APIs(rare) public datasetsPartnershipsMonday, May 20, 13
  • Scraping - using software todownload and store datafrom publicly available site.Monday, May 20, 13
  • Application Performance InterfacesMonday, May 20, 13
  • Monday, May 20, 13
  • Available datasetsMonday, May 20, 13
  • PartnershipsMonday, May 20, 13
  • Monday, May 20, 13
  • Issues with social mediatrace dataAccessRepresenting resultsRepresentativenessValidityCross-channel difficultyAppropriate skill setsEthicsMonday, May 20, 13
  • AccessData often owned byprivate corporations.Need special skills toaccess.Monday, May 20, 13
  • Representing resultsProbability testing breaksdown.Visualization is common,but limited.Training audiences for newdata analysis.Monday, May 20, 13
  • Representativeness*How accurately dosocial media usersrepresent the largerpopulation?How do you rigorouslysample from socialmedia?Monday, May 20, 13
  • ValiditySocial media users areperforming (thoughdon’t know scientistsare observing them)Different sites havedifferent purposes.Monday, May 20, 13
  • Cross-channel useHow do you track oneuser over multiplesocial media sites?Monday, May 20, 13
  • Appropriate researcher skillsetsCombination oftechnical and researchskills are required.New generation of“data scientists”coming now.Monday, May 20, 13
  • EthicsHow can a user optout?More on a panel nextsession...Monday, May 20, 13
  • Challenges in “Big Data”CaptureCurationStorageSearchSharingTransferAnalysisVisualizationMonday, May 20, 13
  • “Pros”of big dataIt’s relatively cheapMassive scale coverssome sinsIt’s inevitable(?)In some cases, itworks.Monday, May 20, 13
  • Insensitive Borg says...Big Data will makesurveys obsolete.Monday, May 20, 13
  • Humble SuggestionsMore interdisciplinary work.Propose and fund work to test these issues.Don’t pretend it isn’t coming OR is a panacea.We’re just at the beginning of the journey.Monday, May 20, 13
  • Social media and surveysprojectCan social media data ever replace and/or supplementsocial measurement, especially for official statistics,based on self-reported answers to questions asked ofa representative sample?Fred Conrad, Michael Schober, Josh PasekMonday, May 20, 13
  • Thanks!Cliff Lampecacl@umich.eduTwitter: @clifflampeSlideshare: clifflampeMonday, May 20, 13