Your SlideShare is downloading. ×
Data Mashups -Data Science Summit
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Data Mashups -Data Science Summit

3,045

Published on

As large datasets come together exciting and unexpected things can happen. Human behavior is high dimensional, so combining many diverse datasets is critical to revealing actionable insights.

As large datasets come together exciting and unexpected things can happen. Human behavior is high dimensional, so combining many diverse datasets is critical to revealing actionable insights.

Published in: Technology
0 Comments
5 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,045
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
41
Comments
0
Likes
5
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Data Mashups May 12, 2011 Data Scientist Summit Turning Data Exhaust into Pete Skomoroch LinkedIn Insights @peteskomoroch
  • 2. We have an explosion of data • DataWrangling • InfoChimps • Data.gov • Factual • SimpleGeo
  • 3. And the tools to make sense of it • Hadoop • NoSQL •R • Python • Mechanical Turk
  • 4. Diverse datasets = better signal
  • 5. Find a meaningful problem • Identify pain points • Work on stuff that matters • Focus on underutilized data http://www.flickr.com/photos/aloshbennett/
  • 6. Trendingtopics.org @hourlytrends
  • 7. LinkedIn Skills
  • 8. The best mashups are actionable • Reveal patterns • Enable predictions • Recommendations
  • 9. Mashup: Skills & Cities
  • 10. Yuba City, California: 21.3% Unemployment
  • 11. Ames, Iowa: 4.7% Unemployment
  • 12. Make data mashups work for you • Open Data = powerful mashups • Mashup > sum of its parts • Focus on meaningful problems • Actionable mashups are better

×