BigData Meets the Federal Data Center

1. BigData Meets the Federal Data Center:Practical Solutions for Wicked Problems Abe Usher, CCHP, CISSP – Chief Technology Officer abe.usher@thehumangeo.com

3. Former Google Engineer

4. Former officer USA, USAF

5. Cloudera Certified Hadoop ProfessionalFor the past three years, I’ve been focused on BigData analytics for special operations

7. Challenges

8. Architectures & Patterns

9. Solutions

10. Action plan for decision makers

11. Homework for data engineersFun stuff

12. What is Cloud? Cloud Computing defined: “Delivery of computing as a service”

13. Big Trends Michael Driscoll http://radar.oreilly.com/2011/08/building-data-startups.html

14. More Trends BigData is now cool (not just geeky) There is an explosion of open source technology for BigData Available cloud technologies are significantly changing our society

15. Common Challenges Federal organizations face declining IT budgets Legacy systems not engineered for BigData Creating value from data is hard

17. Where are we? Where do we want to go? Data processing “State of the Art” A Better (elusive) Future “We have great intentions, but It is a big mess.” “Don’t look behind the curtain.” “The right tool for the right problem.” “Outsource/eliminate things outside of core competencies.”

18. Pattern 1: Outsource Infrastructure & Apps “The Enterprise” “The Cloud” Just-in-time Servers Email & Calendar Travel Coordination

19. Pattern 2: Consolidate Data and Analyze It “Future Enterprise” “The Enterprise Today” Redis MongoDB 1. Incrementally adopt BigData tools as you evolve your Enterprise 2. Maintain parallel capabilities if necessary Hadoop

20. Vignette 1 Tame massive streaming data in 5 minutes or less.

22. cURL utility

23. MongoDB

24. ElasticSearch (optional)

26. Recipe1: MongoDB tames Twitter Why MongoDB: Incredibly easy to setup Fast data inserts (> 20,000 per second or 1,728,000,000 per day) Horizontal scaling as data grows Pluggable compression with Snappy http://bit.ly/ggIWWN Get the code! http://bit.ly/humangeo_twitterpipe

27. Vignette 2 Ask Google to solve your problems

28. Google Prediction API

30. cURL utility

31. Raw data in multiple languages

33. Recipe2: Results

35. Spam detection

36. Recommendation system (e.g. Netflix, Amazon)

37. Customer sentiment analysis

38. Document / email classification

39. Suspicious activity identification

40. Purchase predictions

41. Predict driver behavior and optimize vehicle control systems**Between the Google Prediction API and our own research, we are discovering ways to make information work for the driver and help deliver optimal vehicle performance. –Ryan McGee, Technical Expert, Ford Research and Innovation *http://code.google.com/apis/predict/ ** http://www.google.com/enterprise/cloud/index.html

42. Action Plan for Decision Makers Experiment with Cloud-sourcing: http://bit.ly/cuRUCr Inventory your data and your systems Join a Meetup to get informed http://bit.ly/neNCRq Take a risk*

43. Homework for Data Engineers Understand Google MapReduce: http://bit.ly/GZBw Experiment with NoSQL: http://bit.ly/VCpR5 Ask Google to Predict the future: http://bit.ly/dCiOoc Take the cloud for a test drive: http://bit.ly/9c9IYy Try something, fail fast

44. On-line Twitter: @abeusher E-mail: abe.usher@thehumangeo.com Web: http://thehumangeo.com Facebook: http://on.fb.me/nZS87d This presentation: http://bit.ly/humangeo_cloud2011

45. BACKUP

47. Monitoring trends and opportunities

48. Augmenting and enriching data with influence and sentiment indicatorsTier One special operations intelligence and technology experts with Google experience and agility

BigData Meets the Federal Data Center

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to BigData Meets the Federal Data Center

Similar to BigData Meets the Federal Data Center (20)

Recently uploaded

Recently uploaded (20)

BigData Meets the Federal Data Center