SlideShare a Scribd company logo
BigData Meets the Federal Data Center:Practical Solutions for Wicked Problems Abe Usher, CCHP, CISSP – Chief Technology Officer abe.usher@thehumangeo.com
whoami ,[object Object]
Former Google Engineer
Former officer USA, USAF
Cloudera Certified Hadoop ProfessionalFor the past three years, I’ve been focused on BigData analytics for special operations
Outline Warm up ,[object Object]
Challenges
Architectures & Patterns
Solutions
Action plan for decision makers
Homework for data engineersFun stuff
What is Cloud? Cloud Computing defined: “Delivery of computing as a service”
Big Trends Michael Driscoll http://radar.oreilly.com/2011/08/building-data-startups.html
More Trends BigData is now cool (not just geeky) There is an explosion of open source technology for BigData Available cloud technologies are significantly changing our society
Common Challenges Federal organizations face declining IT budgets Legacy systems not engineered for BigData Creating value from data is hard
Possible Solutions Federal organizations face declining IT budgets ,[object Object],Legacy systems not engineered for BigData ,[object Object],Creating value from data is hard ,[object Object],[object Object]
Where are we?   Where do we want to go? Data processing “State of the Art” A Better (elusive) Future “We have great intentions, but It is a big mess.” “Don’t look behind the curtain.” “The right tool for the right problem.” “Outsource/eliminate things outside  of core competencies.”
Pattern 1: Outsource Infrastructure & Apps “The Enterprise” “The Cloud” Just-in-time Servers Email & Calendar Travel Coordination
Pattern 2:  Consolidate Data and Analyze It “Future Enterprise” “The Enterprise Today” Redis MongoDB 1. Incrementally adopt BigData tools as you evolve your Enterprise 2. Maintain parallel capabilities if necessary Hadoop
Vignette 1 Tame massive streaming data  in 5 minutes or less.
Recipe1: MongoDB tames Twitter Directions Combine ingredients on laptop Run cURL command to grab Twitter sprinkler Pipe data into Mongo & ES Search & Analyze Ingredients ,[object Object]
cURL utility
MongoDB
ElasticSearch (optional)
Laptop,[object Object]
Recipe1: MongoDB tames Twitter Why MongoDB: Incredibly easy to setup Fast data inserts (> 20,000 per second or 1,728,000,000 per day) Horizontal scaling as data grows Pluggable compression with Snappy http://bit.ly/ggIWWN Get the code! http://bit.ly/humangeo_twitterpipe
Vignette 2 Ask Google to  solve your problems

More Related Content

What's hot

Talk in Google fest 2013
Talk in Google fest 2013Talk in Google fest 2013
Talk in Google fest 2013
David Chen
 
Data Engineering Efficiency @ Netflix - Strata 2017
Data Engineering Efficiency @ Netflix - Strata 2017Data Engineering Efficiency @ Netflix - Strata 2017
Data Engineering Efficiency @ Netflix - Strata 2017
Michelle Ufford
 
How to get your engineers to care about the AWS Bill
How to get your engineers to care about the AWS BillHow to get your engineers to care about the AWS Bill
How to get your engineers to care about the AWS Bill
Gil Zellner
 
Is big data dead?
Is big data dead?Is big data dead?
Is big data dead?
Alexander Alten
 
Data Visualization for the Web - How to Get Started
Data Visualization for the Web - How to Get StartedData Visualization for the Web - How to Get Started
Data Visualization for the Web - How to Get Started
Christopher Conlan
 
Report: EDA of TV shows & movies available on Netflix
Report: EDA of TV shows & movies available on NetflixReport: EDA of TV shows & movies available on Netflix
Report: EDA of TV shows & movies available on Netflix
AnkitBirla5
 
Top 5 Deep Learning and AI Stories - November 3, 2017
Top 5 Deep Learning and AI Stories - November 3, 2017Top 5 Deep Learning and AI Stories - November 3, 2017
Top 5 Deep Learning and AI Stories - November 3, 2017
NVIDIA
 
Massive data twitter and semantic analysis in Reador.NET project
Massive data twitter and semantic analysis in Reador.NET projectMassive data twitter and semantic analysis in Reador.NET project
Massive data twitter and semantic analysis in Reador.NET project
descl
 
The IoT Transformation and What it Means to You - Nir Dobovizky
The IoT Transformation and What it Means to You - Nir DobovizkyThe IoT Transformation and What it Means to You - Nir Dobovizky
The IoT Transformation and What it Means to You - Nir Dobovizky
CodeValue
 
Big Data
Big DataBig Data
How I built a ml human hybrid workflow using computer vision - Amir Shitrit
How I built a ml human hybrid workflow using computer vision - Amir ShitritHow I built a ml human hybrid workflow using computer vision - Amir Shitrit
How I built a ml human hybrid workflow using computer vision - Amir Shitrit
CodeValue
 
Big Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning DemystifiedBig Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning Demystified
Matt Stubbs
 
Switching From Web Development to Data Science
Switching From Web Development to Data ScienceSwitching From Web Development to Data Science
Switching From Web Development to Data Science
Karlijn Willems
 
Biq query devfest2017_slides
Biq query devfest2017_slidesBiq query devfest2017_slides
Biq query devfest2017_slides
getdinesh
 
The lessons learned from handling billions of events
The lessons learned from handling billions of eventsThe lessons learned from handling billions of events
The lessons learned from handling billions of events
Ning Zhou
 
Manuel Martinez - Structured Data @ Search London SEO Meetup
Manuel Martinez - Structured Data @ Search London SEO MeetupManuel Martinez - Structured Data @ Search London SEO Meetup
Manuel Martinez - Structured Data @ Search London SEO Meetup
Manuel Martinez
 
Making Sense of Graph Databases
Making Sense of Graph DatabasesMaking Sense of Graph Databases
Making Sense of Graph Databases
InfiniteGraph
 
Data Engineering and the Data Science Lifecycle
Data Engineering and the Data Science LifecycleData Engineering and the Data Science Lifecycle
Data Engineering and the Data Science Lifecycle
Adam Doyle
 
Strata Conference NYC 2013
Strata Conference NYC 2013Strata Conference NYC 2013
Strata Conference NYC 2013
Taewook Eom
 
GDG Varna - Hadoop
GDG Varna - HadoopGDG Varna - Hadoop
GDG Varna - Hadoop
Dimitar Danailov
 

What's hot (20)

Talk in Google fest 2013
Talk in Google fest 2013Talk in Google fest 2013
Talk in Google fest 2013
 
Data Engineering Efficiency @ Netflix - Strata 2017
Data Engineering Efficiency @ Netflix - Strata 2017Data Engineering Efficiency @ Netflix - Strata 2017
Data Engineering Efficiency @ Netflix - Strata 2017
 
How to get your engineers to care about the AWS Bill
How to get your engineers to care about the AWS BillHow to get your engineers to care about the AWS Bill
How to get your engineers to care about the AWS Bill
 
Is big data dead?
Is big data dead?Is big data dead?
Is big data dead?
 
Data Visualization for the Web - How to Get Started
Data Visualization for the Web - How to Get StartedData Visualization for the Web - How to Get Started
Data Visualization for the Web - How to Get Started
 
Report: EDA of TV shows & movies available on Netflix
Report: EDA of TV shows & movies available on NetflixReport: EDA of TV shows & movies available on Netflix
Report: EDA of TV shows & movies available on Netflix
 
Top 5 Deep Learning and AI Stories - November 3, 2017
Top 5 Deep Learning and AI Stories - November 3, 2017Top 5 Deep Learning and AI Stories - November 3, 2017
Top 5 Deep Learning and AI Stories - November 3, 2017
 
Massive data twitter and semantic analysis in Reador.NET project
Massive data twitter and semantic analysis in Reador.NET projectMassive data twitter and semantic analysis in Reador.NET project
Massive data twitter and semantic analysis in Reador.NET project
 
The IoT Transformation and What it Means to You - Nir Dobovizky
The IoT Transformation and What it Means to You - Nir DobovizkyThe IoT Transformation and What it Means to You - Nir Dobovizky
The IoT Transformation and What it Means to You - Nir Dobovizky
 
Big Data
Big DataBig Data
Big Data
 
How I built a ml human hybrid workflow using computer vision - Amir Shitrit
How I built a ml human hybrid workflow using computer vision - Amir ShitritHow I built a ml human hybrid workflow using computer vision - Amir Shitrit
How I built a ml human hybrid workflow using computer vision - Amir Shitrit
 
Big Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning DemystifiedBig Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning Demystified
 
Switching From Web Development to Data Science
Switching From Web Development to Data ScienceSwitching From Web Development to Data Science
Switching From Web Development to Data Science
 
Biq query devfest2017_slides
Biq query devfest2017_slidesBiq query devfest2017_slides
Biq query devfest2017_slides
 
The lessons learned from handling billions of events
The lessons learned from handling billions of eventsThe lessons learned from handling billions of events
The lessons learned from handling billions of events
 
Manuel Martinez - Structured Data @ Search London SEO Meetup
Manuel Martinez - Structured Data @ Search London SEO MeetupManuel Martinez - Structured Data @ Search London SEO Meetup
Manuel Martinez - Structured Data @ Search London SEO Meetup
 
Making Sense of Graph Databases
Making Sense of Graph DatabasesMaking Sense of Graph Databases
Making Sense of Graph Databases
 
Data Engineering and the Data Science Lifecycle
Data Engineering and the Data Science LifecycleData Engineering and the Data Science Lifecycle
Data Engineering and the Data Science Lifecycle
 
Strata Conference NYC 2013
Strata Conference NYC 2013Strata Conference NYC 2013
Strata Conference NYC 2013
 
GDG Varna - Hadoop
GDG Varna - HadoopGDG Varna - Hadoop
GDG Varna - Hadoop
 

Viewers also liked

Anticipatory Intelligence
Anticipatory IntelligenceAnticipatory Intelligence
Anticipatory Intelligence
Abe Usher
 
Taming Social Media with MongoDB
Taming Social Media with MongoDBTaming Social Media with MongoDB
Taming Social Media with MongoDB
HumanGeo Group
 
2012 humangeo smn
2012 humangeo smn2012 humangeo smn
2012 humangeo smn
Abe Usher
 
Heatmaps are the Heat
Heatmaps are the HeatHeatmaps are the Heat
Heatmaps are the Heat
Abe Usher
 
Crowds, Computers, and Coordinates
Crowds, Computers, and CoordinatesCrowds, Computers, and Coordinates
Crowds, Computers, and Coordinates
Abe Usher
 
Taming Social Media
Taming Social MediaTaming Social Media
Taming Social Media
HumanGeo Group
 
Big Data and the Social Sciences
Big Data and the Social SciencesBig Data and the Social Sciences
Big Data and the Social Sciences
Abe Usher
 
Advanced Web-Based Geospatial Visualization using Leaflet
Advanced Web-Based Geospatial Visualization using Leaflet Advanced Web-Based Geospatial Visualization using Leaflet
Advanced Web-Based Geospatial Visualization using Leaflet
HumanGeo Group
 
NoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and AnalyticsNoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and Analytics
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
Matthew Dennis
 
Daily Newsletter: 10th March, 2011
Daily Newsletter: 10th March, 2011Daily Newsletter: 10th March, 2011
Daily Newsletter: 10th March, 2011
Fullerton Securities
 
Video Marketing Project for ESL students
Video Marketing Project for ESL studentsVideo Marketing Project for ESL students
Video Marketing Project for ESL students
Debbie Anholt
 
Как UX-специалист делился своими инструментами с agile-командами
Как UX-специалист делился своими инструментами с agile-командамиКак UX-специалист делился своими инструментами с agile-командами
Как UX-специалист делился своими инструментами с agile-командами
Nikita Efimov
 
Музыка в красках
Музыка в краскахМузыка в красках
Музыка в красках
VasilevaAl
 
3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...
3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...
3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...
SlideTeam.net
 
Responsive design and Drupal, case Costume.fi
Responsive design and Drupal, case Costume.fiResponsive design and Drupal, case Costume.fi
Responsive design and Drupal, case Costume.fi
Exove
 
La potestad tributaria
La potestad tributariaLa potestad tributaria
La potestad tributaria
César Suárez
 
Google App Engine
Google App EngineGoogle App Engine
The most common pitfalls metrics
The most common pitfalls metricsThe most common pitfalls metrics
The most common pitfalls metrics
AKAMIS
 
007 model
007 model007 model
007 model
Evolve Beyond
 

Viewers also liked (20)

Anticipatory Intelligence
Anticipatory IntelligenceAnticipatory Intelligence
Anticipatory Intelligence
 
Taming Social Media with MongoDB
Taming Social Media with MongoDBTaming Social Media with MongoDB
Taming Social Media with MongoDB
 
2012 humangeo smn
2012 humangeo smn2012 humangeo smn
2012 humangeo smn
 
Heatmaps are the Heat
Heatmaps are the HeatHeatmaps are the Heat
Heatmaps are the Heat
 
Crowds, Computers, and Coordinates
Crowds, Computers, and CoordinatesCrowds, Computers, and Coordinates
Crowds, Computers, and Coordinates
 
Taming Social Media
Taming Social MediaTaming Social Media
Taming Social Media
 
Big Data and the Social Sciences
Big Data and the Social SciencesBig Data and the Social Sciences
Big Data and the Social Sciences
 
Advanced Web-Based Geospatial Visualization using Leaflet
Advanced Web-Based Geospatial Visualization using Leaflet Advanced Web-Based Geospatial Visualization using Leaflet
Advanced Web-Based Geospatial Visualization using Leaflet
 
NoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and AnalyticsNoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and Analytics
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
 
Daily Newsletter: 10th March, 2011
Daily Newsletter: 10th March, 2011Daily Newsletter: 10th March, 2011
Daily Newsletter: 10th March, 2011
 
Video Marketing Project for ESL students
Video Marketing Project for ESL studentsVideo Marketing Project for ESL students
Video Marketing Project for ESL students
 
Как UX-специалист делился своими инструментами с agile-командами
Как UX-специалист делился своими инструментами с agile-командамиКак UX-специалист делился своими инструментами с agile-командами
Как UX-специалист делился своими инструментами с agile-командами
 
Музыка в красках
Музыка в краскахМузыка в красках
Музыка в красках
 
3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...
3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...
3 d pie chart circular puzzle with hole in center process 9 stages style 2 po...
 
Responsive design and Drupal, case Costume.fi
Responsive design and Drupal, case Costume.fiResponsive design and Drupal, case Costume.fi
Responsive design and Drupal, case Costume.fi
 
La potestad tributaria
La potestad tributariaLa potestad tributaria
La potestad tributaria
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
 
The most common pitfalls metrics
The most common pitfalls metricsThe most common pitfalls metrics
The most common pitfalls metrics
 
007 model
007 model007 model
007 model
 

Similar to BigData Meets the Federal Data Center

How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists
CCG
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Chris Jang
 
Hot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative AnalyticsHot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative Analytics
Inside Analysis
 
How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014
James Chittenden
 
HadoopWorkshopJuly2014
HadoopWorkshopJuly2014HadoopWorkshopJuly2014
HadoopWorkshopJuly2014
Dieter De Witte
 
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Matt Stubbs
 
Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014
Bastian Grimm
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
Big Data User Group Karlsruhe/Stuttgart
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data Platform
Andrei Savu
 
RightScale Roadtrip Boston: Accelerate to Cloud
RightScale Roadtrip Boston: Accelerate to CloudRightScale Roadtrip Boston: Accelerate to Cloud
RightScale Roadtrip Boston: Accelerate to Cloud
RightScale
 
Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...
Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...
Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...
Google Cloud Platform - Japan
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?
Denodo
 
IoT NY - Google Cloud Services for IoT
IoT NY - Google Cloud Services for IoTIoT NY - Google Cloud Services for IoT
IoT NY - Google Cloud Services for IoT
James Chittenden
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
tsigitnist02
 
Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez
Betacowork
 
Improve your Tech Quotient
Improve your Tech QuotientImprove your Tech Quotient
Improve your Tech Quotient
Tarence DSouza
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
Harvinder Atwal
 
Building your data driven business with Reactive Marketing Technology
Building your data driven business with Reactive Marketing TechnologyBuilding your data driven business with Reactive Marketing Technology
Building your data driven business with Reactive Marketing Technology
Trieu Nguyen
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
DATAVERSITY
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 

Similar to BigData Meets the Federal Data Center (20)

How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists How Cloud is Affecting Data Scientists
How Cloud is Affecting Data Scientists
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
 
Hot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative AnalyticsHot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative Analytics
 
How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014
 
HadoopWorkshopJuly2014
HadoopWorkshopJuly2014HadoopWorkshopJuly2014
HadoopWorkshopJuly2014
 
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
 
Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data Platform
 
RightScale Roadtrip Boston: Accelerate to Cloud
RightScale Roadtrip Boston: Accelerate to CloudRightScale Roadtrip Boston: Accelerate to Cloud
RightScale Roadtrip Boston: Accelerate to Cloud
 
Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...
Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...
Google Developers Summit Tokyo - Google Cloud Platform で知る Google クラウドの「Googl...
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?
 
IoT NY - Google Cloud Services for IoT
IoT NY - Google Cloud Services for IoTIoT NY - Google Cloud Services for IoT
IoT NY - Google Cloud Services for IoT
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 
Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez
 
Improve your Tech Quotient
Improve your Tech QuotientImprove your Tech Quotient
Improve your Tech Quotient
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Building your data driven business with Reactive Marketing Technology
Building your data driven business with Reactive Marketing TechnologyBuilding your data driven business with Reactive Marketing Technology
Building your data driven business with Reactive Marketing Technology
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 

Recently uploaded

June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 

Recently uploaded (20)

June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 

BigData Meets the Federal Data Center

  • 1. BigData Meets the Federal Data Center:Practical Solutions for Wicked Problems Abe Usher, CCHP, CISSP – Chief Technology Officer abe.usher@thehumangeo.com
  • 2.
  • 5. Cloudera Certified Hadoop ProfessionalFor the past three years, I’ve been focused on BigData analytics for special operations
  • 6.
  • 10. Action plan for decision makers
  • 11. Homework for data engineersFun stuff
  • 12. What is Cloud? Cloud Computing defined: “Delivery of computing as a service”
  • 13. Big Trends Michael Driscoll http://radar.oreilly.com/2011/08/building-data-startups.html
  • 14. More Trends BigData is now cool (not just geeky) There is an explosion of open source technology for BigData Available cloud technologies are significantly changing our society
  • 15. Common Challenges Federal organizations face declining IT budgets Legacy systems not engineered for BigData Creating value from data is hard
  • 16.
  • 17. Where are we? Where do we want to go? Data processing “State of the Art” A Better (elusive) Future “We have great intentions, but It is a big mess.” “Don’t look behind the curtain.” “The right tool for the right problem.” “Outsource/eliminate things outside of core competencies.”
  • 18. Pattern 1: Outsource Infrastructure & Apps “The Enterprise” “The Cloud” Just-in-time Servers Email & Calendar Travel Coordination
  • 19. Pattern 2: Consolidate Data and Analyze It “Future Enterprise” “The Enterprise Today” Redis MongoDB 1. Incrementally adopt BigData tools as you evolve your Enterprise 2. Maintain parallel capabilities if necessary Hadoop
  • 20. Vignette 1 Tame massive streaming data in 5 minutes or less.
  • 21.
  • 25.
  • 26. Recipe1: MongoDB tames Twitter Why MongoDB: Incredibly easy to setup Fast data inserts (> 20,000 per second or 1,728,000,000 per day) Horizontal scaling as data grows Pluggable compression with Snappy http://bit.ly/ggIWWN Get the code! http://bit.ly/humangeo_twitterpipe
  • 27. Vignette 2 Ask Google to solve your problems
  • 29.
  • 31. Raw data in multiple languages
  • 32.
  • 34.
  • 36. Recommendation system (e.g. Netflix, Amazon)
  • 38. Document / email classification
  • 41. Predict driver behavior and optimize vehicle control systems**Between the Google Prediction API and our own research, we are discovering ways to make information work for the driver and help deliver optimal vehicle performance. –Ryan McGee, Technical Expert, Ford Research and Innovation *http://code.google.com/apis/predict/ ** http://www.google.com/enterprise/cloud/index.html
  • 42. Action Plan for Decision Makers Experiment with Cloud-sourcing: http://bit.ly/cuRUCr Inventory your data and your systems Join a Meetup to get informed http://bit.ly/neNCRq Take a risk*
  • 43. Homework for Data Engineers Understand Google MapReduce: http://bit.ly/GZBw Experiment with NoSQL: http://bit.ly/VCpR5 Ask Google to Predict the future: http://bit.ly/dCiOoc Take the cloud for a test drive: http://bit.ly/9c9IYy Try something, fail fast
  • 44. On-line Twitter: @abeusher E-mail: abe.usher@thehumangeo.com Web: http://thehumangeo.com Facebook: http://on.fb.me/nZS87d This presentation: http://bit.ly/humangeo_cloud2011
  • 46.
  • 47. Monitoring trends and opportunities
  • 48. Augmenting and enriching data with influence and sentiment indicatorsTier One special operations intelligence and technology experts with Google experience and agility