SlideShare a Scribd company logo
© 2015 SpaceCurve, Inc. Confidential. | 1!
© 2015 SpaceCurve, Inc. Confidential. | 2!
Spatial Data
Hadoop Ecosystem
SpaceCurve’s Spatial Data Platform
Integrating with Hadoop
© 2015 SpaceCurve, Inc. Confidential. | 3!
© 2015 SpaceCurve, Inc. Confidential. | 4!
•  Largest datasets are geospatial in nature
– Daily generation of petabytes of data
– Most is not used or simply discarded
•  Proliferation of mobile platforms, sensors and
IoT
– More geospatial data will be generated in real-time
•  Typical big data solutions can scale to ingest
and store vast quantities of data
– But these are not designed for real-time,
geospatial data
© 2015 SpaceCurve, Inc. Confidential. | 5!
Devices > People
In 2008, # of internet devices 
exceeded # of people on earth
20 - 50 Billion
Estimated # of connected devices by 2020
80% of all data
has spatial attributes*
90% of all mobile
data is location aware*
*According to Gartner
© 2015 SpaceCurve, Inc. Confidential. | 6!
ü Mobile Platforms
ü Operational Intelligence
ü Sensored World/Digital Business
ü Context Rich Autonomous Systems 
ü Smart Machines/M2M
Source: Gartner Technology Trends 2015
© 2015 SpaceCurve, Inc. Confidential. | 7!
THE WORLD IS A
STATIC MAP
CAPTURING THE
MOTION OF THINGS

REMOTE CONTROL
OF THINGS
THINGS TALK TO
EACH OTHER




THINGS BEHAVE
INTELLIGENTLY



Map coordinates of points
of interest cataloged and
described on the Internet.
Packages have passive
sensors, we can track on
web and know where they
passed checkpoints.
UAVs used as remote
sensing platforms for
emergency response.
Aircraft optimize fuel
consumption in real-time
using data from internal and
external sensor networks.
Large fleets of autonomous
vehicles adapting to weather
conditions and traffic
congestion.
EXAMPLES
© 2015 SpaceCurve, Inc. Confidential. | 8!
© 2015 SpaceCurve, Inc. Confidential. | 9!
•  Hadoop’s open source platform has become synonymous 
with big data processing
•  Core ecosystem:
–  Distributed file system for data storage (HDFS)
–  Distributed processing of data at scale (MapReduce)
–  Batch-oriented job execution
•  Hadoop-based solutions excel at:
–  Ingesting and data warehousing multiple sources of data
–  Creating and updating analytical dashboards on a weekly, daily or
hourly basis
–  Providing insights from historical data that apply to future
scenarios
© 2015 SpaceCurve, Inc. Confidential. | 10!
•  Hadoop ecosystem can scale to geospatial storage requirements
•  HDFS not efficient for organizing and analyzing these data models as:
–  Geospatial data does not have a predictable, uniform distribution
–  Hash functions can transform unpredictable, non-uniform
distributions do not preserve nor expose geospatial biases and
relationships efficiently
•  Results:
–  Reduction in parallelism and efficiency of geospatial analysis
–  Inability to implement computational geometry needed for
geospatial analytics
© 2015 SpaceCurve, Inc. Confidential. | 11!
© 2015 SpaceCurve, Inc. Confidential. | 12!
CONTINUOUS HIGH-VELOCITY data ingestion rates are far beyond the
limits of traditional spatial analysis platforms.
SPATIAL ANALYTICS required for high-value Internet of Everything 
applications are not supportable on popular big data platforms.
REAL-TIME operational analysis requirements preclude the use 
of batch-oriented platforms.
DATA VOLUME greatly exceeds capacity of platforms designed for real-time
analysis of human-generated sources.
© 2015 SpaceCurve, Inc. Confidential. | 13!
•  SpaceCurve has created the first purpose-built 
platform from the ground up:
–  Designed for organizing multiple streams of very large scale geospatial
data
–  Optimized for analyzing data in real-time
–  Eliminates limitations on geospatial data inherent in other platforms
•  The SpaceCurve platform makes it possible to:
–  Collect and fuse multiple sources of data in real-time and immediately
streaming it to an application
–  Allow continuous queries and analytics to be run with second and sub-
second responses
–  Provide insights from real-time data that can apply to current,
immediate scenarios
© 2015 SpaceCurve, Inc. Confidential. | 14!
CONTINUOUS
HIGH-VELOCITY
INGESTION
COMPLEX SPATIAL
DATA TYPES 
OPERATIONS
EXTREME DATA
VOLUMES
REAL-TIME QUERY
EXECUTION 
ANALYSIS
© 2015 SpaceCurve, Inc. Confidential. | 15!
© 2015 SpaceCurve, Inc. Confidential. | 16!
•  Integration at the HDFS layer
•  Enables all current systems and tools to be
utilized in their normal workflows
•  Leverages existing investments and enables
real-time geospatial use cases
•  Build combined workflows that operate in
parallel or where Hadoop components can
call out queries into SpaceCurve
© 2015 SpaceCurve, Inc. Confidential. | 17!
•  Additional resources can be found below:
– Github – https://github.com/SpaceCurve/hadoop
•  This resource outlines the mechanics of export/import
between SpaceCurve and Hadoop and includes a step-
by-step tutorial using California earthquake data
– SpaceCurveVM – available upon request
•  This resource lets a user install the SpaceCurve system
loaded with sample data and use SpaceCurve SQL to
query the data
© 2015 SpaceCurve, Inc. Confidential. | 18!
ESRI	
  Tools	
  
HDFS	
  
MapReduce	
  
Hive	
  
GeoJSON	
  
Mapper	
  
Reducer	
  
Hive	
  SQL	
  
SpaceCurve
HTTP/JSON	
  
Hadoop	
  Ecosystem	
  
© 2015 SpaceCurve, Inc. Confidential. | 19!

More Related Content

What's hot

Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...
Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...
Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...
ExtremeEarth
 
ExtremeEarth Data Science Pipeline for Linked Earth Observation Data
ExtremeEarth Data Science Pipeline for Linked Earth Observation DataExtremeEarth Data Science Pipeline for Linked Earth Observation Data
ExtremeEarth Data Science Pipeline for Linked Earth Observation Data
ExtremeEarth
 
The NOAA Big Data Project Overview
The NOAA Big Data Project OverviewThe NOAA Big Data Project Overview
The NOAA Big Data Project Overview
Amy Gaskins
 
Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...
Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...
Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...
Burton Lee
 
Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...
Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...
Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...
Rainer Sternfeld
 
Accelerating Research and Enterprise Solutions by Bridging HPC and AI
Accelerating Research and Enterprise Solutions by Bridging HPC and AIAccelerating Research and Enterprise Solutions by Bridging HPC and AI
Accelerating Research and Enterprise Solutions by Bridging HPC and AI
inside-BigData.com
 
Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Google BigQuery for analysis of scientific datasets: Interactive exploration ...Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Greg Landrum
 
NOAA Big Data Project Presentation for Earth Observations in the Cloud Demo Day
NOAA Big Data Project Presentation for Earth Observations in the Cloud Demo DayNOAA Big Data Project Presentation for Earth Observations in the Cloud Demo Day
NOAA Big Data Project Presentation for Earth Observations in the Cloud Demo Day
Amazon Web Services
 
view_hdf
view_hdfview_hdf
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather Forecasting
James Arnold Faeldon
 
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
The Statistical and Applied Mathematical Sciences Institute
 
NASA Earth Exchange (NEX) Overview
NASA Earth Exchange (NEX) OverviewNASA Earth Exchange (NEX) Overview
NASA Earth Exchange (NEX) Overview
Planet OS
 
Dynamics 365: Empowering Canada's Oil & Gas Industry
Dynamics 365: Empowering Canada's Oil & Gas IndustryDynamics 365: Empowering Canada's Oil & Gas Industry
Dynamics 365: Empowering Canada's Oil & Gas Industry
vinair
 
Dynamics 365: Evolution to the Digital Age
Dynamics 365: Evolution to the Digital AgeDynamics 365: Evolution to the Digital Age
Dynamics 365: Evolution to the Digital Age
vinair
 
Mike Warren Keynote
Mike Warren KeynoteMike Warren Keynote
Mike Warren Keynote
Data Con LA
 
Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08
Sky Bristol
 
DGterzo
DGterzoDGterzo
Building useful models for imbalanced datasets (without resampling)
Building useful models for imbalanced datasets (without resampling)Building useful models for imbalanced datasets (without resampling)
Building useful models for imbalanced datasets (without resampling)
Greg Landrum
 

What's hot (20)

Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...
Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...
Artificial Intelligence and Big Data Technologies for Copernicus Data: the Ex...
 
ExtremeEarth Data Science Pipeline for Linked Earth Observation Data
ExtremeEarth Data Science Pipeline for Linked Earth Observation DataExtremeEarth Data Science Pipeline for Linked Earth Observation Data
ExtremeEarth Data Science Pipeline for Linked Earth Observation Data
 
The NOAA Big Data Project Overview
The NOAA Big Data Project OverviewThe NOAA Big Data Project Overview
The NOAA Big Data Project Overview
 
Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...
Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...
Rainer Sternfeld - Planetary Big Data - PlanetOS - Stanford Engineering - Mar...
 
Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...
Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...
Indexing the Real World Sensor Networks (at RE.WORK Internet of Things Summit...
 
Accelerating Research and Enterprise Solutions by Bridging HPC and AI
Accelerating Research and Enterprise Solutions by Bridging HPC and AIAccelerating Research and Enterprise Solutions by Bridging HPC and AI
Accelerating Research and Enterprise Solutions by Bridging HPC and AI
 
Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Google BigQuery for analysis of scientific datasets: Interactive exploration ...Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Google BigQuery for analysis of scientific datasets: Interactive exploration ...
 
NOAA Big Data Project Presentation for Earth Observations in the Cloud Demo Day
NOAA Big Data Project Presentation for Earth Observations in the Cloud Demo DayNOAA Big Data Project Presentation for Earth Observations in the Cloud Demo Day
NOAA Big Data Project Presentation for Earth Observations in the Cloud Demo Day
 
view_hdf
view_hdfview_hdf
view_hdf
 
Hadoop Developer
Hadoop DeveloperHadoop Developer
Hadoop Developer
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather Forecasting
 
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
 
NASA Earth Exchange (NEX) Overview
NASA Earth Exchange (NEX) OverviewNASA Earth Exchange (NEX) Overview
NASA Earth Exchange (NEX) Overview
 
Dynamics 365: Empowering Canada's Oil & Gas Industry
Dynamics 365: Empowering Canada's Oil & Gas IndustryDynamics 365: Empowering Canada's Oil & Gas Industry
Dynamics 365: Empowering Canada's Oil & Gas Industry
 
Dynamics 365: Evolution to the Digital Age
Dynamics 365: Evolution to the Digital AgeDynamics 365: Evolution to the Digital Age
Dynamics 365: Evolution to the Digital Age
 
Mike Warren Keynote
Mike Warren KeynoteMike Warren Keynote
Mike Warren Keynote
 
Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08
 
DGterzo
DGterzoDGterzo
DGterzo
 
Building useful models for imbalanced datasets (without resampling)
Building useful models for imbalanced datasets (without resampling)Building useful models for imbalanced datasets (without resampling)
Building useful models for imbalanced datasets (without resampling)
 
VINEYARD Overview - ARC 2016
VINEYARD Overview - ARC 2016VINEYARD Overview - ARC 2016
VINEYARD Overview - ARC 2016
 

Viewers also liked

Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...
Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...
Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...JPall
 
Funkcionalnaya shema-kompyutera
Funkcionalnaya shema-kompyuteraFunkcionalnaya shema-kompyutera
Funkcionalnaya shema-kompyuteragrisha737
 
Information
InformationInformation
Informationdonsam48
 
Nuovi strumenti di accesso al credito - minibond
Nuovi strumenti di accesso al credito - minibondNuovi strumenti di accesso al credito - minibond
Nuovi strumenti di accesso al credito - minibondMoltiplika
 
логістика
логістикалогістика
логістикаSery Fomin
 
Dcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis ppDcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis ppDustyn Bailey
 
кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...
кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...
кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...
Детска Амбасада Меѓаши
 
Programna injeneria1
Programna injeneria1Programna injeneria1
Programna injeneria1Sery Fomin
 
Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...
Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...
Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...
Детска Амбасада Меѓаши
 
ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...
ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...
ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...
Детска Амбасада Меѓаши
 
Стратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИ
Стратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИСтратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИ
Стратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИ
Детска Амбасада Меѓаши
 
Prezentatsia menedzhment
Prezentatsia menedzhmentPrezentatsia menedzhment
Prezentatsia menedzhmentSery Fomin
 
Достали
ДосталиДостали
Досталиalenadpua
 
Open Source: alternativa vincente per l'azienda?
Open Source: alternativa vincente per l'azienda?Open Source: alternativa vincente per l'azienda?
Open Source: alternativa vincente per l'azienda?Moltiplika
 
LS 574 Information Literacy Instruction Assignment
LS 574 Information Literacy Instruction AssignmentLS 574 Information Literacy Instruction Assignment
LS 574 Information Literacy Instruction Assignmentned5041
 
Dcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis ppDcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis ppDustyn Bailey
 

Viewers also liked (18)

Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...
Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...
Chapter 10, Part 2: Queering the Gecko- Race, Sexual Orientation, and Margina...
 
Funkcionalnaya shema-kompyutera
Funkcionalnaya shema-kompyuteraFunkcionalnaya shema-kompyutera
Funkcionalnaya shema-kompyutera
 
Information
InformationInformation
Information
 
Nuovi strumenti di accesso al credito - minibond
Nuovi strumenti di accesso al credito - minibondNuovi strumenti di accesso al credito - minibond
Nuovi strumenti di accesso al credito - minibond
 
Komp-nauku123
Komp-nauku123Komp-nauku123
Komp-nauku123
 
логістика
логістикалогістика
логістика
 
Dcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis ppDcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis pp
 
кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...
кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...
кратко резиме од спроведеното истражување во април 2016 од tns brima gallup i...
 
Programna injeneria1
Programna injeneria1Programna injeneria1
Programna injeneria1
 
Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...
Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...
Национална стратегија за застапување на ПРЕЦЕДЕ мрежата 2016-2026 Партнерство...
 
ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...
ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...
ПРИНЦИПИТЕ ОД ПОВЕЛБАТА ЗА ДЕЦАТА 2030 и нивната застапеност во НАЦИОНАЛНИО...
 
Стратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИ
Стратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИСтратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИ
Стратегиски план 2016 2020 на Првата детска амбасада во светот МЕЃАШИ
 
Prezentatsia menedzhment
Prezentatsia menedzhmentPrezentatsia menedzhment
Prezentatsia menedzhment
 
Достали
ДосталиДостали
Достали
 
Open Source: alternativa vincente per l'azienda?
Open Source: alternativa vincente per l'azienda?Open Source: alternativa vincente per l'azienda?
Open Source: alternativa vincente per l'azienda?
 
testing
 testing testing
testing
 
LS 574 Information Literacy Instruction Assignment
LS 574 Information Literacy Instruction AssignmentLS 574 Information Literacy Instruction Assignment
LS 574 Information Literacy Instruction Assignment
 
Dcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis ppDcb cms 330 movie anlysis pp
Dcb cms 330 movie anlysis pp
 

Similar to SpaceCurve - Integrating with Hadoop

Dba to data scientist -Satyendra
Dba to data scientist -SatyendraDba to data scientist -Satyendra
Dba to data scientist -Satyendra
pasalapudi123
 
Open source stak of big data techs open suse asia
Open source stak of big data techs   open suse asiaOpen source stak of big data techs   open suse asia
Open source stak of big data techs open suse asia
Muhammad Rifqi
 
MapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document DatabaseMapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document Database
MapR Technologies
 
Knowledge Processing with Big Data and Semantic Web Technologies
Knowledge Processing with Big Data and  Semantic Web TechnologiesKnowledge Processing with Big Data and  Semantic Web Technologies
Knowledge Processing with Big Data and Semantic Web Technologies
Syed Muhammad Ali Hasnain
 
Hadoop jon
Hadoop jonHadoop jon
Hadoop jon
Humoyun Ahmedov
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
tcloudcomputing-tw
 
Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016
Adam Doyle
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
Abhishek Roy
 
HariKrishna4+_cv
HariKrishna4+_cvHariKrishna4+_cv
HariKrishna4+_cvrevuri
 
Hadoop - Architectural road map for Hadoop Ecosystem
Hadoop -  Architectural road map for Hadoop EcosystemHadoop -  Architectural road map for Hadoop Ecosystem
Hadoop - Architectural road map for Hadoop Ecosystem
nallagangus
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
Data Con LA
 
GLOSIS vision | GSP Soil Data Facility, ISRIC - Bas Kempen
GLOSIS vision | GSP Soil Data Facility, ISRIC - Bas KempenGLOSIS vision | GSP Soil Data Facility, ISRIC - Bas Kempen
GLOSIS vision | GSP Soil Data Facility, ISRIC - Bas Kempen
ExternalEvents
 
Delivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDelivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDataWorks Summit
 
Atlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slidesAtlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slides
Qubole
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
POSSCON
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
new_Rajesh_Hadoop Developer_2016
new_Rajesh_Hadoop Developer_2016new_Rajesh_Hadoop Developer_2016
new_Rajesh_Hadoop Developer_2016Rajesh Kumar
 
Research on vector spatial data storage scheme based
Research on vector spatial data storage scheme basedResearch on vector spatial data storage scheme based
Research on vector spatial data storage scheme based
Anant Kumar
 
Introduction to Apache Hadoop
Introduction to Apache HadoopIntroduction to Apache Hadoop
Introduction to Apache HadoopChristopher Pezza
 

Similar to SpaceCurve - Integrating with Hadoop (20)

Dba to data scientist -Satyendra
Dba to data scientist -SatyendraDba to data scientist -Satyendra
Dba to data scientist -Satyendra
 
Open source stak of big data techs open suse asia
Open source stak of big data techs   open suse asiaOpen source stak of big data techs   open suse asia
Open source stak of big data techs open suse asia
 
MapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document DatabaseMapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document Database
 
Knowledge Processing with Big Data and Semantic Web Technologies
Knowledge Processing with Big Data and  Semantic Web TechnologiesKnowledge Processing with Big Data and  Semantic Web Technologies
Knowledge Processing with Big Data and Semantic Web Technologies
 
Hadoop.pptx
Hadoop.pptxHadoop.pptx
Hadoop.pptx
 
Hadoop jon
Hadoop jonHadoop jon
Hadoop jon
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
 
Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
 
HariKrishna4+_cv
HariKrishna4+_cvHariKrishna4+_cv
HariKrishna4+_cv
 
Hadoop - Architectural road map for Hadoop Ecosystem
Hadoop -  Architectural road map for Hadoop EcosystemHadoop -  Architectural road map for Hadoop Ecosystem
Hadoop - Architectural road map for Hadoop Ecosystem
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
GLOSIS vision | GSP Soil Data Facility, ISRIC - Bas Kempen
GLOSIS vision | GSP Soil Data Facility, ISRIC - Bas KempenGLOSIS vision | GSP Soil Data Facility, ISRIC - Bas Kempen
GLOSIS vision | GSP Soil Data Facility, ISRIC - Bas Kempen
 
Delivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDelivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated Architecture
 
Atlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slidesAtlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slides
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
new_Rajesh_Hadoop Developer_2016
new_Rajesh_Hadoop Developer_2016new_Rajesh_Hadoop Developer_2016
new_Rajesh_Hadoop Developer_2016
 
Research on vector spatial data storage scheme based
Research on vector spatial data storage scheme basedResearch on vector spatial data storage scheme based
Research on vector spatial data storage scheme based
 
Introduction to Apache Hadoop
Introduction to Apache HadoopIntroduction to Apache Hadoop
Introduction to Apache Hadoop
 

Recently uploaded

Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 

Recently uploaded (20)

Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 

SpaceCurve - Integrating with Hadoop

  • 1. © 2015 SpaceCurve, Inc. Confidential. | 1!
  • 2. © 2015 SpaceCurve, Inc. Confidential. | 2! Spatial Data Hadoop Ecosystem SpaceCurve’s Spatial Data Platform Integrating with Hadoop
  • 3. © 2015 SpaceCurve, Inc. Confidential. | 3!
  • 4. © 2015 SpaceCurve, Inc. Confidential. | 4! •  Largest datasets are geospatial in nature – Daily generation of petabytes of data – Most is not used or simply discarded •  Proliferation of mobile platforms, sensors and IoT – More geospatial data will be generated in real-time •  Typical big data solutions can scale to ingest and store vast quantities of data – But these are not designed for real-time, geospatial data
  • 5. © 2015 SpaceCurve, Inc. Confidential. | 5! Devices > People In 2008, # of internet devices exceeded # of people on earth 20 - 50 Billion Estimated # of connected devices by 2020 80% of all data has spatial attributes* 90% of all mobile data is location aware* *According to Gartner
  • 6. © 2015 SpaceCurve, Inc. Confidential. | 6! ü Mobile Platforms ü Operational Intelligence ü Sensored World/Digital Business ü Context Rich Autonomous Systems  ü Smart Machines/M2M Source: Gartner Technology Trends 2015
  • 7. © 2015 SpaceCurve, Inc. Confidential. | 7! THE WORLD IS A STATIC MAP CAPTURING THE MOTION OF THINGS REMOTE CONTROL OF THINGS THINGS TALK TO EACH OTHER THINGS BEHAVE INTELLIGENTLY Map coordinates of points of interest cataloged and described on the Internet. Packages have passive sensors, we can track on web and know where they passed checkpoints. UAVs used as remote sensing platforms for emergency response. Aircraft optimize fuel consumption in real-time using data from internal and external sensor networks. Large fleets of autonomous vehicles adapting to weather conditions and traffic congestion. EXAMPLES
  • 8. © 2015 SpaceCurve, Inc. Confidential. | 8!
  • 9. © 2015 SpaceCurve, Inc. Confidential. | 9! •  Hadoop’s open source platform has become synonymous with big data processing •  Core ecosystem: –  Distributed file system for data storage (HDFS) –  Distributed processing of data at scale (MapReduce) –  Batch-oriented job execution •  Hadoop-based solutions excel at: –  Ingesting and data warehousing multiple sources of data –  Creating and updating analytical dashboards on a weekly, daily or hourly basis –  Providing insights from historical data that apply to future scenarios
  • 10. © 2015 SpaceCurve, Inc. Confidential. | 10! •  Hadoop ecosystem can scale to geospatial storage requirements •  HDFS not efficient for organizing and analyzing these data models as: –  Geospatial data does not have a predictable, uniform distribution –  Hash functions can transform unpredictable, non-uniform distributions do not preserve nor expose geospatial biases and relationships efficiently •  Results: –  Reduction in parallelism and efficiency of geospatial analysis –  Inability to implement computational geometry needed for geospatial analytics
  • 11. © 2015 SpaceCurve, Inc. Confidential. | 11!
  • 12. © 2015 SpaceCurve, Inc. Confidential. | 12! CONTINUOUS HIGH-VELOCITY data ingestion rates are far beyond the limits of traditional spatial analysis platforms. SPATIAL ANALYTICS required for high-value Internet of Everything applications are not supportable on popular big data platforms. REAL-TIME operational analysis requirements preclude the use of batch-oriented platforms. DATA VOLUME greatly exceeds capacity of platforms designed for real-time analysis of human-generated sources.
  • 13. © 2015 SpaceCurve, Inc. Confidential. | 13! •  SpaceCurve has created the first purpose-built platform from the ground up: –  Designed for organizing multiple streams of very large scale geospatial data –  Optimized for analyzing data in real-time –  Eliminates limitations on geospatial data inherent in other platforms •  The SpaceCurve platform makes it possible to: –  Collect and fuse multiple sources of data in real-time and immediately streaming it to an application –  Allow continuous queries and analytics to be run with second and sub- second responses –  Provide insights from real-time data that can apply to current, immediate scenarios
  • 14. © 2015 SpaceCurve, Inc. Confidential. | 14! CONTINUOUS HIGH-VELOCITY INGESTION COMPLEX SPATIAL DATA TYPES OPERATIONS EXTREME DATA VOLUMES REAL-TIME QUERY EXECUTION ANALYSIS
  • 15. © 2015 SpaceCurve, Inc. Confidential. | 15!
  • 16. © 2015 SpaceCurve, Inc. Confidential. | 16! •  Integration at the HDFS layer •  Enables all current systems and tools to be utilized in their normal workflows •  Leverages existing investments and enables real-time geospatial use cases •  Build combined workflows that operate in parallel or where Hadoop components can call out queries into SpaceCurve
  • 17. © 2015 SpaceCurve, Inc. Confidential. | 17! •  Additional resources can be found below: – Github – https://github.com/SpaceCurve/hadoop •  This resource outlines the mechanics of export/import between SpaceCurve and Hadoop and includes a step- by-step tutorial using California earthquake data – SpaceCurveVM – available upon request •  This resource lets a user install the SpaceCurve system loaded with sample data and use SpaceCurve SQL to query the data
  • 18. © 2015 SpaceCurve, Inc. Confidential. | 18! ESRI  Tools   HDFS   MapReduce   Hive   GeoJSON   Mapper   Reducer   Hive  SQL   SpaceCurve HTTP/JSON   Hadoop  Ecosystem  
  • 19. © 2015 SpaceCurve, Inc. Confidential. | 19!