Weathering the Data Storm:
How SnapLogic and Amazon Web Services
Deliver Analytics in the Cloud for Earth Networks
Today’s Agenda
Eddie Dingels
Architect, Earth Networks
Erin Curtis
Product Marketing, SnapLogic
•  Amazon Web Services
•  SnapLogic and Redshift
•  Earth Networks: Moving Data
Analytics IntoThe Cloud
•  Discussion
Kyle Lichtenburg
Solutions Architect,Amazon Web
Services
2011
82
159
2012
280
2013
516
2014
AWS’	
  Rapid	
  Pace	
  of	
  Innova3on	
  
AWS	
  has	
  launched	
  a	
  total	
  of	
  1,515	
  new	
  features	
  and/or	
  services	
  since	
  incep:on	
  in	
  2006.	
  	
  
2015
+342*
* As of July 9, 2015
More	
  Func:onality	
  Than	
  Any	
  Other	
  
Infrastructure	
  Provider	
  
It’s	
  never	
  been	
  easier	
  and	
  less	
  
expensive	
  to	
  collect,	
  store,	
  analyze	
  
&	
  share	
  data	
  
Companies	
  will	
  use	
  data	
  more	
  
expansively	
  than	
  at	
  any	
  other	
  
point	
  in	
  history	
  
Fully	
  Loaded	
  for	
  Big	
  Data	
  
•  Sources	
  of	
  Truth	
   •  High	
  Performance	
  
Databases	
  
•  Analysis	
  PlaKorms	
  
Amazon S3
Amazon Glacier
Amazon EFS
Amazon DynamoDB
Amazon Aurora
Amazon Redshift
Amazon Kinesis
Amazon EMR
Amazon	
  Simple	
  Storage	
  Service	
  (S3)	
  
• Storage	
  for	
  the	
  Internet	
  	
  
• Store	
  and	
  retrieve	
  any	
  amount	
  of	
  data,	
  at	
  any	
  
:me,	
  from	
  anywhere	
  on	
  the	
  web	
  
• Highly	
  scalable,	
  reliable,	
  and	
  secure	
  
• Supports	
  encryp:on	
  
• Pay	
  only	
  for	
  what	
  you	
  use	
  
Amazon	
  DynamoDB	
  
• Fast,	
  fully-­‐managed	
  NoSQL	
  Database	
  Service	
  
• Capable	
  of	
  handling	
  any	
  amount	
  of	
  data	
  
• Durable	
  and	
  Highly	
  Available	
  
• All	
  SSD	
  storage	
  
• Simple	
  and	
  Cost	
  Effec:ve	
  
Amazon	
  RedshiX	
  
• Fast,	
  simple,	
  fully-­‐managed	
  petabyte-­‐scale	
  data	
  
warehousing	
  
• Online	
  and	
  func:onal	
  in	
  minutes	
  
• SQL	
  based	
  
• Con:nuous	
  backup	
  
• Less	
  than	
  $1,000/TB/Year	
  
• ODBC/JDBC	
  Compliant	
  
Connect Faster
Unified Platform for Data,Apps,Things
Our unified platform significantly speeds up enterprise data access
everywhere.
– Gaurav Dhillon, co-founder and CEO, SnapLogic
Why SnapLogic Elastic Integration?
Unified Platform Productive User Experience
Modern Architecture Connected: 300+ Snaps
Productive: UX for Citizen and Advanced Users
We can do more in two hours with
SnapLogic than we could in two days
with traditional solutions.
•  Integration Cloud: Design, Admin, Monitoring
•  Drag, Drop, Connect: HTML5 interface built for speed
Modern Architecture: Hybrid and Elastic
Streams: No data is
stored/cached
Secure: 100%
standards-based
Elastic: Scales out &
handles data and app
integration use cases
Metadata
Data
Databases Enterprise Systems Hadoop
Modern Architecture: Real-Time and Batch
Ultra Pipelines SnapReduce and the Hadooplex
Map Reduce
Certified YARN Execution
Connected: 300+ Snaps
We look at SnapLogic as an opportunity to
think differently about integration.
SnapLogic Integration for Amazon Redshift
Customers:
Free, hosted trial of SnapLogic + Redshift:
www.snaplogic.com/redshift-trial
The Redshift Snap helps customers rapidly transfer data into and out of
Amazon Redshift from multiple sources
•  Rapidly connect Redshift to database services
•  Quickly load data into an Amazon S3 bucket and kick off the Redshift
import process in a single step
•  Easily replicate source tables into their Amazon Redshift clusters and
detect daily changes to keep data synchronized
•  Take advantage of core REST and SOAP connectivity
Edward Dingels
7/22/2015
AWS & SnapLogic
Company
•  Weather Networks
•  Schools/Education
•  Consumer
•  Alerting
•  Environmental Network
•  Energy
7/22/2015
Operational Environment
•  Weather
•  Dynamic
•  Local
•  Users
•  Engaged
•  Proximity
•  Spikes/Peak Periods
7/22/2015 22
Data Center
•  Scale
•  Weather intersecting users
•  Increase rapidly
•  Capacity
•  Hardware
•  Software
7/22/2015 23
Data Center
•  Mobile pushed us to the limit
•  Demanding performance
•  Feature releases delays due to capacity planning
The data center became a limiting factor in a
space where technology should enable
7/22/2015 24
AWS EC2
•  EC2
•  Dynamic capacity
•  Automatic capacity
•  SQL
•  Data tier
•  Horizontal scale
7/22/2015 25
SQL Data Store
API Tier
Ingest
AWS Storage
•  Blob
•  S3
•  Transactional
•  Dynamo
•  RDS
•  Warehouse
•  Redshift
7/22/2015 26
Cloud Storage
API Tier
Ingest
AWS ETL
•  Storage was great
•  ETL limiting
•  SQS
•  Kinesis
•  EMR
•  Data Pipeline
•  How do we move data between different
cloud data stores effectively?
7/22/2015 27
Cloud Integration
•  Needed a new tool set
•  Criteria
•  Data stayed in our VPC
•  Repeatable building blocks
•  Cloud data stores are 1st tier citizens
•  Horizontal scale
•  Performance
•  Price
7/22/2015 28
Project – Data Ingest
•  Network of Networks
•  Challenge – Providers
•  Formats
•  Delivery
•  Standardization
•  Solution – Pipeline Per
Provider
7/22/2015 29
Partner
A
Partner
B
Pipeline
A
Pipeline
B
Cloud Data Stores
Project – Data Analysis
•  Deriving KPIs for BI
•  Challenge – Domains
•  Unique domain
•  Different storage technologies
•  Varied timeliness requirements
•  Solution – Pipeline Per
Domain
7/22/2015 30
Redshift
BI Toolset
Pipeline
Domain
Operational – Automated Database Tasks
•  Storage Limitation
•  Challenge – Automation
•  Redshift
•  MSSQL RDS
•  Solution – Scheduled
pipeline
7/22/2015 31
Scheduled
Pipeline
Cloud Data Stores
AWS + snapLogic
•  AWS is the platform
•  SnapLogic is the glue
Drives faster implementations with repeatable
patterns for more business value
7/22/2015
Thank you
Questions?
See SnapLogic in action:
Contact us: info@snaplogic.com
http://video.snaplogic.com/
@SnapLogic
@awscloud
@EarthNetworks

Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cloud for Earth Networks

  • 1.
    Weathering the DataStorm: How SnapLogic and Amazon Web Services Deliver Analytics in the Cloud for Earth Networks
  • 2.
    Today’s Agenda Eddie Dingels Architect,Earth Networks Erin Curtis Product Marketing, SnapLogic •  Amazon Web Services •  SnapLogic and Redshift •  Earth Networks: Moving Data Analytics IntoThe Cloud •  Discussion Kyle Lichtenburg Solutions Architect,Amazon Web Services
  • 4.
    2011 82 159 2012 280 2013 516 2014 AWS’  Rapid  Pace  of  Innova3on   AWS  has  launched  a  total  of  1,515  new  features  and/or  services  since  incep:on  in  2006.     2015 +342* * As of July 9, 2015
  • 5.
    More  Func:onality  Than  Any  Other   Infrastructure  Provider  
  • 6.
    It’s  never  been  easier  and  less   expensive  to  collect,  store,  analyze   &  share  data  
  • 7.
    Companies  will  use  data  more   expansively  than  at  any  other   point  in  history  
  • 8.
    Fully  Loaded  for  Big  Data   •  Sources  of  Truth   •  High  Performance   Databases   •  Analysis  PlaKorms   Amazon S3 Amazon Glacier Amazon EFS Amazon DynamoDB Amazon Aurora Amazon Redshift Amazon Kinesis Amazon EMR
  • 9.
    Amazon  Simple  Storage  Service  (S3)   • Storage  for  the  Internet     • Store  and  retrieve  any  amount  of  data,  at  any   :me,  from  anywhere  on  the  web   • Highly  scalable,  reliable,  and  secure   • Supports  encryp:on   • Pay  only  for  what  you  use  
  • 10.
    Amazon  DynamoDB   • Fast,  fully-­‐managed  NoSQL  Database  Service   • Capable  of  handling  any  amount  of  data   • Durable  and  Highly  Available   • All  SSD  storage   • Simple  and  Cost  Effec:ve  
  • 11.
    Amazon  RedshiX   • Fast,  simple,  fully-­‐managed  petabyte-­‐scale  data   warehousing   • Online  and  func:onal  in  minutes   • SQL  based   • Con:nuous  backup   • Less  than  $1,000/TB/Year   • ODBC/JDBC  Compliant  
  • 12.
  • 13.
    Unified Platform forData,Apps,Things Our unified platform significantly speeds up enterprise data access everywhere. – Gaurav Dhillon, co-founder and CEO, SnapLogic
  • 14.
    Why SnapLogic ElasticIntegration? Unified Platform Productive User Experience Modern Architecture Connected: 300+ Snaps
  • 15.
    Productive: UX forCitizen and Advanced Users We can do more in two hours with SnapLogic than we could in two days with traditional solutions. •  Integration Cloud: Design, Admin, Monitoring •  Drag, Drop, Connect: HTML5 interface built for speed
  • 16.
    Modern Architecture: Hybridand Elastic Streams: No data is stored/cached Secure: 100% standards-based Elastic: Scales out & handles data and app integration use cases Metadata Data Databases Enterprise Systems Hadoop
  • 17.
    Modern Architecture: Real-Timeand Batch Ultra Pipelines SnapReduce and the Hadooplex Map Reduce Certified YARN Execution
  • 18.
    Connected: 300+ Snaps Welook at SnapLogic as an opportunity to think differently about integration.
  • 19.
    SnapLogic Integration forAmazon Redshift Customers: Free, hosted trial of SnapLogic + Redshift: www.snaplogic.com/redshift-trial The Redshift Snap helps customers rapidly transfer data into and out of Amazon Redshift from multiple sources •  Rapidly connect Redshift to database services •  Quickly load data into an Amazon S3 bucket and kick off the Redshift import process in a single step •  Easily replicate source tables into their Amazon Redshift clusters and detect daily changes to keep data synchronized •  Take advantage of core REST and SOAP connectivity
  • 20.
  • 21.
    Company •  Weather Networks • Schools/Education •  Consumer •  Alerting •  Environmental Network •  Energy 7/22/2015
  • 22.
    Operational Environment •  Weather • Dynamic •  Local •  Users •  Engaged •  Proximity •  Spikes/Peak Periods 7/22/2015 22
  • 23.
    Data Center •  Scale • Weather intersecting users •  Increase rapidly •  Capacity •  Hardware •  Software 7/22/2015 23
  • 24.
    Data Center •  Mobilepushed us to the limit •  Demanding performance •  Feature releases delays due to capacity planning The data center became a limiting factor in a space where technology should enable 7/22/2015 24
  • 25.
    AWS EC2 •  EC2 • Dynamic capacity •  Automatic capacity •  SQL •  Data tier •  Horizontal scale 7/22/2015 25 SQL Data Store API Tier Ingest
  • 26.
    AWS Storage •  Blob • S3 •  Transactional •  Dynamo •  RDS •  Warehouse •  Redshift 7/22/2015 26 Cloud Storage API Tier Ingest
  • 27.
    AWS ETL •  Storagewas great •  ETL limiting •  SQS •  Kinesis •  EMR •  Data Pipeline •  How do we move data between different cloud data stores effectively? 7/22/2015 27
  • 28.
    Cloud Integration •  Neededa new tool set •  Criteria •  Data stayed in our VPC •  Repeatable building blocks •  Cloud data stores are 1st tier citizens •  Horizontal scale •  Performance •  Price 7/22/2015 28
  • 29.
    Project – DataIngest •  Network of Networks •  Challenge – Providers •  Formats •  Delivery •  Standardization •  Solution – Pipeline Per Provider 7/22/2015 29 Partner A Partner B Pipeline A Pipeline B Cloud Data Stores
  • 30.
    Project – DataAnalysis •  Deriving KPIs for BI •  Challenge – Domains •  Unique domain •  Different storage technologies •  Varied timeliness requirements •  Solution – Pipeline Per Domain 7/22/2015 30 Redshift BI Toolset Pipeline Domain
  • 31.
    Operational – AutomatedDatabase Tasks •  Storage Limitation •  Challenge – Automation •  Redshift •  MSSQL RDS •  Solution – Scheduled pipeline 7/22/2015 31 Scheduled Pipeline Cloud Data Stores
  • 32.
    AWS + snapLogic • AWS is the platform •  SnapLogic is the glue Drives faster implementations with repeatable patterns for more business value 7/22/2015
  • 33.
    Thank you Questions? See SnapLogicin action: Contact us: info@snaplogic.com http://video.snaplogic.com/ @SnapLogic @awscloud @EarthNetworks