BI in the Sky: 
The New Rules of Cloud Analytics
Riding the Cloud Analytics Wave 
2
A Brief Look at Business Intelligence 
Source: The Waves of BI, Wayne Eckerson 
3 
Source: Data Mart Consulting
Welcome to the New Era of SMACT 
4
Today’s Agenda 
 Cloud Analytics in Action: Bonobos 
 SnapLogic Introduction and Demonstration 
 Discussion and Next Steps 
5
David Glueck: Data Science @ Bonobos 
 Founder of the data science and 
6 
engineering team at Bonobos 
– The largest e-commerce-born apparel 
brand in the US 
 Prior to Bonobos, held business 
intelligence roles at: 
– Groupon 
– Netflix 
– HP 
– Knightsbridge Consulting 
– Cisco
Bonobos is a clothing brand 
focused on delivering great fit, 
excellent customer experience, 
and a fun approach to 
menswear. Launched online in 
2007 with its signature line of 
better-fitting men's pants, 
Bonobos is now the largest 
apparel brand ever built on the 
web in the United States. 
Bonobos was named “One of America’s 
Hottest Brands” by Advertising Age, “Best 
Men’s Pants” by New York Magazine, one 
of Inc. Magazine’s “20 Awesome 
Facebook Pages” and was awarded 
Crain’s “Best Places to Work in New York 
City” in 2011 and 2013 
7 7
Background: Our Journey to the Cloud 
 Got to start from scratch 
8 
– Company founded in 2007 
– Didn’t have to worry about legacy 
architecture 
 Source systems are in the cloud 
– Website, email, ticketing service 
– Cloud was a natural choice 
 Netflix example is a good one 
– This is where every company wants to go
Why Cloud Analytics? 
9 
Speed 
• Time to get up and 
running greatly 
reduced 
• 1 month vs. 6m-1yr 
• Faster return 
Flexibility 
• The landscape was 
changing rapidly – 
our needs were also 
changing as we grew 
• Didn’t want to buy 
into a stack and get 
painted into a corner 
- switching costs high 
• Started with MySQL 
and then came 
Redshift 
Scale 
• Integrate new tools 
and data sources 
quickly 
• Redshift is able to 
handle the volumes 
– 10tb today that 
will be growing by 
order of magnitude
Data Architecture 
Amazon 
Redshift 
Reporting Platform 
Analytics Platform 
Data Integration Platform 
Real-Time Dashboards
Results 
 Reduction in headcount to administer and maintain 
11 
– Economies of scale - more effective and efficient 
 Lowers the barrier to entry for advanced analytics 
– Large scale analytics and advanced algorithms used to require 
large scale investment 
– Now available at smaller scale using cost effective shared 
services 
 Insights: 
– Visibility to active customer base and lifetime value 
– Deeper understanding of return rates, and market potential 
– Ability to identify relevant, personalized products for marketing 
messaging
A Few Tips 
✔ Pick the high value use case first 
12 
• Avoid the big bang 
• Pick something that does one thing well and then find a way to 
adapt 
Know what you need to get an A in 
• Management reporting and visibility was most important 
• Predictive analytics was coming, but not up front 
Know what you want to accomplish and why 
• Team must align with the business 
• Make it clear what each member of the team is doing 
• If it’s reporting and BI, how to do more effective visualization and 
craft a clear message to the business 
• If it’s data science – know how this algorithm is relevant for the 
business 
Think like an investor 
• If you’re close the business, you’ll know what questions to ask 
• Become an extended part of the business 
✔ 
✔ 
✔
Today’s Agenda 
 Cloud Analytics in Action: Bonobos 
 SnapLogic Introduction and Demonstration 
 Discussion 
13
Introducing SnapLogic 
Legacy data integration technologies are slowing down 
business growth and innovation in the era of social, 
mobile, analytics, cloud computing and the Internet of 
 Experienced Team 
14 
– CEO founded and ran Informatica 
– Leadership from INFA, Salesforce, MSFT 
 Board of Directors 
– Andreessen Horowitz & Ignition 
 CIO Advisory Board 
– AstraZenica, Cisco, Clorox, HP, Yahoo 
 Headquarters 
– San Mateo, CA 
Things. 
“ 
- Gaurav Dhillon, SnapLogic Founder and CEO 
“
The SnapLogic Elastic Integration Platform 
Single Platform to Connect Data, Apps and APIs 
15
Today’s Demonstrations 
 SnapLogic for Wave: Salesforce Analytics Cloud 
 SnapLogic for Amazon Redshift 
 SnapLogic for Big Data Integration 
16
Key Components of the SnapLogic Platform 
17 
Integration Cloud 
1 
Designer, Manager, Dashboards 
(Multi-tenant cloud service) 
“Cloudplex” 
“Groundplex” 
“Hadooplex” 
Snaplex 
Elastic execution 
2 
New! 
Snaps 
(Buy or 
Build) 
3
SnapLogic Modern Architecture: Elastic Scale 
18 
Cloud to On-Prem 
Snaplex 
REST 
Hybrid 
Snaplex 
• Streams: No data is 
stored/cached 
• Secure: 100% 
standards-based 
• Elastic: Scales out & 
handles data and app 
integration use cases 
REST 
Metadata 
Data
SnapReduce and the Hadooplex 
19 
Hadooplex: Native YARN Application 
MapReduce Generation 
MapReduce 
Snaplogic iPaaS + Hadoop 
YARN 
YARN 
MapReduce 
MapReduce MapReduce 
MapReduce MapReduce 
MapReduce MapReduce 
= Snaplex Container
Today’s Agenda 
 Cloud Analytics in Action: Bonobos 
 SnapLogic Introduction and Demonstration 
 Discussion 
20
Discussion 
21 
www.SnapLogic.com/Resources 
@SnapLogic 
Facebook/SnapLogic 
Linkedin.com/company/snaplogic_2 
…and subscribe to the blog: 
Snaplogic.com/blog

Webinar: BI in the Sky - The New Rules of Cloud Analytics

  • 1.
    BI in theSky: The New Rules of Cloud Analytics
  • 2.
    Riding the CloudAnalytics Wave 2
  • 3.
    A Brief Lookat Business Intelligence Source: The Waves of BI, Wayne Eckerson 3 Source: Data Mart Consulting
  • 4.
    Welcome to theNew Era of SMACT 4
  • 5.
    Today’s Agenda Cloud Analytics in Action: Bonobos  SnapLogic Introduction and Demonstration  Discussion and Next Steps 5
  • 6.
    David Glueck: DataScience @ Bonobos  Founder of the data science and 6 engineering team at Bonobos – The largest e-commerce-born apparel brand in the US  Prior to Bonobos, held business intelligence roles at: – Groupon – Netflix – HP – Knightsbridge Consulting – Cisco
  • 7.
    Bonobos is aclothing brand focused on delivering great fit, excellent customer experience, and a fun approach to menswear. Launched online in 2007 with its signature line of better-fitting men's pants, Bonobos is now the largest apparel brand ever built on the web in the United States. Bonobos was named “One of America’s Hottest Brands” by Advertising Age, “Best Men’s Pants” by New York Magazine, one of Inc. Magazine’s “20 Awesome Facebook Pages” and was awarded Crain’s “Best Places to Work in New York City” in 2011 and 2013 7 7
  • 8.
    Background: Our Journeyto the Cloud  Got to start from scratch 8 – Company founded in 2007 – Didn’t have to worry about legacy architecture  Source systems are in the cloud – Website, email, ticketing service – Cloud was a natural choice  Netflix example is a good one – This is where every company wants to go
  • 9.
    Why Cloud Analytics? 9 Speed • Time to get up and running greatly reduced • 1 month vs. 6m-1yr • Faster return Flexibility • The landscape was changing rapidly – our needs were also changing as we grew • Didn’t want to buy into a stack and get painted into a corner - switching costs high • Started with MySQL and then came Redshift Scale • Integrate new tools and data sources quickly • Redshift is able to handle the volumes – 10tb today that will be growing by order of magnitude
  • 10.
    Data Architecture Amazon Redshift Reporting Platform Analytics Platform Data Integration Platform Real-Time Dashboards
  • 11.
    Results  Reductionin headcount to administer and maintain 11 – Economies of scale - more effective and efficient  Lowers the barrier to entry for advanced analytics – Large scale analytics and advanced algorithms used to require large scale investment – Now available at smaller scale using cost effective shared services  Insights: – Visibility to active customer base and lifetime value – Deeper understanding of return rates, and market potential – Ability to identify relevant, personalized products for marketing messaging
  • 12.
    A Few Tips ✔ Pick the high value use case first 12 • Avoid the big bang • Pick something that does one thing well and then find a way to adapt Know what you need to get an A in • Management reporting and visibility was most important • Predictive analytics was coming, but not up front Know what you want to accomplish and why • Team must align with the business • Make it clear what each member of the team is doing • If it’s reporting and BI, how to do more effective visualization and craft a clear message to the business • If it’s data science – know how this algorithm is relevant for the business Think like an investor • If you’re close the business, you’ll know what questions to ask • Become an extended part of the business ✔ ✔ ✔
  • 13.
    Today’s Agenda Cloud Analytics in Action: Bonobos  SnapLogic Introduction and Demonstration  Discussion 13
  • 14.
    Introducing SnapLogic Legacydata integration technologies are slowing down business growth and innovation in the era of social, mobile, analytics, cloud computing and the Internet of  Experienced Team 14 – CEO founded and ran Informatica – Leadership from INFA, Salesforce, MSFT  Board of Directors – Andreessen Horowitz & Ignition  CIO Advisory Board – AstraZenica, Cisco, Clorox, HP, Yahoo  Headquarters – San Mateo, CA Things. “ - Gaurav Dhillon, SnapLogic Founder and CEO “
  • 15.
    The SnapLogic ElasticIntegration Platform Single Platform to Connect Data, Apps and APIs 15
  • 16.
    Today’s Demonstrations SnapLogic for Wave: Salesforce Analytics Cloud  SnapLogic for Amazon Redshift  SnapLogic for Big Data Integration 16
  • 17.
    Key Components ofthe SnapLogic Platform 17 Integration Cloud 1 Designer, Manager, Dashboards (Multi-tenant cloud service) “Cloudplex” “Groundplex” “Hadooplex” Snaplex Elastic execution 2 New! Snaps (Buy or Build) 3
  • 18.
    SnapLogic Modern Architecture:Elastic Scale 18 Cloud to On-Prem Snaplex REST Hybrid Snaplex • Streams: No data is stored/cached • Secure: 100% standards-based • Elastic: Scales out & handles data and app integration use cases REST Metadata Data
  • 19.
    SnapReduce and theHadooplex 19 Hadooplex: Native YARN Application MapReduce Generation MapReduce Snaplogic iPaaS + Hadoop YARN YARN MapReduce MapReduce MapReduce MapReduce MapReduce MapReduce MapReduce = Snaplex Container
  • 20.
    Today’s Agenda Cloud Analytics in Action: Bonobos  SnapLogic Introduction and Demonstration  Discussion 20
  • 21.
    Discussion 21 www.SnapLogic.com/Resources @SnapLogic Facebook/SnapLogic Linkedin.com/company/snaplogic_2 …and subscribe to the blog: Snaplogic.com/blog

Editor's Notes

  • #4 http://www.b-eye-network.com/blogs/eckerson/archives/business_analyt/ http://www.datamart.de/competence/business-intelligence-data-warehouse/Seiten/default.aspx
  • #8 2011: Extended offline, launching Bonobos Guideshops, e-commerce stores that deliver personalized, one-to-one service to those wanting to experience the brand in person. 2012: Expanded distribution partnering with Nordstrom, bringing Bonobos apparel into stores nationwide and to Nordstrom.com. 2013: Launched a second brand, AYR for women.
  • #11 Architecture diagram - SnapLogic pulls data from all of the services, CSVs, APIs, databases and pushes it into all of the BI tools Good Data is the primary BI tool Easy to use web interface (minimal training) easy to maintain Tableau for deeper business analysts – internal case studies Real-time reporting – Gecko Board (API calls for quick dashboards) Everything goes into Redshift Good Data is subset, more highly aggregated Use Python to do the data science on Redshift Predictive, product recommendation
  • #15 The SnapLogic approach is to deliver a single, unified platform that is built for the cloud era and designed to handle multiple styles of integration: Application, data and process integration. SnapLogic was founded in 2006 by Gaurav Dhillon, who co-founded Informatica in the early ‘90s and ran that company for 12 years. Our management team has deep enterprise and SaaS roots, the board consists of representation from our 2 primary VC partners: Andreesen and Ignition and we have a world-class CIO advisory board consisting of CIOs from AZ, Cisco, Clorox and Netflix. http://www.snaplogic.com/about-us/leadership
  • #18 Orchestrations, schedules, connections and security details are managed by the cloud-based Designer, Manager and Monitoring Dashboard. The Snaplex streams data between applications and data sources, and can run in the cloud or behind the firewall. We like to say that the SnapLogic Integration Cloud “respects data gravity.” If most of your apps that are being integrated in the cloud, why would you want your integration to run behind the firewall? On the other hand, if you’re primarily integrating SaaS apps with on-premises databases and applications like SAP and Oracle, you’ll mostly likely want to run the Snaplex as close to the data as possible. The Snaplex is a self-upgrading execution grid that streams data between applications, databases, files, social and big data sources. When running in the cloud, the Snaplex is able to scale up and down elastically based on the volume of data being processed or the latency requirements of the integration flow. The Snaplex can also be configured to run behind the firewall for hybrid deployments involving on-premise enterprise applications. The Snaplex allows data and process flows to be triggered based on events or scheduled jobs, called via REST APIs, or invoked programmatically via the SnAPI. No data is stored or cached in the SnapLogic Integration Cloud. It streams data. It is 100% standards based and elastically scales out to meet your capacity requirements.
  • #19 Orchestrations, schedules, connections and security details are managed by the cloud-based Designer, Manager and Monitoring Dashboard. The Snaplex streams data between applications and data sources, and can run in the cloud or behind the firewall. We like to say that the SnapLogic Integration Cloud “respects data gravity.” If most of your apps that are being integrated in the cloud, why would you want your integration to run behind the firewall? On the other hand, if you’re primarily integrating SaaS apps with on-premises databases and applications like SAP and Oracle, you’ll mostly likely want to run the Snaplex as close to the data as possible. The Snaplex is a self-upgrading execution grid that streams data between applications, databases, files, social and big data sources. When running in the cloud, the Snaplex is able to scale up and down elastically based on the volume of data being processed or the latency requirements of the integration flow. The Snaplex can also be configured to run behind the firewall for hybrid deployments involving on-premise enterprise applications. The Snaplex allows data and process flows to be triggered based on events or scheduled jobs, called via REST APIs, or invoked programmatically via the SnAPI. No data is stored or cached in the SnapLogic Integration Cloud. It streams data. It is 100% standards based and elastically scales out to meet your capacity requirements.