SlideShare a Scribd company logo
1 of 21
Download to read offline
The Infochimps Big Data Cloud!
                                            Faster and Smarter Decision-Making!




30 days from critical business problems to impactful insight. Our managed Big Data Platform-as-a-Service Cloud with
proven application developer tools and infrastructure remove risk, accelerate deployment, and streamline your Big Data
projects- enabling you to quickly start gaining insights, then scale to more data and use cases as you go.
Key Benefits !
     Fast!                                                                              Critical	

     It only takes a few hours to deploy a complete solution to a public cloud         Business 	

     or your private enterprise cloud. This means you can achieve immediate
     insights without sacrificing custom development ability.!
                                                                                       Problems	

     Simple!                                                                                +	

     It shouldn‘t take a rocket scientist to tap into the insights Big Data can
     provide. We’ve created analytic services and application developer
     frameworks that make interacting with Big Data systems much easier by
     letting you use languages already familiar to you.!

     Flexible!                                                                        big data cloud	

     Our comprehensive architecture means you can combine real-time, ad-
     hoc, and batch analytics depending on your application needs. You can
     also start your system at the size that s right for you, and grow it over time
     to additional data and use cases as your business evolves.!
                                                                                            =	

     Enterprise Ready!
     We reduce risk with the stability of our managed platform, our firm stance
                                                                                        Impactful	

     on data security, and our compatibility with many public, private, and              Business	

     hybrid cloud environments.!
                                                                                         Insights	


2!
Big Data Drivers!
     §  The proliferation of data
         capture and creation                                                                             More
                                                  More
         technologies	

                         Content!                                                Devices!


     §  Increased “interconnectedness”
         drives consumption
         (creating more data)	

     §  Inexpensive storage makes                More                                                    New &
                                               Consumption!                                               Better
         it possible to keep more, longer	

                                                           Information!


     §  Innovative software and
         analysis tools turn data into
         information	




                                                              §  Every gigabyte of stored content can generate a
         Big Data encompasses not only the                              petabyte or more of transient data*	

          content itself, but how it’s                                                               *Source: IDC 2011	


           analyzed and consumed.	

                          §  The information about you is much greater than
                                                                              the information you create	



3!
Our Customers & Use Cases!
              Customer Segmentation	

              Cisco is processing 100s of terabytes of weblog data to segment customers
              downloading software from their support portal by product, geography, and
              industry.	




               Social Media Listening	

               Infomart built a brand new social media listening platform consuming100s of millions of
               messages from a variety of social networks in real-time, adding custom influence and
               authority scores, and building a simple front-end on top of Elasticsearch’s powerful
               API.	




               Mission Critical Data Pipeline	

               Spongecell’s ad network produces over 10,000+ events per second and lost data
               means lost revenue. They built a robust, loss-free, high-volume data pipeline that
               processes all their events meaning they never worry about their data again.	




               Retail Analytics	

               Koupon helps their large retail customers run marketing campaigns around mobile coupons.
               They collect data from mobile devices and add context around demographics and
               geolocation to provide their customers with in-depth insight about their customers.	



4!
Big Data Cloud Services: Overview!




      Data Integration and Real-Time Analytics	

   Ad-Hoc Query and Near-Real-Time Analytics	

   Batch Analytics	





5!
Big Data Cloud Services: Data Flow!




6!
Social Media Listening Platform!


                                   Analytics!
                              •    Sentiment Analysis	

                              •    Authority Scoring	

                              •    Influencer Ranking	

                              •    Gender Classifier	





                                   Application!

7!
Ironfan™!
     Foundation for Your Big Data Services!
     !
     Ironfan is a systems provisioning, deployment, and
     updating tool. Ironfan automates not only machine
     configuration, but entire systems configuration to
     enable the complete Big Data stack, including data
     integration, routing, storage, computation, monitoring,
     and more.!
     !
     1.  Cycle time goes from weeks to minutes!
     2.  Service discovery means your machines auto-
         wire themselves together!
     3.     Infrastructure-as-Code provides a simple,
           iterative, testable contract for how your system
           will function!
     4.  Leverages a combination of proprietary and open
         source code, including Chef and Fog!




8!
Data Delivery Service™!
     Data Integration & Real-Time Analytics!
     !
     Data Delivery Service™ (DDS) integrates seamlessly with your
     existing environment, provides highly scalable ETL (extract-
     transform-load) capabilities, and enables real-time, streaming
     data analytics.!
     !
     DDS™ gives you scalability & flexibility!
     !
     §  Tap    into virtually any data source!
          §    Internal!
          §    External!
     §  Real-Time    Stream Processing!
          §    Ingestions!
          §    Analytics!
     §  Make    Well-Informed Business Decisions!
          §    On-the-fly queries!




9!
Database Management!
      Ad-Hoc Query & Analytics!
      Whether it's HBase, Cassandra, Elasticsearch, MongoDB,
      MySQL, or others, we ensure the right data storage for the job
      is always right at your fingertips.!
      Database management gives you peace-of-mind!
      §    Databases and data storage, as a service. We are your
            outsourced Big Data database administrator (DBA), providing !
              §    Database maintenance!
              §    Updates!
              §    Support  !
      §    Database Agnostics!
              §    Amazon S3!
              §    HBase!
              §    Cassandra!
              §    Elasticsearch!
              §    MongoDB!
              §    MySQL!
              §    + Many More!
      §    Deploy to your internal cloud or to a public cloud!


10!
Cloud Hadoop!
      Batch Analytics!
      Perform large-scale batch analysis as you need it, whether
      ad-hoc Hadoop clusters or always-on production
      workflows. Access all the tools you need, with on-demand
      scaling and tuning.!
      Cloud Hadoop gives you cloud elasticity &
      efficiency!
      §  Turn clusters on at a moment s notice!
      §  Scale and customize on the fly!
      §  Leverage tools that make Hadoop easier!
           §    Wukong™!
           §    Pig!
           §    Hive!
      §  Leverage   tools that extend Hadoop!
           §    Azkaban!
           §    Sqoop!
           §    + more!
                                                                   Video: Hadoop Cluster !
                                                                           in 20 Minutes!



11!
Wukong™!
      Simplified Scripts for Analytics!
      Wukong™ provides a simplified analytics scripting experience.
      Write your analytics in developer-friendly Ruby, run code locally
      for faster development cycles, and leverage existing analytics
      scripts.!
      Wukong™ gives you Superpowers!!
      §    Ruby for Big Data Analytics - That means you can use a familiar,
            fun programming language to do both Hadoop jobs and DDS™
            algorithms.!
      !
      §    Quickly Iterate - Rather than developing and testing everything on
            your production Hadoop and DDS™ clusters, you can develop scripts
            locally on your laptop.!

      §    Leverage Familiar Standard-In/Standard-Out Language -
            Wukong™ can leverage your existing standard-in/standard-out code
            with Big Data.!




12!
Dashpot™!
      Reporting & Systems
      Management!
      Dashpot™ is a lightweight analytics and operations
      dashboard for administrators & developers!
      Dashpot™ gives you visibility and
      control!!
      §  Real-Time       visualizations from streaming data!
      §  Deep      Visibility !
              §    Individual Machines!
              §    Overall Systems!
      §    Quickly Start & Stop functional units in your data
            clusters!




13!
Platform API!
      Custom Applications and Dashboards!
      With a unified API, control of the platform and visibility of the data
      within it are just a few web requests away. !
      !

      The Platform API gives you fine-grain control!!
      !
      § HTTP-based    API!
          §    Simple JSON commands!

      § Access   data through a simple, unified endpoint!
      § Manage    Platform Configuration Settings!




14!
big data cloud	



      Bringing Big Data Analytics	

       To Your Enterprise Data	

                                               Analytics	



15!
Traditional vs. DIY vs. Infochimps!
            Traditional	

               Big Data 	

                 Big Data 	

        Data Infrastructure	

        Infrastructure	

                Cloud	





                                                                big data cloud	



      •  24 Month Project	

     •  12 Month Project	

       •  1 Month Project	

      •  $1M for 10TB	

         •  $300K for 10TB	

         •  $10K / month for 10TB	

      •  Analyzing 15% of        •  Analyzing up to 100% of   •  Analyzing up to 100% of
         Enterprise Data	

         Enterprise Data	

           Enterprise Data + 15,000+
                                                                 external sources	


16!
Cloud Delivery!

  Data Center Infrastructure!
  ‣    Lights Out Data Center!
  ‣    Global Footprint!
  ‣    Co-located with Data!
  ‣    99.95 - 99.995% SLA!




17!
Cloud Delivery!


Business Intelligence!
  ‣    Visualize your data!
  ‣    Business Reporting!
  ‣    Application Integration!
  ‣    Integrated with the Cloud!




18!
Cloud Delivery!

      Professional Services!
       ‣    Big Data Planning!
       ‣    Data Modeling!
       ‣    Analytics!
       ‣    Architecture/Design!
       ‣    Implementation!




19!
Infochimps Engagement Model!
                                  Deploy initial design to
                              development & staging cloud,
                               iteratively add functionality




        Identify first use case, create                    Deploy to production
       proposal, design workflows and                     public or private cloud
        iterate on architecture locally                      and scale out


20!
Contact Information!
       Brian Krpec!
       Director of Sales
       512-709-4704 cell
       brian.krpec@infochimps.com
       @bkrpec




21!

More Related Content

What's hot

Elastic Caching for a Smarter Planet - Make Every Transaction Count
Elastic Caching for a Smarter Planet - Make Every Transaction CountElastic Caching for a Smarter Planet - Make Every Transaction Count
Elastic Caching for a Smarter Planet - Make Every Transaction Count
Yakura Coffee
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise Adoption
DATAVERSITY
 
Big data: analyzing large data sets
Big data: analyzing large data setsBig data: analyzing large data sets
Big data: analyzing large data sets
R A Akerkar
 

What's hot (20)

Elastic Caching for a Smarter Planet - Make Every Transaction Count
Elastic Caching for a Smarter Planet - Make Every Transaction CountElastic Caching for a Smarter Planet - Make Every Transaction Count
Elastic Caching for a Smarter Planet - Make Every Transaction Count
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise Adoption
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
 
Hadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteHadoop Big Data Lakes Keynote
Hadoop Big Data Lakes Keynote
 
Bhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystemBhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystem
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time Systems
 
How I learned to stop worrying and love Oracle
How I learned to stop worrying and love OracleHow I learned to stop worrying and love Oracle
How I learned to stop worrying and love Oracle
 
Thriving and surviving the Big Data revolution
Thriving and surviving the Big Data revolutionThriving and surviving the Big Data revolution
Thriving and surviving the Big Data revolution
 
Security needs in Hadoop’s Current and Future – How Apache Ranger can help?
Security needs in Hadoop’s Current and Future – How Apache Ranger can help?Security needs in Hadoop’s Current and Future – How Apache Ranger can help?
Security needs in Hadoop’s Current and Future – How Apache Ranger can help?
 
6 Ways to Get More From Your Azure
6 Ways to Get More From Your Azure6 Ways to Get More From Your Azure
6 Ways to Get More From Your Azure
 
All Grown Up: Maturation of Analytics in the Cloud
All Grown Up: Maturation of Analytics in the CloudAll Grown Up: Maturation of Analytics in the Cloud
All Grown Up: Maturation of Analytics in the Cloud
 
MongoDB
MongoDBMongoDB
MongoDB
 
Bhadale group of companies offers webportal catalogue
Bhadale group of companies offers webportal catalogueBhadale group of companies offers webportal catalogue
Bhadale group of companies offers webportal catalogue
 
Enterprise mobility management customer presentation december scripted
Enterprise mobility management customer presentation december scriptedEnterprise mobility management customer presentation december scripted
Enterprise mobility management customer presentation december scripted
 
Back to school: Big Data IDEA 101
Back to school: Big Data IDEA 101Back to school: Big Data IDEA 101
Back to school: Big Data IDEA 101
 
Big SQL: Powerful SQL Optimization - Re-Imagined for open source
Big SQL: Powerful SQL Optimization - Re-Imagined for open sourceBig SQL: Powerful SQL Optimization - Re-Imagined for open source
Big SQL: Powerful SQL Optimization - Re-Imagined for open source
 
Machine Learning for z/OS
Machine Learning for z/OSMachine Learning for z/OS
Machine Learning for z/OS
 
Hortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataHortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your data
 
Cloud Security (CASB) for Slack
Cloud Security (CASB) for SlackCloud Security (CASB) for Slack
Cloud Security (CASB) for Slack
 
Big data: analyzing large data sets
Big data: analyzing large data setsBig data: analyzing large data sets
Big data: analyzing large data sets
 

Viewers also liked

Sap proposal
Sap proposalSap proposal
Sap proposal
JenGata
 
Profile_Suryadeep Banerjee July 23
Profile_Suryadeep Banerjee July 23Profile_Suryadeep Banerjee July 23
Profile_Suryadeep Banerjee July 23
Suryadeep Banerjee
 
ValueClothing, SAP, Extended Warehouse Management Proposal
ValueClothing, SAP, Extended Warehouse Management ProposalValueClothing, SAP, Extended Warehouse Management Proposal
ValueClothing, SAP, Extended Warehouse Management Proposal
Georgios Daskalakis
 
CETPA On-Campus Training Proposal for Different Colleges
CETPA On-Campus Training Proposal for Different CollegesCETPA On-Campus Training Proposal for Different Colleges
CETPA On-Campus Training Proposal for Different Colleges
CETPA INFOTECH PVT. LTD.
 

Viewers also liked (20)

Sap proposal
Sap proposalSap proposal
Sap proposal
 
Rooster analytics
Rooster analytics Rooster analytics
Rooster analytics
 
FIWARE and IoT net services by DunavNET, SenZations 2015
FIWARE and IoT net services by DunavNET, SenZations 2015FIWARE and IoT net services by DunavNET, SenZations 2015
FIWARE and IoT net services by DunavNET, SenZations 2015
 
CONIM Integration Software - Budgeting
CONIM Integration Software - BudgetingCONIM Integration Software - Budgeting
CONIM Integration Software - Budgeting
 
Ramesh Sa Ppresentation2011
Ramesh Sa Ppresentation2011Ramesh Sa Ppresentation2011
Ramesh Sa Ppresentation2011
 
MOCK TAKEOVER PROPOSAL OF SAP
MOCK TAKEOVER PROPOSAL OF SAPMOCK TAKEOVER PROPOSAL OF SAP
MOCK TAKEOVER PROPOSAL OF SAP
 
Profile_Suryadeep Banerjee July 23
Profile_Suryadeep Banerjee July 23Profile_Suryadeep Banerjee July 23
Profile_Suryadeep Banerjee July 23
 
Big Data analytics best practices
Big Data analytics best practicesBig Data analytics best practices
Big Data analytics best practices
 
project proposal guidelines for bw on hana Dr Erdas
project proposal guidelines for bw on hana Dr Erdasproject proposal guidelines for bw on hana Dr Erdas
project proposal guidelines for bw on hana Dr Erdas
 
Thailand Hadoop Big Data Challenge #1
Thailand Hadoop Big Data Challenge #1Thailand Hadoop Big Data Challenge #1
Thailand Hadoop Big Data Challenge #1
 
Fiware io t_ul20_cpbr8
Fiware io t_ul20_cpbr8Fiware io t_ul20_cpbr8
Fiware io t_ul20_cpbr8
 
Sap hr workshop proposal
Sap hr workshop proposalSap hr workshop proposal
Sap hr workshop proposal
 
ValueClothing, SAP, Extended Warehouse Management Proposal
ValueClothing, SAP, Extended Warehouse Management ProposalValueClothing, SAP, Extended Warehouse Management Proposal
ValueClothing, SAP, Extended Warehouse Management Proposal
 
CETPA On-Campus Training Proposal for Different Colleges
CETPA On-Campus Training Proposal for Different CollegesCETPA On-Campus Training Proposal for Different Colleges
CETPA On-Campus Training Proposal for Different Colleges
 
Group 6 presentation sap implemention proposal
Group 6 presentation sap implemention proposalGroup 6 presentation sap implemention proposal
Group 6 presentation sap implemention proposal
 
[Tokopresentasi.com] Proposal training prezi prezzentation
[Tokopresentasi.com] Proposal training prezi prezzentation[Tokopresentasi.com] Proposal training prezi prezzentation
[Tokopresentasi.com] Proposal training prezi prezzentation
 
SAP Systems Integration by SAP PI (XI)
SAP Systems Integration by SAP PI (XI)SAP Systems Integration by SAP PI (XI)
SAP Systems Integration by SAP PI (XI)
 
Big Data Analytics Proposal #1
Big Data Analytics Proposal #1Big Data Analytics Proposal #1
Big Data Analytics Proposal #1
 
Software Proposal Portal Inc.
Software Proposal Portal Inc.Software Proposal Portal Inc.
Software Proposal Portal Inc.
 
SAP Draft Solution for GST India
SAP Draft Solution for GST IndiaSAP Draft Solution for GST India
SAP Draft Solution for GST India
 

Similar to Infochimps #1 Big Data Platform for the Cloud

Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
DataWorks Summit
 

Similar to Infochimps #1 Big Data Platform for the Cloud (20)

Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Microsoft Azure
Microsoft AzureMicrosoft Azure
Microsoft Azure
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Azure Global Bootcamp 2018 Paris Keynote
Azure Global Bootcamp 2018 Paris KeynoteAzure Global Bootcamp 2018 Paris Keynote
Azure Global Bootcamp 2018 Paris Keynote
 
A Quick Introduction to Microsoft Azure Public Cloud
A Quick Introduction to Microsoft Azure Public CloudA Quick Introduction to Microsoft Azure Public Cloud
A Quick Introduction to Microsoft Azure Public Cloud
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
Benefits of the Azure Cloud
Benefits of the Azure CloudBenefits of the Azure Cloud
Benefits of the Azure Cloud
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
 
Ibm db2update2019 icp4 data
Ibm db2update2019   icp4 dataIbm db2update2019   icp4 data
Ibm db2update2019 icp4 data
 
Big Data on Azure Tutorial
Big Data on Azure TutorialBig Data on Azure Tutorial
Big Data on Azure Tutorial
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
 
Introduction to Microsoft Azure
Introduction to Microsoft AzureIntroduction to Microsoft Azure
Introduction to Microsoft Azure
 
Hybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
Hybrid IT, Laying the "Right Mix" Foundation for Digital TransformationHybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
Hybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
 
Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013
 
Secure Big Data Analytics - Hadoop & Intel
Secure Big Data Analytics - Hadoop & IntelSecure Big Data Analytics - Hadoop & Intel
Secure Big Data Analytics - Hadoop & Intel
 
An Overview of All The Different Databases in Google Cloud
An Overview of All The Different Databases in Google CloudAn Overview of All The Different Databases in Google Cloud
An Overview of All The Different Databases in Google Cloud
 
Tapping the cloud for real time data analytics
 Tapping the cloud for real time data analytics Tapping the cloud for real time data analytics
Tapping the cloud for real time data analytics
 
Streaming IBM i to Kafka for Next-Gen Use Cases
Streaming IBM i to Kafka for Next-Gen Use CasesStreaming IBM i to Kafka for Next-Gen Use Cases
Streaming IBM i to Kafka for Next-Gen Use Cases
 
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
 

Infochimps #1 Big Data Platform for the Cloud

  • 1. The Infochimps Big Data Cloud! Faster and Smarter Decision-Making! 30 days from critical business problems to impactful insight. Our managed Big Data Platform-as-a-Service Cloud with proven application developer tools and infrastructure remove risk, accelerate deployment, and streamline your Big Data projects- enabling you to quickly start gaining insights, then scale to more data and use cases as you go.
  • 2. Key Benefits ! Fast! Critical It only takes a few hours to deploy a complete solution to a public cloud Business or your private enterprise cloud. This means you can achieve immediate insights without sacrificing custom development ability.! Problems Simple! + It shouldn‘t take a rocket scientist to tap into the insights Big Data can provide. We’ve created analytic services and application developer frameworks that make interacting with Big Data systems much easier by letting you use languages already familiar to you.! Flexible! big data cloud Our comprehensive architecture means you can combine real-time, ad- hoc, and batch analytics depending on your application needs. You can also start your system at the size that s right for you, and grow it over time to additional data and use cases as your business evolves.! = Enterprise Ready! We reduce risk with the stability of our managed platform, our firm stance Impactful on data security, and our compatibility with many public, private, and Business hybrid cloud environments.! Insights 2!
  • 3. Big Data Drivers! §  The proliferation of data capture and creation More More technologies Content! Devices! §  Increased “interconnectedness” drives consumption (creating more data) §  Inexpensive storage makes More New & Consumption! Better it possible to keep more, longer Information! §  Innovative software and analysis tools turn data into information §  Every gigabyte of stored content can generate a Big Data encompasses not only the petabyte or more of transient data* content itself, but how it’s *Source: IDC 2011 analyzed and consumed. §  The information about you is much greater than the information you create 3!
  • 4. Our Customers & Use Cases! Customer Segmentation Cisco is processing 100s of terabytes of weblog data to segment customers downloading software from their support portal by product, geography, and industry. Social Media Listening Infomart built a brand new social media listening platform consuming100s of millions of messages from a variety of social networks in real-time, adding custom influence and authority scores, and building a simple front-end on top of Elasticsearch’s powerful API. Mission Critical Data Pipeline Spongecell’s ad network produces over 10,000+ events per second and lost data means lost revenue. They built a robust, loss-free, high-volume data pipeline that processes all their events meaning they never worry about their data again. Retail Analytics Koupon helps their large retail customers run marketing campaigns around mobile coupons. They collect data from mobile devices and add context around demographics and geolocation to provide their customers with in-depth insight about their customers. 4!
  • 5. Big Data Cloud Services: Overview! Data Integration and Real-Time Analytics Ad-Hoc Query and Near-Real-Time Analytics Batch Analytics 5!
  • 6. Big Data Cloud Services: Data Flow! 6!
  • 7. Social Media Listening Platform! Analytics! •  Sentiment Analysis •  Authority Scoring •  Influencer Ranking •  Gender Classifier Application! 7!
  • 8. Ironfan™! Foundation for Your Big Data Services! ! Ironfan is a systems provisioning, deployment, and updating tool. Ironfan automates not only machine configuration, but entire systems configuration to enable the complete Big Data stack, including data integration, routing, storage, computation, monitoring, and more.! ! 1.  Cycle time goes from weeks to minutes! 2.  Service discovery means your machines auto- wire themselves together! 3.  Infrastructure-as-Code provides a simple, iterative, testable contract for how your system will function! 4.  Leverages a combination of proprietary and open source code, including Chef and Fog! 8!
  • 9. Data Delivery Service™! Data Integration & Real-Time Analytics! ! Data Delivery Service™ (DDS) integrates seamlessly with your existing environment, provides highly scalable ETL (extract- transform-load) capabilities, and enables real-time, streaming data analytics.! ! DDS™ gives you scalability & flexibility! ! §  Tap into virtually any data source! §  Internal! §  External! §  Real-Time Stream Processing! §  Ingestions! §  Analytics! §  Make Well-Informed Business Decisions! §  On-the-fly queries! 9!
  • 10. Database Management! Ad-Hoc Query & Analytics! Whether it's HBase, Cassandra, Elasticsearch, MongoDB, MySQL, or others, we ensure the right data storage for the job is always right at your fingertips.! Database management gives you peace-of-mind! §  Databases and data storage, as a service. We are your outsourced Big Data database administrator (DBA), providing ! §  Database maintenance! §  Updates! §  Support ! §  Database Agnostics! §  Amazon S3! §  HBase! §  Cassandra! §  Elasticsearch! §  MongoDB! §  MySQL! §  + Many More! §  Deploy to your internal cloud or to a public cloud! 10!
  • 11. Cloud Hadoop! Batch Analytics! Perform large-scale batch analysis as you need it, whether ad-hoc Hadoop clusters or always-on production workflows. Access all the tools you need, with on-demand scaling and tuning.! Cloud Hadoop gives you cloud elasticity & efficiency! §  Turn clusters on at a moment s notice! §  Scale and customize on the fly! §  Leverage tools that make Hadoop easier! §  Wukong™! §  Pig! §  Hive! §  Leverage tools that extend Hadoop! §  Azkaban! §  Sqoop! §  + more! Video: Hadoop Cluster ! in 20 Minutes! 11!
  • 12. Wukong™! Simplified Scripts for Analytics! Wukong™ provides a simplified analytics scripting experience. Write your analytics in developer-friendly Ruby, run code locally for faster development cycles, and leverage existing analytics scripts.! Wukong™ gives you Superpowers!! §  Ruby for Big Data Analytics - That means you can use a familiar, fun programming language to do both Hadoop jobs and DDS™ algorithms.! ! §  Quickly Iterate - Rather than developing and testing everything on your production Hadoop and DDS™ clusters, you can develop scripts locally on your laptop.! §  Leverage Familiar Standard-In/Standard-Out Language - Wukong™ can leverage your existing standard-in/standard-out code with Big Data.! 12!
  • 13. Dashpot™! Reporting & Systems Management! Dashpot™ is a lightweight analytics and operations dashboard for administrators & developers! Dashpot™ gives you visibility and control!! §  Real-Time visualizations from streaming data! §  Deep Visibility ! §  Individual Machines! §  Overall Systems! §  Quickly Start & Stop functional units in your data clusters! 13!
  • 14. Platform API! Custom Applications and Dashboards! With a unified API, control of the platform and visibility of the data within it are just a few web requests away. ! ! The Platform API gives you fine-grain control!! ! § HTTP-based API! §  Simple JSON commands! § Access data through a simple, unified endpoint! § Manage Platform Configuration Settings! 14!
  • 15. big data cloud Bringing Big Data Analytics To Your Enterprise Data Analytics 15!
  • 16. Traditional vs. DIY vs. Infochimps! Traditional Big Data Big Data Data Infrastructure Infrastructure Cloud big data cloud •  24 Month Project •  12 Month Project •  1 Month Project •  $1M for 10TB •  $300K for 10TB •  $10K / month for 10TB •  Analyzing 15% of •  Analyzing up to 100% of •  Analyzing up to 100% of Enterprise Data Enterprise Data Enterprise Data + 15,000+ external sources 16!
  • 17. Cloud Delivery! Data Center Infrastructure! ‣  Lights Out Data Center! ‣  Global Footprint! ‣  Co-located with Data! ‣  99.95 - 99.995% SLA! 17!
  • 18. Cloud Delivery! Business Intelligence! ‣  Visualize your data! ‣  Business Reporting! ‣  Application Integration! ‣  Integrated with the Cloud! 18!
  • 19. Cloud Delivery! Professional Services! ‣  Big Data Planning! ‣  Data Modeling! ‣  Analytics! ‣  Architecture/Design! ‣  Implementation! 19!
  • 20. Infochimps Engagement Model! Deploy initial design to development & staging cloud, iteratively add functionality Identify first use case, create Deploy to production proposal, design workflows and public or private cloud iterate on architecture locally and scale out 20!
  • 21. Contact Information! Brian Krpec! Director of Sales 512-709-4704 cell brian.krpec@infochimps.com @bkrpec 21!