SlideShare a Scribd company logo
Infochimps targets enterprises with
     stream-processing additions to 'big
                 data' PaaS
Analyst: Matt Aslett
14 Nov, 2012



'Big data' PaaS provider Infochimps has updated its Infochimps Platform with the addition of
stream-processing capabilities to the Infochimps Data Delivery Service based on technologies
first developed at Twitter and LinkedIn. With its first paying customer on board, the company is
now seeking partnerships to help support its enterprise-focused PaaS offering.




   The 451 Take

   There's a big difference between offering Hadoop as a service to be configured, deployed and
   managed, and offering a managed service that masks the complexity of configuring and
   deploying Hadoop. We believe the latter will gain traction as more late adopters begin to look
   at adopting the benefits of Hadoop without investing upfront in the expertise and
   infrastructure required to support it. While Infochimps will need to establish the trust of its
   target customers, it is well-positioned with an easy-to-consume managed service for those
   without Hadoop expertise, as well as a stack of technologically interesting projects for the
   'devops' crowd.




Context

We first covered Infochimps earlier this year when the company pivoted from being a data
marketplace provider to releasing the technology that supported its data marketplace, both as
open source projects and as PaaS. The initial focus was on making it easier to deploy the Hadoop


Copyright 2012 - The 451 Group                                                                       1
data-processing framework via a Chef-based systems provisioning, deployment and updating tool
called IronFan. Infochimps has expanded since then with the addition in April of an operations
dashboard called Dashpot, and in August with the addition of the Apache Flume-based Data
Delivery Service (DDS) for integrating with existing data sources, as well as early data-streaming
functionality in DDS via extensions to Wukong, the company's Ruby for Hadoop. The latest addition
to the platform expands its support for stream processing through the integration of open source
stream-processing projects Storm and Kafka.


Initially developed by BackType and released as an open source project by Twitter in August 2011
following its acquisition of the social analytics provider, Storm is a stream-processing engine. Kafka,
meanwhile, is a distributed message queue originally developed by LinkedIn and used by the
company in a number of projects, including feeding all activity events to its data warehouse and
Hadoop, as well as keeping its search engine up to date with network activity in real time. Storm
and Kafka are used by Infochimps as the foundation of DDS, which is used to connect the
company's Hadoop-based PaaS with multiple existing data sources, enabling real-time integration
of relevant data for processing and analysis.


DDS is a key component of the Infochimps Platform that elevates it beyond a platform for Hadoop
deployment to being a potential big data management and analytics platform of choice. It is DDS
that will enable businesses to adopt the Infochimps Platform alongside existing data management
technologies and quickly gain insight from new and existing sources of data.


Infochimps' main selling point is in lowering the barriers to adopting Hadoop. While there is a lot of
complex technology involved – such as IronFan, elastic Hadoop, DDS, elasticsearch, NoSQL and
NewSQL databases, Wukong and Dashpot – the platform is delivered as a service designed to mask
that complexity. The company maintains that it can take customers from nowhere to generating
business insight from the Infochimps Platform in 30 days, without the need to hire specialist
support and analytics staff, or invest in specialist infrastructure.


Infochimps has attracted nine paying customers since its platform went live in the second quarter,
with an average selling price of $200,000. The company charges customers per node per month for
what is currently a public cloud offering hosted on Amazon Web Services or Rackspace Cloud.
Infochimps has established relationships (soon to be announced) to deliver both private cloud and
virtual private cloud offerings supported in its customers' own datacenters or via their trusted
datacenter provider. The company is launching its cloud services across a network of tier four
datacenters in North America and will begin offering its big data cloud services in the first quarter
of 2013. The potential to support private cloud deployments will be aided by the fact that IronFan is

Copyright 2012 - The 451 Group                                                                           2
a key component in VMware's Serengeti project to make it easy to configure and deploy Hadoop on
virtual machines, while the Infochimps Platform also supports the OpenStack API.


The shift toward more enterprise-focused services and partnerships is being led by former Teradata
and StackIQ executive (and Xerox PARC EIR) Jim Kaskade, who joined the company as CEO in
August, replacing cofounder Joe Kelly, who became COO. Kaskade has also been busy lining up a
new major financing round. Infochimps had previously raised a total of $3m from investors
including DFJ Mercury, although that was during its previous incarnation as a data marketplace
provider. The company currently has 23 employees, up from 14 in March.


Competition

There are an increasing number of vendors offering Hadoop as a service, with Amazon and Google
being the biggest players at this point. While they therefore pose a competitive threat to
Infochimps, the value proposition is quite different, since it still requires a degree of expertise to
configure, deploy and manage a cloud-based Hadoop service in comparison to Infochimps'
managed services approach. We've seen limited uptake of cloud-based Hadoop services to date,
with the main use case being development and testing. Indeed, we've noted before that if a
company begins to move toward a larger-scale deployment, the costs can be prohibitive enough to
require on-premises deployment. While Infochimps' service is initially based on the public cloud, it
has designs on supporting deployment choice. The company also believes that with the added
value of IronFan, DDS, Wukong, Dashpot and the rest, along with its managed services approach, it
has enough to justify the additional cost above that of running Hadoop on a public cloud service
with the required expertise.


Other Hadoop service providers include SunGard, Treasure Data, Qubole, Mortar Data and Guavus,
while Infochimps believes its closest competition will come from MetaScale, the Hadoop managed
services subsidiary of Sears Holdings, and tresata, the stealthy data platform provider founded by
former Bank of America managing director for big data and analytics Abhi Mehta. Other vendors are
trying to mask the complexity of configuring and deploying Hadoop by building it into larger
on-premises application stacks, so we might also expect would-be customers to consider the likes
of Drawn to Scale, Splice Machine or Digital Reasoning, depending on the specific application. The
company must also be considered a rival to some extent with Hadoop distributors such as Cloudera,
Hortonworks, MapR, IBM and EMC, although there is also the potential for partnerships here, as
indicated by the fact that Cloudera CEO Mike Olson is an adviser to Infochimps.




Copyright 2012 - The 451 Group                                                                           3
SWOT Analysis

 Strengths                                               Weaknesses
 We were already fans of the Chef-based cluster          Managed services relationships are built on trust.
 platform tuned for the needs of enterprises using       While Infochimps has technological expertise, it
 Hadoop. DDS adds all-important integration with         will need to establish itself before some would-be
 existing tools that will help drive wider adoption.     customers will consider it.
 Opportunities                                           Threats
 We are seeing an increasing need for technologies and   The big services and software providers are
 services that mask the complexity of configuring,       unlikely to sit back and let demand for Hadoop
 deploying and managing Hadoop for late adopters.        managed services go elsewhere. Expect the
 Infochimps has both.                                    competition to increase with demand.




Copyright 2012 - The 451 Group                                                                                4
Reproduced by permission of The 451 Group; © 2012. This report was originally published within 451
 Research’s Market Insight Service. For additional information on 451 Research or to apply for trial access, go
 to: www.451research.com




Copyright 2012 - The 451 Group                                                                                    5

More Related Content

What's hot

Hadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, ProvidersHadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, Providers
Mrigendra Sharma
 
Bigger Data For Your Budget
Bigger Data For Your BudgetBigger Data For Your Budget
Bigger Data For Your Budget
Hortonworks
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsJane Roberts
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for Windows
Hortonworks
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
Hortonworks
 
Data Discovery, Visualization, and Apache Hadoop
Data Discovery, Visualization, and Apache HadoopData Discovery, Visualization, and Apache Hadoop
Data Discovery, Visualization, and Apache Hadoop
Hortonworks
 
Trucking demo w Spark ML - Paul Hargis - Hortonworks
Trucking demo w Spark ML - Paul Hargis - HortonworksTrucking demo w Spark ML - Paul Hargis - Hortonworks
Trucking demo w Spark ML - Paul Hargis - Hortonworks
Kelly Kohlleffel
 
MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014Erni Susanti
 
Redington Value Journal - September 2017
Redington Value Journal - September 2017Redington Value Journal - September 2017
Redington Value Journal - September 2017
Redington Value Distribution
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Hortonworks
 
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
Hortonworks
 
Talend Open Studio and Hortonworks Data Platform
Talend Open Studio and Hortonworks Data PlatformTalend Open Studio and Hortonworks Data Platform
Talend Open Studio and Hortonworks Data Platform
Hortonworks
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks
 
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Hortonworks
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
Hortonworks
 
Data warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaData warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaJyrki Määttä
 
IBM - Transformation digitale et le SI des banques
IBM - Transformation digitale et le SI des banquesIBM - Transformation digitale et le SI des banques
IBM - Transformation digitale et le SI des banques
Rodolphe Lezennec
 
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hortonworks
 

What's hot (18)

Hadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, ProvidersHadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, Providers
 
Bigger Data For Your Budget
Bigger Data For Your BudgetBigger Data For Your Budget
Bigger Data For Your Budget
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for Windows
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
 
Data Discovery, Visualization, and Apache Hadoop
Data Discovery, Visualization, and Apache HadoopData Discovery, Visualization, and Apache Hadoop
Data Discovery, Visualization, and Apache Hadoop
 
Trucking demo w Spark ML - Paul Hargis - Hortonworks
Trucking demo w Spark ML - Paul Hargis - HortonworksTrucking demo w Spark ML - Paul Hargis - Hortonworks
Trucking demo w Spark ML - Paul Hargis - Hortonworks
 
MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014
 
Redington Value Journal - September 2017
Redington Value Journal - September 2017Redington Value Journal - September 2017
Redington Value Journal - September 2017
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
 
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
 
Talend Open Studio and Hortonworks Data Platform
Talend Open Studio and Hortonworks Data PlatformTalend Open Studio and Hortonworks Data Platform
Talend Open Studio and Hortonworks Data Platform
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
 
Data warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaData warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-cloudera
 
IBM - Transformation digitale et le SI des banques
IBM - Transformation digitale et le SI des banquesIBM - Transformation digitale et le SI des banques
IBM - Transformation digitale et le SI des banques
 
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
 

Viewers also liked

111302011 iw computing_economics_final_v2
111302011 iw computing_economics_final_v2111302011 iw computing_economics_final_v2
111302011 iw computing_economics_final_v2Accenture
 
Braking china
Braking chinaBraking china
Braking chinaAccenture
 
Festive Cook Book
Festive Cook BookFestive Cook Book
Festive Cook Book
Grazyna
 
Emc big data
Emc big dataEmc big data
Emc big dataAccenture
 
Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1Accenture
 
NetApp system installation workbook Spokane
NetApp system installation workbook SpokaneNetApp system installation workbook Spokane
NetApp system installation workbook SpokaneAccenture
 

Viewers also liked (6)

111302011 iw computing_economics_final_v2
111302011 iw computing_economics_final_v2111302011 iw computing_economics_final_v2
111302011 iw computing_economics_final_v2
 
Braking china
Braking chinaBraking china
Braking china
 
Festive Cook Book
Festive Cook BookFestive Cook Book
Festive Cook Book
 
Emc big data
Emc big dataEmc big data
Emc big data
 
Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1
 
NetApp system installation workbook Spokane
NetApp system installation workbook SpokaneNetApp system installation workbook Spokane
NetApp system installation workbook Spokane
 

Similar to Infochimps report 451 research impact report

Why Hadoop as a Service?
Why Hadoop as a Service?Why Hadoop as a Service?
Why Hadoop as a Service?
Virtusa Corporation
 
SnapLogic Raises $37.5M to Fuel Big Data Integration Push
SnapLogic Raises $37.5M to Fuel Big Data Integration PushSnapLogic Raises $37.5M to Fuel Big Data Integration Push
SnapLogic Raises $37.5M to Fuel Big Data Integration Push
SnapLogic
 
InterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the wayInterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the way
Robert Bira
 
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges" Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Dataconomy Media
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
Skillspeed
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
Hortonworks
 
Hadoop data-lake-white-paper
Hadoop data-lake-white-paperHadoop data-lake-white-paper
Hadoop data-lake-white-paper
Supratim Ray
 
DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...
DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...
DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...
Stefan Zosel
 
IIA: The Current State of Hadoop in the Enterprise
IIA: The Current State of Hadoop in the EnterpriseIIA: The Current State of Hadoop in the Enterprise
IIA: The Current State of Hadoop in the Enterprise
Coy Dean
 
The Big Picture on Big Data and Cognos
The Big Picture on Big Data and CognosThe Big Picture on Big Data and Cognos
The Big Picture on Big Data and Cognos
Senturus
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
Bigdata Meetup Kochi
 
Building a Hybrid Data Pipeline for Salesforce and Hadoop
Building a Hybrid Data Pipeline for Salesforce and HadoopBuilding a Hybrid Data Pipeline for Salesforce and Hadoop
Building a Hybrid Data Pipeline for Salesforce and Hadoop
Sumit Sarkar
 
ds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suiteds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_SuiteRobin Fong 方俊强
 
SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...
SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...
SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...
SnapLogic
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
WANdisco Plc
 
Hortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User GroupHortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User Group
Mats Johansson
 
Meetup oslo hortonworks HDP
Meetup oslo hortonworks HDPMeetup oslo hortonworks HDP
Meetup oslo hortonworks HDP
Alexander Bakos Leirvåg
 
Cascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading 2015 User Survey Results
Cascading 2015 User Survey Results
Kim Loughead
 
Cascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading 2015 User Survey Results
Cascading 2015 User Survey Results
Cascading
 

Similar to Infochimps report 451 research impact report (20)

Why Hadoop as a Service?
Why Hadoop as a Service?Why Hadoop as a Service?
Why Hadoop as a Service?
 
SnapLogic Raises $37.5M to Fuel Big Data Integration Push
SnapLogic Raises $37.5M to Fuel Big Data Integration PushSnapLogic Raises $37.5M to Fuel Big Data Integration Push
SnapLogic Raises $37.5M to Fuel Big Data Integration Push
 
InterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the wayInterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the way
 
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges" Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
Hadoop data-lake-white-paper
Hadoop data-lake-white-paperHadoop data-lake-white-paper
Hadoop data-lake-white-paper
 
DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...
DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...
DevOps and Modern Application Development in the Cloud: Red Hat, T-Systems, a...
 
IIA: The Current State of Hadoop in the Enterprise
IIA: The Current State of Hadoop in the EnterpriseIIA: The Current State of Hadoop in the Enterprise
IIA: The Current State of Hadoop in the Enterprise
 
The Big Picture on Big Data and Cognos
The Big Picture on Big Data and CognosThe Big Picture on Big Data and Cognos
The Big Picture on Big Data and Cognos
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Building a Hybrid Data Pipeline for Salesforce and Hadoop
Building a Hybrid Data Pipeline for Salesforce and HadoopBuilding a Hybrid Data Pipeline for Salesforce and Hadoop
Building a Hybrid Data Pipeline for Salesforce and Hadoop
 
ds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suiteds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suite
 
Combining hadoop with big data analytics
Combining hadoop with big data analyticsCombining hadoop with big data analytics
Combining hadoop with big data analytics
 
SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...
SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...
SnapLogic's Latest Elastic iPaaS Release Adds Hybrid Links for Spark, Cortana...
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Hortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User GroupHortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User Group
 
Meetup oslo hortonworks HDP
Meetup oslo hortonworks HDPMeetup oslo hortonworks HDP
Meetup oslo hortonworks HDP
 
Cascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading 2015 User Survey Results
Cascading 2015 User Survey Results
 
Cascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading 2015 User Survey Results
Cascading 2015 User Survey Results
 

More from Accenture

Certify 2014trends-report
Certify 2014trends-reportCertify 2014trends-report
Certify 2014trends-report
Accenture
 
Calabrio analyze
Calabrio analyzeCalabrio analyze
Calabrio analyze
Accenture
 
Tier 2 net app baseline design standard revised nov 2011
Tier 2 net app baseline design standard   revised nov 2011Tier 2 net app baseline design standard   revised nov 2011
Tier 2 net app baseline design standard revised nov 2011Accenture
 
Perf stat windows
Perf stat windowsPerf stat windows
Perf stat windowsAccenture
 
Performance problems on ethernet networks when the e0m management interface i...
Performance problems on ethernet networks when the e0m management interface i...Performance problems on ethernet networks when the e0m management interface i...
Performance problems on ethernet networks when the e0m management interface i...Accenture
 
Migrate volume in akfiler7
Migrate volume in akfiler7Migrate volume in akfiler7
Migrate volume in akfiler7Accenture
 
Migrate vol in akfiler7
Migrate vol in akfiler7Migrate vol in akfiler7
Migrate vol in akfiler7Accenture
 
Data storage requirements AK
Data storage requirements AKData storage requirements AK
Data storage requirements AKAccenture
 
C mode class
C mode classC mode class
C mode classAccenture
 
Akfiler upgrades providence july 2012
Akfiler upgrades providence july 2012Akfiler upgrades providence july 2012
Akfiler upgrades providence july 2012Accenture
 
Reporting demo
Reporting demoReporting demo
Reporting demoAccenture
 
Net app virtualization preso
Net app virtualization presoNet app virtualization preso
Net app virtualization presoAccenture
 
Providence net app upgrade plan PPMC
Providence net app upgrade plan PPMCProvidence net app upgrade plan PPMC
Providence net app upgrade plan PPMCAccenture
 
WSC Net App storage for windows challenges and solutions
WSC Net App storage for windows challenges and solutionsWSC Net App storage for windows challenges and solutions
WSC Net App storage for windows challenges and solutionsAccenture
 
50,000-seat_VMware_view_deployment
50,000-seat_VMware_view_deployment50,000-seat_VMware_view_deployment
50,000-seat_VMware_view_deploymentAccenture
 
Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...
Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...
Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...Accenture
 
Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11
Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11
Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11Accenture
 
Snap mirror source to tape to destination scenario
Snap mirror source to tape to destination scenarioSnap mirror source to tape to destination scenario
Snap mirror source to tape to destination scenarioAccenture
 
Ref arch for ve sg248155
Ref arch for ve sg248155Ref arch for ve sg248155
Ref arch for ve sg248155Accenture
 

More from Accenture (20)

Certify 2014trends-report
Certify 2014trends-reportCertify 2014trends-report
Certify 2014trends-report
 
Calabrio analyze
Calabrio analyzeCalabrio analyze
Calabrio analyze
 
Tier 2 net app baseline design standard revised nov 2011
Tier 2 net app baseline design standard   revised nov 2011Tier 2 net app baseline design standard   revised nov 2011
Tier 2 net app baseline design standard revised nov 2011
 
Perf stat windows
Perf stat windowsPerf stat windows
Perf stat windows
 
Performance problems on ethernet networks when the e0m management interface i...
Performance problems on ethernet networks when the e0m management interface i...Performance problems on ethernet networks when the e0m management interface i...
Performance problems on ethernet networks when the e0m management interface i...
 
Migrate volume in akfiler7
Migrate volume in akfiler7Migrate volume in akfiler7
Migrate volume in akfiler7
 
Migrate vol in akfiler7
Migrate vol in akfiler7Migrate vol in akfiler7
Migrate vol in akfiler7
 
Data storage requirements AK
Data storage requirements AKData storage requirements AK
Data storage requirements AK
 
C mode class
C mode classC mode class
C mode class
 
Akfiler upgrades providence july 2012
Akfiler upgrades providence july 2012Akfiler upgrades providence july 2012
Akfiler upgrades providence july 2012
 
NA notes
NA notesNA notes
NA notes
 
Reporting demo
Reporting demoReporting demo
Reporting demo
 
Net app virtualization preso
Net app virtualization presoNet app virtualization preso
Net app virtualization preso
 
Providence net app upgrade plan PPMC
Providence net app upgrade plan PPMCProvidence net app upgrade plan PPMC
Providence net app upgrade plan PPMC
 
WSC Net App storage for windows challenges and solutions
WSC Net App storage for windows challenges and solutionsWSC Net App storage for windows challenges and solutions
WSC Net App storage for windows challenges and solutions
 
50,000-seat_VMware_view_deployment
50,000-seat_VMware_view_deployment50,000-seat_VMware_view_deployment
50,000-seat_VMware_view_deployment
 
Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...
Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...
Tr 3998 -deployment_guide_for_hosted_shared_desktops_and_on-demand_applicatio...
 
Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11
Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11
Tr 3749 -net_app_storage_best_practices_for_v_mware_vsphere,_dec_11
 
Snap mirror source to tape to destination scenario
Snap mirror source to tape to destination scenarioSnap mirror source to tape to destination scenario
Snap mirror source to tape to destination scenario
 
Ref arch for ve sg248155
Ref arch for ve sg248155Ref arch for ve sg248155
Ref arch for ve sg248155
 

Recently uploaded

By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 

Recently uploaded (20)

By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 

Infochimps report 451 research impact report

  • 1. Infochimps targets enterprises with stream-processing additions to 'big data' PaaS Analyst: Matt Aslett 14 Nov, 2012 'Big data' PaaS provider Infochimps has updated its Infochimps Platform with the addition of stream-processing capabilities to the Infochimps Data Delivery Service based on technologies first developed at Twitter and LinkedIn. With its first paying customer on board, the company is now seeking partnerships to help support its enterprise-focused PaaS offering. The 451 Take There's a big difference between offering Hadoop as a service to be configured, deployed and managed, and offering a managed service that masks the complexity of configuring and deploying Hadoop. We believe the latter will gain traction as more late adopters begin to look at adopting the benefits of Hadoop without investing upfront in the expertise and infrastructure required to support it. While Infochimps will need to establish the trust of its target customers, it is well-positioned with an easy-to-consume managed service for those without Hadoop expertise, as well as a stack of technologically interesting projects for the 'devops' crowd. Context We first covered Infochimps earlier this year when the company pivoted from being a data marketplace provider to releasing the technology that supported its data marketplace, both as open source projects and as PaaS. The initial focus was on making it easier to deploy the Hadoop Copyright 2012 - The 451 Group 1
  • 2. data-processing framework via a Chef-based systems provisioning, deployment and updating tool called IronFan. Infochimps has expanded since then with the addition in April of an operations dashboard called Dashpot, and in August with the addition of the Apache Flume-based Data Delivery Service (DDS) for integrating with existing data sources, as well as early data-streaming functionality in DDS via extensions to Wukong, the company's Ruby for Hadoop. The latest addition to the platform expands its support for stream processing through the integration of open source stream-processing projects Storm and Kafka. Initially developed by BackType and released as an open source project by Twitter in August 2011 following its acquisition of the social analytics provider, Storm is a stream-processing engine. Kafka, meanwhile, is a distributed message queue originally developed by LinkedIn and used by the company in a number of projects, including feeding all activity events to its data warehouse and Hadoop, as well as keeping its search engine up to date with network activity in real time. Storm and Kafka are used by Infochimps as the foundation of DDS, which is used to connect the company's Hadoop-based PaaS with multiple existing data sources, enabling real-time integration of relevant data for processing and analysis. DDS is a key component of the Infochimps Platform that elevates it beyond a platform for Hadoop deployment to being a potential big data management and analytics platform of choice. It is DDS that will enable businesses to adopt the Infochimps Platform alongside existing data management technologies and quickly gain insight from new and existing sources of data. Infochimps' main selling point is in lowering the barriers to adopting Hadoop. While there is a lot of complex technology involved – such as IronFan, elastic Hadoop, DDS, elasticsearch, NoSQL and NewSQL databases, Wukong and Dashpot – the platform is delivered as a service designed to mask that complexity. The company maintains that it can take customers from nowhere to generating business insight from the Infochimps Platform in 30 days, without the need to hire specialist support and analytics staff, or invest in specialist infrastructure. Infochimps has attracted nine paying customers since its platform went live in the second quarter, with an average selling price of $200,000. The company charges customers per node per month for what is currently a public cloud offering hosted on Amazon Web Services or Rackspace Cloud. Infochimps has established relationships (soon to be announced) to deliver both private cloud and virtual private cloud offerings supported in its customers' own datacenters or via their trusted datacenter provider. The company is launching its cloud services across a network of tier four datacenters in North America and will begin offering its big data cloud services in the first quarter of 2013. The potential to support private cloud deployments will be aided by the fact that IronFan is Copyright 2012 - The 451 Group 2
  • 3. a key component in VMware's Serengeti project to make it easy to configure and deploy Hadoop on virtual machines, while the Infochimps Platform also supports the OpenStack API. The shift toward more enterprise-focused services and partnerships is being led by former Teradata and StackIQ executive (and Xerox PARC EIR) Jim Kaskade, who joined the company as CEO in August, replacing cofounder Joe Kelly, who became COO. Kaskade has also been busy lining up a new major financing round. Infochimps had previously raised a total of $3m from investors including DFJ Mercury, although that was during its previous incarnation as a data marketplace provider. The company currently has 23 employees, up from 14 in March. Competition There are an increasing number of vendors offering Hadoop as a service, with Amazon and Google being the biggest players at this point. While they therefore pose a competitive threat to Infochimps, the value proposition is quite different, since it still requires a degree of expertise to configure, deploy and manage a cloud-based Hadoop service in comparison to Infochimps' managed services approach. We've seen limited uptake of cloud-based Hadoop services to date, with the main use case being development and testing. Indeed, we've noted before that if a company begins to move toward a larger-scale deployment, the costs can be prohibitive enough to require on-premises deployment. While Infochimps' service is initially based on the public cloud, it has designs on supporting deployment choice. The company also believes that with the added value of IronFan, DDS, Wukong, Dashpot and the rest, along with its managed services approach, it has enough to justify the additional cost above that of running Hadoop on a public cloud service with the required expertise. Other Hadoop service providers include SunGard, Treasure Data, Qubole, Mortar Data and Guavus, while Infochimps believes its closest competition will come from MetaScale, the Hadoop managed services subsidiary of Sears Holdings, and tresata, the stealthy data platform provider founded by former Bank of America managing director for big data and analytics Abhi Mehta. Other vendors are trying to mask the complexity of configuring and deploying Hadoop by building it into larger on-premises application stacks, so we might also expect would-be customers to consider the likes of Drawn to Scale, Splice Machine or Digital Reasoning, depending on the specific application. The company must also be considered a rival to some extent with Hadoop distributors such as Cloudera, Hortonworks, MapR, IBM and EMC, although there is also the potential for partnerships here, as indicated by the fact that Cloudera CEO Mike Olson is an adviser to Infochimps. Copyright 2012 - The 451 Group 3
  • 4. SWOT Analysis Strengths Weaknesses We were already fans of the Chef-based cluster Managed services relationships are built on trust. platform tuned for the needs of enterprises using While Infochimps has technological expertise, it Hadoop. DDS adds all-important integration with will need to establish itself before some would-be existing tools that will help drive wider adoption. customers will consider it. Opportunities Threats We are seeing an increasing need for technologies and The big services and software providers are services that mask the complexity of configuring, unlikely to sit back and let demand for Hadoop deploying and managing Hadoop for late adopters. managed services go elsewhere. Expect the Infochimps has both. competition to increase with demand. Copyright 2012 - The 451 Group 4
  • 5. Reproduced by permission of The 451 Group; © 2012. This report was originally published within 451 Research’s Market Insight Service. For additional information on 451 Research or to apply for trial access, go to: www.451research.com Copyright 2012 - The 451 Group 5