SlideShare a Scribd company logo
1 of 23
Download to read offline
Presenter:	
  John	
  Johnson	
  
Date:	
  01/01/14	
  
PRESENTATION	
  TITLE	
  
Everything	
  but	
  your	
  code®	
  
Contents	
  
•  What	
  does	
  “Big	
  Data”	
  really	
  mean?	
  
•  Big	
  Data	
  use	
  cases	
  
•  Considera:ons	
  when	
  building	
  your	
  
project/applica:on	
  
•  Hos:ng	
  op:ons	
  and	
  Big	
  Data	
  challenges	
  
•  Opera:ons-­‐as-­‐a-­‐Service	
  
•  Customer	
  close-­‐up	
  
“Big	
  data	
  is	
  a	
  collec.on	
  of	
  [unstructured]	
  
data	
  from	
  tradi.onal	
  and	
  digital	
  sources	
  
inside	
  and	
  outside	
  your	
  company	
  that	
  
represents	
  a	
  source	
  for	
  ongoing	
  discovery	
  
and	
  analysis.”	
  	
  
-­‐	
  Lisa	
  Arthur,	
  Forbes	
  
Big	
  Data	
  use	
  cases	
  
DATA	
  
Make	
  unstructured	
  info	
  transparent	
  
and	
  usable	
  at	
  much	
  higher	
  frequency	
  
Precisely	
  tailor	
  products/services	
  for	
  
beJer	
  analysis	
  and	
  segmenta:on	
  
Improve	
  development	
  of	
  next	
  gen	
  
products/services	
  	
  
Create	
  and	
  store	
  unstructured	
  	
  
transac:onal	
  data	
  
Planning	
  your	
  build	
  
When	
  you’re	
  building	
  big	
  data	
  applica:ons	
  you	
  have	
  to	
  have	
  a	
  view	
  
of	
  the	
  complete	
  Stack	
  
The	
  Stack	
  
Requirements	
  of	
  Big	
  Data	
  ApplicaEons	
  
•  Big	
  Data	
  is	
  power	
  hungry	
  
•  10	
  or	
  40Gbps	
  networks	
  at	
  a	
  
minimum	
  
•  Big	
  Data	
  is	
  distributed	
  	
  
•  Big	
  Data	
  is	
  monitoring	
  intensive	
  	
  
–  Requires	
  accurate,	
  specific	
  and	
  
frequent	
  diagnos:cs	
  to	
  run	
  properly	
  
•  Big	
  Data	
  apps	
  require	
  tons	
  of	
  
memory	
  and	
  storage	
  
•  Applica:on	
  Support	
  Tools	
  
	
  
What	
  do	
  you	
  need	
  for	
  the	
  back	
  end?	
  
Take	
  a	
  big	
  task	
  and	
  divide	
  into	
  smaller,	
  discrete	
  tasks	
  that	
  can	
  be	
  carried	
  
out	
  in	
  parallel	
  
In	
  the	
  cloud,	
  your	
  data	
  could	
  be	
  spread	
  across	
  mul:ple	
  servers	
  
Because	
  of	
  this	
  complexity,	
  the	
  task	
  needs	
  to	
  be	
  divided	
  into	
  smaller	
  tasks	
  
Choosing	
  a	
  hos:ng	
  op:on	
  for	
  your	
  project	
  
In-­‐house	
  
vs.	
  
Cloud	
  
vs.	
  
Coloca:on	
  
vs.	
  
Dedicated	
  Managed	
  Hos:ng	
  
(Opera:ons	
  as	
  a	
  Service)	
  
Management	
  vs.	
  Resources	
  
Shared	
   Dedicated	
  
Fully	
  Managed	
  
Unmanaged	
  
[Management]	
  
[Resources]	
  
Cloud	
  
Coloca:on	
  
DIY	
  
OaaS	
  	
  
(Dedicated	
  Managed	
  Hos:ng)	
  
In-­‐house	
  –	
  What	
  you	
  get	
  
•  Purpose	
  built	
  system	
  (custom	
  design)	
  =	
  Fast!	
  
	
  
•  Minimal	
  Packet	
  Loss,	
  JiJer	
  and	
  Latency	
  
•  Single	
  Tenant	
  
	
  
•  Reduced/No	
  Server	
  or	
  Data	
  Sprawl	
  
	
  
•  Transparent	
  Infrastructure	
  
•  10	
  or	
  40Gbps	
  Network	
  
	
  
✓	
  
✓	
  
✓	
  
✓	
  
✓	
  
✓	
  
Challenges	
  of	
  Big	
  Data	
  w/	
  In-­‐House	
  hos:ng	
  
•  Do	
  you	
  have	
  the	
  experience	
  and	
  knowledge	
  to	
  
design,	
  build	
  and	
  maintain	
  the	
  network?	
  
	
  
§  Have	
  you	
  thought	
  about	
  the	
  total	
  costs?	
  
– Data	
  center	
  costs	
  	
  
– Equipment	
  costs	
  
– Staffing	
  costs	
  
– Applica:on	
  Support	
  costs	
  
	
  
•  Did	
  you	
  factor	
  in	
  applica:on	
  support	
  tools?	
  
	
  
•  Do	
  you	
  want	
  to	
  be	
  an	
  internet	
  plumber?	
  
$	
  
Cloud	
  –	
  what	
  you	
  get	
  
•  Quick	
  spin-­‐up	
  :me	
  
	
  
•  Lower	
  equipment	
  costs	
  
	
  
•  Lower	
  personnel	
  costs	
  for	
  infrastructure	
  
support	
  
✓	
  
✓	
  
✓	
  
Challenges	
  of	
  Big	
  Data	
  in	
  a	
  Cloud	
  environment	
  
•  Would	
  your	
  opera:ons	
  be	
  adversely	
  affected	
  by	
  
packet	
  loss,	
  jiJer	
  and	
  latency?	
  
	
  
•  Do	
  you	
  want	
  to	
  share	
  resources	
  with	
  other	
  
companies	
  on	
  a	
  system	
  that’s	
  designed	
  to	
  be	
  big,	
  
but	
  not	
  fast?	
  
	
  
	
  
•  Does	
  your	
  data	
  need	
  to	
  be	
  “in	
  one	
  place”?	
  
	
  
•  Distributed	
  data	
  puts	
  a	
  stress	
  on	
  the	
  network	
  that	
  
most	
  cloud	
  environments	
  were	
  not	
  designed	
  for	
  
	
  
! !
Challenges	
  of	
  Big	
  Data	
  in	
  a	
  Cloud	
  environment	
  	
  
•  Is	
  the	
  cloud	
  provider	
  capable	
  of	
  providing	
  the	
  intensive	
  
monitoring	
  needed	
  by	
  Big	
  Data	
  applica:ons?	
  
–  Requires	
  accurate,	
  specific	
  and	
  frequent	
  diagnos:cs	
  
to	
  run	
  properly	
  
–  The	
  privacy	
  of	
  the	
  cloud	
  works	
  against	
  efficiency	
  
Coloca:on	
  Hos:ng–	
  what	
  you	
  get	
  
•  Lower	
  equipment	
  costs	
  
•  Control	
  over	
  non-­‐data	
  center	
  
infrastructure	
  (servers,	
  network,	
  etc.)	
  
•  Not	
  responsible	
  for	
  data	
  center	
  design,	
  
build	
  or	
  maintenance	
  
•  No	
  tech	
  support	
  for	
  equipment	
  
•  Single-­‐tenancy	
  
✓	
  
✓	
  
✓	
  
✓	
  
✓	
  
Challenges	
  of	
  Big	
  Data	
  in	
  a	
  Coloca:on	
  Environment	
  
•  Do	
  you	
  want	
  to	
  be	
  responsible	
  for	
  all	
  non-­‐data	
  center	
  
support?	
  
	
  
•  Are	
  you	
  comfortable	
  with	
  having	
  no	
  applica:on	
  
support?	
  
	
  
•  Does	
  the	
  provider	
  custom-­‐design	
  your	
  architecture,	
  
or	
  rely	
  on	
  a	
  ‘one	
  size	
  fits	
  most’	
  deployment?	
  	
  
	
  
•  What	
  hardware	
  is	
  single-­‐tenant,	
  and	
  what	
  is	
  mul:-­‐
tenant/shared,	
  and	
  would	
  the	
  shared	
  elements	
  
impact	
  your	
  opera:ons?	
  
	
  
Opera:ons-­‐as-­‐a-­‐Service	
  (Dedicated	
  Managed	
  Hos:ng)	
  
OperaEons-­‐as-­‐a-­‐Service	
  
In-­‐House	
   Cloud	
   ColocaEon	
   OaaS	
  via	
  Peak	
  
HosEng	
  
Minimal	
  Packet	
  
Loss,	
  JiQer	
  and	
  
Latency	
  
þ	
   ý	
   Maybe	
   þ	
  
Single	
  Tenant	
  
þ	
   ý	
   þ	
   þ	
  
Reduced/No	
  Server	
  
or	
  Data	
  Sprawl	
  
þ	
   ý	
   Maybe	
   þ	
  
DC	
  Techs	
  Supplied	
  
ý	
   þ	
   þ	
   þ	
  
SysAdmin	
  Supplied	
  
ý	
   ý	
   ý	
   þ	
  
Transparent	
  
Infrastructure	
  
þ	
   ý	
   þ	
   þ	
  
Custom	
  Design	
  
þ	
   þ	
   Maybe	
   þ	
  
10	
  or	
  40Gbps	
  
Network	
  
þ	
   ?	
   ?	
   þ	
  
ApplicaEon	
  Support	
  
tools	
   ý	
   ý	
   ý	
   þ	
  
Peak	
  Hos:ng	
  Customer	
  Close-­‐up	
  
Big	
  social	
  data	
  analy:cs	
  company,	
  delivering	
  advanced	
  social	
  
intelligence	
  and	
  real-­‐:me	
  threat	
  detec:on	
  across	
  the	
  consumer	
  
packaged	
  goods,	
  food	
  and	
  beverage,	
  media	
  and	
  entertainment	
  
and	
  pharmaceu:cal	
  industries.	
  	
  
Akuda	
  Labs’	
  Pulsar	
  real-­‐:me	
  streaming	
  classifica:on	
  engine	
  
available,	
  currently	
  processing	
  5	
  Billion	
  SCOPS	
  (was	
  500	
  million	
  
when	
  the	
  came	
  to	
  Peak	
  Hos:ng)	
  for	
  their	
  product,	
  ListenLogic	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  -­‐	
  The	
  search	
  
•  Needs:	
  
–  At	
  least	
  1	
  Billion	
  SCOPS	
  processing	
  power	
  to	
  
run	
  Hadoop-­‐level,	
  deep	
  dive	
  ques:ons	
  
–  Answers	
  in	
  real-­‐:me	
  
Build	
  vs.	
  Buy?	
  
Cloud	
   DIY	
  (Build)	
   Dedicated	
  Managed	
  HosEng	
  
Not	
  an	
  op:on	
  due	
  
to	
  shared	
  and	
  
distributed	
  
infrastructure	
  in	
  a	
  
cloud	
  environment	
  
•  Total	
  control	
  
•  EXPENSIVE	
  $$$	
  
	
  	
  	
  	
  	
  	
  	
  -­‐	
  HW	
  	
  
	
  	
  	
  	
  	
  	
  	
  -­‐	
  Staffing	
  
•  Their	
  best	
  op:on	
  
•  Now,	
  which	
  provider?	
  
-­‐	
  The	
  choice	
  
Best	
  performing	
  hardware	
  
Fast	
  network	
  	
  
Customized	
  Infrastructure	
  –	
  designed	
  
specifically	
  for	
  Akuda	
  Labs	
   !	
  
þ	
  
Technical	
  Support	
  staff	
  
þ	
  
OperaEons-­‐as-­‐a-­‐Service	
  
þ	
  
-­‐	
  What	
  we	
  did	
  
2012:	
  	
  
•  Provided	
  40-­‐50	
  servers	
  –	
  24	
  &	
  34	
  core	
  machines	
  w/	
  128GB	
  RAM	
  
	
  
2013:	
  	
  
•  Akuda	
  upgrades	
  to	
  64-­‐core	
  servers	
  w/	
  512GB	
  RAM	
  
•  S:ll	
  only	
  40-­‐50	
  servers	
  
•  Connected	
  via	
  dual	
  10Gbps	
  networking	
  
	
  Pool	
  servers	
  for	
  customers	
  and	
  simply	
  add	
  more	
  servers	
  to	
  the	
  
pool	
  as	
  needed	
  –	
  rather	
  than	
  deploy	
  a	
  new	
  cluster	
  per	
  customer	
  
New	
  Abili:es	
  
Process	
  100X	
  the	
  data	
  they	
  previously	
  could	
  
Easily	
  process	
  500	
  million	
  SCOPS,	
  with	
  the	
  ability	
  to	
  process	
  50	
  billion	
  if	
  
they	
  had	
  enough	
  data	
  
-­‐	
  The	
  ROI	
  
BeQer	
  Efficiency	
  
BeQer	
  Service	
  BeQer	
  Economics	
  
More	
  ProducEvity	
  
Trim	
  server	
  count	
  by	
  20%	
   Schedule	
  tasks	
  on-­‐demand	
  
instead	
  of	
  wai:ng	
  for	
  
resources	
  
BeJer	
  performance,	
  higher	
  levels	
  of	
  
customiza:on	
  and	
  produc:vity	
  
	
  
All	
  while	
  paying	
  30%	
  less	
  than	
  with	
  
previous	
  provider	
  
Worked	
  together	
  to	
  design,	
  build,	
  
maintain,	
  and	
  support	
  current	
  
infrastructure	
  	
  
In	
  conclusion	
  

More Related Content

What's hot

Don't be Hadooped when looking for Big Data ROI
Don't be Hadooped when looking for Big Data ROIDon't be Hadooped when looking for Big Data ROI
Don't be Hadooped when looking for Big Data ROIDataWorks Summit
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarioskcmallu
 
Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...Ontico
 
Give Your Organization Better, Faster Insights & Answers with High Performanc...
Give Your Organization Better, Faster Insights & Answers with High Performanc...Give Your Organization Better, Faster Insights & Answers with High Performanc...
Give Your Organization Better, Faster Insights & Answers with High Performanc...Dell World
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...Cloudera, Inc.
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchCloudera, Inc.
 
start_your_datacenter_sds_v3
start_your_datacenter_sds_v3start_your_datacenter_sds_v3
start_your_datacenter_sds_v3David Byte
 
Zeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureZeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureMapR Technologies
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudCloudera, Inc.
 
Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...
Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...
Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...Dell World
 
Dell - HPC-29mai2012
Dell - HPC-29mai2012Dell - HPC-29mai2012
Dell - HPC-29mai2012Agora Group
 
File Server and Storage Consolidation in the Cloud
File Server and Storage Consolidation in the CloudFile Server and Storage Consolidation in the Cloud
File Server and Storage Consolidation in the CloudBuurst
 
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldPart 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldCloudera, Inc.
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Cloudera, Inc.
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...DataWorks Summit
 
Make a Move to AWS Now
Make a Move to AWS Now Make a Move to AWS Now
Make a Move to AWS Now Buurst
 
Migrate Existing Applications to AWS without Re-engineering
Migrate Existing Applications to AWS without Re-engineeringMigrate Existing Applications to AWS without Re-engineering
Migrate Existing Applications to AWS without Re-engineeringBuurst
 
12 Architectural Requirements for Protecting Business Data in the Cloud
12 Architectural Requirements for Protecting Business Data in the Cloud12 Architectural Requirements for Protecting Business Data in the Cloud
12 Architectural Requirements for Protecting Business Data in the CloudBuurst
 

What's hot (20)

Don't be Hadooped when looking for Big Data ROI
Don't be Hadooped when looking for Big Data ROIDon't be Hadooped when looking for Big Data ROI
Don't be Hadooped when looking for Big Data ROI
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...
 
Big Data Introduction
Big Data IntroductionBig Data Introduction
Big Data Introduction
 
Give Your Organization Better, Faster Insights & Answers with High Performanc...
Give Your Organization Better, Faster Insights & Answers with High Performanc...Give Your Organization Better, Faster Insights & Answers with High Performanc...
Give Your Organization Better, Faster Insights & Answers with High Performanc...
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science Workbench
 
start_your_datacenter_sds_v3
start_your_datacenter_sds_v3start_your_datacenter_sds_v3
start_your_datacenter_sds_v3
 
Zeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureZeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data Architecture
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
 
Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...
Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...
Manage easier, deliver faster, innovate more - Top 10 facts on Dell Enterpris...
 
Dell - HPC-29mai2012
Dell - HPC-29mai2012Dell - HPC-29mai2012
Dell - HPC-29mai2012
 
File Server and Storage Consolidation in the Cloud
File Server and Storage Consolidation in the CloudFile Server and Storage Consolidation in the Cloud
File Server and Storage Consolidation in the Cloud
 
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldPart 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
 
Make a Move to AWS Now
Make a Move to AWS Now Make a Move to AWS Now
Make a Move to AWS Now
 
Migrate Existing Applications to AWS without Re-engineering
Migrate Existing Applications to AWS without Re-engineeringMigrate Existing Applications to AWS without Re-engineering
Migrate Existing Applications to AWS without Re-engineering
 
12 Architectural Requirements for Protecting Business Data in the Cloud
12 Architectural Requirements for Protecting Business Data in the Cloud12 Architectural Requirements for Protecting Business Data in the Cloud
12 Architectural Requirements for Protecting Business Data in the Cloud
 

Similar to How to Choose a Host for a Big Data Project

Journey to the Programmable Data Center
Journey to the Programmable Data CenterJourney to the Programmable Data Center
Journey to the Programmable Data CenterToby Weiss
 
Desktop as a service (daas)
Desktop as a service (daas)Desktop as a service (daas)
Desktop as a service (daas)johndorian555
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud ComputingUOS
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013Michael Hiskey
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?TechWell
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM
 
How To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQLHow To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQLDataStax
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...DataWorks Summit
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...DataStax
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantagePrecisely
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutionssolarisyougood
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
Financial impact of Cloud Computing
Financial impact of Cloud ComputingFinancial impact of Cloud Computing
Financial impact of Cloud Computingkrisbliesner
 
Data Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax EnterpriseData Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax EnterpriseDataStax
 
Managing Performance in the Cloud
Managing Performance in the CloudManaging Performance in the Cloud
Managing Performance in the CloudDevOpsGroup
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...Splunk
 
AWS Sydney Summit 2013 - Big Data Analytics
AWS Sydney Summit 2013 - Big Data AnalyticsAWS Sydney Summit 2013 - Big Data Analytics
AWS Sydney Summit 2013 - Big Data AnalyticsAmazon Web Services
 

Similar to How to Choose a Host for a Big Data Project (20)

Journey to the Programmable Data Center
Journey to the Programmable Data CenterJourney to the Programmable Data Center
Journey to the Programmable Data Center
 
Desktop as a service (daas)
Desktop as a service (daas)Desktop as a service (daas)
Desktop as a service (daas)
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
How To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQLHow To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQL
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Financial impact of Cloud Computing
Financial impact of Cloud ComputingFinancial impact of Cloud Computing
Financial impact of Cloud Computing
 
Data Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax EnterpriseData Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax Enterprise
 
Managing Performance in the Cloud
Managing Performance in the CloudManaging Performance in the Cloud
Managing Performance in the Cloud
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
 
AWS Sydney Summit 2013 - Big Data Analytics
AWS Sydney Summit 2013 - Big Data AnalyticsAWS Sydney Summit 2013 - Big Data Analytics
AWS Sydney Summit 2013 - Big Data Analytics
 

More from Peak Hosting

Peak Hosting Corporate brochure
Peak Hosting Corporate brochurePeak Hosting Corporate brochure
Peak Hosting Corporate brochurePeak Hosting
 
Peak hosting Employee Benefits
Peak hosting Employee BenefitsPeak hosting Employee Benefits
Peak hosting Employee BenefitsPeak Hosting
 
Peak Hosting Differentiators Slides
Peak Hosting Differentiators SlidesPeak Hosting Differentiators Slides
Peak Hosting Differentiators SlidesPeak Hosting
 
Webinar - Order out of Chaos: Avoiding the Migration Migraine
Webinar - Order out of Chaos: Avoiding the Migration MigraineWebinar - Order out of Chaos: Avoiding the Migration Migraine
Webinar - Order out of Chaos: Avoiding the Migration MigrainePeak Hosting
 
Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...
Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...
Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...Peak Hosting
 
5 Reasons to Choose a Custom Managed Architecture v. Cloud
5 Reasons to Choose a Custom Managed Architecture v. Cloud5 Reasons to Choose a Custom Managed Architecture v. Cloud
5 Reasons to Choose a Custom Managed Architecture v. CloudPeak Hosting
 

More from Peak Hosting (6)

Peak Hosting Corporate brochure
Peak Hosting Corporate brochurePeak Hosting Corporate brochure
Peak Hosting Corporate brochure
 
Peak hosting Employee Benefits
Peak hosting Employee BenefitsPeak hosting Employee Benefits
Peak hosting Employee Benefits
 
Peak Hosting Differentiators Slides
Peak Hosting Differentiators SlidesPeak Hosting Differentiators Slides
Peak Hosting Differentiators Slides
 
Webinar - Order out of Chaos: Avoiding the Migration Migraine
Webinar - Order out of Chaos: Avoiding the Migration MigraineWebinar - Order out of Chaos: Avoiding the Migration Migraine
Webinar - Order out of Chaos: Avoiding the Migration Migraine
 
Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...
Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...
Webinar | So You Think You Know the Cloud: Hosting Alternatives You May Not K...
 
5 Reasons to Choose a Custom Managed Architecture v. Cloud
5 Reasons to Choose a Custom Managed Architecture v. Cloud5 Reasons to Choose a Custom Managed Architecture v. Cloud
5 Reasons to Choose a Custom Managed Architecture v. Cloud
 

Recently uploaded

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 

Recently uploaded (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 

How to Choose a Host for a Big Data Project

  • 1. Presenter:  John  Johnson   Date:  01/01/14   PRESENTATION  TITLE   Everything  but  your  code®  
  • 2. Contents   •  What  does  “Big  Data”  really  mean?   •  Big  Data  use  cases   •  Considera:ons  when  building  your   project/applica:on   •  Hos:ng  op:ons  and  Big  Data  challenges   •  Opera:ons-­‐as-­‐a-­‐Service   •  Customer  close-­‐up  
  • 3. “Big  data  is  a  collec.on  of  [unstructured]   data  from  tradi.onal  and  digital  sources   inside  and  outside  your  company  that   represents  a  source  for  ongoing  discovery   and  analysis.”     -­‐  Lisa  Arthur,  Forbes  
  • 4. Big  Data  use  cases   DATA   Make  unstructured  info  transparent   and  usable  at  much  higher  frequency   Precisely  tailor  products/services  for   beJer  analysis  and  segmenta:on   Improve  development  of  next  gen   products/services     Create  and  store  unstructured     transac:onal  data  
  • 5. Planning  your  build   When  you’re  building  big  data  applica:ons  you  have  to  have  a  view   of  the  complete  Stack   The  Stack  
  • 6. Requirements  of  Big  Data  ApplicaEons   •  Big  Data  is  power  hungry   •  10  or  40Gbps  networks  at  a   minimum   •  Big  Data  is  distributed     •  Big  Data  is  monitoring  intensive     –  Requires  accurate,  specific  and   frequent  diagnos:cs  to  run  properly   •  Big  Data  apps  require  tons  of   memory  and  storage   •  Applica:on  Support  Tools    
  • 7. What  do  you  need  for  the  back  end?   Take  a  big  task  and  divide  into  smaller,  discrete  tasks  that  can  be  carried   out  in  parallel   In  the  cloud,  your  data  could  be  spread  across  mul:ple  servers   Because  of  this  complexity,  the  task  needs  to  be  divided  into  smaller  tasks  
  • 8. Choosing  a  hos:ng  op:on  for  your  project   In-­‐house   vs.   Cloud   vs.   Coloca:on   vs.   Dedicated  Managed  Hos:ng   (Opera:ons  as  a  Service)  
  • 9. Management  vs.  Resources   Shared   Dedicated   Fully  Managed   Unmanaged   [Management]   [Resources]   Cloud   Coloca:on   DIY   OaaS     (Dedicated  Managed  Hos:ng)  
  • 10. In-­‐house  –  What  you  get   •  Purpose  built  system  (custom  design)  =  Fast!     •  Minimal  Packet  Loss,  JiJer  and  Latency   •  Single  Tenant     •  Reduced/No  Server  or  Data  Sprawl     •  Transparent  Infrastructure   •  10  or  40Gbps  Network     ✓   ✓   ✓   ✓   ✓   ✓  
  • 11. Challenges  of  Big  Data  w/  In-­‐House  hos:ng   •  Do  you  have  the  experience  and  knowledge  to   design,  build  and  maintain  the  network?     §  Have  you  thought  about  the  total  costs?   – Data  center  costs     – Equipment  costs   – Staffing  costs   – Applica:on  Support  costs     •  Did  you  factor  in  applica:on  support  tools?     •  Do  you  want  to  be  an  internet  plumber?   $  
  • 12. Cloud  –  what  you  get   •  Quick  spin-­‐up  :me     •  Lower  equipment  costs     •  Lower  personnel  costs  for  infrastructure   support   ✓   ✓   ✓  
  • 13. Challenges  of  Big  Data  in  a  Cloud  environment   •  Would  your  opera:ons  be  adversely  affected  by   packet  loss,  jiJer  and  latency?     •  Do  you  want  to  share  resources  with  other   companies  on  a  system  that’s  designed  to  be  big,   but  not  fast?       •  Does  your  data  need  to  be  “in  one  place”?     •  Distributed  data  puts  a  stress  on  the  network  that   most  cloud  environments  were  not  designed  for     ! !
  • 14. Challenges  of  Big  Data  in  a  Cloud  environment     •  Is  the  cloud  provider  capable  of  providing  the  intensive   monitoring  needed  by  Big  Data  applica:ons?   –  Requires  accurate,  specific  and  frequent  diagnos:cs   to  run  properly   –  The  privacy  of  the  cloud  works  against  efficiency  
  • 15. Coloca:on  Hos:ng–  what  you  get   •  Lower  equipment  costs   •  Control  over  non-­‐data  center   infrastructure  (servers,  network,  etc.)   •  Not  responsible  for  data  center  design,   build  or  maintenance   •  No  tech  support  for  equipment   •  Single-­‐tenancy   ✓   ✓   ✓   ✓   ✓  
  • 16. Challenges  of  Big  Data  in  a  Coloca:on  Environment   •  Do  you  want  to  be  responsible  for  all  non-­‐data  center   support?     •  Are  you  comfortable  with  having  no  applica:on   support?     •  Does  the  provider  custom-­‐design  your  architecture,   or  rely  on  a  ‘one  size  fits  most’  deployment?       •  What  hardware  is  single-­‐tenant,  and  what  is  mul:-­‐ tenant/shared,  and  would  the  shared  elements   impact  your  opera:ons?    
  • 17. Opera:ons-­‐as-­‐a-­‐Service  (Dedicated  Managed  Hos:ng)   OperaEons-­‐as-­‐a-­‐Service   In-­‐House   Cloud   ColocaEon   OaaS  via  Peak   HosEng   Minimal  Packet   Loss,  JiQer  and   Latency   þ   ý   Maybe   þ   Single  Tenant   þ   ý   þ   þ   Reduced/No  Server   or  Data  Sprawl   þ   ý   Maybe   þ   DC  Techs  Supplied   ý   þ   þ   þ   SysAdmin  Supplied   ý   ý   ý   þ   Transparent   Infrastructure   þ   ý   þ   þ   Custom  Design   þ   þ   Maybe   þ   10  or  40Gbps   Network   þ   ?   ?   þ   ApplicaEon  Support   tools   ý   ý   ý   þ  
  • 18. Peak  Hos:ng  Customer  Close-­‐up   Big  social  data  analy:cs  company,  delivering  advanced  social   intelligence  and  real-­‐:me  threat  detec:on  across  the  consumer   packaged  goods,  food  and  beverage,  media  and  entertainment   and  pharmaceu:cal  industries.     Akuda  Labs’  Pulsar  real-­‐:me  streaming  classifica:on  engine   available,  currently  processing  5  Billion  SCOPS  (was  500  million   when  the  came  to  Peak  Hos:ng)  for  their  product,  ListenLogic                                                                
  • 19.                                                          -­‐  The  search   •  Needs:   –  At  least  1  Billion  SCOPS  processing  power  to   run  Hadoop-­‐level,  deep  dive  ques:ons   –  Answers  in  real-­‐:me   Build  vs.  Buy?   Cloud   DIY  (Build)   Dedicated  Managed  HosEng   Not  an  op:on  due   to  shared  and   distributed   infrastructure  in  a   cloud  environment   •  Total  control   •  EXPENSIVE  $$$                -­‐  HW                  -­‐  Staffing   •  Their  best  op:on   •  Now,  which  provider?  
  • 20. -­‐  The  choice   Best  performing  hardware   Fast  network     Customized  Infrastructure  –  designed   specifically  for  Akuda  Labs   !   þ   Technical  Support  staff   þ   OperaEons-­‐as-­‐a-­‐Service   þ  
  • 21. -­‐  What  we  did   2012:     •  Provided  40-­‐50  servers  –  24  &  34  core  machines  w/  128GB  RAM     2013:     •  Akuda  upgrades  to  64-­‐core  servers  w/  512GB  RAM   •  S:ll  only  40-­‐50  servers   •  Connected  via  dual  10Gbps  networking    Pool  servers  for  customers  and  simply  add  more  servers  to  the   pool  as  needed  –  rather  than  deploy  a  new  cluster  per  customer   New  Abili:es   Process  100X  the  data  they  previously  could   Easily  process  500  million  SCOPS,  with  the  ability  to  process  50  billion  if   they  had  enough  data  
  • 22. -­‐  The  ROI   BeQer  Efficiency   BeQer  Service  BeQer  Economics   More  ProducEvity   Trim  server  count  by  20%   Schedule  tasks  on-­‐demand   instead  of  wai:ng  for   resources   BeJer  performance,  higher  levels  of   customiza:on  and  produc:vity     All  while  paying  30%  less  than  with   previous  provider   Worked  together  to  design,  build,   maintain,  and  support  current   infrastructure