• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Does Big Data Spell Big Costs- Impetus Webinar
 

Does Big Data Spell Big Costs- Impetus Webinar

on

  • 1,100 views

Impetus webinar on ‘Does Big Data Spell Big Costs? ‘

Impetus webinar on ‘Does Big Data Spell Big Costs? ‘

Register at http://bit.ly/zAXFmV

Statistics

Views

Total Views
1,100
Views on SlideShare
1,100
Embed Views
0

Actions

Likes
1
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Almost all commercial MPP offer similar capabilities
  • Almost all commercial MPP offer similar capabilities

Does Big Data Spell Big Costs- Impetus Webinar Does Big Data Spell Big Costs- Impetus Webinar Presentation Transcript

  • Does Big Data Spell Big Costs ? March 30 10:00 am PT/ 1:00 pm ET @ impetuscalling Recorded version available athttp://www.impetus.com/webinar_registration?event=archived&eid=56
  • Outline  Big Data – Current Scenario  Cost components in a Big Data Warehouse  Best Practices - Reducing the cost of Big Data solutions  Cost of storage  Technologies- What and Where?  Big Data strategies  Our recommendations to reduce TCO Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 2
  • Big Data - Current Scenario 2.5 Quintillion Bytes produced every day $6 Trillion Big data cost IDC/EMC Cost of wasted productivity because of $650 Billion information overload 1ZB Estimated Internet Traffic by 2015 1800EB Size of the digital universe in 2011 90% of the data in the world today has been 90% created in the last two years alone 18 Months Estimated time for the digital universe to double Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 3
  • Age of Data Age of Software Age of Data Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 4
  • Existing State of Big Data Solutions Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 5
  • Using Commodity H/w for Big Data Commodity Hardware  Pros  Build your own  The promise of innovation  Cons  Building reliable storage – $1 per GB  Add the cost of managing / monitoring / hosting Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 6
  • Using Open Source & Cloud Computing Open Source  Pros  Software is free !! Glory to the Elephant  Cons  Cost of Training – thinking parallel is not intuitive  Cost of Support – support is not free Cloud Computing  Pros  Rent what you need  Cons  $14,000 a month for 100 TB data – storage only Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 7
  • Big Data Warehouse – Cost Components  Initial entry costs- Cost of experimentation  Cost of integration and moving data - Cost of ETL  Query and analytics capability  Manageability  On-going maintenance - Monitoring and tuning  Changing capacity - Additional hardware  Cost of compliance Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 8
  • Lowering TCO of Big Data  Hardware  Lower cost of storage  Lower cost of computation  Software  Make things faster  Do more with less Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 9
  • How to reduce the cost of storage?  Compress – RainStor and similar solutions  Just make sure your „Read Throughput‟ is high  Retain all v/s load & process  Setup “data pipelines” or use ILM Principles  Creation and Receipt  Distribution  Use  Maintenance  Disposition  Focus on Big Data but don‟t forget the “Small Data” Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 10
  • Technologies: What and Where? What?  Open Source vs. Commercial software?  Specialized hardware/appliances vs. commodity hardware?  Vendor lock-in vs. vendor independence?  Cost of latencies?  Cloud? Where?  OLTP - NoSQL v/s OLAP - DW (MapReduce & MPP) Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 11
  • OLAP: Big Data Scenarios Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 12
  • Data Tapping Point, Cost & Latency Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 13
  • Indirect Analytics over Hadoop Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 14
  • Direct Analytics over Hadoop Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 15
  • Analytics over Hadoop with MPP DW Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 16
  • Selecting the Right Technology Key considerations  $ per TB  Business Continuity/ Cost/ Vendor Lock-in  Latency Needs Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 17
  • Choosing MPP $ per TB Driven  EMC Greenplum  Teradata, Aster  HP Vertica  Oracle Exadata  Netezza  ParAccel  Others Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 18
  • Faster Map Reduce & Hadoop Business Continuity/ Cost/ Vendor Lock-in  MapR  HPCC  Hadapt  Pervasive DataRush, HStreaming  Cloud Map Reduce  DataStax  Platform Computing  MARS, GPMR  ParStream Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 19
  • OLTP: NoSQL Solutions Latency Needs  Column stores  HBase, Cassandra  Documents stores  MongoDB, CouchDB  Key stores  Redis, Riak etc.; Kyoto Cabinet/Tokyo Tyrant, Berkley  GraphDB  Neo4j  Cloud stores  SimpleDB Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 20
  • OLTP: New Era RDBMS Versions  Postgres, InfiniDB, Infobright  MySQL Cluster  GridSQL, EnterpriseDB  MS SQL  Sybase IQ  Specialized stores  VoltDB, MarkLogic, Clustrix  Xeround  ParStream  Oracle NoSQL Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 21
  • Recommendations – Cost Components of a Big Data Warehouse  Initial Entry Costs - Cost of Experimentation We recommend – Follow Best Practices , Learn or Hire  Cost of Integration and Moving Data- Cost of ETL We recommend - Remove costly licensed tools, switch to Map Reduce for ETL or ELT  Manageability - Provisioning, management tools We recommend – Opt for multi-vendor management toolsets, e.g. Impetus Ankush  On-Going Maintenance- Monitoring and Tuning We recommend – Automate! Automate! Automate!  Changing Capacity - Additional Hardware Do you know the GPU? Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 22
  • Recommendations – Hardware & Software  Cost of Storage- Compress Data We recommend – Opting for RainStor/ similar solutions  Do More with Less - Faster MR We recommend – MapR/ similar solutions  Acunu and related solutions for NoSQL Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 23
  • About Us  Strategic partners for software product engineering and R&D  Thought leaders in cutting-edge technologies  Mature processes and practices that are methodical, yet flexible  Diverse domain expertise Our services in Big Data and Analytics  Expert consulting  Proof-of-concept & Implementation  Support services Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=56 24
  • Big Data Quick Start ProgramThree Modules Gear up (1 day session) Base Camp (4 day session) Summit (5 day session) Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=56
  • Questions Please send in your questions using the chat panel Recorded version available atImpetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=57 26
  • Thank you For more information, write to us at inquiry@impetus.com @ impetuscalling Recorded version available athttp://www.impetus.com/webinar_registration?event=archived&eid=56