Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Big Data on Public Cloud

8,376 views

Published on

Presentation by Dr.Thanachart Numnonda, IMC Institute @Thailand Big Data User Group #2, 13 March 2015

Published in: Technology
  • Be the first to comment

Big Data on Public Cloud

  1. 1. Big Data on Public Cloud Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 13 March 2015
  2. 2. 2 “Bัy 2015, 20% of Global 1000 organizations Will have established a strategic focus on information infrastructure ” Gartner
  3. 3. 3 Big Data Landscape Source: Big Data in the Enterprise. When to Use What?
  4. 4. 4 Big Data Landscape Source : http://www.vitria.com/
  5. 5. 5
  6. 6. 6 NoSQL
  7. 7. 7 A scalable fault-tolerant distributed system for data storage and processing Completely written in java Open source & distributed under Apache license What is Hadoop?
  8. 8. 8 Hadoop Environment Source: Hadoop in Practice; Alex Holmes
  9. 9. 9 Major Hadoop Components Hadoop Distributed File System (HDFS) Map/Reduce System
  10. 10. 10 Hadoop Distribution Microsoft Azure
  11. 11. 11 Big Data Future Architecture Sscial Media Images e-mails Crawlers ERP CRM LOB APPs Unstructured and Structured Data Parallel Data Warehouse Hadoop On Cloud Hadoop On Private Server Connectors S S R S BI Platform Familiar End User Tools Spreadsheet Predictive Analytics Data Market Place NoSQL Petabytes of Data (Unstructured) Hundreds of TB of Data (structured)
  12. 12. 12 Issue with Big Data Infrastructure Large investment Scalabilty ROI Business Cases
  13. 13. 13
  14. 14. 14Source : http://acloudyplace.com/
  15. 15. 15 Big Data on Cloud Using IaaS to leverage Cloud Vms Using Big Data as a Services
  16. 16. 16 Big Data Services on Cloud Amazon Elastic Mapreduce Microsoft Azure Hadoop
  17. 17. 17 Big Data as a Service
  18. 18. 18
  19. 19. 19 Database as a Service Amazon RDS IBM SQL Database for Bluemix Microsoft SQL Database Google CloudSQL
  20. 20. 20 NoSQL as a Service Amazon DynomoDB Google Cloud DataStore Microsoft Azure DocumentDB Cloudant on IBM Bluemix. Mongo DB on Heroku
  21. 21. 21 Hadoop as a Service Amazon Elastic Map Reduce Rackspace Cloud Big Data Platform Qubole Google Cloud Platform IBM Bluemix: Analytic on Hadoop Microsoft Azure HDInsight
  22. 22. 22
  23. 23. 23
  24. 24. 24 Big Data on Amazon EMR
  25. 25. 25
  26. 26. 26
  27. 27. 27
  28. 28. 28 Big Data on Cloud Roadmap Step 1: Build the business case Step 2: Assess your Big Data application workloads Step 3: Develop a technical approach for deploying and managing Big Data in the cloud Step 4: Address governance, security, privacy, risk, Step 5: Deploy, integrate, and operationalize your cloud-based Big Data infrastructure Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS
  29. 29. 29 Access your application workloads Big-data storage Big-data processing Big-data development Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS
  30. 30. 30 Sample applications Enterprise applications already hosted in the cloud High-volume external data sources that require considerable preprocessing Tactical applications beyond your on- premises, Big Data capabilities Elastic provisioning of very large but short- lived analytic sandboxes Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS
  31. 31. 31 Demo
  32. 32. 32 Amazon DynomoDB
  33. 33. 33 Google BigQuery
  34. 34. 34 Hadoop on Google
  35. 35. 35 Amazon EMR
  36. 36. 36 www.facebook.com/imcinstitute
  37. 37. 37 Thank you thanachart@imcinstitute.com www.facebook.com/imcinstitute www.slideshare.net/imcinstitute www.thanachart.org

×