Bd cloud v3
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
1,466
On Slideshare
1,466
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
59
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • http://qugstart.com/blog/amazon-web-services/how-to-set-up-db-server-on-amazon-ec2-with-data-stored-on-ebs-drive-formatted-with-xfs/ Here’s the procedure I decided on. It involves symlinking Mysql config files and data directories onto the EBS volume. Another trick I used because I needed to migrate about 20 GiB’s of data to get started, was that I initially set up an “X-tra large” instance, with 10 GiB’s RAM to handle the data import. After the data was migrated and imported to my database, I simply terminated my X-Large instance and spun up a small instance connected to the same EBS volume! All the databases were preserved nicely and I did not have to waste money paying for an X-Large instance anymore. This exemplifies the value of thinking in the “cloud” mindset – where you can spin up and down servers in a matter of seconds! Hope this article helps someone else out there!
  • http://qugstart.com/blog/amazon-web-services/how-to-set-up-db-server-on-amazon-ec2-with-data-stored-on-ebs-drive-formatted-with-xfs/ Here’s the procedure I decided on. It involves symlinking Mysql config files and data directories onto the EBS volume. Another trick I used because I needed to migrate about 20 GiB’s of data to get started, was that I initially set up an “X-tra large” instance, with 10 GiB’s RAM to handle the data import. After the data was migrated and imported to my database, I simply terminated my X-Large instance and spun up a small instance connected to the same EBS volume! All the databases were preserved nicely and I did not have to waste money paying for an X-Large instance anymore. This exemplifies the value of thinking in the “cloud” mindset – where you can spin up and down servers in a matter of seconds! Hope this article helps someone else out there!
  • realm

Transcript

  • 1. The Big Data Cloud:Are You Ready for the Zettabyte?Steven C. Markey, MSIS, PMP, CISSP, CIPP, CISM, CISA, STS-EV, CCSK, CompTIA Cloud Essentials Principal, nControl, LLC Adjunct ProfessorPresident, Cloud Security Alliance – Delaware Valley Chapter (CSA-DelVal)
  • 2. Big Data Cloud• Presentation Overview – Why Should You Care? – Cloud Overview – Big Data Overview – Cloud-Based Big Data Offerings – Securing Cloud-Based DB Solutions
  • 3. Big Data Cloud• Why Should You Care – Organizational Cost Reduction Requirements • Justify Investments • Improve Efficiencies (Productivity, Time to Market) – Digital Information – 60%~ Annual Growth Rate (AGR) – Data Storage – 15-20% AGR Capital Expense (CapEx) – Categorization, Classification & Retention Magnify • Compliance, Legal & Privacy Regulations – Prevalent & Interconnected Business Ecosystems • Supply Chains • Business Process Outsourcers (BPO) • Information Technology Outsourcers (ITO) • Vendor’s Vendors Source: IDC
  • 4. Source: NIST
  • 5. Service Delivery Models Source: Swain Techs
  • 6. Source: Matthew Gardiner, Computer Associates
  • 7. Big Data Cloud Source: Flickr
  • 8. Big Data Cloud• Big Data Overview – Aggregated Data from the Following Sources • Traditional • Source • Social
  • 9. Big Data Cloud• Traditional Data – Database Management Systems • Relational Database Management Systems (RDBMS) • Object-Oriented Database Management Systems (OODBMS) • Non-Relational, Distributed DB Management Systems (NRDBMS) • Mobile Databases (SQLite, Oracle Lite) – Online Transaction Processing (OLTP) • Real-Time Data Warehousing – Online Analytical Processing (OLAP) • Operational Data Stores (ODS) • Enterprise Data Warehouse (EDW)
  • 10. Big Data Cloud• Traditional Data – OLAP • Business Intelligence (BI) – Data Mining – Reporting – OLAP (Continued) » Relational OLAP (ROLAP) » Multi-Dimensional OLAP (MOLAP) » Hybrid OLAP (HOLAP) OLTPODSEDW (Data Marts)BI (Data Mining) OLTPODSEDW (Data Marts)BI (Reporting) OLTPODSEDW (Data Marts)BI (OLAP)
  • 11. Big Data Cloud Source: Flickr
  • 12. Big Data Cloud• Source Data – Log Files • Event Logs / Operating System (OS) - Level • Appliance / Peripherals • Analyzers / Sniffers – Multimedia • Image Logs • Video Logs – Web Content Management (WCM) • Web Logs • Search Engine Optimization (SEO) – Web Metadata
  • 13. Big Data Cloud• Big Data Overview – Aggregators • Mostly NRDBMS Implemtations – Not only – Structured Query Language (NoSQL) • NRDBMS Examples – Column Family Stores: BigTable (Google), Cassandra & HBase (Apache) – Key-Values Stores: App Engine DataStore (Google), DynamoDB & SimpleDB (AWS) – Document Databases: CouchDB, MongoDB – Graph Databases: Neo4J
  • 14. Big Data Cloud• Big Data Overview – Serial Processing • Hadoop – Hadoop Distributed File System (HDFS) – Hive – DW – Pig – Querying Language • Riak – Parallel Processing • HadoopDB – Analytics • Google MapReduce • Apache MapReduce • Splunk (for Security Information / Event Management [SIEM])
  • 15. Source: Cloudera
  • 16. Source: Wikispaces
  • 17. Source: Google
  • 18. Source: Cloudera
  • 19. Big Data Cloud• Cloud-Based Big Data Solutions – PaaS • DBaaS – Amazon Web Services (AWS) » DynamoDB » SimpleDB » Relational Database Service (RDS): Oracle 11g / MySQL – Google App Engine » Datastore – Microsoft SQL Azure – Oracle Public Cloud: 11g • Processing – AWS Elastic MapReduce (EMR) – Google App Engine MapReduce: Mapper API – Microsoft: Apache Hadoop for Azure – IBM SmartCloud Enterprise on IBM InfoSphere BigInsights Basics
  • 20. Big Data Cloud
  • 21. Big Data Cloud
  • 22. Big Data Cloud
  • 23. Big Data Cloud
  • 24. Big Data Cloud
  • 25. Big Data Cloud• Cloud-Based Database Solutions – IaaS • Basic Components: Compute & Storage Nodes – AWS Elastic Compute Cloud (EC2) – AWS Elastic Block Store (EBS) – OpenStack Compute (Nova) – OpenStack Storage (Swift) • Advanced Components – Apache Hadoop – Apache Hadoop MapReduce • Commercial Applications – Cloudera – DataStax – MapR – Splunk
  • 26. Big Data Cloud AWS Cloud EC2 Availability Zone S3 StorageEBS EBS EBS EBS EBS EBS EBS Snapshot EBS Snapshot EBS SnapshotEC2 EC2 EC2 EBS Snapshot EBS Snapshot Internet Source: Amazon
  • 27. Big Data Cloud• Big Data in the Cloud Use Cases – Public Cloud • AWS: EC2 Hadoop & S3 • AWS: EC2 Hadoop, DynamoDB & EMR • AWS: EC2 Linux, Apache (w / Tomcat), DynamoDB & EMR • AWS: EC2 Cloudera Hadoop & EMR • AWS: EC2 Splunk – Hybrid • Oracle Big Data Appliance & Connector, Google App Engine • OpenStack Swift, AWS EC2 Cloudera Hadoop & EMR – Private Cloud • OpenStack Nova & Swift, Apache Hadoop • OpenStack Nova & Swift, Cloudera Hadoop
  • 28. Big Data Cloud
  • 29. Source: Flickr
  • 30. Big Data Cloud• Securing Cloud-Based NRDBMS Solutions – General • Focus on Application / Middleware-Level Security – SQL Injections Are Still Possible – Leverage Application IAM for NRDBMS User Rights Mgmt (URM) – Leverage Application & System Logging for Authentication, Authorization & Accounting (AAA) • Segregation of Duties – Read / Write Namespaces – Read-Only Namespaces – Specific • Document – Consistency Assurance • Key / Value – Ensure Referential Integrity
  • 31. Big Data Cloud
  • 32. Big Data Cloud• Securing Big Data in the Cloud – Identity & Access Management (IAM) • Security Assertion Markup Language (SAML) • Representational State Transfer (REST) – AWS IAM – Windows Azure Access Control Service (ACS) • Web Services – Trust Language (WS-Trust)
  • 33. Source: OASIS
  • 34. Source: Intuit
  • 35. Big Data Cloud• Securing Big Data in the Cloud – Identity & Access Management (IAM) • Security Assertion Markup Language (SAML) • Representational State Transfer (REST) – AWS IAM – Windows Azure Access Control Service (ACS) • Web Services – Trust Language (WS-Trust)
  • 36. Source: Apache
  • 37. Big Data Cloud
  • 38. Big Data Cloud
  • 39. Big Data Cloud
  • 40. Big Data Cloud• Securing Big Data in the Cloud – Identity & Access Management (IAM) • Security Assertion Markup Language (SAML) • Representational State Transfer (REST) – AWS IAM – Windows Azure Access Control Service (ACS) • Web Services – Trust Language (WS-Trust)
  • 41. Big Data Cloud
  • 42. Big Data Cloud• Securing Big Data in the Cloud – Electronic Discovery (eDiscovery) • eDiscovery Reference Model (EDRM) • Legal Holds • Litigation Response – Records & Information Management (RIM) • Generally Accepted Recordkeeping Principles (GARP®) • Information Governance Reference Model (IGRM) • Information Lifecycle Management (ILM) • MIKE2.0
  • 43. Big Data Cloud
  • 44. Big Data Cloud• Privacy & Data Protection for Big Data Clouds – Jurisdictions* • Regional: EU DPA • National: PIPEDA, GLBA, HIPAA / HITECH, COPPA, Safe Harbor • Statutory: Bavarian, CA SB 1386 / 24, MA 201 CMR 17, NV SB 227 – Data Flow & Jurisdictional Adherence • Data Sharing with Third Parties – Pseudonymization / De-Identification • Consent & Notices – Contract Clauses • Model Contracts – Privacy Best Practices • Generally Accepted Privacy Principles (GAPP) * Not all inclusive.
  • 45. Big Data Cloud• Presentation Take-Aways – Big Data in the Cloud is Here to Stay – It Has to be Secure – Segregation of Data – Access Controls – Separation / Segregation of Duties – Federated Identities – Logging
  • 46. • Questions?• Contact – Email: steve@ncontrol-llc.com – Twitter: markes1 – LI: http://www.linkedin.com/in/smarkey – CSA-DelVal: http://www.csadelval.org/