Introduction to Amazon Redshift

  • 3,686 views
Uploaded on

An introduction to Amazon Redshift.

An introduction to Amazon Redshift.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
No Downloads

Views

Total Views
3,686
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
255
Comments
1
Likes
6

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Introducing Amazon Redshift http://aws.amazon.com/resources/databaseservices/webinarsDavid PearsonBusiness Development Manager
  • 2. What is AWS? Deployment & Administration Application Services Compute Storage Database Networking AWS Global Infrastructure
  • 3. AWS Database Amazon Redshift Services Fast, Powerful, Fully Managed, Petabyte-Scale Data Warehouse Service Amazon DynamoDBScalable High Performance Application Storage in the Fast, Predictable, Highly-Scalable NoSQL Data Store Cloud Amazon RDS Deployment & Administration Managed Relational Database Service for MySQL, Oracle and SQL Server Application Services Amazon ElastiCache In-Memory Caching Service Compute Storage Database Networking AWS Global Infrastructure
  • 4. Why Data Warehousing? Easy to provision and scale up massively No upfront costs, pay as you go Really fast performance at a really low price Open and flexible with support for popular tools
  • 5. AmazonRedshift fast and fully managed petabyte-scale data warehouse service
  • 6. objectives design and build a petabyte-scale data warehouse service A Lot FasterAmazonRedshift A Lot Cheaper A Whole Lot Simpler
  • 7. Redshift Dramatically Reduces I/O• Direct-attached storage Id Age State 123 20 CA• Large data block sizes 345 25 WA• Columnar storage 678 40 FL• Data compression• Zone maps Row storage Column storage
  • 8. Redshift Runs on Optimized Hardware HS1.8XL: 128GB RAM, 16 Cores, 24 Spindles, 16TB Storage, 2GB/sec scan rate HS1.XL: 16GB RAM, 2 Cores, 3 Spindles, 2TB Storage • Optimized for I/O intensive workloads • High disk density • Runs in HPC - fast network • HS1.8XL available on Amazon EC2
  • 9. Redshift Runs on Optimized Hardware HS1.8XL: 128GB RAM, 16 Cores, 24 Spindles, 16TB Storage, 2GB/sec scan rate HS1.XL: 16GB RAM, 2 Cores, 3 Spindles, 2TB Storage Grow Big Start Small 100 x 8XL = 1.6PB 1 x XL = 2TB
  • 10. Redshift Parallelizes andDistributes Everything Client VPC SQL Clients / BI Tools Load Query Leader Node Resize 10 GigE (HPC) Compute Compute Backup Node 16TB Node 16TB Compute Node 16TB Ingestion Restore Backup Restore Amazon S3
  • 11. data generated Gapdata volume data available for analysis Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011 IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares
  • 12. Redshift is Priced to Analyze All Your Data $0.85 per hour for on-demand (2TB) $999 per TB per year (3-yr reservation)
  • 13. Working withRedshift
  • 14. differentiated effort increases theuniqueness of an application
  • 15. Redshift Simplifies Provisioning• Create a cluster in minutes• Automatically patch your OS and data warehouse software• Scale up to 1.6PB with a few clicks and no downtime
  • 16. IntegrateRedshift withremote data centers
  • 17. 1. Cluster placed in read-only mode 2. New cluster provisioned 3. Data copied across (MPP) SQL Clients / BI Tools Leader Leader Node NodeCompute Compute Compute Compute Compute ComputeNode Node Node Node Node Node2TB 2TB 2TB 2TB 2TB 2TB Amazon S3
  • 18. 1. Cluster placed in read-only mode 2. New cluster provisioned 3. Data copied across (MPP) 4. DNS switched to new cluster (read-write) 5. Source cluster is de-provisioned SQL Clients / BI Tools Leader Leader Node NodeCompute Compute Compute Compute Compute ComputeNode Node Node Node Node Node2TB 2TB 2TB 2TB 2TB 2TB Amazon S3
  • 19. Integrates With ExistingBI Tools JDBC/ODBC Amazon Redshift
  • 20. Amazon RedshiftLive DemonstrationJeremy WintersLead Architect and Database Warehouse Designer
  • 21. GettingStarted
  • 22. Reporting Warehouse OLTP RDBMS Reporting ERP Redshift and BI • Accelerated operational reporting • Support for short-time use cases • Data compression, index redundancy
  • 23. On-Premises Integration OLTP RDBMS Reporting ERP Redshift and BI Data Integration Partners** as of 3/14/2013
  • 24. Live Archive for (Structured) Big Data OLTP DynamoDB Web Apps Reporting Redshift and BI • Direct integration with copy command • High velocity data ages into Redshift • Low cost, high scale option for new apps
  • 25. Cloud ETL for Big Data S3 Reporting Elastic MapReduce Redshift and BI • Maintain online SQL access to historical logs • Transformation and enrichment with EMR • Longer history ensures better insight
  • 26. Redshift “up to 50 times faster than our current OLAP solution” “exponential gains in performance”FastLow Cost less than $1 / hour to get started less than $1K / TB to run Redshift for a yearEasy To Get Started Please visit: http://aws.amazon.com/redshift/
  • 27. http://aws.amazon.com/resources/databaseservices/webinars Questions?