• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Introduction to Amazon Redshift
 

Introduction to Amazon Redshift

on

  • 3,941 views

An introduction to Amazon Redshift.

An introduction to Amazon Redshift.

Statistics

Views

Total Views
3,941
Views on SlideShare
3,935
Embed Views
6

Actions

Likes
4
Downloads
188
Comments
1

2 Embeds 6

https://twitter.com 4
http://tweetedtimes.com 2

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel

11 of 1 previous next

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Introduction to Amazon Redshift Introduction to Amazon Redshift Presentation Transcript

    • Introducing Amazon Redshift http://aws.amazon.com/resources/databaseservices/webinarsDavid PearsonBusiness Development Manager
    • What is AWS? Deployment & Administration Application Services Compute Storage Database Networking AWS Global Infrastructure
    • AWS Database Amazon Redshift Services Fast, Powerful, Fully Managed, Petabyte-Scale Data Warehouse Service Amazon DynamoDBScalable High Performance Application Storage in the Fast, Predictable, Highly-Scalable NoSQL Data Store Cloud Amazon RDS Deployment & Administration Managed Relational Database Service for MySQL, Oracle and SQL Server Application Services Amazon ElastiCache In-Memory Caching Service Compute Storage Database Networking AWS Global Infrastructure
    • Why Data Warehousing? Easy to provision and scale up massively No upfront costs, pay as you go Really fast performance at a really low price Open and flexible with support for popular tools
    • AmazonRedshift fast and fully managed petabyte-scale data warehouse service
    • objectives design and build a petabyte-scale data warehouse service A Lot FasterAmazonRedshift A Lot Cheaper A Whole Lot Simpler
    • Redshift Dramatically Reduces I/O• Direct-attached storage Id Age State 123 20 CA• Large data block sizes 345 25 WA• Columnar storage 678 40 FL• Data compression• Zone maps Row storage Column storage
    • Redshift Runs on Optimized Hardware HS1.8XL: 128GB RAM, 16 Cores, 24 Spindles, 16TB Storage, 2GB/sec scan rate HS1.XL: 16GB RAM, 2 Cores, 3 Spindles, 2TB Storage • Optimized for I/O intensive workloads • High disk density • Runs in HPC - fast network • HS1.8XL available on Amazon EC2
    • Redshift Runs on Optimized Hardware HS1.8XL: 128GB RAM, 16 Cores, 24 Spindles, 16TB Storage, 2GB/sec scan rate HS1.XL: 16GB RAM, 2 Cores, 3 Spindles, 2TB Storage Grow Big Start Small 100 x 8XL = 1.6PB 1 x XL = 2TB
    • Redshift Parallelizes andDistributes Everything Client VPC SQL Clients / BI Tools Load Query Leader Node Resize 10 GigE (HPC) Compute Compute Backup Node 16TB Node 16TB Compute Node 16TB Ingestion Restore Backup Restore Amazon S3
    • data generated Gapdata volume data available for analysis Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011 IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares
    • Redshift is Priced to Analyze All Your Data $0.85 per hour for on-demand (2TB) $999 per TB per year (3-yr reservation)
    • Working withRedshift
    • differentiated effort increases theuniqueness of an application
    • Redshift Simplifies Provisioning• Create a cluster in minutes• Automatically patch your OS and data warehouse software• Scale up to 1.6PB with a few clicks and no downtime
    • IntegrateRedshift withremote data centers
    • 1. Cluster placed in read-only mode 2. New cluster provisioned 3. Data copied across (MPP) SQL Clients / BI Tools Leader Leader Node NodeCompute Compute Compute Compute Compute ComputeNode Node Node Node Node Node2TB 2TB 2TB 2TB 2TB 2TB Amazon S3
    • 1. Cluster placed in read-only mode 2. New cluster provisioned 3. Data copied across (MPP) 4. DNS switched to new cluster (read-write) 5. Source cluster is de-provisioned SQL Clients / BI Tools Leader Leader Node NodeCompute Compute Compute Compute Compute ComputeNode Node Node Node Node Node2TB 2TB 2TB 2TB 2TB 2TB Amazon S3
    • Integrates With ExistingBI Tools JDBC/ODBC Amazon Redshift
    • Amazon RedshiftLive DemonstrationJeremy WintersLead Architect and Database Warehouse Designer
    • GettingStarted
    • Reporting Warehouse OLTP RDBMS Reporting ERP Redshift and BI • Accelerated operational reporting • Support for short-time use cases • Data compression, index redundancy
    • On-Premises Integration OLTP RDBMS Reporting ERP Redshift and BI Data Integration Partners** as of 3/14/2013
    • Live Archive for (Structured) Big Data OLTP DynamoDB Web Apps Reporting Redshift and BI • Direct integration with copy command • High velocity data ages into Redshift • Low cost, high scale option for new apps
    • Cloud ETL for Big Data S3 Reporting Elastic MapReduce Redshift and BI • Maintain online SQL access to historical logs • Transformation and enrichment with EMR • Longer history ensures better insight
    • Redshift “up to 50 times faster than our current OLAP solution” “exponential gains in performance”FastLow Cost less than $1 / hour to get started less than $1K / TB to run Redshift for a yearEasy To Get Started Please visit: http://aws.amazon.com/redshift/
    • http://aws.amazon.com/resources/databaseservices/webinars Questions?