Your SlideShare is downloading. ×
0
introducingAMAZON REDSHIFT                  forBUSINESS INTELLIGENCE            a presentation at      MICROSTRATEGY WORLD...
Hello.
Thank you.
IData, dataeverywhere
I            IIData, data   Collection &everywhere     storage
I            II          IIIData, data   Collection &     Dataeverywhere     storage      security
I            II          III         IVData, data   Collection &     Data       Dataeverywhere     storage      security  ...
I            II      0.    III         IV                  Amazon webData, data   Collectionervices Data                  ...
Building blocks.
Compute, storage & databases.
Retail   Merchant     Web         services   services
Blinding flash of the obvious.
Available.
Low cost.
Flexible.
Every day, AWS adds enough servercapacity to power amazon.com in 2003,when it was a $5B enterprise
IData, data everywhere
Data for competitive advantage.
Customer segmentation,financial modeling,system analysis,line of sight,business intelligence...
Generation  Collection & storageAnalytics & computationCollaboration & sharing
Cost of data generation is falling.
devicesKindle Fire HD, Kindle Fire, KindlePaperwhite and Kindle hold the top fourspots on the Amazon world wide best selle...
Amazon Appstore selection tripled in 2012.                                apps and games
Amazon customers purchased more than  one toy per second on mobile devices.commerce
most giftedkindle book
lower cost,increased throughput                             Generation                         Collection & storage       ...
Generation                            highly                          constrained  Collection & storageAnalytics & computa...
Gap.
Data volume                               The Data Analysis Gap                                                           ...
Enter AWS.
Utility.
Remove constraints.
Generation                            highly                          constrained  Collection & storageAnalytics & computa...
Generation  Collection & storageAnalytics & computationCollaboration & sharing
Full value.
Close the gap.
Reduced time to market.
Identify and meet new business         opportunities.
Lower costs.
IICollection & Storage
One schema to rule them all.
One schema to rule them all.
Lots of data. Lots of users.  Lots of uses.Lots of locations.
Cost.
Multipliers.
Object storage.
99.999999999%     durability
Relational databases.
NoSQL data stores.
HDFS based stores.
Undifferentiated heavy lifting.
Lower costs. Ease of use.
only pay for what you useno capital investment          Lower costs. Ease of use.pay as you go         no subscriptions
programmable     integrate with                        existing toolsLower costs. Ease of use.                           e...
Data warehousing.
Expensive. Complicated.
Enterprises average between3 and 4 DBAs per datawarehouse.  Source: Gartner. Critical factors in calculating the data ware...
Source: Oracle technology global price list 11/1/2012
Expensive. Complicated.
Unobtainable.
Amazon Redshift.
Fast. Powerful. Petabyte scale.
Managed service.
Automated deployment   & configuration.
SQL access and BI tool integration.
Parallel execution.
Leader Node
Leader           NodeCompute   Compute   Compute Node      Node      Node
Leader           NodeCompute   Compute   Compute Node      Node      Node
10gigE full bisection network.
Leader           NodeCompute   Compute   Compute Node      Node      Node
Common BI Tools          JDBC/ODBC                Leader                 NodeCompute        Compute        Compute Node   ...
Certified for use with  Microstrategy.
Data compression.
Automated backup to S3.
Data encrypted in transit       & at rest.
Streaming recovery.
Common BI Tools          JDBC/ODBC                Leader                 NodeCompute        Compute        Compute Node   ...
Common BI Tools          JDBC/ODBC                Leader                 NodeCompute        Compute        Compute Node   ...
Common BI Tools          JDBC/ODBC                Leader                 NodeCompute        Compute        Compute Node   ...
Elastic.
Common BI Tools          JDBC/ODBC                Leader                 NodeCompute        Compute        Compute Node   ...
Common BI Tools                    JDBC/ODBC                          Leader                           NodeCompute   Compu...
Common BI Tools          JDBC/ODBC                Leader                 NodeCompute        Compute        Compute Node   ...
Data warehouse node types.
High Storage Extra Large (XL)15GB RAM2TB local attached storage3 drives2 virtual cores
High Storage Extra Large (XL)   8 High Storage Extra Large (8XL)15GB RAM                        120GB RAM2TB local attache...
Pay as you go.
Hourly Prices              2 TB nodes           16 TB nodesOn-demand       $0.850                   $6.801 Year           ...
Hourly Prices              2 TB nodes           16 TB nodesOn-demand       $0.850                   $6.801 Year           ...
$999 per TB
Don’t pay for the leader node.
No additional storage charge for  backups of active clusters.
VPC ready.
Low cost. Easy to use.
Focus on analysis.
Private beta today.
Available early this year.
aws.amazon.com/redshift
2 billion row dataset. 6 representative queries.
Amazon Redshift: 2 instance clusterCompared to 32 nodes. 128 CPUs. 4.2 TB RAM. 1.6 PB storage. 2 billion row data set.    ...
29 minutes 58 seconds        down to     12 seconds
IIIData security.
Security is our number one priority.
Shared responsibility.
Choose your region.
Availability zones.
SOC 2       ISAE 3402 FISMA Moderate   PCI DSS   FIPS 140-2ISO 27001        ITAR          HIPAA             MPAA
“You basically turn yourself into apolymorphic surface to which the attack guyhas a much tougher time getting at. That,ult...
Virtual Private Cloud.
Network isolated environment.
Public and private subnets.
Redshift, relational databases, Hadoop      can run inside the VPC.
Extend your VPN.
Identity and access federation.
Identity and access management.
IVData movement.
“How do I get my data  into the cloud?”
Generated and stored in the AWS cloud.
Inbound transfer if free.
Multipart upload.
Aspera, IRODS.
Physical media.
AWS Direct Connect.
1Gbps or 10Gbps
Built in AZ replication.
Regional replication.
“How do I integrate my data?”
Amazon S3           Amazon RDSAmazon DynamoDB     Amazon RedshiftHDFS (Amazon EMR)   On Premise
AWS Data Pipeline
Data-intensive orchestration       & automation.
Reliable, scheduleddata movement and analytics.
aws.amazon.com/datapipeline
aws.amazon.com
IData, dataeverywhere
I            IIData, data   Collection &everywhere     storage
I            II          IIIData, data   Collection &     Dataeverywhere     storage      security
I            II          III         IVData, data   Collection &     Data       Dataeverywhere     storage      security  ...
Thank you.
get in touch        introducing                        MATTHEW@AMAZON.COMAMAZON REDSHIFT                  or              ...
Amazon Redshift for Business Intelligence
Amazon Redshift for Business Intelligence
Upcoming SlideShare
Loading in...5
×

Amazon Redshift for Business Intelligence

12,503

Published on

An introduction to Amazon Redshift for business intelligence applications. Presented at Microstrategy World 2013.

Published in: Technology

Transcript of "Amazon Redshift for Business Intelligence"

  1. 1. introducingAMAZON REDSHIFT forBUSINESS INTELLIGENCE a presentation at MICROSTRATEGY WORLD 2013 by DR. MATT WOOD
  2. 2. Hello.
  3. 3. Thank you.
  4. 4. IData, dataeverywhere
  5. 5. I IIData, data Collection &everywhere storage
  6. 6. I II IIIData, data Collection & Dataeverywhere storage security
  7. 7. I II III IVData, data Collection & Data Dataeverywhere storage security movement
  8. 8. I II 0. III IV Amazon webData, data Collectionervices Data S & Dataeverywhere storage security movement
  9. 9. Building blocks.
  10. 10. Compute, storage & databases.
  11. 11. Retail Merchant Web services services
  12. 12. Blinding flash of the obvious.
  13. 13. Available.
  14. 14. Low cost.
  15. 15. Flexible.
  16. 16. Every day, AWS adds enough servercapacity to power amazon.com in 2003,when it was a $5B enterprise
  17. 17. IData, data everywhere
  18. 18. Data for competitive advantage.
  19. 19. Customer segmentation,financial modeling,system analysis,line of sight,business intelligence...
  20. 20. Generation Collection & storageAnalytics & computationCollaboration & sharing
  21. 21. Cost of data generation is falling.
  22. 22. devicesKindle Fire HD, Kindle Fire, KindlePaperwhite and Kindle hold the top fourspots on the Amazon world wide best sellerchart since launch.
  23. 23. Amazon Appstore selection tripled in 2012. apps and games
  24. 24. Amazon customers purchased more than one toy per second on mobile devices.commerce
  25. 25. most giftedkindle book
  26. 26. lower cost,increased throughput Generation Collection & storage Analytics & computation Collaboration & sharing
  27. 27. Generation highly constrained Collection & storageAnalytics & computationCollaboration & sharing
  28. 28. Gap.
  29. 29. Data volume The Data Analysis Gap Generated data Available for analysis 1990 2000 2010 2020 Enterprise Data Data in Warehouse Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011 IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares
  30. 30. Enter AWS.
  31. 31. Utility.
  32. 32. Remove constraints.
  33. 33. Generation highly constrained Collection & storageAnalytics & computationCollaboration & sharing
  34. 34. Generation Collection & storageAnalytics & computationCollaboration & sharing
  35. 35. Full value.
  36. 36. Close the gap.
  37. 37. Reduced time to market.
  38. 38. Identify and meet new business opportunities.
  39. 39. Lower costs.
  40. 40. IICollection & Storage
  41. 41. One schema to rule them all.
  42. 42. One schema to rule them all.
  43. 43. Lots of data. Lots of users. Lots of uses.Lots of locations.
  44. 44. Cost.
  45. 45. Multipliers.
  46. 46. Object storage.
  47. 47. 99.999999999% durability
  48. 48. Relational databases.
  49. 49. NoSQL data stores.
  50. 50. HDFS based stores.
  51. 51. Undifferentiated heavy lifting.
  52. 52. Lower costs. Ease of use.
  53. 53. only pay for what you useno capital investment Lower costs. Ease of use.pay as you go no subscriptions
  54. 54. programmable integrate with existing toolsLower costs. Ease of use. easy to zero admin configure
  55. 55. Data warehousing.
  56. 56. Expensive. Complicated.
  57. 57. Enterprises average between3 and 4 DBAs per datawarehouse. Source: Gartner. Critical factors in calculating the data warehouse TCO, July 2009
  58. 58. Source: Oracle technology global price list 11/1/2012
  59. 59. Expensive. Complicated.
  60. 60. Unobtainable.
  61. 61. Amazon Redshift.
  62. 62. Fast. Powerful. Petabyte scale.
  63. 63. Managed service.
  64. 64. Automated deployment & configuration.
  65. 65. SQL access and BI tool integration.
  66. 66. Parallel execution.
  67. 67. Leader Node
  68. 68. Leader NodeCompute Compute Compute Node Node Node
  69. 69. Leader NodeCompute Compute Compute Node Node Node
  70. 70. 10gigE full bisection network.
  71. 71. Leader NodeCompute Compute Compute Node Node Node
  72. 72. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Node Node Node
  73. 73. Certified for use with Microstrategy.
  74. 74. Data compression.
  75. 75. Automated backup to S3.
  76. 76. Data encrypted in transit & at rest.
  77. 77. Streaming recovery.
  78. 78. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Node Node Node
  79. 79. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Node Node Node
  80. 80. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Node Node Node
  81. 81. Elastic.
  82. 82. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Node Node Node
  83. 83. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Compute Compute Node Node Node Node Node
  84. 84. Common BI Tools JDBC/ODBC Leader NodeCompute Compute Compute Node Node Node
  85. 85. Data warehouse node types.
  86. 86. High Storage Extra Large (XL)15GB RAM2TB local attached storage3 drives2 virtual cores
  87. 87. High Storage Extra Large (XL) 8 High Storage Extra Large (8XL)15GB RAM 120GB RAM2TB local attached storage 16TB local attached storage3 drives 24 drives2 virtual cores 16 virtual cores
  88. 88. Pay as you go.
  89. 89. Hourly Prices 2 TB nodes 16 TB nodesOn-demand $0.850 $6.801 Year $0.50 $4.00Reservation3 Year $0.228 $1.824Reservation
  90. 90. Hourly Prices 2 TB nodes 16 TB nodesOn-demand $0.850 $6.801 Year $0.50 $4.00Reservation3 Year $0.228 $1.824Reservation
  91. 91. $999 per TB
  92. 92. Don’t pay for the leader node.
  93. 93. No additional storage charge for backups of active clusters.
  94. 94. VPC ready.
  95. 95. Low cost. Easy to use.
  96. 96. Focus on analysis.
  97. 97. Private beta today.
  98. 98. Available early this year.
  99. 99. aws.amazon.com/redshift
  100. 100. 2 billion row dataset. 6 representative queries.
  101. 101. Amazon Redshift: 2 instance clusterCompared to 32 nodes. 128 CPUs. 4.2 TB RAM. 1.6 PB storage. 2 billion row data set. 12x to 150x faster
  102. 102. 29 minutes 58 seconds down to 12 seconds
  103. 103. IIIData security.
  104. 104. Security is our number one priority.
  105. 105. Shared responsibility.
  106. 106. Choose your region.
  107. 107. Availability zones.
  108. 108. SOC 2 ISAE 3402 FISMA Moderate PCI DSS FIPS 140-2ISO 27001 ITAR HIPAA MPAA
  109. 109. “You basically turn yourself into apolymorphic surface to which the attack guyhas a much tougher time getting at. That,ultimately, is the real key advantage to drivesecurity and make things much better for usacross the board.”Gus Hunt, CTOCentral Intelligence Agency
  110. 110. Virtual Private Cloud.
  111. 111. Network isolated environment.
  112. 112. Public and private subnets.
  113. 113. Redshift, relational databases, Hadoop can run inside the VPC.
  114. 114. Extend your VPN.
  115. 115. Identity and access federation.
  116. 116. Identity and access management.
  117. 117. IVData movement.
  118. 118. “How do I get my data into the cloud?”
  119. 119. Generated and stored in the AWS cloud.
  120. 120. Inbound transfer if free.
  121. 121. Multipart upload.
  122. 122. Aspera, IRODS.
  123. 123. Physical media.
  124. 124. AWS Direct Connect.
  125. 125. 1Gbps or 10Gbps
  126. 126. Built in AZ replication.
  127. 127. Regional replication.
  128. 128. “How do I integrate my data?”
  129. 129. Amazon S3 Amazon RDSAmazon DynamoDB Amazon RedshiftHDFS (Amazon EMR) On Premise
  130. 130. AWS Data Pipeline
  131. 131. Data-intensive orchestration & automation.
  132. 132. Reliable, scheduleddata movement and analytics.
  133. 133. aws.amazon.com/datapipeline
  134. 134. aws.amazon.com
  135. 135. IData, dataeverywhere
  136. 136. I IIData, data Collection &everywhere storage
  137. 137. I II IIIData, data Collection & Dataeverywhere storage security
  138. 138. I II III IVData, data Collection & Data Dataeverywhere storage security movement
  139. 139. Thank you.
  140. 140. get in touch introducing MATTHEW@AMAZON.COMAMAZON REDSHIFT or @MZA forBUSINESS INTELLIGENCE AWS.AMAZON.COM
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×