35. Oracle Certification, Support and Licensing
All products certified on the
Oracle Virtual Machine are now
Certified & Certified on Amazon EC2
supported
managed OVM
Full Support from Oracle and
AWS
Standard Licensing Policies
Apply
AMIs exist today for many
Oracle products
36. Biomarker
Warehouse
pre-clinical, clinical, 3rd party data and publications
Estimated cost: 10 TB warehouse over 3
years
53. Customers are using for …
Targeted advertising / Clickstream analysis
Data warehousing applications
Bioinformatics
Financial modeling
File processing
Web indexing
Data mining and BI
54. CLICKSTREAM ANALYSIS – RAZORFISH AND BEST
BUY
Best Buy came to Razorfish
3.5 billion records, 71 million unique cookies, 1.7 million targeted ads
required per day
User recently
purchased a
home theater Targeted Ad
system and is
searching for (1.7 Million per day)
video games
• Leveraged AWS and Elastic MapReduce
– 100 node cluster on demand
– Processing time dropped from 2+ days to 8 hours
– Increased ROAS by 500%
59. Cascading.Jruby
source 'raw_log_data’
assembly 'raw_log_data' do
# Helper method to parse fields from raw web logs: session_id, url parameters, etc.
parse_log :query_fields => ['affiliate_code’,'sale_confirmation_id’]
branch 'affiliate_events' do
where 'affiliate_code:string != null’
rename 'created_at' =>; 'affiliate_timestamp’
end
branch 'sales_events' do
where 'sale_confirmation_id:string != null’
rename 'created_at' => 'sale_timestamp’
end
end
63. RESIZE RUNNING JOB FLOWS
Use Case: Increase speed of running job flows
Speed up job flow execution in response to changing requirements
Dynamically balance cost versus performance without restarting a job
Job Flow
Job Flow
Job Flow
Allocate Expand to Expand to
4 instances 9 instances 25 instances
Time remaining:
Time remaining:
14 Hours
7 Hours
Time remaining:
3 Hours
64. Dynamically Resize Job Flows
Use Case: Agile Data Warehouse Cluster
Customize cluster size to support varying resource needs (e.g., query
support during the day versus batch processing overnight)
Leverage flexibility to reduce costs and increase cluster utilization
Data Warehouse
(Batch Processing)
Data Warehouse Data Warehouse
(Steady State) (Steady State)
Allocate Expand to Shrink to
9 instances 25 instances 9 instances
65. Ecosystem And Tools
Business Intelligence
MicroStrategy, Pentaho
Analytics
Datameer, Karmasphere
Open source
Beeswax
…
83. deesingh@amazon.com
Twitter:@mndoci
Inspiration and material from
Matt Wood, Peter Sirota & Larry Lessig
Credit” Oberazzi under a CC-BY-NC-SA license