SlideShare a Scribd company logo
1 of 43
Download to read offline
© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
John Newton, Founder & CTO Alfresco
@johnnewton
October 2015
DAT309
Scaling Massive Content
Stores with Amazon Aurora
Alfresco on AWS
What to expect from the session
• Challenges of scaling to billions of documents
• Architectural approaches of managing data, search, and storage
with Amazon Aurora, Solr, Amazon EBS, and Amazon S3
• The breadth of use cases of content at scale
• How to support user applications that require sub-second response
times
• Moving from large data centers to cost-effective management with
AWS and Amazon Aurora
Alfresco in action
Government
Finance
Manufacturing
Transportation & Utilities
Healthcare Other
Content in dynamic
context
Consumerized search &
discovery
Secure & mobile
collaboration
Invisible Information
Governance
SIMPLE
SMART
Powerful metadata,
rules & relationships
Easy process
creation & analysis
CONTENT PROCESS
Modern & agile
architectureOPEN
Cloud integration,
sync & scale
Open source, open
APIs
Difference
@johnnewton alfresco.com/awsreinvent
Somewhere deep in the Nevada Desert
in a secret underground location
someone is trying to store…
One Billion
Documents!!!
http://www.warnerbros.com/austin-powers-international-man-mystery
Some have tried before … and failed!
We’ll
configure
1 Million
SharePoint
Servers!!!
Digital transformation is driving huge flows of content
Gartner Nexus
PWC 6th Annual Digital IQ Survey, 2014
Digital
Business
Cloud Social
Big DataMobile
Content use cases at scale
Enterprise
Document Library
Loans &
Policies
Claims & Case
Processing
Transaction &
Logistics Records
Research &
Analysis
Real-time
Video
Internet of
Things
Medical &
Personnel Records
Government
Records & Archives
Discovery &
Litigation
Content management applications
Document
Library
Image
Management
File Sync &
Share
Search &
Retrieval
Business
Process
Management
Records
Management
Case
Management
Media
Management
Information
Archiving
What is this
“Content”?!!!
Content vs. data vs. files vs. EFSS
Data Files EFSS Content and ECM
Content architecture as a big data problem
15
Files /
Renditions
Metadata
Directory CategoriesRelationships
Indexes
Search
Activities
Security People
APIs
Processes /
Tasks
Rules
Semantics
Types
Content
Object
Access Create – Manage – Distribute – Use
Context
Database
Distributed
FSDatabase
Solr /
Elasticsearch
Content at scale in the enterprise
Users at Scale
Concurrency Content Count
Read/Write
Throughput
Geographic
Distribution
Volume Size
The problem with traditional approaches
Provisioning and
Administration
Geographic Distribution Lack of Agility
Lack of Redundancy Lack of Elasticity
It’s enough to get one “fired”
Unique challenges of very large repositories
19
Scaling Up
• Clustered Servers
• Clustered Database
• Clustered Indexes
• Read Replicas
Scaling Out
• Sharding
• Federation
• Replication
• Shared Nothing
Content management architecture
20
Alfresco Share
Alfresco Repository
Alfresco SOLR
Activiti Workflow
Engine
Database
FS Content
Store
Indexes
S3
Amazon RDS
Amazon Elastic Block
Store or Ephemeral
PIOPS EBS
(or Amazon Glacier)
EC2
Content management architecture
21
Alfresco Share
Alfresco
Repository
Alfresco SOLR
Activiti
Workflow
Protocols
APIs (CMIS)
Media
Mgmt
Desktop and
Mail Client
Mobile App Cloud Sync
Database
FS Content
Store
Indexes
Records
Management
Reports &
Analytics
Reports and
Analytics Server
Media Transform
Services
Transforms
Authentication
Auditing
Rules/Policies
Web Scripts
Scheduled Jobs
Process management architecture
Database Elasticsearch
Files
Amazon EBS
Process
Mining
Activiti Engine
Tomcat / Jetty
Process
Virtual
Machine
Tasks Processes Jobs
Activiti REST App REST Admin REST
MS Office
Protocol
Activiti
Analysis
AngularJS
Activiti
App
AngularJS
Activiti
Admin
AngularJS
MS
Office
Activiti
Mobile
iOS / Android
Activiti
Designer
Eclipse
Scaling in tiers
Alfresco
Transformation Server
Alfresco
Transformation Server
Alfresco Solr
Alfresco Local Repo
(Index Tracking)
Alfresco Solr
Alfresco Local Repo
(Index Tracking)
Alfresco Repository Alfresco Repository
Alfresco Share Alfresco Share
Alfresco Activiti Suite
Alfresco Activiti Suite
Multi-tenant Cloud Service on AWS
RDS
Activities
Route53 (DNS)
S3
ELB
Layer7
solr trans
/share
/alfresco
haproxy
haproxy
varnish
/share
/alfresco
haproxy
/share
/alfresco
haproxy
haproxy
varnish
haproxy
varnish
web nodes
alfresco nodes
solr
solr trans
trans
You can scale
my server any
day, Baby!!!
Data meta-model
A
B
C
D
Folder
Folder
Doc
Doc
rendition
Class
Type Aspect
Property
Association
Constraint
Child
Association
Folder
Document
contains
name
name
content
Auditable who by
when
rendition
Type
Child Association
Type
Association
Property
Property
Property
Aspect
Model Metadata Organization
1 Billion 15 Billion
Database schema
• Tables
• Indexes
• Concurrency
• Throughput Techniques
• Striping
• Parallelism
• Sharding
Table Size (GB)
alf_child_assoc 448
alf_content_data 149
alf_content_url 202
alf_node 711
alf_node_aspects 217
alf_node_properties 1,524
Amazon Aurora difference
AZ 1 AZ 2
EBS
mirror
EBS
mirror
Amazon S3
EBS
Standby
Instance
Primary
Instance
AZ 1 AZ 3
Amazon S3
Primary
Instance
AZ 2
Replica
Instance
• Highly available — synchronous vs. asynchronous replication
• Significantly more efficient use of network I/O
• Self-healing, fault-tolerant, instant crash recovery
MySQL with standby Amazon Aurora
async
4/6 quorum
PiTR
Sequential
write
Sequential
write
Distributed
writes
Amazon Elastic
Block Store (EBS)
Modifications required for Aurora
Index and search architecture
Full-Text Query
Metadata Query
Facets & Buckets
Security Filters
Results Processing
Credit: Ryan Tobora
ThinkBig, Teradata
http://thinkbig.teradata.com/solrcl
oud-terminology/
Text Extraction
Metadata Injection
& Path Processing
Shingles
ACL Processing
Results ProcessTerm-hit Highlighting
x 20 instances
Storage Layer
File storage architecture
In Place: AWS Import/Export Snowball
Direct
Streaming
Aurora EBS
Metadata Content
Metadata
Content
Archive Layer
S3 Amazon Glacier
Metadata Content
File
System
Protocols
APIs
Security architecture
Web Filtering
Enterprise
Authentication
Access Control
Lists and Caveats
Query Filters and
Object Policies
Do you suppose we
can put it together
with some string and
Scotch Tape?!!
Provisioning VLDB repos
10
new architecture allows room to scale the environment to support 2013 and 2014 roadmap
plans while still supporting an environment that will be reliable and robust.
Additionally, this environment support disaster recovery capabilities as well, guaranteeing that in
case of a severe outage, that backups are stored and quick turn around can occur to restore the
environment.
Below is a screenshot of the Customer Deployment Portal in which Stanford will be able to scale
the Alfresco environment seamlessly within a web based UI.
Flexibility
This new architecture will also utilize new Cloud Ops tools that will allow increased flexibility in
the administration of the Alfresco environment. This gives Stanford the flexibility to grow or
shrink the different environments based on demand, pricing, or performance. While the need for
flexibility of the environment might be minimal in production, this will be especially advantageous
as Stanford develops on the Alfresco service, and needs to rapidly spin up and down
test/development environments.
Self-Service
The Customer Deployment Portal will be one of the benefits of moving to the proposed
environment. The Customer Deployment Portal is a web based administration tool that allows
Stanford to self-service their environment. Stanford will be able to setup, deploy, change, and
monitor the different AWS environments through a user friendly and intuitive web interface.
This includes control over the number of virtual machines, size of the virtual machines, load
balancers, databases, storage sizes and types, and more.
ContainersDevOps
Data as a Service
Indexing and
Search as a Service
Files as a Service
Rolling Deployment
• Nginx
• HA Proxy
• Varnish
• Alfresco Share
• Alfresco Repo
• Alfresco
Analytics
• Alfresco Media
• Activiti
• Solr
• ActiveMQ
• Transform
• Database
• Storage
• LDAP
• Email Server
• Logs
• Monitoring
Security
Large-scale benchmarking
BM01 User scenarios
BM02 User concurrency on single
node
BM03 Solr Performance
BM04 Concurrent Load and Access
– multi-user
BM05 User Invite and Tenant
Provisioning
BM06 Workflow service
performance
BM07 Workflow API performance
BM08 High concurrency in Multi-
Tenancy
https://wiki.alfresco.com/wiki/Benchmark_Testing_with_Alfresco
https://github.com/AlfrescoBenchmark/alfresco-benchmark
Benchmark Server
Tomcat 7
Rest API
MongoDB
Config Data
Services
MongoDB
Test Data
UI
Benchmark Driver (xN)
Benchmark Driver (xN)
Benchmark Driver
Tomcat 7 Extras
(Selenium)
Servers / APIs Servers / APIs
Load Balancer
Servers / APIs
Test
Services
Rest API
BM4 test execution environment: 1.2B docs
UI Test x 20 m3.2xlarge
Simulate 500 Users
• Selenium / Firefox
• 1 hour constant load
• 10 sec think time
UI Test UI Test
Alfresco Alfresco Alfresco x 10 c3.2xlarge
Alfresco with Share
and Repo
Solr x 20 c3.8xlargeSolr Solr
Aurora x 1 db.r3.4xlarge
ELB
Sharded Solr Cloud
sites folders files transactions dbSize GB
10,804 1,168,206 1,168,206,000 15,475,064 3,185
Simulate AWS
Import/Export
(in place)
Benchmark results
• Document load rate 1200 per sec
• 4.3 Million per Hour on 10 nodes!
• Load rate consistent beyond 1B
• CPU loads:
• Database: 70-80% in Bulk Load
• Alfresco: ~50%
• Solr: <60%
• CMIS API Calls (OASIS Standard)
• Aurora indexes efficient at 3.2TB
• NAME (=, LIKE) ~ 20ms
• IN_FOLDER (sorted, limited) ~ 160ms
• Sub-second login times and good,
linear responses for other actions
• Open Library: ~3s
• Page Results: <1s
• Navigate to Site: ~1s
• Individual search ~1s
• 500 concurrent search: ~3s response
• CPU loads:
• Database: <10%
• Alfresco: 25-30%
• Solr: 25-30%
No size-related bottlenecks with 1.2 billion documents
Bulk Operations User Operations
AWS
What a difference
3-6 Months
Questionable Scale
Little Redundancy
Lots of $$$
< 30 mins
10x Faster
Elastic, Fault-Tolerant
Open, Cost Effective
ECM ECM ECM
Search Search Search
FS FS FS
Hardware Hardware Hardware
Load Balancer
DR Plan
HSM HSM HSM
ECM ECM ECM
ELB
Alfresco Alfresco Alfresco
Solr Solr Solr
S3
EC2 EC2 EC2
AZ1 AZ2 AZ3
Aurora
EBS
What a difference
3-6 Months
Questionable Scale
Little Redundancy
Lots of $$$
< 30 mins
10x Faster
Elastic, Fault-Tolerant
Open, Cost Effective
Well, what am I
supposed to do
with all this
frickin’
hardware?!!
Future: globally distributed environment
41
Distributed FS Distributed FS
Sharding Sharding
Thank you!
Vegas, Baby!
Thank you!
What the world
needs now,
is evaluation sheets!
No, not just for some,
But for @johnnewton!
alfresco.com/awsreinvent

More Related Content

What's hot

Best Practices for running the Oracle Database on EC2 webinar
Best Practices for running the Oracle Database on EC2 webinarBest Practices for running the Oracle Database on EC2 webinar
Best Practices for running the Oracle Database on EC2 webinarTom Laszewski
 
Ceate a Scalable Cloud Architecture
Ceate a Scalable Cloud ArchitectureCeate a Scalable Cloud Architecture
Ceate a Scalable Cloud ArchitectureAmazon Web Services
 
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...Amazon Web Services
 
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...Amazon Web Services
 
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFSSimple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFSAmazon Web Services
 
Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...
Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...
Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...Amazon Web Services
 
PASS 17 SQL Server on AWS Best Practices
PASS 17 SQL Server on AWS Best PracticesPASS 17 SQL Server on AWS Best Practices
PASS 17 SQL Server on AWS Best PracticesAmazon Web Services
 
(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable
(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable
(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and ScalableAmazon Web Services
 
Making (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingMaking (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingAmazon Web Services
 
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL ServicesAmazon Web Services
 
SRV407 Deep Dive on Amazon Aurora
SRV407 Deep Dive on Amazon AuroraSRV407 Deep Dive on Amazon Aurora
SRV407 Deep Dive on Amazon AuroraAmazon Web Services
 
What’s New in Amazon Aurora for MySQL and PostgreSQL
What’s New in Amazon Aurora for MySQL and PostgreSQLWhat’s New in Amazon Aurora for MySQL and PostgreSQL
What’s New in Amazon Aurora for MySQL and PostgreSQLAmazon Web Services
 
Enterprise Grade SQL Server Deployments in AWS
Enterprise Grade SQL Server Deployments in AWSEnterprise Grade SQL Server Deployments in AWS
Enterprise Grade SQL Server Deployments in AWSAmazon Web Services
 
Introduction to Block and File storage on AWS
Introduction to Block and File storage on AWSIntroduction to Block and File storage on AWS
Introduction to Block and File storage on AWSAmazon Web Services
 
Making (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingMaking (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingAmazon Web Services
 
Best Practices running SQL Server on AWS
Best Practices running SQL Server on AWSBest Practices running SQL Server on AWS
Best Practices running SQL Server on AWSAmazon Web Services
 
Amazon RDS: Deep dive with Oracle
Amazon RDS: Deep dive with OracleAmazon RDS: Deep dive with Oracle
Amazon RDS: Deep dive with OracleAmazon Web Services
 

What's hot (20)

Best Practices for running the Oracle Database on EC2 webinar
Best Practices for running the Oracle Database on EC2 webinarBest Practices for running the Oracle Database on EC2 webinar
Best Practices for running the Oracle Database on EC2 webinar
 
Ceate a Scalable Cloud Architecture
Ceate a Scalable Cloud ArchitectureCeate a Scalable Cloud Architecture
Ceate a Scalable Cloud Architecture
 
What’s New in Amazon Aurora
What’s New in Amazon AuroraWhat’s New in Amazon Aurora
What’s New in Amazon Aurora
 
Deep Dive on Amazon Aurora
Deep Dive on Amazon AuroraDeep Dive on Amazon Aurora
Deep Dive on Amazon Aurora
 
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
 
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
 
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFSSimple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
 
Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...
Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...
Amazon RDS & Amazon Aurora: Relational Databases on AWS - SRV206 - Atlanta AW...
 
PASS 17 SQL Server on AWS Best Practices
PASS 17 SQL Server on AWS Best PracticesPASS 17 SQL Server on AWS Best Practices
PASS 17 SQL Server on AWS Best Practices
 
(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable
(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable
(DAT303) Oracle on AWS and Amazon RDS: Secure, Fast, and Scalable
 
Making (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingMaking (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with Caching
 
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
 
SRV407 Deep Dive on Amazon Aurora
SRV407 Deep Dive on Amazon AuroraSRV407 Deep Dive on Amazon Aurora
SRV407 Deep Dive on Amazon Aurora
 
What’s New in Amazon Aurora for MySQL and PostgreSQL
What’s New in Amazon Aurora for MySQL and PostgreSQLWhat’s New in Amazon Aurora for MySQL and PostgreSQL
What’s New in Amazon Aurora for MySQL and PostgreSQL
 
Enterprise Grade SQL Server Deployments in AWS
Enterprise Grade SQL Server Deployments in AWSEnterprise Grade SQL Server Deployments in AWS
Enterprise Grade SQL Server Deployments in AWS
 
Introduction to Block and File storage on AWS
Introduction to Block and File storage on AWSIntroduction to Block and File storage on AWS
Introduction to Block and File storage on AWS
 
Making (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingMaking (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with Caching
 
Best Practices running SQL Server on AWS
Best Practices running SQL Server on AWSBest Practices running SQL Server on AWS
Best Practices running SQL Server on AWS
 
Amazon RDS Deep Dive
Amazon RDS Deep DiveAmazon RDS Deep Dive
Amazon RDS Deep Dive
 
Amazon RDS: Deep dive with Oracle
Amazon RDS: Deep dive with OracleAmazon RDS: Deep dive with Oracle
Amazon RDS: Deep dive with Oracle
 

Viewers also liked

The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora - Benchmark ...
The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora  - Benchmark ...The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora  - Benchmark ...
The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora - Benchmark ...Symphony Software Foundation
 
AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...
AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...
AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...Amazon Web Services
 
Alfresco Day Roma 2015: Alfresco Activiti
Alfresco Day Roma 2015: Alfresco ActivitiAlfresco Day Roma 2015: Alfresco Activiti
Alfresco Day Roma 2015: Alfresco ActivitiAlfresco Software
 
Guide to alfresco monitoring
Guide to alfresco monitoringGuide to alfresco monitoring
Guide to alfresco monitoringMiguel Rodriguez
 
Alfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + Demo
Alfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + DemoAlfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + Demo
Alfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + DemoAlfresco Software
 
Alfresco security best practices CHECK LIST ONLY
Alfresco security best practices CHECK LIST ONLYAlfresco security best practices CHECK LIST ONLY
Alfresco security best practices CHECK LIST ONLYToni de la Fuente
 
Alfresco Day Brussels 2016 - Alfresco One Product Suite Update + Demo
Alfresco Day Brussels 2016 - Alfresco One Product Suite Update + DemoAlfresco Day Brussels 2016 - Alfresco One Product Suite Update + Demo
Alfresco Day Brussels 2016 - Alfresco One Product Suite Update + DemoAlfresco Software
 
Alfresco Security Best Practices Guide
Alfresco Security Best Practices GuideAlfresco Security Best Practices Guide
Alfresco Security Best Practices GuideToni de la Fuente
 
Alfresco Security Best Practices 2014
Alfresco Security Best Practices 2014Alfresco Security Best Practices 2014
Alfresco Security Best Practices 2014Toni de la Fuente
 
AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...
AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...
AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...Amazon Web Services
 
Sizing your alfresco platform
Sizing your alfresco platformSizing your alfresco platform
Sizing your alfresco platformLuis Cabaceira
 
Amazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer ChurnAmazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer ChurnAmazon Web Services
 
Build a Recommendation Engine using Amazon Machine Learning in Real-time
Build a Recommendation Engine using Amazon Machine Learning in Real-timeBuild a Recommendation Engine using Amazon Machine Learning in Real-time
Build a Recommendation Engine using Amazon Machine Learning in Real-timeAmazon Web Services
 
AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)
AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)
AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)Amazon Web Services
 
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...Alex Casalboni
 
Giuseppe Vaciago, Cybercrime, Digital Investigation e Digital Forensics
Giuseppe Vaciago, Cybercrime, Digital Investigation e Digital ForensicsGiuseppe Vaciago, Cybercrime, Digital Investigation e Digital Forensics
Giuseppe Vaciago, Cybercrime, Digital Investigation e Digital ForensicsAndrea Rossetti
 
2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie
2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie
2015-06 Roberto Boselli, Dal dato non strutturato alle ontologieAndrea Rossetti
 

Viewers also liked (20)

The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora - Benchmark ...
The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora  - Benchmark ...The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora  - Benchmark ...
The Alfresco ECM 1 Billion Document Benchmark on AWS and Aurora - Benchmark ...
 
AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...
AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...
AWS re:Invent 2016: Workshop: Stretching Scalability: Doing more with Amazon ...
 
Alfresco Day Roma 2015: Alfresco Activiti
Alfresco Day Roma 2015: Alfresco ActivitiAlfresco Day Roma 2015: Alfresco Activiti
Alfresco Day Roma 2015: Alfresco Activiti
 
Guide to alfresco monitoring
Guide to alfresco monitoringGuide to alfresco monitoring
Guide to alfresco monitoring
 
Alfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + Demo
Alfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + DemoAlfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + Demo
Alfresco Day Amsterdam 2015 - Alfresco One Product Suite Update + Demo
 
Alfresco security best practices CHECK LIST ONLY
Alfresco security best practices CHECK LIST ONLYAlfresco security best practices CHECK LIST ONLY
Alfresco security best practices CHECK LIST ONLY
 
Alfresco Day Brussels 2016 - Alfresco One Product Suite Update + Demo
Alfresco Day Brussels 2016 - Alfresco One Product Suite Update + DemoAlfresco Day Brussels 2016 - Alfresco One Product Suite Update + Demo
Alfresco Day Brussels 2016 - Alfresco One Product Suite Update + Demo
 
Alfresco Security Best Practices Guide
Alfresco Security Best Practices GuideAlfresco Security Best Practices Guide
Alfresco Security Best Practices Guide
 
Storage and Alfresco
Storage and AlfrescoStorage and Alfresco
Storage and Alfresco
 
Alfresco Security Best Practices 2014
Alfresco Security Best Practices 2014Alfresco Security Best Practices 2014
Alfresco Security Best Practices 2014
 
AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...
AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...
AWS re:Invent 2016: Amazon Aurora Best Practices: Getting the Best Out of You...
 
Alfresco tuning part2
Alfresco tuning part2Alfresco tuning part2
Alfresco tuning part2
 
Sizing your alfresco platform
Sizing your alfresco platformSizing your alfresco platform
Sizing your alfresco platform
 
Alfresco tuning part1
Alfresco tuning part1Alfresco tuning part1
Alfresco tuning part1
 
Amazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer ChurnAmazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer Churn
 
Build a Recommendation Engine using Amazon Machine Learning in Real-time
Build a Recommendation Engine using Amazon Machine Learning in Real-timeBuild a Recommendation Engine using Amazon Machine Learning in Real-time
Build a Recommendation Engine using Amazon Machine Learning in Real-time
 
AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)
AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)
AWS re:Invent 2016: IAM Best Practices to Live By (SAC317)
 
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
 
Giuseppe Vaciago, Cybercrime, Digital Investigation e Digital Forensics
Giuseppe Vaciago, Cybercrime, Digital Investigation e Digital ForensicsGiuseppe Vaciago, Cybercrime, Digital Investigation e Digital Forensics
Giuseppe Vaciago, Cybercrime, Digital Investigation e Digital Forensics
 
2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie
2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie
2015-06 Roberto Boselli, Dal dato non strutturato alle ontologie
 

Similar to (DAT309) Scaling Massive Content Stores with Amazon Aurora

201511 - Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...
201511 -  Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...201511 -  Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...
201511 - Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...Symphony Software Foundation
 
Alfresco Day Roma 2015: Platform Update
Alfresco Day Roma 2015: Platform UpdateAlfresco Day Roma 2015: Platform Update
Alfresco Day Roma 2015: Platform UpdateAlfresco Software
 
Developer’s intro to the alfresco platform
Developer’s intro to the alfresco platformDeveloper’s intro to the alfresco platform
Developer’s intro to the alfresco platformAlfresco Software
 
AWS Cloud Kata | Manila - Getting to Scale on AWS
AWS Cloud Kata | Manila - Getting to Scale on AWSAWS Cloud Kata | Manila - Getting to Scale on AWS
AWS Cloud Kata | Manila - Getting to Scale on AWSAmazon Web Services
 
AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...
AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...
AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...Amazon Web Services
 
Running Enterprise Workloads on AWS
Running Enterprise Workloads on AWSRunning Enterprise Workloads on AWS
Running Enterprise Workloads on AWSAmazon Web Services
 
10 Pro Tips for Scaling Your Startup from 0-10M Users
10 Pro Tips for Scaling Your Startup from 0-10M Users10 Pro Tips for Scaling Your Startup from 0-10M Users
10 Pro Tips for Scaling Your Startup from 0-10M UsersAmazon Web Services
 
Oracle Peoplesoft on AWS: A quick introduction
Oracle Peoplesoft on AWS: A quick introductionOracle Peoplesoft on AWS: A quick introduction
Oracle Peoplesoft on AWS: A quick introductionTom Laszewski
 
The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017Amazon Web Services
 
WIN401_Migrating Microsoft Applications to AWS
WIN401_Migrating Microsoft Applications to AWSWIN401_Migrating Microsoft Applications to AWS
WIN401_Migrating Microsoft Applications to AWSAmazon Web Services
 
Cloud Made Easy - August 2017
Cloud Made Easy - August 2017Cloud Made Easy - August 2017
Cloud Made Easy - August 2017Franco Ucci
 
Escalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuariosEscalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuariosAmazon Web Services LATAM
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAmazon Web Services
 
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your StartupDean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startuphuguk
 
Escalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuariosEscalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuariosAmazon Web Services LATAM
 
AWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWS
AWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWSAWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWS
AWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWSAmazon Web Services
 
Scaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit DublinScaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit DublinAmazon Web Services
 
Scaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit DublinScaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit DublinIan Massingham
 
Running Enterprise Workloads on AWS
Running Enterprise Workloads on AWSRunning Enterprise Workloads on AWS
Running Enterprise Workloads on AWSAmazon Web Services
 
AWS Enterprise Workloads on AWS IP Expo 2013
AWS Enterprise Workloads on AWS IP Expo 2013AWS Enterprise Workloads on AWS IP Expo 2013
AWS Enterprise Workloads on AWS IP Expo 2013Amazon Web Services
 

Similar to (DAT309) Scaling Massive Content Stores with Amazon Aurora (20)

201511 - Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...
201511 -  Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...201511 -  Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...
201511 - Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Bo...
 
Alfresco Day Roma 2015: Platform Update
Alfresco Day Roma 2015: Platform UpdateAlfresco Day Roma 2015: Platform Update
Alfresco Day Roma 2015: Platform Update
 
Developer’s intro to the alfresco platform
Developer’s intro to the alfresco platformDeveloper’s intro to the alfresco platform
Developer’s intro to the alfresco platform
 
AWS Cloud Kata | Manila - Getting to Scale on AWS
AWS Cloud Kata | Manila - Getting to Scale on AWSAWS Cloud Kata | Manila - Getting to Scale on AWS
AWS Cloud Kata | Manila - Getting to Scale on AWS
 
AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...
AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...
AWS re:Invent 2016: How to Launch a 100K-User Corporate Back Office with Micr...
 
Running Enterprise Workloads on AWS
Running Enterprise Workloads on AWSRunning Enterprise Workloads on AWS
Running Enterprise Workloads on AWS
 
10 Pro Tips for Scaling Your Startup from 0-10M Users
10 Pro Tips for Scaling Your Startup from 0-10M Users10 Pro Tips for Scaling Your Startup from 0-10M Users
10 Pro Tips for Scaling Your Startup from 0-10M Users
 
Oracle Peoplesoft on AWS: A quick introduction
Oracle Peoplesoft on AWS: A quick introductionOracle Peoplesoft on AWS: A quick introduction
Oracle Peoplesoft on AWS: A quick introduction
 
The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017
 
WIN401_Migrating Microsoft Applications to AWS
WIN401_Migrating Microsoft Applications to AWSWIN401_Migrating Microsoft Applications to AWS
WIN401_Migrating Microsoft Applications to AWS
 
Cloud Made Easy - August 2017
Cloud Made Easy - August 2017Cloud Made Easy - August 2017
Cloud Made Easy - August 2017
 
Escalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuariosEscalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuarios
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
 
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your StartupDean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startup
 
Escalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuariosEscalando para sus primeros 10 millones de usuarios
Escalando para sus primeros 10 millones de usuarios
 
AWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWS
AWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWSAWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWS
AWS Cloud Kata | Kuala Lumpur - Getting to Scale on AWS
 
Scaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit DublinScaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit Dublin
 
Scaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit DublinScaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit Dublin
 
Running Enterprise Workloads on AWS
Running Enterprise Workloads on AWSRunning Enterprise Workloads on AWS
Running Enterprise Workloads on AWS
 
AWS Enterprise Workloads on AWS IP Expo 2013
AWS Enterprise Workloads on AWS IP Expo 2013AWS Enterprise Workloads on AWS IP Expo 2013
AWS Enterprise Workloads on AWS IP Expo 2013
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

(DAT309) Scaling Massive Content Stores with Amazon Aurora

  • 1. © 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. John Newton, Founder & CTO Alfresco @johnnewton October 2015 DAT309 Scaling Massive Content Stores with Amazon Aurora Alfresco on AWS
  • 2. What to expect from the session • Challenges of scaling to billions of documents • Architectural approaches of managing data, search, and storage with Amazon Aurora, Solr, Amazon EBS, and Amazon S3 • The breadth of use cases of content at scale • How to support user applications that require sub-second response times • Moving from large data centers to cost-effective management with AWS and Amazon Aurora
  • 3.
  • 5. Content in dynamic context Consumerized search & discovery Secure & mobile collaboration Invisible Information Governance SIMPLE SMART Powerful metadata, rules & relationships Easy process creation & analysis CONTENT PROCESS Modern & agile architectureOPEN Cloud integration, sync & scale Open source, open APIs Difference @johnnewton alfresco.com/awsreinvent
  • 6. Somewhere deep in the Nevada Desert
  • 7. in a secret underground location
  • 8. someone is trying to store… One Billion Documents!!! http://www.warnerbros.com/austin-powers-international-man-mystery
  • 9. Some have tried before … and failed! We’ll configure 1 Million SharePoint Servers!!!
  • 10. Digital transformation is driving huge flows of content Gartner Nexus PWC 6th Annual Digital IQ Survey, 2014 Digital Business Cloud Social Big DataMobile
  • 11. Content use cases at scale Enterprise Document Library Loans & Policies Claims & Case Processing Transaction & Logistics Records Research & Analysis Real-time Video Internet of Things Medical & Personnel Records Government Records & Archives Discovery & Litigation
  • 12. Content management applications Document Library Image Management File Sync & Share Search & Retrieval Business Process Management Records Management Case Management Media Management Information Archiving
  • 14. Content vs. data vs. files vs. EFSS Data Files EFSS Content and ECM
  • 15. Content architecture as a big data problem 15 Files / Renditions Metadata Directory CategoriesRelationships Indexes Search Activities Security People APIs Processes / Tasks Rules Semantics Types Content Object Access Create – Manage – Distribute – Use Context Database Distributed FSDatabase Solr / Elasticsearch
  • 16. Content at scale in the enterprise Users at Scale Concurrency Content Count Read/Write Throughput Geographic Distribution Volume Size
  • 17. The problem with traditional approaches Provisioning and Administration Geographic Distribution Lack of Agility Lack of Redundancy Lack of Elasticity
  • 18. It’s enough to get one “fired”
  • 19. Unique challenges of very large repositories 19 Scaling Up • Clustered Servers • Clustered Database • Clustered Indexes • Read Replicas Scaling Out • Sharding • Federation • Replication • Shared Nothing
  • 20. Content management architecture 20 Alfresco Share Alfresco Repository Alfresco SOLR Activiti Workflow Engine Database FS Content Store Indexes S3 Amazon RDS Amazon Elastic Block Store or Ephemeral PIOPS EBS (or Amazon Glacier) EC2
  • 21. Content management architecture 21 Alfresco Share Alfresco Repository Alfresco SOLR Activiti Workflow Protocols APIs (CMIS) Media Mgmt Desktop and Mail Client Mobile App Cloud Sync Database FS Content Store Indexes Records Management Reports & Analytics Reports and Analytics Server Media Transform Services Transforms Authentication Auditing Rules/Policies Web Scripts Scheduled Jobs
  • 22. Process management architecture Database Elasticsearch Files Amazon EBS Process Mining Activiti Engine Tomcat / Jetty Process Virtual Machine Tasks Processes Jobs Activiti REST App REST Admin REST MS Office Protocol Activiti Analysis AngularJS Activiti App AngularJS Activiti Admin AngularJS MS Office Activiti Mobile iOS / Android Activiti Designer Eclipse
  • 23. Scaling in tiers Alfresco Transformation Server Alfresco Transformation Server Alfresco Solr Alfresco Local Repo (Index Tracking) Alfresco Solr Alfresco Local Repo (Index Tracking) Alfresco Repository Alfresco Repository Alfresco Share Alfresco Share Alfresco Activiti Suite Alfresco Activiti Suite
  • 24. Multi-tenant Cloud Service on AWS RDS Activities Route53 (DNS) S3 ELB Layer7 solr trans /share /alfresco haproxy haproxy varnish /share /alfresco haproxy /share /alfresco haproxy haproxy varnish haproxy varnish web nodes alfresco nodes solr solr trans trans
  • 25. You can scale my server any day, Baby!!!
  • 26. Data meta-model A B C D Folder Folder Doc Doc rendition Class Type Aspect Property Association Constraint Child Association Folder Document contains name name content Auditable who by when rendition Type Child Association Type Association Property Property Property Aspect Model Metadata Organization 1 Billion 15 Billion
  • 27. Database schema • Tables • Indexes • Concurrency • Throughput Techniques • Striping • Parallelism • Sharding Table Size (GB) alf_child_assoc 448 alf_content_data 149 alf_content_url 202 alf_node 711 alf_node_aspects 217 alf_node_properties 1,524
  • 28. Amazon Aurora difference AZ 1 AZ 2 EBS mirror EBS mirror Amazon S3 EBS Standby Instance Primary Instance AZ 1 AZ 3 Amazon S3 Primary Instance AZ 2 Replica Instance • Highly available — synchronous vs. asynchronous replication • Significantly more efficient use of network I/O • Self-healing, fault-tolerant, instant crash recovery MySQL with standby Amazon Aurora async 4/6 quorum PiTR Sequential write Sequential write Distributed writes Amazon Elastic Block Store (EBS)
  • 30. Index and search architecture Full-Text Query Metadata Query Facets & Buckets Security Filters Results Processing Credit: Ryan Tobora ThinkBig, Teradata http://thinkbig.teradata.com/solrcl oud-terminology/ Text Extraction Metadata Injection & Path Processing Shingles ACL Processing Results ProcessTerm-hit Highlighting x 20 instances
  • 31. Storage Layer File storage architecture In Place: AWS Import/Export Snowball Direct Streaming Aurora EBS Metadata Content Metadata Content Archive Layer S3 Amazon Glacier Metadata Content File System Protocols APIs
  • 32. Security architecture Web Filtering Enterprise Authentication Access Control Lists and Caveats Query Filters and Object Policies
  • 33. Do you suppose we can put it together with some string and Scotch Tape?!!
  • 34. Provisioning VLDB repos 10 new architecture allows room to scale the environment to support 2013 and 2014 roadmap plans while still supporting an environment that will be reliable and robust. Additionally, this environment support disaster recovery capabilities as well, guaranteeing that in case of a severe outage, that backups are stored and quick turn around can occur to restore the environment. Below is a screenshot of the Customer Deployment Portal in which Stanford will be able to scale the Alfresco environment seamlessly within a web based UI. Flexibility This new architecture will also utilize new Cloud Ops tools that will allow increased flexibility in the administration of the Alfresco environment. This gives Stanford the flexibility to grow or shrink the different environments based on demand, pricing, or performance. While the need for flexibility of the environment might be minimal in production, this will be especially advantageous as Stanford develops on the Alfresco service, and needs to rapidly spin up and down test/development environments. Self-Service The Customer Deployment Portal will be one of the benefits of moving to the proposed environment. The Customer Deployment Portal is a web based administration tool that allows Stanford to self-service their environment. Stanford will be able to setup, deploy, change, and monitor the different AWS environments through a user friendly and intuitive web interface. This includes control over the number of virtual machines, size of the virtual machines, load balancers, databases, storage sizes and types, and more. ContainersDevOps Data as a Service Indexing and Search as a Service Files as a Service Rolling Deployment • Nginx • HA Proxy • Varnish • Alfresco Share • Alfresco Repo • Alfresco Analytics • Alfresco Media • Activiti • Solr • ActiveMQ • Transform • Database • Storage • LDAP • Email Server • Logs • Monitoring Security
  • 35. Large-scale benchmarking BM01 User scenarios BM02 User concurrency on single node BM03 Solr Performance BM04 Concurrent Load and Access – multi-user BM05 User Invite and Tenant Provisioning BM06 Workflow service performance BM07 Workflow API performance BM08 High concurrency in Multi- Tenancy https://wiki.alfresco.com/wiki/Benchmark_Testing_with_Alfresco https://github.com/AlfrescoBenchmark/alfresco-benchmark Benchmark Server Tomcat 7 Rest API MongoDB Config Data Services MongoDB Test Data UI Benchmark Driver (xN) Benchmark Driver (xN) Benchmark Driver Tomcat 7 Extras (Selenium) Servers / APIs Servers / APIs Load Balancer Servers / APIs Test Services Rest API
  • 36. BM4 test execution environment: 1.2B docs UI Test x 20 m3.2xlarge Simulate 500 Users • Selenium / Firefox • 1 hour constant load • 10 sec think time UI Test UI Test Alfresco Alfresco Alfresco x 10 c3.2xlarge Alfresco with Share and Repo Solr x 20 c3.8xlargeSolr Solr Aurora x 1 db.r3.4xlarge ELB Sharded Solr Cloud sites folders files transactions dbSize GB 10,804 1,168,206 1,168,206,000 15,475,064 3,185 Simulate AWS Import/Export (in place)
  • 37. Benchmark results • Document load rate 1200 per sec • 4.3 Million per Hour on 10 nodes! • Load rate consistent beyond 1B • CPU loads: • Database: 70-80% in Bulk Load • Alfresco: ~50% • Solr: <60% • CMIS API Calls (OASIS Standard) • Aurora indexes efficient at 3.2TB • NAME (=, LIKE) ~ 20ms • IN_FOLDER (sorted, limited) ~ 160ms • Sub-second login times and good, linear responses for other actions • Open Library: ~3s • Page Results: <1s • Navigate to Site: ~1s • Individual search ~1s • 500 concurrent search: ~3s response • CPU loads: • Database: <10% • Alfresco: 25-30% • Solr: 25-30% No size-related bottlenecks with 1.2 billion documents Bulk Operations User Operations
  • 38. AWS What a difference 3-6 Months Questionable Scale Little Redundancy Lots of $$$ < 30 mins 10x Faster Elastic, Fault-Tolerant Open, Cost Effective ECM ECM ECM Search Search Search FS FS FS Hardware Hardware Hardware Load Balancer DR Plan HSM HSM HSM ECM ECM ECM ELB Alfresco Alfresco Alfresco Solr Solr Solr S3 EC2 EC2 EC2 AZ1 AZ2 AZ3 Aurora EBS
  • 39. What a difference 3-6 Months Questionable Scale Little Redundancy Lots of $$$ < 30 mins 10x Faster Elastic, Fault-Tolerant Open, Cost Effective
  • 40. Well, what am I supposed to do with all this frickin’ hardware?!!
  • 41. Future: globally distributed environment 41 Distributed FS Distributed FS Sharding Sharding
  • 43. What the world needs now, is evaluation sheets! No, not just for some, But for @johnnewton! alfresco.com/awsreinvent