DATA VIRTUALIZATION IN THE
CLOUD – ACCELERATING TIME-TO-
VALUE
2
Avinash Deshpande
Principal, Big data and Advanced Analytics
3
Journey to Cloud
Cloud empowers IT organizations to redefine the way data
services are produced and delivered for Analytics.
▪ more scalable … can reconfigure larger cluster in an hour
▪ more efficient
‒ can turn off over the weekend
‒ can clone prod for UAT and drop when done
▪ more reliable
‒ AWS automatically does 90% of what our DBAs did
4
Challenges of traditional warehouse
On Premises Data Warehouse could no longer be extended
to effectively address our evolving business needs:
▪ Growing too fast for Exadata
‒ smallest increase in any resource is a quarter rack
▪ Difficult to set up and tune performance
▪ Difficult to manage usage
‒ Resources usage over time
‒ Queries … impact of each team, process
5
Solution Architecture
AWS Glacier
ERP
POS
JSON
AWS
S3
AWS
Redshift
Tableau
Pentaho BA
Data Interfaces
Web Services
Denodo
Pentaho DI
MDM
Pentaho Operations Mart
RDS
Mysql
postgres
AWS - EC2DRM
SFDC GitHub
Cloudwatch
SNS
IAM Cloudtrail
6
DATA VIRTUALIZATION
▪ Business Layer
‒ Keep the business from trolling through the backend database
‒ Data Consistency through single object, multiple consumers
▪ Security thru Data Virtualization rather than every tool
‒ Hard to keep security in synch across multiple analytic tools
▪ Rapid Prototyping
‒ Add new data source in DV layer first
‒ Move to Redshift / Pentaho after virtual analytics are validated
3/13/2016 LOGITECH CONFIDENTIAL: NOT FOR DISTRIBUTION 7
DATA VIRTUALIZATION - ROADMAP
8
BENEFITS
▪ Proactive – IT has embraced cloud as a model for achieving
innovation through increased efficiency, reliability and agility
▪ Reusability and template development
▪ Rapid innovation within governance structure, balanced
costs, risks and service levels
▪ Greater efficiency and reliability, enabling broader audience
to consume IT services via self-service
9
NEXT STEPS
Want to learn more?
▪Amazon AWS - https://aws.amazon.com/
▪Data Virtualization
https://en.wikipedia.org/wiki/Data_virtualization
http://www.denodo.com/en
▪Columnar databases - https://aws.amazon.com/redshift/
▪Pentaho DI and BA - http://community.pentaho.com/
THANK YOU
<Avi - @Avinash49799752 >

Data virtualization in the cloud – accelerating time to-value

  • 2.
    DATA VIRTUALIZATION INTHE CLOUD – ACCELERATING TIME-TO- VALUE 2 Avinash Deshpande Principal, Big data and Advanced Analytics
  • 3.
    3 Journey to Cloud Cloudempowers IT organizations to redefine the way data services are produced and delivered for Analytics. ▪ more scalable … can reconfigure larger cluster in an hour ▪ more efficient ‒ can turn off over the weekend ‒ can clone prod for UAT and drop when done ▪ more reliable ‒ AWS automatically does 90% of what our DBAs did
  • 4.
    4 Challenges of traditionalwarehouse On Premises Data Warehouse could no longer be extended to effectively address our evolving business needs: ▪ Growing too fast for Exadata ‒ smallest increase in any resource is a quarter rack ▪ Difficult to set up and tune performance ▪ Difficult to manage usage ‒ Resources usage over time ‒ Queries … impact of each team, process
  • 5.
    5 Solution Architecture AWS Glacier ERP POS JSON AWS S3 AWS Redshift Tableau PentahoBA Data Interfaces Web Services Denodo Pentaho DI MDM Pentaho Operations Mart RDS Mysql postgres AWS - EC2DRM SFDC GitHub Cloudwatch SNS IAM Cloudtrail
  • 6.
    6 DATA VIRTUALIZATION ▪ BusinessLayer ‒ Keep the business from trolling through the backend database ‒ Data Consistency through single object, multiple consumers ▪ Security thru Data Virtualization rather than every tool ‒ Hard to keep security in synch across multiple analytic tools ▪ Rapid Prototyping ‒ Add new data source in DV layer first ‒ Move to Redshift / Pentaho after virtual analytics are validated
  • 7.
    3/13/2016 LOGITECH CONFIDENTIAL:NOT FOR DISTRIBUTION 7 DATA VIRTUALIZATION - ROADMAP
  • 8.
    8 BENEFITS ▪ Proactive –IT has embraced cloud as a model for achieving innovation through increased efficiency, reliability and agility ▪ Reusability and template development ▪ Rapid innovation within governance structure, balanced costs, risks and service levels ▪ Greater efficiency and reliability, enabling broader audience to consume IT services via self-service
  • 9.
    9 NEXT STEPS Want tolearn more? ▪Amazon AWS - https://aws.amazon.com/ ▪Data Virtualization https://en.wikipedia.org/wiki/Data_virtualization http://www.denodo.com/en ▪Columnar databases - https://aws.amazon.com/redshift/ ▪Pentaho DI and BA - http://community.pentaho.com/
  • 10.
    THANK YOU <Avi -@Avinash49799752 >