Be the first to like this
Listen to this session to get some insights from two recent implementations of cloud-based Big Data clusters. The purpose of the first solution is DWH Offloading and Machine Learning in the telecommunications industry. The session will cover how we established the data transfer between on-premises server and cloud services. In addition, we will talk about Spark jobs on EMR-cluster, Hive with GlueCatalog to query data stored in S3, quick analytics with Athena, hosting and testing Exasol on EC2-Instances and the provisioning of the cloud infrastructure with CloudFormation. Looking at an earlier phase in the AWS adoption lifecycle, we will also talk about an insurance company finding their way into the AWS cloud. Their goal is to complement their existing enterprise DWH with more agile and data science oriented tools from the cloud, aiming at machine learning and artifical intelligence to complement their claims workflow. In this part we will cover topics like security setup in IAM, connectivity configuration in EC2 and EMR, all complemented with S3 for their storage needs.
Speakers: Roland Wammers, Matthias Diekstall, Manuel Marowski, Opitz Consulting Deutschland GmbH