A talk for the Breda Dev meetup in which I showed what challenges microservices architectures bring for data analysis and how you can tackle these challenges with Apache Spark on Azure.
3. Event bus
Micro services
• Multiple smaller services that scale independedely
• Each service his own data store
• Data flows between services through the event bus
5. Data analytics challenges with microservices
• A complete picture is there, but spread over a vast landscape
• Most data doesn’t come in a database
• Data changes rapidly
10. Scenario 2: Detect anomalies
• The goal is to detect anomalies on the website and prevent abuse
• Machine learning needed to detect the anomalies
• Data based on the data lake
29. Tips for going in production
• When using streams, always have n+1 worker nodes
• More partitions = more speed
• Longer intervals is slower, but sometimes better