19. •
•
•
• Kafka provides seamless integration between information of producers and consumers without blocking the producers of the
information, and without letting producers know who the final consumers are.
• Each consumer keeps control of its own offset (read)
• On demand topic creation
SPARK STREAMING OVERVIEW
20. • ETL and ELT, wide catalog of sources and sinks
• Flexible design of topologies and agent deployment strategies.
• Data transformation, thanks to interceptors.
•
•
SPARK STREAMING OVERVIEW
63. • Stateful transformations (updateStateByKey,
reduceByKeyAndWindow).
• As fault-tolerance mechanism, when driver crashes.
HDFS is mandatory if you are going to use operations that requires checkpointing.
SPARK STREAMING OVERVIEW