- Presentations
- Documents
- Infographics
Open Source Reliability for Data Lake with Apache Spark by Michael Armbrust
Data Con LA
•
4 years ago
Strata NY 2017 Parquet Arrow roadmap
Julien Le Dem
•
6 years ago
LLAP: long-lived execution in Hive
DataWorks Summit
•
9 years ago
Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production
Cloudera, Inc.
•
8 years ago
Handling Data Skew Adaptively In Spark Using Dynamic Repartitioning
Spark Summit
•
8 years ago