kaidata Lee

0 Followers

Presentations
Documents
Infographics

Latest Most Popular

Paris ML meetup

Yves Raimond • 8 years ago

Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Julia Bennett, Netflix

Flink Forward • 4 years ago

What's new in 1.9.0 blink planner - Kurt Young, Alibaba

Flink Forward • 4 years ago

Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache

Dremio Corporation • 6 years ago

Apache Arrow: In Theory, In Practice

Dremio Corporation • 6 years ago

Improving Apache Spark's Reliability with DataSourceV2

Databricks • 4 years ago

Fast and Reliable Apache Spark SQL Engine

Databricks • 4 years ago

Dynamic Partition Pruning in Apache Spark

Databricks • 4 years ago

Building Reliable Data Lakes at Scale with Delta Lake

Databricks • 4 years ago

Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Architect Things Right

Databricks • 4 years ago

Cowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов

Sigma Software • 4 years ago

Apache Spark Core – Practical Optimization

Databricks • 4 years ago

The Parquet Format and Performance Optimization Opportunities

Databricks • 4 years ago

Driver Location Intelligence at Scale using Apache Spark, Delta Lake, and MLflow on Databricks

Databricks • 4 years ago

Petabytes, Exabytes, and Beyond: Managing Delta Lakes for Interactive Queries at Scale

Databricks • 4 years ago

Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang

Databricks • 5 years ago

Deep learning at scale in Azure

Microsoft Tech Community • 5 years ago

Large Scale Deep Learning with TensorFlow

Jen Aman • 7 years ago

Deep Learning at Scale

Herman Wu • 5 years ago

How to Become a Data Scientist

ryanorban • 9 years ago