Personal Information
Organization / Workplace
Hangzhou, Zhejiang, China China
Occupation
海康研究院 大数据架构工程师 大数据安防 平安城市 智慧城市
Industry
Technology / Software / Internet
Website
kaidata.github.io
About
大数据处理
- Presentations
- Documents
- Infographics
Paris ML meetup
Yves Raimond
•
8 years ago
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Julia Bennett, Netflix
Flink Forward
•
4 years ago
What's new in 1.9.0 blink planner - Kurt Young, Alibaba
Flink Forward
•
4 years ago
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
6 years ago
Apache Arrow: In Theory, In Practice
Dremio Corporation
•
6 years ago
Improving Apache Spark's Reliability with DataSourceV2
Databricks
•
4 years ago
Fast and Reliable Apache Spark SQL Engine
Databricks
•
4 years ago
Dynamic Partition Pruning in Apache Spark
Databricks
•
4 years ago
Building Reliable Data Lakes at Scale with Delta Lake
Databricks
•
4 years ago
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Architect Things Right
Databricks
•
4 years ago
Cowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов
Sigma Software
•
4 years ago
Apache Spark Core – Practical Optimization
Databricks
•
4 years ago
The Parquet Format and Performance Optimization Opportunities
Databricks
•
4 years ago
Driver Location Intelligence at Scale using Apache Spark, Delta Lake, and MLflow on Databricks
Databricks
•
4 years ago
Petabytes, Exabytes, and Beyond: Managing Delta Lakes for Interactive Queries at Scale
Databricks
•
4 years ago
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Databricks
•
5 years ago
Deep learning at scale in Azure
Microsoft Tech Community
•
5 years ago
Large Scale Deep Learning with TensorFlow
Jen Aman
•
7 years ago
Deep Learning at Scale
Herman Wu
•
5 years ago
How to Become a Data Scientist
ryanorban
•
9 years ago