- Presentations
- Documents
- Infographics
Optimizing Apache Spark SQL Joins
Databricks
•
7 years ago
Building Robust ETL Pipelines with Apache Spark
Databricks
•
6 years ago
Modern Data Architecture
Alexey Grishchenko
•
8 years ago
sizeof(Object): how much memory objects take on JVMs and when this may matter
Dawid Weiss
•
11 years ago
cstore_fdw: Columnar Storage for PostgreSQL
Citus Data
•
9 years ago
2016 Spark Summit East Keynote: Matei Zaharia
Databricks
•
8 years ago
Deep Learning and the state of AI / 2016
Grigory Sapunov
•
8 years ago
Social network analysis & Big Data - Telecommunications and more
Wael Elrifai
•
10 years ago
Thrift vs Protocol Buffers vs Avro - Biased Comparison
Igor Anishchenko
•
11 years ago
Parquet Hadoop Summit 2013
Julien Le Dem
•
10 years ago
Choosing an HDFS data storage format- Avro vs. Parquet and more - StampedeCon 2015
StampedeCon
•
8 years ago
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
DataWorks Summit
•
8 years ago
Apache Storm 0.9 basic training - Verisign
Michael Noll
•
9 years ago
Scala dreaded underscore
RUDDER
•
13 years ago
New Security Features in Apache HBase 0.98: An Operator's Guide
HBaseCon
•
9 years ago
Near-realtime analytics with Kafka and HBase
dave_revell
•
11 years ago
Security needs in Hadoop’s Current and Future – How Apache Ranger can help?
DataWorks Summit
•
9 years ago
STORM as an ETL Engine to HADOOP
DataWorks Summit
•
9 years ago
Realtime Analytics with Hadoop and HBase
larsgeorge
•
12 years ago
Indexed Hive
NikhilDeshpande
•
13 years ago