Holden Karau, profile picture

Holden Karau

Sort by
Validating big data pipelines - FOSDEM 2019
Powering tensor flow with big data using apache beam, flink, and spark cern 2019 (3)
Validating spark ml jobs stopping failures before production on Apache Spark @ Spark Summit 2019
PySpark on Kubernetes @ Python Barcelona March Meetup
Contributing to Apache Spark 3
Validating big data pipelines - Scala eXchange 2018
Validating Big Data Pipelines - Big Data Spain 2018
Big data with Python on kubernetes (pyspark on k8s) - Big Data Spain 2018
Building Recoverable (and optionally async) Pipelines with Apache Spark (+ small revisions)
Keynote Open Source Diversity - Festival del Software Libre
Intro - End to end ML with Kubeflow @ SignalConf 2018
The magic of (data parallel) distributed systems and where it all breaks - Reversim
Using Spark ML on Spark Errors - What do the clusters tell us?
Validating big data jobs - Spark AI Summit EU
Spark Autotuning - Strata EU 2018