Holden Karau

Sort by

Latest

Most popular

Validating big data pipelines - FOSDEM 2019

Powering tensor flow with big data using apache beam, flink, and spark cern 2019 (3)

Validating spark ml jobs stopping failures before production on Apache Spark @ Spark Summit 2019

PySpark on Kubernetes @ Python Barcelona March Meetup

Contributing to Apache Spark 3

Validating big data pipelines - Scala eXchange 2018

Validating Big Data Pipelines - Big Data Spain 2018

Big data with Python on kubernetes (pyspark on k8s) - Big Data Spain 2018

Building Recoverable (and optionally async) Pipelines with Apache Spark (+ small revisions)

Keynote Open Source Diversity - Festival del Software Libre

Intro - End to end ML with Kubeflow @ SignalConf 2018

The magic of (data parallel) distributed systems and where it all breaks - Reversim

Using Spark ML on Spark Errors - What do the clusters tell us?

Validating big data jobs - Spark AI Summit EU

Spark Autotuning - Strata EU 2018