Building a Data Ingestion & Processing Pipeline with Spark & Airflow

Freelance Big Data & Machine Learning Software Engineer
Feb. 11, 2017
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
1 of 18

More Related Content

What's hot

Hadoop summit-ams-2014-04-03Hadoop summit-ams-2014-04-03
Hadoop summit-ams-2014-04-03SDanzanvilliersCriteo
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | GrafanaStreaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | GrafanaInfluxData
Workflow Engines for HadoopWorkflow Engines for Hadoop
Workflow Engines for HadoopJoe Crobak
Managing data workflows with LuigiManaging data workflows with Luigi
Managing data workflows with LuigiTeemu Kurppa
The IDI Digital Transformation - OpenStack Day Israel 2016The IDI Digital Transformation - OpenStack Day Israel 2016
The IDI Digital Transformation - OpenStack Day Israel 2016Cloud Native Day Tel Aviv
Scalable Eventing Over Apache MesosScalable Eventing Over Apache Mesos
Scalable Eventing Over Apache MesosOlivier Paugam

What's hot(20)

Viewers also liked

How to write maintainable codeHow to write maintainable code
How to write maintainable codePeter Hilton
Data Driven Action : A Primer on Data ScienceData Driven Action : A Primer on Data Science
Data Driven Action : A Primer on Data ScienceSrivatsan Ramanujam
Transforming Data to Unlock Its Latent ValueTransforming Data to Unlock Its Latent Value
Transforming Data to Unlock Its Latent ValueTony Ojeda
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...Ilkay Altintas, Ph.D.
Big datalabBig datalab
Big datalabDavid Chen
Gartner Predictions for HadoopGartner Predictions for Hadoop
Gartner Predictions for HadoopBruno Aziza

Similar to Building a Data Ingestion & Processing Pipeline with Spark & Airflow

RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...Databricks
Transforming to OpenStack: a sample roadmap to DevOpsTransforming to OpenStack: a sample roadmap to DevOps
Transforming to OpenStack: a sample roadmap to DevOpsNicolas (Nick) Barcet
The 12 Factor AppThe 12 Factor App
The 12 Factor Apprudiyardley
Scaling AI in production using PyTorchScaling AI in production using PyTorch
Scaling AI in production using PyTorchgeetachauhan
OSDC 2018 | Puppet and the Road to Pervasive Automation by Walter GildersleeveOSDC 2018 | Puppet and the Road to Pervasive Automation by Walter Gildersleeve
OSDC 2018 | Puppet and the Road to Pervasive Automation by Walter GildersleeveNETWAYS
200,000 Lines Later: Our Journey to Manageable Puppet Code200,000 Lines Later: Our Journey to Manageable Puppet Code
200,000 Lines Later: Our Journey to Manageable Puppet CodeDavid Danzilio

Similar to Building a Data Ingestion & Processing Pipeline with Spark & Airflow(20)

Recently uploaded

Grumman TBF-1/TBM-1 Avenger I Pilot's Handbook.pdfGrumman TBF-1/TBM-1 Avenger I Pilot's Handbook.pdf
Grumman TBF-1/TBM-1 Avenger I Pilot's Handbook.pdfTahirSadikovi
PORTFOLIO-Pascual-Khristine-Lerie.pdfPORTFOLIO-Pascual-Khristine-Lerie.pdf
PORTFOLIO-Pascual-Khristine-Lerie.pdfKhristinePascual1
Vintage Computing Festival Midwest 18 2023-09-09 What's In A Terminal.pdfVintage Computing Festival Midwest 18 2023-09-09 What's In A Terminal.pdf
Vintage Computing Festival Midwest 18 2023-09-09 What's In A Terminal.pdfRichard Thomson
DCES_Classroom_Building_100_CDs.pdfDCES_Classroom_Building_100_CDs.pdf
DCES_Classroom_Building_100_CDs.pdfRoshelleStahl1
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of usMassimo Ferre'
Operations and Supply Chain management.pdfOperations and Supply Chain management.pdf
Operations and Supply Chain management.pdfParmitSihag1

Recently uploaded(20)

Building a Data Ingestion & Processing Pipeline with Spark & Airflow