SlideShare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our User Agreement and Privacy Policy.
SlideShare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our Privacy Policy and User Agreement for details.
Successfully reported this slideshow.
Activate your 14 day free trial to unlock unlimited reading.
3.
Extract Transform Load
foo
bar
baz
Extract Transform Load
4.
Extract Transform Load
foo
bar
baz
Extract
Transform Load
Transform
more
Join
DB
HTTP POST
log?
5.
Bonobo ETL
• It’s just Plain Old Python (3) Objects
• Streaming data through them
• Write / Configure / Execute / Monitor
Code first. Eventually a GUI will come.
6.
Bonobo is not…
• Data Science & Analytics (see pandas)
• Workflow Scheduler & Monitor (see airflow)
• Big Data (see hadoop, big query, dataflow, spark, …)
• A monkey (spoiler : it’s an ape)
16.
Simplest cases can be run without any code, from CLI
$ bonobo convert orders.csv orders.json
- CsvReader in=1 out=29 [done]
- JsonWriter in=29 out=29 [done]
17.
Even with custom transformations in the middle of the chain
$ bonobo convert orders.csv -t transform:append_isbn orders.json
- CsvReader in=1 out=29 [done]
- append_isbn in=29 out=29 [done]
- JsonWriter in=29 out=29 [done]