Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Talk at Paris.Py, with captures from demo

114 views

Published on

Bonobo ETL at Paris.Py / Algolia

Published in: Internet
  • Be the first to comment

  • Be the first to like this

Talk at Paris.Py, with captures from demo

  1. 1. Bonobo Lightweight Extract Transform Load Python 3.5+
  2. 2. Introduction
  3. 3. Extract Transform Load foo bar baz Extract Transform Load
  4. 4. Extract Transform Load foo bar baz Extract Transform Load Transform
 more Join 
 DB HTTP POST log?
  5. 5. Bonobo ETL • It’s just Plain Old Python (3) Objects • Streaming data through them • Write / Configure / Execute / Monitor Code first. Eventually a GUI will come.
  6. 6. Bonobo is not… • Data Science & Analytics (see pandas) • Workflow Scheduler & Monitor (see airflow) • Big Data (see hadoop, big query, dataflow, spark, …) • A monkey (spoiler : it’s an ape)
  7. 7. Action
  8. 8. Quick Demo or brown fox?
  9. 9. pip install bonobo then import it
  10. 10. Let’s show what we can do with simple callable (generators, here).
  11. 11. Adding them to a graph show the data flow
  12. 12. Run them, see what happens real time.
  13. 13. Use external services, implementation resolution is done at runtime.
  14. 14. Provide implementations at runtime, easy to be 12factor-compliant.
  15. 15. Also easy to read/write from files.
  16. 16. Simplest cases can be run without any code, from CLI $ bonobo convert orders.csv orders.json - CsvReader in=1 out=29 [done] - JsonWriter in=29 out=29 [done]
  17. 17. Even with custom transformations in the middle of the chain $ bonobo convert orders.csv -t transform:append_isbn orders.json - CsvReader in=1 out=29 [done] - append_isbn in=29 out=29 [done] - JsonWriter in=29 out=29 [done]
  18. 18. Epilogue
  19. 19. Young • First commit : December 2016 • 23 releases, ~420 commits, 4 contributors • 25 releases, ~565 commits, 12 contributors • 34 releases, ~774 commits, 21 contributors • Current : 0.4.3 0.5.1 0.6.1
  20. 20. … • Open-Source. • Light & Focused. • Very few dependencies. • Comprehensive standard library.
  21. 21. Integrations • Django • SQLAlchemy • Jupyter • Docker • Google APIs, OpenDataSoft, Cheese Shop :)
  22. 22. www.bonobo-project.org
  23. 23. Thank you! @bonobo_etl @rdorgueil https://goo.gl/e25eoa

×