How to track and organize your experimentation process

•

0 likes•60 views

Presentation from PyData Munich '19. You will learn how with some simple steps you can have your work organized around creative iterations, reproducible and easy to share with anyone. You will see how to easily track the code, metrics, hyperparameters, learning curves, data versions and more. Bonus point: we will speak with a bot that knows a lot about the experiments.

Technology

Jakub Czakon
Senior Data Scientist
@neptune.ml
● Worked at a data science consulting firm
deepsense.ai
● Joined the team that developed an internal
tool for tracking and managing experiments
● We spinned-off this tool as neptune.ml
● Working on open-source/community side of
things neptune-contrib

● Worked on my machine, when I ran that notebook
● They got 75% on that problem… idk which data version or metric
● I don’t understand this approach... what do you mean this person is
long gone… the confluence page is not really helping
● Mid-work interruptions are not exactly what I like the most
.

● We are missing tracking/organization standards
● Knowledge is scattered across many tools
● People are not really working together

● Time spent fixing/re-doing >> time spent discovering
● “bus factor” goes way up
● Visit to the “alone in the dark” land
● and...

● Magic numbers -> hyperparameters
● Make sure your notebook works
jupyter nbconvert --to script nb.ipynb
python nb.py

● Everything goes into config
● If passed via command -> automagically goes to
config
● If passed via script -> automagically goes into
config
Bonus -> hyper parameter optimization for free(ish)

● Good validation >> insert smth
● Always be (c)logging
● The more metrics the better

● Good validation >> insert smth
● Always be (c)logging
● The more metrics the better score = evaluate(model)
exp.log(‘score’, score)

● Storage is cheap(ish) >> keep old versions
● Log data path
● Log data hash train = pd.read_csv(TRAIN_PATH)
exp.log(‘data_path’, TRAIN_PATH)
md5 = md5_from_file(TRAIN_PATH)
exp.log(‘data_version’, md5)

● You get a better picture, and keep it longer
● Someone may actually be able to understand your problem
● You get a clear picture -> clear head -> better ideas

● Confusion matrix heatmap
● Predictions distributions
● Best/worst predictions
dist_fig = plot_dist(predictions)
exp.log(’figure’, dist_fig)

● Confusion matrix heatmap
● Predictions distributions
● Best/worst predictions

● Insights changelog -> Wiki
● High level tags
exp.tags.append(‘resnet’)
exp.tags.append(‘heavy augmentations’)

Feel free to contact me:
jakub@neptune.ml
https://neptune.ml/
https://neptune-contrib.readthedocs.io
https://medium.com/neptune-ml

Similar to How to track and organize your experimentation process

San Francisco Hacker News - Machine Learning for HackersAdam Gibson

Machine learningMike Martinez

OSDC 2012 | Devops and Open Source by Kris BuyaertNETWAYS

OSDC 2012 | Devops and Open Source by Kris BuytaertNETWAYS

Data Science Challenge presentation given to the CinBITools Meetup GroupDoug Needham

Cloudera Data Science ChallengeMark Nichols, P.E.

How to apply deep learning to 3 d objectsOgushi Masaya

Puppet@Citygrid - Julien Rottenberg - PuppetCamp LA '12Puppet

supervised.pptxMohamedSaied316569

Developer-friendly taskqueues: What you should ask yourself before choosing oneSylvain Zimmer

Developer-friendly task queues: what we learned building MRQ, Sylvain ZimmerPôle Systematic Paris-Region

Fuzzing: The New Unit TestingDmitry Vyukov

DN18 | The Data Janitor Returns | Daniel Molnar | Oberlo/Shopify Dataconomy Media

The Data Janitor Returns | Daniel Molnar | DN18DataconomyGmbH

Beyond unit tests: Deployment and testing for Hadoop/Spark workflowsDataWorks Summit

PPT6: Neuron Demoakira-ai

Hacker vs company, Cloud Cyber Security Automated with Kubernetes - Demi Ben-...Demi Ben-Ari

Practical deep learning for computer visionEran Shlomo

Automating MySQL operations with PuppetKris Buytaert

First adventure within a shell - Andrea Telatin at Quadram InstituteAndrea Telatin

Similar to How to track and organize your experimentation process (20)

San Francisco Hacker News - Machine Learning for Hackers

Machine learning

OSDC 2012 | Devops and Open Source by Kris Buyaert

OSDC 2012 | Devops and Open Source by Kris Buytaert

Data Science Challenge presentation given to the CinBITools Meetup Group

Cloudera Data Science Challenge

How to apply deep learning to 3 d objects

Puppet@Citygrid - Julien Rottenberg - PuppetCamp LA '12

supervised.pptx

Developer-friendly taskqueues: What you should ask yourself before choosing one

Developer-friendly task queues: what we learned building MRQ, Sylvain Zimmer

Fuzzing: The New Unit Testing

DN18 | The Data Janitor Returns | Daniel Molnar | Oberlo/Shopify

The Data Janitor Returns | Daniel Molnar | DN18

Beyond unit tests: Deployment and testing for Hadoop/Spark workflows

PPT6: Neuron Demo

Hacker vs company, Cloud Cyber Security Automated with Kubernetes - Demi Ben-...

Practical deep learning for computer vision

Automating MySQL operations with Puppet

First adventure within a shell - Andrea Telatin at Quadram Institute

Recently uploaded

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

GenAI Risks & Security Meetup 01052024.pdflior mazor

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Recently uploaded (20)

Partners Life - Insurer Innovation Award 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Boost Fertility New Invention Ups Success Rates.pdf

Automating Google Workspace (GWS) & more with Apps Script

CNv6 Instructor Chapter 6 Quality of Service

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Boost PC performance: How more available memory can improve productivity

How to Troubleshoot Apps for the Modern Connected Worker

Driving Behavioral Change for Information Management through Data-Driven Gree...

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

GenAI Risks & Security Meetup 01052024.pdf

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Strategies for Landing an Oracle DBA Job as a Fresher

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Data Cloud, More than a CDP by Matt Robison

08448380779 Call Girls In Civil Lines Women Seeking Men

How to track and organize your experimentation process

2. Jakub Czakon Senior Data Scientist @neptune.ml ● Worked at a data science consulting firm deepsense.ai ● Joined the team that developed an internal tool for tracking and managing experiments ● We spinned-off this tool as neptune.ml ● Working on open-source/community side of things neptune-contrib

3. is not

4. ● Worked on my machine, when I ran that notebook ● They got 75% on that problem… idk which data version or metric ● I don’t understand this approach... what do you mean this person is long gone… the confluence page is not really helping ● Mid-work interruptions are not exactly what I like the most .

5. ● We are missing tracking/organization standards ● Knowledge is scattered across many tools ● People are not really working together

8. ● Time spent fixing/re-doing >> time spent discovering ● “bus factor” goes way up ● Visit to the “alone in the dark” land ● and...

10. ● Magic numbers -> hyperparameters ● Make sure your notebook works jupyter nbconvert --to script nb.ipynb python nb.py

11. ● Everything goes into config ● If passed via command -> automagically goes to config ● If passed via script -> automagically goes into config Bonus -> hyper parameter optimization for free(ish)

12. ● Good validation >> insert smth ● Always be (c)logging ● The more metrics the better

13. ● Good validation >> insert smth ● Always be (c)logging ● The more metrics the better score = evaluate(model) exp.log(‘score’, score)

14. ● Storage is cheap(ish) >> keep old versions ● Log data path ● Log data hash train = pd.read_csv(TRAIN_PATH) exp.log(‘data_path’, TRAIN_PATH) md5 = md5_from_file(TRAIN_PATH) exp.log(‘data_version’, md5)

15. ● You get a better picture, and keep it longer ● Someone may actually be able to understand your problem ● You get a clear picture -> clear head -> better ideas

16. ● Confusion matrix heatmap ● Predictions distributions ● Best/worst predictions dist_fig = plot_dist(predictions) exp.log(’figure’, dist_fig)

17. ● Confusion matrix heatmap ● Predictions distributions ● Best/worst predictions

18. ● Confusion matrix heatmap ● Predictions distributions ● Best/worst predictions

19. ● Insights changelog -> Wiki ● High level tags exp.tags.append(‘resnet’) exp.tags.append(‘heavy augmentations’)

20.

21.

22.

23.

24.

25. Feel free to contact me: jakub@neptune.ml https://neptune.ml/ https://neptune-contrib.readthedocs.io https://medium.com/neptune-ml

How to track and organize your experimentation process

Recommended

Recommended

More Related Content

Similar to How to track and organize your experimentation process

Similar to How to track and organize your experimentation process (20)

Recently uploaded

Recently uploaded (20)

How to track and organize your experimentation process