We've updated our privacy policy. Click here to review the details. Tap here to review the details.
Activate your 30 day free trial to unlock unlimited reading.
Activate your 30 day free trial to continue reading.
Download to read offline
Headstart takes reproducibility very seriously.
Our system needs to be fully auditable: our “match score” is a crucial element for candidate selection. At any point in time we need to be able to:
- Access the models that were being used in production when the match score was computed;
- Examine their code (including all upstream ETL/preprocessing pipelines);
- Examine the data they were trained on;
- Be able to deserialize the models and run diagnostics/tests on them.
To support our requirements, we developed our own internal model versioning system using Git, Docker, CircleCI, AWS S3 and Pipenv.
This presentation will share the design, implementation and functionalities of our versioning system, with a detailed walkthrough using our skill recommendation engine as a streamlined running example.
Headstart takes reproducibility very seriously.
Our system needs to be fully auditable: our “match score” is a crucial element for candidate selection. At any point in time we need to be able to:
- Access the models that were being used in production when the match score was computed;
- Examine their code (including all upstream ETL/preprocessing pipelines);
- Examine the data they were trained on;
- Be able to deserialize the models and run diagnostics/tests on them.
To support our requirements, we developed our own internal model versioning system using Git, Docker, CircleCI, AWS S3 and Pipenv.
This presentation will share the design, implementation and functionalities of our versioning system, with a detailed walkthrough using our skill recommendation engine as a streamlined running example.
You just clipped your first slide!
Clipping is a handy way to collect important slides you want to go back to later. Now customize the name of a clipboard to store your clips.The SlideShare family just got bigger. Enjoy access to millions of ebooks, audiobooks, magazines, and more from Scribd.
Cancel anytime.Unlimited Reading
Learn faster and smarter from top experts
Unlimited Downloading
Download to take your learnings offline and on the go
You also get free access to Scribd!
Instant access to millions of ebooks, audiobooks, magazines, podcasts and more.
Read and listen offline with any device.
Free access to premium services like Tuneln, Mubi and more.
We’ve updated our privacy policy so that we are compliant with changing global privacy regulations and to provide you with insight into the limited ways in which we use your data.
You can read the details below. By accepting, you agree to the updated privacy policy.
Thank you!