SlideShare a Scribd company logo
Welcome to Sanitas
04.09.2017 Zürich R User meetup1
822’500customers
750employees
#1in customer satisfaction1
1.4 mncustomer contacts
2.8 bn assets under management
61 mnCHF operating profit
Founded in 1958
2.7 bn CHF total revenues
2.5 bn CHF total paid-out claims
1 K-Tipp-Survey no. 15/2016
Source: annual report 2016
Zürich R User meetup
R at Sanitas – Workflow, Problems and
Solutions
Patrik Lengacher
Zürich, 04. September 2017
Source: Photoshopped adoption from : http://www.ecns.cn/cns-wire/2015/02-26/155833.shtml/
Who am I?
11/6/2017Zürich R – R at Sanitas3
Patrik Lengacher
Data Manager
Analytics
MSc, ETH, Mathematics
BSc, ETH, Mathematics
Sanitas – Health Insurance
Layzapp AG – Start up
Accenture – Consulting
Paul Scherrer Institute
github.com/plengacher
linkedin.com/in/plengacher
patrik.lengacher@{sanitas,gmail}.com
Agenda
11/6/2017Zürich R – R at Sanitas7
Zürich R – R at Sanitas
The data part of the data science workflow
Data Preparation
Modeling
Operationalize
Reproducibility
corporate identity & corporate design (CI/CD)
The data part of the data science workflow
11/6/2017Zürich R – R at Sanitas9
Data Sources
Data Prep
Optimize Data
Data blending
Data cleansing
Feature engineering
…
Modeling
Apply statistics / machine learning
Prototyping
Train & testing
Validation
Visualization
….
Operationalize
Deploy models
Dashboards
Reports
Presentations
…
Agenda
11/6/2017Zürich R – R at Sanitas10
Zürich R – R at Sanitas
The data part of the data science workflow
Data Preparation
Modeling
Operationalize
Reproducibility
corporate identity & corporate design (CI/CD)
Old New
Data Preparation
Firewalls and tool incompatibilities prohibit a clean workflow. Changes in the IT Infrastructure and the
use of our Rstudio Server lets R interact directly with the data sources.
11/6/2017Zürich R – R at Sanitas11
Agenda
11/6/2017Zürich R – R at Sanitas12
Zürich R – R at Sanitas
The data part of the data science workflow
Data Preparation
Modeling
Operationalize
Reproducibility
corporate identity & corporate design (CI/CD)
Old New
Modeling
If the local machine runs out of resources, isolated solutions were created. Due to the nature of the
solutions, they were not integrated in our infrastructure.
Zürich R – R at Sanitas13
Agenda
11/6/2017Zürich R – R at Sanitas14
Zürich R – R at Sanitas
The data part of the data science workflow
Data Preparation
Modeling
Operationalize
Reproducibility
corporate identity & corporate design (CI/CD)
Old New
Operationalize (I/II)
Results of Ad-Hoc Requests were written as comments into the source files. Furthermore emailing the
results became a time consuming repetitive task.
Zürich R – R at Sanitas15
Old New
Operationalize (II/II)
Results of Ad-Hoc Requests were written as comments into the source files. Furthermore emailing the
results became a time consuming repetitive task.
Zürich R – R at Sanitas16
Agenda
11/6/2017Zürich R – R at Sanitas17
Zürich R – R at Sanitas
The data part of the data science workflow
Data Preparation
Modeling
Operationalize
Reproducibility
corporate identity & corporate design (CI/CD)
Old New
Reproducibility (I/III)
Using markdown, shiny, MicroStrategy helps us to distribute results, presentations and reports. The
tools help to keep the documents up to date.
Zürich R – R at Sanitas18
Old New
Reproducibility (II/III)
Version Control and a SanitasR Package with the commonly used functions help us to reproduce our
results.
Zürich R – R at Sanitas19
Old New
Reproducibility (III/III)
Version Control and a SanitasR Package with the commonly used functions help us to reproduce our
results.
Zürich R – R at Sanitas20
Agenda
11/6/2017Zürich R – R at Sanitas21
Zürich R – R at Sanitas
The data part of the data science workflow
Data Preparation
Modeling
Operationalize
Reproducibility
corporate identity & corporate design (CI/CD)
Old New
CI/CD
Defining colors and themes help to get our plots CI/CD ready.
Zürich R – R at Sanitas22
06.11.2017Zürich R – R at Sanitas23

More Related Content

Similar to R at Sanitas - Workflow, Problems and Solutions

Philippe Borremans - How To Automate Boring Tasks & Increase Productivity In PR
Philippe Borremans - How To Automate Boring Tasks & Increase Productivity In PRPhilippe Borremans - How To Automate Boring Tasks & Increase Productivity In PR
Philippe Borremans - How To Automate Boring Tasks & Increase Productivity In PR
Norsk kommunikasjonsforening
 
Qbt nlp en_2014
Qbt nlp en_2014Qbt nlp en_2014
Qbt nlp en_2014
Qbtsagl3
 
Sap fico presentation full class Demo.pptx
Sap fico presentation full class Demo.pptxSap fico presentation full class Demo.pptx
Sap fico presentation full class Demo.pptx
ravipabbolla1
 
Digitalization and SME: Vision vs. Reality
Digitalization and SME: Vision vs. RealityDigitalization and SME: Vision vs. Reality
Digitalization and SME: Vision vs. Reality
Hans-Dieter Zimmermann
 
Présentation Séminaire Alfabet Décembre 2014
Présentation Séminaire Alfabet Décembre 2014Présentation Séminaire Alfabet Décembre 2014
Présentation Séminaire Alfabet Décembre 2014
Software AG France Community
 
Enterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slidesEnterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slides
University St. Gallen
 
Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...
Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...
Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...
Shaun West
 
Asia Start-Up InsurTech Award 2017 - Digital Fineprint
Asia Start-Up InsurTech Award 2017 - Digital FineprintAsia Start-Up InsurTech Award 2017 - Digital Fineprint
Asia Start-Up InsurTech Award 2017 - Digital Fineprint
The Digital Insurer
 
PowerBi Event with Machine Learning
PowerBi Event with Machine LearningPowerBi Event with Machine Learning
PowerBi Event with Machine Learning
Trivadis
 
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Lora Cecere
 
Data Quality and Governance in a Data Obsessed World
Data Quality and Governance in a Data Obsessed WorldData Quality and Governance in a Data Obsessed World
Data Quality and Governance in a Data Obsessed World
ibi
 
Data-driven leadership culture
Data-driven leadership cultureData-driven leadership culture
Data-driven leadership culture
Juuso Parkkinen
 
Avantium Tibco Study Case
Avantium Tibco Study CaseAvantium Tibco Study Case
Avantium Tibco Study CaseDavid Jim
 
Replex - NOAH17 London
Replex - NOAH17 LondonReplex - NOAH17 London
Replex - NOAH17 London
NOAH Advisors
 
SAP agile proof of concept
SAP agile proof of conceptSAP agile proof of concept
SAP agile proof of concept
Costantino Petrosellini
 
European Decision Support Days 2015 agenda
European Decision Support Days 2015 agendaEuropean Decision Support Days 2015 agenda
European Decision Support Days 2015 agenda
Copperberg
 
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
Dr. Haxel Consult
 
Data Science & Data Products at Neue Zürcher Zeitung
Data Science & Data Products at Neue Zürcher ZeitungData Science & Data Products at Neue Zürcher Zeitung
Data Science & Data Products at Neue Zürcher Zeitung
René Pfitzner
 
Servitization of office lighting in the context of the Internet of Things
Servitization of office lighting in the context of the Internet of ThingsServitization of office lighting in the context of the Internet of Things
Servitization of office lighting in the context of the Internet of Things
Shaun West
 

Similar to R at Sanitas - Workflow, Problems and Solutions (20)

Philippe Borremans - How To Automate Boring Tasks & Increase Productivity In PR
Philippe Borremans - How To Automate Boring Tasks & Increase Productivity In PRPhilippe Borremans - How To Automate Boring Tasks & Increase Productivity In PR
Philippe Borremans - How To Automate Boring Tasks & Increase Productivity In PR
 
Qbt nlp en_2014
Qbt nlp en_2014Qbt nlp en_2014
Qbt nlp en_2014
 
Sap fico presentation full class Demo.pptx
Sap fico presentation full class Demo.pptxSap fico presentation full class Demo.pptx
Sap fico presentation full class Demo.pptx
 
Digitalization and SME: Vision vs. Reality
Digitalization and SME: Vision vs. RealityDigitalization and SME: Vision vs. Reality
Digitalization and SME: Vision vs. Reality
 
Présentation Séminaire Alfabet Décembre 2014
Présentation Séminaire Alfabet Décembre 2014Présentation Séminaire Alfabet Décembre 2014
Présentation Séminaire Alfabet Décembre 2014
 
Enterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slidesEnterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slides
 
Infosys Information Platform - Translating data into action
Infosys Information Platform - Translating data into actionInfosys Information Platform - Translating data into action
Infosys Information Platform - Translating data into action
 
Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...
Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...
Second SMART SERVICES SUMMIT 2019 - Decision Making Services for Capital Equi...
 
Asia Start-Up InsurTech Award 2017 - Digital Fineprint
Asia Start-Up InsurTech Award 2017 - Digital FineprintAsia Start-Up InsurTech Award 2017 - Digital Fineprint
Asia Start-Up InsurTech Award 2017 - Digital Fineprint
 
PowerBi Event with Machine Learning
PowerBi Event with Machine LearningPowerBi Event with Machine Learning
PowerBi Event with Machine Learning
 
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
 
Data Quality and Governance in a Data Obsessed World
Data Quality and Governance in a Data Obsessed WorldData Quality and Governance in a Data Obsessed World
Data Quality and Governance in a Data Obsessed World
 
Data-driven leadership culture
Data-driven leadership cultureData-driven leadership culture
Data-driven leadership culture
 
Avantium Tibco Study Case
Avantium Tibco Study CaseAvantium Tibco Study Case
Avantium Tibco Study Case
 
Replex - NOAH17 London
Replex - NOAH17 LondonReplex - NOAH17 London
Replex - NOAH17 London
 
SAP agile proof of concept
SAP agile proof of conceptSAP agile proof of concept
SAP agile proof of concept
 
European Decision Support Days 2015 agenda
European Decision Support Days 2015 agendaEuropean Decision Support Days 2015 agenda
European Decision Support Days 2015 agenda
 
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
 
Data Science & Data Products at Neue Zürcher Zeitung
Data Science & Data Products at Neue Zürcher ZeitungData Science & Data Products at Neue Zürcher Zeitung
Data Science & Data Products at Neue Zürcher Zeitung
 
Servitization of office lighting in the context of the Internet of Things
Servitization of office lighting in the context of the Internet of ThingsServitization of office lighting in the context of the Internet of Things
Servitization of office lighting in the context of the Internet of Things
 

More from Zurich_R_User_Group

Anomaly detection - database integrated
Anomaly detection - database integratedAnomaly detection - database integrated
Anomaly detection - database integrated
Zurich_R_User_Group
 
Modeling Bus Bunching
Modeling Bus BunchingModeling Bus Bunching
Modeling Bus Bunching
Zurich_R_User_Group
 
Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...
Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...
Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...
Zurich_R_User_Group
 
Introduction to Renjin, the alternative engine for R
Introduction to Renjin, the alternative engine for R Introduction to Renjin, the alternative engine for R
Introduction to Renjin, the alternative engine for R
Zurich_R_User_Group
 
How to use R in different professions: R for Car Insurance Product (Speaker: ...
How to use R in different professions: R for Car Insurance Product (Speaker: ...How to use R in different professions: R for Car Insurance Product (Speaker: ...
How to use R in different professions: R for Car Insurance Product (Speaker: ...
Zurich_R_User_Group
 
How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...
How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...
How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...
Zurich_R_User_Group
 
Where South America is Swinging to the Right: An R-Driven Data Journalism Pr...
Where South America is Swinging to the Right:  An R-Driven Data Journalism Pr...Where South America is Swinging to the Right:  An R-Driven Data Journalism Pr...
Where South America is Swinging to the Right: An R-Driven Data Journalism Pr...
Zurich_R_User_Group
 
Visualization Challenge: Mapping Health During Travel
Visualization Challenge: Mapping Health During TravelVisualization Challenge: Mapping Health During Travel
Visualization Challenge: Mapping Health During Travel
Zurich_R_User_Group
 
Zurich R User group: Desc tools
Zurich R User group: Desc tools Zurich R User group: Desc tools
Zurich R User group: Desc tools
Zurich_R_User_Group
 
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table packageJanuary 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
Zurich_R_User_Group
 
December 2015 Meetup - Shiny: Make Your R Code Interactive - Craig Wang
December 2015 Meetup - Shiny: Make Your R Code Interactive - Craig WangDecember 2015 Meetup - Shiny: Make Your R Code Interactive - Craig Wang
December 2015 Meetup - Shiny: Make Your R Code Interactive - Craig Wang
Zurich_R_User_Group
 

More from Zurich_R_User_Group (11)

Anomaly detection - database integrated
Anomaly detection - database integratedAnomaly detection - database integrated
Anomaly detection - database integrated
 
Modeling Bus Bunching
Modeling Bus BunchingModeling Bus Bunching
Modeling Bus Bunching
 
Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...
Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...
Visualizing the frequency of transit delays using QGIS and the Leaflet javasc...
 
Introduction to Renjin, the alternative engine for R
Introduction to Renjin, the alternative engine for R Introduction to Renjin, the alternative engine for R
Introduction to Renjin, the alternative engine for R
 
How to use R in different professions: R for Car Insurance Product (Speaker: ...
How to use R in different professions: R for Car Insurance Product (Speaker: ...How to use R in different professions: R for Car Insurance Product (Speaker: ...
How to use R in different professions: R for Car Insurance Product (Speaker: ...
 
How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...
How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...
How to use R in different professions: R In Finance (Speaker: Gabriel Foix, M...
 
Where South America is Swinging to the Right: An R-Driven Data Journalism Pr...
Where South America is Swinging to the Right:  An R-Driven Data Journalism Pr...Where South America is Swinging to the Right:  An R-Driven Data Journalism Pr...
Where South America is Swinging to the Right: An R-Driven Data Journalism Pr...
 
Visualization Challenge: Mapping Health During Travel
Visualization Challenge: Mapping Health During TravelVisualization Challenge: Mapping Health During Travel
Visualization Challenge: Mapping Health During Travel
 
Zurich R User group: Desc tools
Zurich R User group: Desc tools Zurich R User group: Desc tools
Zurich R User group: Desc tools
 
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table packageJanuary 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
 
December 2015 Meetup - Shiny: Make Your R Code Interactive - Craig Wang
December 2015 Meetup - Shiny: Make Your R Code Interactive - Craig WangDecember 2015 Meetup - Shiny: Make Your R Code Interactive - Craig Wang
December 2015 Meetup - Shiny: Make Your R Code Interactive - Craig Wang
 

Recently uploaded

Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
Adele Miller
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
Neo4j
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
Google
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
Globus
 
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeA Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Mind IT Systems
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Łukasz Chruściel
 
Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)
abdulrafaychaudhry
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
Pro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp BookPro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp Book
abdulrafaychaudhry
 
AI Genie Review: World’s First Open AI WordPress Website Creator
AI Genie Review: World’s First Open AI WordPress Website CreatorAI Genie Review: World’s First Open AI WordPress Website Creator
AI Genie Review: World’s First Open AI WordPress Website Creator
Google
 
Empowering Growth with Best Software Development Company in Noida - Deuglo
Empowering Growth with Best Software  Development Company in Noida - DeugloEmpowering Growth with Best Software  Development Company in Noida - Deuglo
Empowering Growth with Best Software Development Company in Noida - Deuglo
Deuglo Infosystem Pvt Ltd
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOMLORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
lorraineandreiamcidl
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
Globus
 

Recently uploaded (20)

Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
 
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeA Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
 
Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
 
Pro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp BookPro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp Book
 
AI Genie Review: World’s First Open AI WordPress Website Creator
AI Genie Review: World’s First Open AI WordPress Website CreatorAI Genie Review: World’s First Open AI WordPress Website Creator
AI Genie Review: World’s First Open AI WordPress Website Creator
 
Empowering Growth with Best Software Development Company in Noida - Deuglo
Empowering Growth with Best Software  Development Company in Noida - DeugloEmpowering Growth with Best Software  Development Company in Noida - Deuglo
Empowering Growth with Best Software Development Company in Noida - Deuglo
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOMLORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
 

R at Sanitas - Workflow, Problems and Solutions

  • 1. Welcome to Sanitas 04.09.2017 Zürich R User meetup1 822’500customers 750employees #1in customer satisfaction1 1.4 mncustomer contacts 2.8 bn assets under management 61 mnCHF operating profit Founded in 1958 2.7 bn CHF total revenues 2.5 bn CHF total paid-out claims 1 K-Tipp-Survey no. 15/2016 Source: annual report 2016
  • 2. Zürich R User meetup R at Sanitas – Workflow, Problems and Solutions Patrik Lengacher Zürich, 04. September 2017 Source: Photoshopped adoption from : http://www.ecns.cn/cns-wire/2015/02-26/155833.shtml/
  • 3. Who am I? 11/6/2017Zürich R – R at Sanitas3 Patrik Lengacher Data Manager Analytics MSc, ETH, Mathematics BSc, ETH, Mathematics Sanitas – Health Insurance Layzapp AG – Start up Accenture – Consulting Paul Scherrer Institute github.com/plengacher linkedin.com/in/plengacher patrik.lengacher@{sanitas,gmail}.com
  • 4. Agenda 11/6/2017Zürich R – R at Sanitas7 Zürich R – R at Sanitas The data part of the data science workflow Data Preparation Modeling Operationalize Reproducibility corporate identity & corporate design (CI/CD)
  • 5. The data part of the data science workflow 11/6/2017Zürich R – R at Sanitas9 Data Sources Data Prep Optimize Data Data blending Data cleansing Feature engineering … Modeling Apply statistics / machine learning Prototyping Train & testing Validation Visualization …. Operationalize Deploy models Dashboards Reports Presentations …
  • 6. Agenda 11/6/2017Zürich R – R at Sanitas10 Zürich R – R at Sanitas The data part of the data science workflow Data Preparation Modeling Operationalize Reproducibility corporate identity & corporate design (CI/CD)
  • 7. Old New Data Preparation Firewalls and tool incompatibilities prohibit a clean workflow. Changes in the IT Infrastructure and the use of our Rstudio Server lets R interact directly with the data sources. 11/6/2017Zürich R – R at Sanitas11
  • 8. Agenda 11/6/2017Zürich R – R at Sanitas12 Zürich R – R at Sanitas The data part of the data science workflow Data Preparation Modeling Operationalize Reproducibility corporate identity & corporate design (CI/CD)
  • 9. Old New Modeling If the local machine runs out of resources, isolated solutions were created. Due to the nature of the solutions, they were not integrated in our infrastructure. Zürich R – R at Sanitas13
  • 10. Agenda 11/6/2017Zürich R – R at Sanitas14 Zürich R – R at Sanitas The data part of the data science workflow Data Preparation Modeling Operationalize Reproducibility corporate identity & corporate design (CI/CD)
  • 11. Old New Operationalize (I/II) Results of Ad-Hoc Requests were written as comments into the source files. Furthermore emailing the results became a time consuming repetitive task. Zürich R – R at Sanitas15
  • 12. Old New Operationalize (II/II) Results of Ad-Hoc Requests were written as comments into the source files. Furthermore emailing the results became a time consuming repetitive task. Zürich R – R at Sanitas16
  • 13. Agenda 11/6/2017Zürich R – R at Sanitas17 Zürich R – R at Sanitas The data part of the data science workflow Data Preparation Modeling Operationalize Reproducibility corporate identity & corporate design (CI/CD)
  • 14. Old New Reproducibility (I/III) Using markdown, shiny, MicroStrategy helps us to distribute results, presentations and reports. The tools help to keep the documents up to date. Zürich R – R at Sanitas18
  • 15. Old New Reproducibility (II/III) Version Control and a SanitasR Package with the commonly used functions help us to reproduce our results. Zürich R – R at Sanitas19
  • 16. Old New Reproducibility (III/III) Version Control and a SanitasR Package with the commonly used functions help us to reproduce our results. Zürich R – R at Sanitas20
  • 17. Agenda 11/6/2017Zürich R – R at Sanitas21 Zürich R – R at Sanitas The data part of the data science workflow Data Preparation Modeling Operationalize Reproducibility corporate identity & corporate design (CI/CD)
  • 18. Old New CI/CD Defining colors and themes help to get our plots CI/CD ready. Zürich R – R at Sanitas22
  • 19. 06.11.2017Zürich R – R at Sanitas23