SlideShare a Scribd company logo
Machine Learning
Ram Seshadri
March 2020 Slide 1
AutoViz and Auto_ViML
Faster Time to Insights
using Automated
Visualization and Machine
Learning
“Machine learning teams are still struggling to take advantage of ML
due to challenges with inflexible frameworks, lack of reproducibility,
collaboration issues, and immature software tools”
Cecelia Shao
Comet.ml
“Why is my Data Science team taking sooo
long to complete a simple project?”
-- A Frustrated CIO
Slide 2
Machine learning teams are still struggling to take advantage of ML due to
challenges with inflexible frameworks, lack of reproducibility, collaboration
issues, and immature software tools.
The Answer?
7/24/2019
Faster
Visualization
Automatic
Feature
Selection
• Auto_ViML
Automatic
Model Selection
and Tuning
• Auto_ViML
One Click Model
Serving and
Production
Auto_ViML was designed along with AutoViz to Build Variant
Interpretable Machine Learning Models Fast!
__
● They are proprietary and expensive (lock-in)
● Black Boxes which are too complex to interpret
● Very little reproducibility outside of tool
HOWEVER CURRENT TOOLS ARE LIMITED BECAUSE...
• AutoViz
INTRODUCING A SIMPLER APPROACH TO AUTO-ML
Slide 3
How can we make DATA SCIENTISTS more productive?
●Open Source Tools for Faster Time to Insights with Design Goals as:
○ Simple: Invoke them with a single Line of Code (each)
○ Flexible: Suited to any kind of structured data set with no Prep required
○ Incremental: Can be used by anyone from beginners to experts alike
○ Experimental: Compare multiple visualization methods and models step by step
○ Interpretable: get clear explanation of steps taken with validation graphs
○ Reproducible: No Black Box. Reproducible model pipelines and outputs
○ Extensible: Open Source with contributions from Python and DS community
I Built AutoViz and Auto_ViML to make my own life easier.
Hope it will do the same for you.
Slide 4
What is Auto_Viz and Auto_ViML?
AutoViz and Auto_ViML do not completely eliminate the need for data scientists. But they speed up some
steps in the ML workflow which makes data scientists more productive!
Slide 5
Where do they fit in ML Workflow?
AutoViz Auto_ViML
What is AutoViz?
Slide 6
AutoViz enables you to automatically
visualize any data set with a Single Line of
Code. It automatically:
1. Selects a Random Sample from the Data
Set (if the Data Set is very large)
2. Selects most important features using
ML (if Number of Variables is very large)
3. Selects Best Methods to Visualize Data
for a given problem
4. Provides Charts to be saved in PNG,
JPG, and SVG Formats
OVERVIEW
Why AutoViz?
Slide 7
Help explain your hypotheses and variable selection better to others
BENEFITS
Systematic Look for insights systematically rather than through “gut instinct” or
domain knowledge
Simple Reduce features to the most important ones to deliver simple yet
powerful insights
Explainable
How AutoViz Works
Slide 8
Variable
Classification
Problem
Identification
Complex
Interactions
AutoViz classifies features into
highly granular data types to
determine how best to
represent them in Charts
AutoViz can visualize any
dataset for a given target:
Regression, Classification, Time
Series, Clustering and more
Most charts involve more than
one variable helping to deliver
powerful insights with minimal
effort
Select the Most
Important Features
Select the Best
Charts
Deliver them Fast!
AutoViz uses the powerful ML
algorithm, XGBoost, to select
important features given the
target variable
AutoViz selects the best ways
to visualize your data to extract
insights from your data
AutoViz selects statistically
valid sample data to visualize
(in case data set is very large)
Design Goals
Implementation
AutoViz PROCESS
https://github.com/AutoViML/AutoViz
AutoViz
from autoviz.AutoViz_Class import AutoViz_Class
AV = AutoViz_Class()
2. Next Import...
3. Run AutoViz.
dft = AV.AutoViz('', sep, target, df)
See Results...
Slide 9
Thanks to UCI Machine Learning Repository for all data sets in this presentation:
Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA:
University of California, School of Information and Computer Science.
1. First, Install...
pip install autoviz
Installing and Running
AutoViz is as easy as 1, 2, 3!
Source Code:
pip install autoviz+
AutoViz Downloads
Slide 10
AutoViz has now
been downloaded
more than 17K times+
AutoViz Downloads
* As of August 23, 2019
Chart Source Courtesy:
https://packaging.python.org/guides/analyzing-pypi-package-downloads/
+ Stats Source Courtesy:
PePy org
AutoViz: Housing*
● Number of Rooms and Median Value of
Homes seem to be highly correlated
● As Age of Building increases, Median
Value decreases albeit slowly
INSIGHTS
● NOX and DIS seem to be highly
correlated though they seem
to have a polynomial or
non-linear relationship
Slide 11* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
AutoViz: Housing*
● Both CRIM and ZN are highly skewed
● Both may require a transformation
INSIGHTS
● PTRATIO and DIS seem to be
somewhat skewed as well but
don’t require transformations
Slide 12* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
AutoViz: Housing*
RM, LSTAT, TAX, INDUS, AGE, and CRIM seem to be
decently correlated with Target. May be worth
exploring if they come up as Important Features.
INSIGHTS
● Average Median Value of
homes varies widely by CHAS
and RAD. Hence would be
important features in any
model.
Slide 13* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
Build your first baseline model using:
1. Features selected by AutoViz
2. Iterate through models and visualization until
satisfied with result
How AutoViz enables better Modeling
Slide 14
AutoViz
Look at Charts and Graphs generated by AutoViz:
1. Sometimes, AV can generate over 1000 charts!
2. Save Charts you like best by right click and save
3. You also have the option of getting charts in
SVG or PNG or JPEG formats
Call AutoViz from Python Jupyter Notebooks:
1. Give directory path+filename, separator in
data, target variable name (can be empty
string)
2. If DataFrame available instead of filename,
give name of DataFrame in above
3. Run AutoViz
Select Variables:
1. That appear most promising to Modeling goal
2. Rerun AutoViz by removing variables or adding
new variables that provide more insights
1. Call AutoViz with your Filename and target
2. Look at Charts to derive Insights into Data3. Add/Remove Variables and re-run AutoViz
4. Build your baseline model with AutoViz features
How to Build a better model?
Slide 15
Remove Low Information
and Redundant Features
Add Polynomial and
Interaction, Other
Features
Select Models from
Simple to Complex and
Perform Tuning
Add Entropy Binning,
Stacking to K-Means
Featurizers to model
Add Imbalanced sampling
and training
Perform Ensembling of
Multiple Types of models
BUILD A ViML Model!
(VARIANT INTERPRETABLE MACHINE LEARNING MODEL, Step by Step)
PROCESS
What is Auto_ViML?
Slide 16
INSIGHTFUL
INTERPRETA
BLE
VARIANT
REPRODUCI
BLE
ITERATIVE
ViML helps you
try as many as
15 different
models with one
API
ViML reduces
features to the
bare minimum
(as much 10-90%
reduction in
features)
ViML is fully
reproducible by
explaining its
steps (full
transparency)
Delivers insights
into which
models and
techniques will
work best with
your data
ViML helps you
build you more
complex models
after trying out
simpler options
VARIANT INTERPRETABLE MACHINE LEARNING MODELS with one API
Why Auto_ViML?
Slide 17
MULTIPLE MODELS
TRANSPARENCY
FEATURE
ENGINEERING
AUTOMATIC
FEATURE
SELECTION
SYSTEMATIC Auto ViML was designed from the ground-up to mimic how a Data
Scientist would approach a Modeling Problem.
Enables selective model complexity by adding features and complexity
step by step
Provides Deep Insights into the Data Set with Full Transparency
Models with Fewer Features result in Simpler Models. Auto_ViML
Produces models with 10-99% Fewer Features than Regular Models
without Significant Loss of Predictive Power*
* Based on my experience. Your results may vary.
Build and test multiple models thru’ Hyper Tuning and Cross Validation
BENEFITS
Now with
CatBoost!!
Auto_ViML LETS YOU TRY MULTIPLE APPROACHES
Slide 18
You can access all the powerful features of with one line of Python Code after you import.
You can turn on and turn off features and flags to see how they impact Model.
TRY
MULTIPLE
APPROACHES
TO GET THE
BEST MODEL
INTERACTIONS
vs. NO
INTERACTIONS
BOOSTING
vs. BAGGING
ENSEMBLING
vs. STACKING
IMBALANCED
vs.
BALANCED
GRIDSEARCH
vs. RANDOM
FEATURE
IMPORTANCES
Keep Upgrading
Auto_ViML version
since it is updated
monthly!
Predictions
from 12
models
Downsampling
supported
HyperOpt
coming
SHAP included
0 = No Intxns
1 = Pairwise Intxns
2 = Squared Vars
model, features, trainm, testm = Auto_ViML(train, target, test, sample_submission='',
hyper_param='GS', scoring_parameter='rmse',
feature_reduction=True,
Boosting_Flag=None, KMeans_Featurizer=False, Add_Poly=0,
Stacking_Flag=False, Binning_Flag=False, Imbalanced_Flag=True,
verbose=0)
Github: https://github.com/AutoViML/Auto_ViML
Auto_ViML
from autoviml.Auto_ViML import Auto_ViML
2. Next, Import...
3. Run Auto_VIML.
Slide 19
Get a fully trained Model, best Features and transformed Train and
Test data...
Thanks to UCI Machine Learning Repository for all data sets in this presentation:
Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA:
University of California, School of Information and Computer Science.
Installing and Running
Auto_ViML is as easy as
1, 2, 3!
pip install autoviml
1. First Install...
pip install autoviml
Auto_ViML Downloads
Slide 20
Chart Source Courtesy:
https://packaging.python.org/guides/analyzing-pypi-package-downloads/
Auto_ViML has now
nearly 50K
downloads+
Auto_ViML Downloads
* As of March 20, 2020
+ Stats Source Courtesy:
PePy org
Here is an example of a Regression data set: Boston
Housing*. There are 13 predictors in the dataset.
But Auto_ViML finds that only 10 variables are needed
to get the job done. For example:
['RM', 'LSTAT', 'NOX', 'PTRATIO', 'CRIM', 'TAX',
'CHAS', 'B', 'RAD', 'ZN']
Auto_ViML: Boston Housing*
Slide 21
DATA SET SIZE 506 x
14
TIME TAKEN
6 secs
Variables Selected
10
FEATURE REDUCTION
24%
Results:
Start with Linear Model
* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
Auto_ViML: Boston Housing
Slide 22
Results:
Move to Random Forests
Time Taken = 30 seconds
Thanks to UCI Machine Learning Repository for all data sets in this presentation:
Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA:
University of California, School of Information and Computer Science.
Auto_ViML: Boston Housing
Slide 23
Results:
Close with XGBoost
* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
Slide 24
Auto_ViML: Boston Housing
Slide 25
Linear Model with Interaction Variables
Ensemble Model with Binning
Forests Model with Binning Numerics
XGBoost Model with Stacking
Multiple Models
* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
Auto_ViML: Wisconsin Breast Cancer
Slide 26
DATA SET SIZE
512 x 32
TIME TAKEN
12 Secs
The Wisconsin Breast Cancer* data set is a classic
Data Set: Auto_ViML took 12 Seconds to find the
best features and best model with Weighted F1
score of 100% on validation set using Linear model
Wisconsin Breast Cancer Data Set
FEATURE REDUCTION
52%
Macro Average ROC AUC
100%
Results:
Compare the results
to another model
using Deep Learning
and Keras
Link
“Hyperparameter
Optimization with
Keras” by Mikko
* Thanks to UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)
● What’s Missing / Could be Improved:
○ No Feature Engineering: You can create your own or use kits like featuretools, etc.
○ No Image/Video/NLP Support: At the moment, it removes these features from model considerations
○ No Time Series modeling: Auto_TimeSeries is in the works. Stay Tuned.
○ No Neural Networks or Deep Learning: You can add your own modules or use tools like Ludwig
○ Model serving: Adding a module for test data transformation necessary
Slide 27
Next Steps for AutoViz and Auto_ViML...
● What’s Missing / Could be Improved:
○ Build it into Existing Tools such that structured data can be Visualized Fast!
○ Build it into Educational tools to make it easy for Students and Colleges (where small, structured
datasets are the Norm) to help Visualize data (as writing code is still very hard for Students)
○ Add additional Visualizations such as Pie Charts, Mosaic Charts, etc.
○ Build it into Industrial Instruments such as IoT tools so that large data sets can be visualized
Auto_ViML
AutoViz
AutoViz + Auto_ViML = POWERFUL INSIGHTS
Slide 28
LOAD DATA SET
RUN AUTOVIZ TO VISUALIZE
ENGINEER FEATURES
RUN AUTO_ViML AGAIN
ADD / REMOVE FEATURES
RUN AUTO_ViML
BEST MODEL and FEATURES
SELECTED
BUILD PIPELINE
SERVE MODEL
Not in Scope
THANK YOU
Slide 29
Ram Seshadri rsesha2001@yahoo.com

More Related Content

What's hot

Model Drift Monitoring using Tensorflow Model Analysis
Model Drift Monitoring using Tensorflow Model AnalysisModel Drift Monitoring using Tensorflow Model Analysis
Model Drift Monitoring using Tensorflow Model Analysis
Vivek Raja P S
 
GDG PDX - An Intro to Google Cloud AutoML Vision
GDG PDX - An Intro to Google Cloud AutoML VisionGDG PDX - An Intro to Google Cloud AutoML Vision
GDG PDX - An Intro to Google Cloud AutoML Vision
jerryhargrove
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windows
Mia Chang
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdf
Amazon Web Services
 
Build, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at ScaleBuild, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at Scale
Amazon Web Services
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
Xavier Amatriain
 
Machine Learning In Production
Machine Learning In ProductionMachine Learning In Production
Machine Learning In Production
Samir Bessalah
 
An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101
QuantUniversity
 
Managing and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in PythonManaging and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in Python
Simon Frid
 
On-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on AndroidOn-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on Android
Yufeng Guo
 
Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...
Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...
Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...
Fwdays
 
Seamless MLOps with Seldon and MLflow
Seamless MLOps with Seldon and MLflowSeamless MLOps with Seldon and MLflow
Seamless MLOps with Seldon and MLflow
Databricks
 
DataSciencePT #27 - Fifty Shades of Automated Machine Learning
DataSciencePT #27 - Fifty Shades of Automated Machine LearningDataSciencePT #27 - Fifty Shades of Automated Machine Learning
DataSciencePT #27 - Fifty Shades of Automated Machine Learning
Rui Quintino
 
TFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platformTFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platform
Shunya Ueta
 
Machine Learning Infrastructure
Machine Learning InfrastructureMachine Learning Infrastructure
Machine Learning Infrastructure
SigOpt
 
Serverless Deep Learning
Serverless Deep LearningServerless Deep Learning
Serverless Deep Learning
Alexey Grigorev
 
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
MLconf
 
Deep Learning in Production: Serving Image Models at Scale
Deep Learning in Production: Serving Image Models at ScaleDeep Learning in Production: Serving Image Models at Scale
Deep Learning in Production: Serving Image Models at Scale
Alexey Grigorev
 
Tensorflow Ecosystem
Tensorflow EcosystemTensorflow Ecosystem
Tensorflow Ecosystem
Vivek Raja P S
 

What's hot (20)

Model Drift Monitoring using Tensorflow Model Analysis
Model Drift Monitoring using Tensorflow Model AnalysisModel Drift Monitoring using Tensorflow Model Analysis
Model Drift Monitoring using Tensorflow Model Analysis
 
GDG PDX - An Intro to Google Cloud AutoML Vision
GDG PDX - An Intro to Google Cloud AutoML VisionGDG PDX - An Intro to Google Cloud AutoML Vision
GDG PDX - An Intro to Google Cloud AutoML Vision
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windows
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdf
 
Build, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at ScaleBuild, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at Scale
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
 
Machine Learning In Production
Machine Learning In ProductionMachine Learning In Production
Machine Learning In Production
 
An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)
 
Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101
 
Managing and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in PythonManaging and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in Python
 
On-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on AndroidOn-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on Android
 
Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...
Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...
Kyrylo Perevozchykov "Continuous delivery for Machine Learning, the future of...
 
Seamless MLOps with Seldon and MLflow
Seamless MLOps with Seldon and MLflowSeamless MLOps with Seldon and MLflow
Seamless MLOps with Seldon and MLflow
 
DataSciencePT #27 - Fifty Shades of Automated Machine Learning
DataSciencePT #27 - Fifty Shades of Automated Machine LearningDataSciencePT #27 - Fifty Shades of Automated Machine Learning
DataSciencePT #27 - Fifty Shades of Automated Machine Learning
 
TFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platformTFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platform
 
Machine Learning Infrastructure
Machine Learning InfrastructureMachine Learning Infrastructure
Machine Learning Infrastructure
 
Serverless Deep Learning
Serverless Deep LearningServerless Deep Learning
Serverless Deep Learning
 
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
 
Deep Learning in Production: Serving Image Models at Scale
Deep Learning in Production: Serving Image Models at ScaleDeep Learning in Production: Serving Image Models at Scale
Deep Learning in Production: Serving Image Models at Scale
 
Tensorflow Ecosystem
Tensorflow EcosystemTensorflow Ecosystem
Tensorflow Ecosystem
 

Similar to Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning

Auto visualization and viml
Auto visualization and vimlAuto visualization and viml
Auto visualization and viml
Bill Liu
 
IBM Cognos 10 Framework Manager Metadata Modeling: Tips and Tricks
IBM Cognos 10 Framework Manager Metadata Modeling: Tips and TricksIBM Cognos 10 Framework Manager Metadata Modeling: Tips and Tricks
IBM Cognos 10 Framework Manager Metadata Modeling: Tips and Tricks
Senturus
 
A case study in using ibm watson studio machine learning services ibm devel...
A case study in using ibm watson studio machine learning services   ibm devel...A case study in using ibm watson studio machine learning services   ibm devel...
A case study in using ibm watson studio machine learning services ibm devel...
Einar Karlsen
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for Developers
Mark Tabladillo
 
Strata CA 2019: From Jupyter to Production Manu Mukerji
Strata CA 2019: From Jupyter to Production Manu MukerjiStrata CA 2019: From Jupyter to Production Manu Mukerji
Strata CA 2019: From Jupyter to Production Manu Mukerji
Manu Mukerji
 
2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML
2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML
2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML
Bruno Capuano
 
2021 06 19 ms student ambassadors nigeria ml net 01 slide-share
2021 06 19 ms student ambassadors nigeria ml net 01   slide-share2021 06 19 ms student ambassadors nigeria ml net 01   slide-share
2021 06 19 ms student ambassadors nigeria ml net 01 slide-share
Bruno Capuano
 
Serverless projects at Myplanet
Serverless projects at MyplanetServerless projects at Myplanet
Serverless projects at Myplanet
Daniel Zivkovic
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence Event
Michail Pagiatakis
 
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Amazon Web Services Korea
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
VictorSzoltysek
 
Introduction to ML.NET
Introduction to ML.NETIntroduction to ML.NET
Introduction to ML.NET
Gianni Rosa Gallina
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
gdgsurrey
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
zekeLabs Technologies
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
David Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 
vodQA Pune (2019) - Testing AI,ML applications
vodQA Pune (2019) - Testing AI,ML applicationsvodQA Pune (2019) - Testing AI,ML applications
vodQA Pune (2019) - Testing AI,ML applications
vodQA
 
Experiment Management for the Enterprise
Experiment Management for the EnterpriseExperiment Management for the Enterprise
Experiment Management for the Enterprise
SigOpt
 
Pitfalls of machine learning in production
Pitfalls of machine learning in productionPitfalls of machine learning in production
Pitfalls of machine learning in production
Antoine Sauray
 
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Benjamin Bengfort
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms Reliable
Justin Basilico
 

Similar to Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning (20)

Auto visualization and viml
Auto visualization and vimlAuto visualization and viml
Auto visualization and viml
 
IBM Cognos 10 Framework Manager Metadata Modeling: Tips and Tricks
IBM Cognos 10 Framework Manager Metadata Modeling: Tips and TricksIBM Cognos 10 Framework Manager Metadata Modeling: Tips and Tricks
IBM Cognos 10 Framework Manager Metadata Modeling: Tips and Tricks
 
A case study in using ibm watson studio machine learning services ibm devel...
A case study in using ibm watson studio machine learning services   ibm devel...A case study in using ibm watson studio machine learning services   ibm devel...
A case study in using ibm watson studio machine learning services ibm devel...
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for Developers
 
Strata CA 2019: From Jupyter to Production Manu Mukerji
Strata CA 2019: From Jupyter to Production Manu MukerjiStrata CA 2019: From Jupyter to Production Manu Mukerji
Strata CA 2019: From Jupyter to Production Manu Mukerji
 
2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML
2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML
2021 02 23 MVP Fusion Getting Started with Machine Learning.Net and AutoML
 
2021 06 19 ms student ambassadors nigeria ml net 01 slide-share
2021 06 19 ms student ambassadors nigeria ml net 01   slide-share2021 06 19 ms student ambassadors nigeria ml net 01   slide-share
2021 06 19 ms student ambassadors nigeria ml net 01 slide-share
 
Serverless projects at Myplanet
Serverless projects at MyplanetServerless projects at Myplanet
Serverless projects at Myplanet
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence Event
 
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Introduction to ML.NET
Introduction to ML.NETIntroduction to ML.NET
Introduction to ML.NET
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
vodQA Pune (2019) - Testing AI,ML applications
vodQA Pune (2019) - Testing AI,ML applicationsvodQA Pune (2019) - Testing AI,ML applications
vodQA Pune (2019) - Testing AI,ML applications
 
Experiment Management for the Enterprise
Experiment Management for the EnterpriseExperiment Management for the Enterprise
Experiment Management for the Enterprise
 
Pitfalls of machine learning in production
Pitfalls of machine learning in productionPitfalls of machine learning in production
Pitfalls of machine learning in production
 
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms Reliable
 

More from Bill Liu

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production Project
Bill Liu
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Bill Liu
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to Hero
Bill Liu
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
Bill Liu
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at Scale
Bill Liu
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
Bill Liu
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its Applications
Bill Liu
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19
Bill Liu
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...
Bill Liu
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
Bill Liu
 
AISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with MicroeconomicsAISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with Microeconomics
Bill Liu
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First World
Bill Liu
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917
Bill Liu
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
Bill Liu
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AI
Bill Liu
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
Bill Liu
 
weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...
weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...
weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...
Bill Liu
 
Data stream with cruise control
Data stream with cruise controlData stream with cruise control
Data stream with cruise control
Bill Liu
 
AI in linkedin
AI in linkedinAI in linkedin
AI in linkedin
Bill Liu
 
Deep natural language processing in search systems
Deep natural language processing in search systemsDeep natural language processing in search systems
Deep natural language processing in search systems
Bill Liu
 

More from Bill Liu (20)

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production Project
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to Hero
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at Scale
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its Applications
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
AISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with MicroeconomicsAISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with Microeconomics
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First World
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AI
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
 
weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...
weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...
weekly AI tech talk #85 ml-agents Enabling Learned Behaviors with Reinforceme...
 
Data stream with cruise control
Data stream with cruise controlData stream with cruise control
Data stream with cruise control
 
AI in linkedin
AI in linkedinAI in linkedin
AI in linkedin
 
Deep natural language processing in search systems
Deep natural language processing in search systemsDeep natural language processing in search systems
Deep natural language processing in search systems
 

Recently uploaded

Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 

Recently uploaded (20)

Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 

Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning

  • 1. Machine Learning Ram Seshadri March 2020 Slide 1 AutoViz and Auto_ViML Faster Time to Insights using Automated Visualization and Machine Learning
  • 2. “Machine learning teams are still struggling to take advantage of ML due to challenges with inflexible frameworks, lack of reproducibility, collaboration issues, and immature software tools” Cecelia Shao Comet.ml “Why is my Data Science team taking sooo long to complete a simple project?” -- A Frustrated CIO Slide 2 Machine learning teams are still struggling to take advantage of ML due to challenges with inflexible frameworks, lack of reproducibility, collaboration issues, and immature software tools. The Answer?
  • 3. 7/24/2019 Faster Visualization Automatic Feature Selection • Auto_ViML Automatic Model Selection and Tuning • Auto_ViML One Click Model Serving and Production Auto_ViML was designed along with AutoViz to Build Variant Interpretable Machine Learning Models Fast! __ ● They are proprietary and expensive (lock-in) ● Black Boxes which are too complex to interpret ● Very little reproducibility outside of tool HOWEVER CURRENT TOOLS ARE LIMITED BECAUSE... • AutoViz INTRODUCING A SIMPLER APPROACH TO AUTO-ML Slide 3 How can we make DATA SCIENTISTS more productive?
  • 4. ●Open Source Tools for Faster Time to Insights with Design Goals as: ○ Simple: Invoke them with a single Line of Code (each) ○ Flexible: Suited to any kind of structured data set with no Prep required ○ Incremental: Can be used by anyone from beginners to experts alike ○ Experimental: Compare multiple visualization methods and models step by step ○ Interpretable: get clear explanation of steps taken with validation graphs ○ Reproducible: No Black Box. Reproducible model pipelines and outputs ○ Extensible: Open Source with contributions from Python and DS community I Built AutoViz and Auto_ViML to make my own life easier. Hope it will do the same for you. Slide 4 What is Auto_Viz and Auto_ViML?
  • 5. AutoViz and Auto_ViML do not completely eliminate the need for data scientists. But they speed up some steps in the ML workflow which makes data scientists more productive! Slide 5 Where do they fit in ML Workflow? AutoViz Auto_ViML
  • 6. What is AutoViz? Slide 6 AutoViz enables you to automatically visualize any data set with a Single Line of Code. It automatically: 1. Selects a Random Sample from the Data Set (if the Data Set is very large) 2. Selects most important features using ML (if Number of Variables is very large) 3. Selects Best Methods to Visualize Data for a given problem 4. Provides Charts to be saved in PNG, JPG, and SVG Formats OVERVIEW
  • 7. Why AutoViz? Slide 7 Help explain your hypotheses and variable selection better to others BENEFITS Systematic Look for insights systematically rather than through “gut instinct” or domain knowledge Simple Reduce features to the most important ones to deliver simple yet powerful insights Explainable
  • 8. How AutoViz Works Slide 8 Variable Classification Problem Identification Complex Interactions AutoViz classifies features into highly granular data types to determine how best to represent them in Charts AutoViz can visualize any dataset for a given target: Regression, Classification, Time Series, Clustering and more Most charts involve more than one variable helping to deliver powerful insights with minimal effort Select the Most Important Features Select the Best Charts Deliver them Fast! AutoViz uses the powerful ML algorithm, XGBoost, to select important features given the target variable AutoViz selects the best ways to visualize your data to extract insights from your data AutoViz selects statistically valid sample data to visualize (in case data set is very large) Design Goals Implementation AutoViz PROCESS
  • 9. https://github.com/AutoViML/AutoViz AutoViz from autoviz.AutoViz_Class import AutoViz_Class AV = AutoViz_Class() 2. Next Import... 3. Run AutoViz. dft = AV.AutoViz('', sep, target, df) See Results... Slide 9 Thanks to UCI Machine Learning Repository for all data sets in this presentation: Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science. 1. First, Install... pip install autoviz Installing and Running AutoViz is as easy as 1, 2, 3! Source Code:
  • 10. pip install autoviz+ AutoViz Downloads Slide 10 AutoViz has now been downloaded more than 17K times+ AutoViz Downloads * As of August 23, 2019 Chart Source Courtesy: https://packaging.python.org/guides/analyzing-pypi-package-downloads/ + Stats Source Courtesy: PePy org
  • 11. AutoViz: Housing* ● Number of Rooms and Median Value of Homes seem to be highly correlated ● As Age of Building increases, Median Value decreases albeit slowly INSIGHTS ● NOX and DIS seem to be highly correlated though they seem to have a polynomial or non-linear relationship Slide 11* Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
  • 12. AutoViz: Housing* ● Both CRIM and ZN are highly skewed ● Both may require a transformation INSIGHTS ● PTRATIO and DIS seem to be somewhat skewed as well but don’t require transformations Slide 12* Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
  • 13. AutoViz: Housing* RM, LSTAT, TAX, INDUS, AGE, and CRIM seem to be decently correlated with Target. May be worth exploring if they come up as Important Features. INSIGHTS ● Average Median Value of homes varies widely by CHAS and RAD. Hence would be important features in any model. Slide 13* Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
  • 14. Build your first baseline model using: 1. Features selected by AutoViz 2. Iterate through models and visualization until satisfied with result How AutoViz enables better Modeling Slide 14 AutoViz Look at Charts and Graphs generated by AutoViz: 1. Sometimes, AV can generate over 1000 charts! 2. Save Charts you like best by right click and save 3. You also have the option of getting charts in SVG or PNG or JPEG formats Call AutoViz from Python Jupyter Notebooks: 1. Give directory path+filename, separator in data, target variable name (can be empty string) 2. If DataFrame available instead of filename, give name of DataFrame in above 3. Run AutoViz Select Variables: 1. That appear most promising to Modeling goal 2. Rerun AutoViz by removing variables or adding new variables that provide more insights 1. Call AutoViz with your Filename and target 2. Look at Charts to derive Insights into Data3. Add/Remove Variables and re-run AutoViz 4. Build your baseline model with AutoViz features
  • 15. How to Build a better model? Slide 15 Remove Low Information and Redundant Features Add Polynomial and Interaction, Other Features Select Models from Simple to Complex and Perform Tuning Add Entropy Binning, Stacking to K-Means Featurizers to model Add Imbalanced sampling and training Perform Ensembling of Multiple Types of models BUILD A ViML Model! (VARIANT INTERPRETABLE MACHINE LEARNING MODEL, Step by Step) PROCESS
  • 16. What is Auto_ViML? Slide 16 INSIGHTFUL INTERPRETA BLE VARIANT REPRODUCI BLE ITERATIVE ViML helps you try as many as 15 different models with one API ViML reduces features to the bare minimum (as much 10-90% reduction in features) ViML is fully reproducible by explaining its steps (full transparency) Delivers insights into which models and techniques will work best with your data ViML helps you build you more complex models after trying out simpler options VARIANT INTERPRETABLE MACHINE LEARNING MODELS with one API
  • 17. Why Auto_ViML? Slide 17 MULTIPLE MODELS TRANSPARENCY FEATURE ENGINEERING AUTOMATIC FEATURE SELECTION SYSTEMATIC Auto ViML was designed from the ground-up to mimic how a Data Scientist would approach a Modeling Problem. Enables selective model complexity by adding features and complexity step by step Provides Deep Insights into the Data Set with Full Transparency Models with Fewer Features result in Simpler Models. Auto_ViML Produces models with 10-99% Fewer Features than Regular Models without Significant Loss of Predictive Power* * Based on my experience. Your results may vary. Build and test multiple models thru’ Hyper Tuning and Cross Validation BENEFITS
  • 18. Now with CatBoost!! Auto_ViML LETS YOU TRY MULTIPLE APPROACHES Slide 18 You can access all the powerful features of with one line of Python Code after you import. You can turn on and turn off features and flags to see how they impact Model. TRY MULTIPLE APPROACHES TO GET THE BEST MODEL INTERACTIONS vs. NO INTERACTIONS BOOSTING vs. BAGGING ENSEMBLING vs. STACKING IMBALANCED vs. BALANCED GRIDSEARCH vs. RANDOM FEATURE IMPORTANCES Keep Upgrading Auto_ViML version since it is updated monthly! Predictions from 12 models Downsampling supported HyperOpt coming SHAP included 0 = No Intxns 1 = Pairwise Intxns 2 = Squared Vars
  • 19. model, features, trainm, testm = Auto_ViML(train, target, test, sample_submission='', hyper_param='GS', scoring_parameter='rmse', feature_reduction=True, Boosting_Flag=None, KMeans_Featurizer=False, Add_Poly=0, Stacking_Flag=False, Binning_Flag=False, Imbalanced_Flag=True, verbose=0) Github: https://github.com/AutoViML/Auto_ViML Auto_ViML from autoviml.Auto_ViML import Auto_ViML 2. Next, Import... 3. Run Auto_VIML. Slide 19 Get a fully trained Model, best Features and transformed Train and Test data... Thanks to UCI Machine Learning Repository for all data sets in this presentation: Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science. Installing and Running Auto_ViML is as easy as 1, 2, 3! pip install autoviml 1. First Install...
  • 20. pip install autoviml Auto_ViML Downloads Slide 20 Chart Source Courtesy: https://packaging.python.org/guides/analyzing-pypi-package-downloads/ Auto_ViML has now nearly 50K downloads+ Auto_ViML Downloads * As of March 20, 2020 + Stats Source Courtesy: PePy org
  • 21. Here is an example of a Regression data set: Boston Housing*. There are 13 predictors in the dataset. But Auto_ViML finds that only 10 variables are needed to get the job done. For example: ['RM', 'LSTAT', 'NOX', 'PTRATIO', 'CRIM', 'TAX', 'CHAS', 'B', 'RAD', 'ZN'] Auto_ViML: Boston Housing* Slide 21 DATA SET SIZE 506 x 14 TIME TAKEN 6 secs Variables Selected 10 FEATURE REDUCTION 24% Results: Start with Linear Model * Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
  • 22. Auto_ViML: Boston Housing Slide 22 Results: Move to Random Forests Time Taken = 30 seconds Thanks to UCI Machine Learning Repository for all data sets in this presentation: Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.
  • 23. Auto_ViML: Boston Housing Slide 23 Results: Close with XGBoost * Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
  • 25. Auto_ViML: Boston Housing Slide 25 Linear Model with Interaction Variables Ensemble Model with Binning Forests Model with Binning Numerics XGBoost Model with Stacking Multiple Models * Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/machine-learning-databases/housing/
  • 26. Auto_ViML: Wisconsin Breast Cancer Slide 26 DATA SET SIZE 512 x 32 TIME TAKEN 12 Secs The Wisconsin Breast Cancer* data set is a classic Data Set: Auto_ViML took 12 Seconds to find the best features and best model with Weighted F1 score of 100% on validation set using Linear model Wisconsin Breast Cancer Data Set FEATURE REDUCTION 52% Macro Average ROC AUC 100% Results: Compare the results to another model using Deep Learning and Keras Link “Hyperparameter Optimization with Keras” by Mikko * Thanks to UCI Machine Learning Repository https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)
  • 27. ● What’s Missing / Could be Improved: ○ No Feature Engineering: You can create your own or use kits like featuretools, etc. ○ No Image/Video/NLP Support: At the moment, it removes these features from model considerations ○ No Time Series modeling: Auto_TimeSeries is in the works. Stay Tuned. ○ No Neural Networks or Deep Learning: You can add your own modules or use tools like Ludwig ○ Model serving: Adding a module for test data transformation necessary Slide 27 Next Steps for AutoViz and Auto_ViML... ● What’s Missing / Could be Improved: ○ Build it into Existing Tools such that structured data can be Visualized Fast! ○ Build it into Educational tools to make it easy for Students and Colleges (where small, structured datasets are the Norm) to help Visualize data (as writing code is still very hard for Students) ○ Add additional Visualizations such as Pie Charts, Mosaic Charts, etc. ○ Build it into Industrial Instruments such as IoT tools so that large data sets can be visualized Auto_ViML AutoViz
  • 28. AutoViz + Auto_ViML = POWERFUL INSIGHTS Slide 28 LOAD DATA SET RUN AUTOVIZ TO VISUALIZE ENGINEER FEATURES RUN AUTO_ViML AGAIN ADD / REMOVE FEATURES RUN AUTO_ViML BEST MODEL and FEATURES SELECTED BUILD PIPELINE SERVE MODEL Not in Scope
  • 29. THANK YOU Slide 29 Ram Seshadri rsesha2001@yahoo.com