SlideShare a Scribd company logo
1 of 59
Download to read offline
VP AIOps for the Autonomous Database
Sandesh Rao
From DBA’s to Data Scientists
ILOUG
Introduction to Machine Learning
@sandeshr
https://www.linkedin.com/in/raosandesh/
https://www.slideshare.net/SandeshRao4
The following is intended to outline our general product direction. It is intended for information
purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any
material, code, or functionality, and should not be relied upon in making purchasing decisions. The
development, release, timing, and pricing of any features or functionality described for Oracle’s
products may change and remains at the sole discretion of Oracle Corporation.
Statements in this presentation relating to Oracle’s future plans, expectations, beliefs, intentions and
prospects are “forward-looking statements” and are subject to material risks and uncertainties. A
detailed discussion of these factors and other risks that affect our business is contained in Oracle’s
Securities and Exchange Commission (SEC) filings, including our most recent reports on Form 10-K and
Form 10-Q under the heading “Risk Factors.” These filings are available on the SEC’s website or on
Oracle’s website at http://www.oracle.com/investor. All information in this presentation is current as of
September 2019 and Oracle undertakes no duty to update any statement in light of new information or
future events.
Safe harbor statement
1. Overview of ML and the Autonomous Database
2. Regression
3. Classification
4. Clustering
5. Anomaly detection
6. Workload prediction
7. Dynamic maintenance windows
8. Oracle Machine Learning examples
Agenda
Overview of ML and
Autonomous Database
Tasks Specific to Business and Innovation
• Architecture, planning, data modeling
• Data security and lifecycle management
• Application related tuning
• End-to-End service level management
Maintenance Tasks
• Configuration and tuning of systems, network, storage
• Database provisioning, patching
• Database backups, H/A, disaster recovery
• Database optimization
Traditionally DBAs are Responsible for:
Value Scale
Innovation
Maintenance
Tasks Specific to Business and Innovation
• Architecture, planning, data modeling
• Data security and lifecycle management
• Application related tuning
• End-to-End service level management
Maintenance Tasks
• Configuration and tuning of systems, network, storage
• Database provisioning, patching
• Database backups, H/A, disaster recovery
• Database optimization
Freedom from Drudgery for DBA: More Time to Innovate and Improve the Business
Autonomous Database Removes Generic Tasks
Value Scale
Innovation
Maintenance
Machine Learning
Solving data-driven
problems
Discovering insights
Making predictions
Data Security
Data classification,
Data life-cycle mgmt
Application Tuning
SQL tuning,
connection mgmt
The Evolution of the DBA/Database Developer Role
Data Engineer
Architecture,
“data wrangler”
Data extraction
Data wrangling
Deriving new attributes
(“feature engineering”)
…
…
…
Import predictions & insights
Translate and deploy ML models
Automate
You Are Probably Already Doing Most of This Work!
Database Developer to Data Scientist Journey
1 - https://www.infoworld.com/article/3228245/data-science/the-80-20-data-science-dilemma.html
Typically 80% of the work
Most data scientists spend only 20 percent of their time
on actual data analysis and 80 percent of their time
finding, cleaning, and reorganizing huge amounts of
data, which is an inefficient data strategy1
Eliminated or minimized with Oracle
Data Management platform becomes
combine/hybrid DM + machine learning platform
Albert Einstein
“If I had an hour to solve a
problem I'd spend 55
minutes thinking about the
problem and 5 minutes
thinking about solutions.”
Lots of Data needs to be crunched
• No time to manually sift through the data
Machine Learning has become accessible
• Anyone can be a Data Scientist
• Software and algorithms are available
• Frameworks allow for massive training with no coding
• CI/CD available for MLOps
Business use cases
- How to find value from the data
Why Machine Learning for us and why now?
Analytics Value vs. Maturity
Reports &
Dashboards
Data
Information
Predictions & Insights Appls with ML
Analytical Maturity
ValueofAnalytics
Diagnostic
Analysis &
Reports
Predictive /
Machine
Learning
“ML Enabled”
Applications
What Happened?
Why it Happened?
What WILL happen?
Automated ML Appls
Database Developer to Data Scientist Journey
ML Project Workflow
Set the business objectives
Gather compare and
clean data
Identify and extract features
(important columns) from imported data
This helps us identify the efficiency of
the algorithm
Take the input data which is also called the training
data and apply the algorithm to it
For the algorithm to function efficiently, it is
important to pick the right value for hyper parameters
(algorithm input parameters to the algorithm)
Once the training data in
the algorithm are
combined we get a model
1
2
3
4
5
Types of Machine Learning
Supervised Learning
Predict future outcomes with the help of
training data provided by human experts
Semi-Supervised Learning
Discover patterns within raw data and make
predictions, which are then reviewed by
human experts, who provide feedback which
is used to improve the model accuracy
Unsupervised Learning
Find patterns without any external input other
than the raw data
Reinforcement Learning
Take decisions based on past rewards for this
type of action
Regression
REGRESSION
Predicting numbers
Customer lifetime value
Estimate optimal pricing
House price estimates
-
10
0 10 20 30 40 50 60
Classification
CLASSIFICATION
Membership of a
known class
Identify likely high
value customers
Find customer
likely to churn
Fraud detection
Clustering
CLUSTERING
Membership of an
inferred class
Customer segmentation
Credit risk evaluation
Document similarity
Anomaly Detection
ANOMALY
DETECTION
Outliers
Dentist billing
85 fillings / hour
Employees with high
claims / grade
One variable moving
out of sync
ASSOCIATION
RULES
Finding like-
minded people
You might be
interested in...
Root cause analysis
Identify “harbingers
of failure”
TIME SERIES
Temporal Aspect
Hitting a threshold
Forecasting energy use
Seasonality of data
GRAPH
ANALYTICS
Supplement ML
algorithm
Customer churn
Network outage
Fraud detection
NEURAL
NETWORKS
Learn (More) Like A
Human
Classification
Regression
Deep learning
http://neuralnetworksanddeeplearning.com/chap1.html
Machine Learning Algorithms
• Multiple Regression, Support Vector
Machine, Linear Model, LASSO, Random
Forest, Ridge Regression, Generalized
Linear Model, Stepwise Linear Regression
Regression
Association & Collaborative Filtering
Reinforcement Learning - brute force,
Monte Carlo, temporal difference....
• Many different use cases
Neural network & deep Learning with
Deep Neural Network
• Hierarchical k-means, Orthogonal
Partitioning Clustering, Expectation-
Maximization
Clustering
Feature Extraction/Attribute
Importance / Component Analysis
• Decision Tree, Naive Bayes, Random
Forest, Logistic Regression, Support
Vector Machine
Classification
ML To determine workload
and deviation from it
What is Workload
Automatically
check
workload for
past x mins
Decide if
workload is
abnormally
high
Highlight any
abnormal
workload
issues
Optionally run on
demand
Optionally snooze
checking of a
component
Calculated via machine learning
Adaptive Learning
Workload Process
Captures metrics for key
performance dimensions across 5 X
1 minute time windows
CAPTURE1
Using semi-supervised learning via
SME threshold rules, the following
models are retrained :
• Isolation Forest
• One-Class Support Vector Machine
• Local Outlier Factor
Model with highest confidence
becomes the primary, if confidence is
high enough
TRAIN2
Straight after capture, the primary model
is used to predict anomalies.
Where anomalies are identified, metrics
are compared to SME threshold rules to
identify the type of anomaly
PREDICT3
Every
5 Mins
Every
Week
Every
5 Mins
Prediction (Every 5 minutes)
5 X 1 min metrics captured for
each dimension & ASH report
captured for later analysis
Metrics evaluated by the primary model to
determine if there are anomalies
If there is no primary model
(i.e. <7 days of data or <=95% model confidence)
then SME rules are used for anomaly detection
Each anomaly is compared against
the SME rules to determine which
dimension it applies to
Any anomalies are raised
along with recently
captured ASH report
Resource usage prediction
Configurable threshold
boundary – notify Admin of
forecasts above here
Actual values
(Black)
Forecast values
(Blue line)
Upper & lower
forecast range
(light blue area)
Unusual values
(anomalies)
Future forecast
values
ML To determine dynamic
maintenance windows
Identify Relevant Workload Metrics
• Ex: Average Active Sessions, CPU/Mem/IO Utilization
Time Series Decomposition
• Trend
• Seasonality
• Residual
Workload Seasonality Determination Locating Minimas
Optimum Window Identification and Validation
Model Generation and Training Flow
Maintenance Slot Identification
Maintenance window identification
START_TIME CNT
2018-04-11 15:00:00 290
2018-04-11 16:00:00 31120
2018-04-11 17:00:00 21530
2018-04-11 18:00:00 26240
2018-04-11 19:00:00 40520
2018-04-11 20:00:00 54270
2018-04-11 21:00:00 51460
2018-04-11 22:00:00 44310
2018-04-11 23:00:00 25690
START_TIME
2018-04-11 15:00:00 -0.226098
2018-04-11 16:00:00 -0.069821
2018-04-11 17:00:00 -0.350088
2018-04-11 18:00:00 -0.187483
2018-04-11 19:00:00 -0.513240
2018-04-11 20:00:00 0.019737
2018-04-11 21:00:00 0.059213
2018-04-11 22:00:00 -0.011312
2018-04-11 23:00:00 -0.179156
START_TIME
2018-04-11 15:00:00 5.669881
2018-04-11 16:00:00 10.345606
2018-04-11 17:00:00 9.977203
2018-04-11 18:00:00 10.175040
2018-04-11 19:00:00 10.609551
2018-04-11 20:00:00 10.901727
2018-04-11 21:00:00 10.848560
2018-04-11 22:00:00 10.698966
2018-04-11 23:00:00 10.153857
Current Date : 2018-05-12 15:00:00
Current Position in Seasonality : -0.22609829742533585
Best Maintenance Period in next Cycle : 2018-05-12 19:00:00
Worst Maintenance Period in next Cycle : 2018-05-13 08:00:00
Original observation data
1
Convolution filter & average
2
Calculate seasonality
3
Use seasonality to
predict best
maintenance window
4
OML examples
Simple SQL Syntax—Statistical Comparisons (t-tests)
Compare AVE Purchase Amounts Men vs. Women Grouped_By INCOME_LEVEL
Statistical Functions
SELECT SUBSTR(cust_income_level, 1, 22) income_level,
AVG(DECODE(cust_gender, 'M', amount_sold, null)) sold_to_men,
AVG(DECODE(cust_gender, 'F', amount_sold, null)) sold_to_women,
STATS_T_TEST_INDEPU(cust_gender, amount_sold, 'STATISTIC', 'F') t_observed,
STATS_T_TEST_INDEPU(cust_gender, amount_sold) two_sided_p_value
FROM customers c, sales s
WHERE c.cust_id = s.cust_id
GROUP BY ROLLUP(cust_income_level)
ORDER BY income_level, sold_to_men, sold_to_women, t_observed;
STATS_T_TEST_INDEPU (SQL) Example;
P_Values < 05 show statistically
significantly differences in the amounts
purchased by men vs. women
Simple SQL Syntax—Attribute Importance - ML Model Build (PL/SQL)
OAA Model Build and Real-time SQL Apply Prediction
BEGIN
DBMS_DATA_MINING.CREATE_MODEL(
model_name => 'BUY_INSURANCE_AI',
mining_function => DBMS_DATA_MINING.ATTRIBUTE_IMPORTANCE,
data_table_name => 'CUST_INSUR_LTV',
case_id_column_name => 'cust_id',
target_column_name => 'BUY_INSURANCE',
settings_table_name => 'Att_Import_Mode_Settings');
END;
/
SELECT attribute_name, rank , attribute_value
FROM BUY_INSURANCE_AI
ORDER BY rank, attribute_name;
Model Results (SQL query)
ATTRIBUTE_NAME RANK ATTRIBUTE_VALUE
BANK_FUNDS 1 0.2161
MONEY_MONTLY_OVERDRAWN 2 0.1489
N_TRANS_ATM 3 0.1463
N_TRANS_TELLER 4 0.1156
T_AMOUNT_AUTOM_PAYMENTS 5 0.1095
A1A2A3A4 A5A6 A7
Key Features
Collaborative UI for data scientists
• Packaged with Autonomous Data
Warehouse Cloud (V1)
• Easy access to shared notebooks,
templates, permissions, scheduler, etc.
• SQL ML algorithms API (V1)
• Supports deployment of ML analytics
Machine Learning Notebook for Autonomous Data Warehouse Cloud
Oracle Machine Learning
Multiple Languages UIs Supported for End Users & Apps Development
Oracle Machine Leaning
Application DevelopersDBAs
R & Python Data Scientists “Citizen” Data ScientistsNotebook Users & DS Teams
New! New!
Target “best” customers who have GOOD CREDIT and make payments
Business Usecase
Define Problem Statement
Poorly Defined Better
Data Mining
Technique
Predict employees that leave
• Based on past employees that voluntarily left:
• Create New Attribute EmplTurnover à O/1
Predict customers that churn
• Based on past customers that have churned:
• Create New Attribute Churn à YES/NO
Target “best” customers
• Recency, Frequency Monetary (RFM) Analysis
• Specific Dollar Amount over Time Window:
• Who has spent $500+ in most recent 18 months
How can I make more $$? • What helps me sell soft drinks & coffee?
Which customers are likely to buy? • How much is each customer likely to spend?
Who are my “best customers”? • What descriptive “rules” describe “best customers”?
How can I combat fraud?
• Which transactions are the most anomalous?
• Then roll-up to physician, claimant, employee…
Data loading and Review
Target “best” customers who have GOOD CREDIT and make payments
Data Scoping
Create New Derived Attributes or “Engineered Features”
Feature Engineering
Source Attribute New Attribute/”Engineered Feature”
Date of Birth AGE
Address DISTANCE_TO_DESTINATION
COMMUTE_TIME
Call detail records (CDRs) #_DROPPED_CALLS
PERCENT_iNTERNATIONAL
Salary PERCENT_VS_PEERS
Purchases TOTALS_PER_CATEGORY (e.g. Food,
Clothing)
Create new derived attributes to tease more
information out of the data. For example:
• RECENCY, FREQUENCY, MONETARY
(RFM Analysis)
Create New Derived Attributes or “Engineered Features”
Feature Engineering
Data remains in Database
• Model building and scoring occur in-
database
• Leverage investment in Oracle IT
• Eliminate data duplication
- Eliminate separate analytical servers
Deliver enterprise-wide
“predictive” applications
Don’t move the Data
Traditional ML
Hours, Days or Weeks
Data Extraction
Data Prep &
Transformation
Data Mining
Model Building
Data Mining
Model “Scoring”
Data Prep. &
Transformation
Data Import
avings
Model “Scoring”
Embedded Data Prep
Data Preparation
Model Building
Oracle’s in-DB Machine Learning
Secs, Mins or Hours
ORACLE
AUTONOMOUS
DATABASE
Increasing sources of relevant data can
boost model accuracy
More Data Variety—Better Predictive Models
Model with 20 variables
Model with “Big Data” and
hundreds -- thousands of
input variables including:
• Demographic data
• Purchase POS transactional
data
• “Unstructured data”, text &
comments
• Spatial location data
• Long term vs. recent
historical behavior
• Web visits
• Sensor data
• etc.
Naïve Guess
or Random
100%
0% Population Size
Responders
Model with 75
variables
Model with 250
variables
100%
Engineered Features – Derived attributes/variable
that reflect domain knowledge—key to best models
First, Identify the Key Attributes That Most Influence the Target Attribute
Modeling and Machine Learning
Attribute Importance Model
Next, Build Predictive Models to Predict Customers who are Likely to Have Good_Credit
Modeling and Machine Learning
Split Data into Train and Test
Build and Test Classification Model
Test the ML model’s accuracy
• Randomly selected “hold out” sample
of data that was used to train the ML
model
• Compute Cumulative Gains, Lift,
Accuracy, etc.
• Review the attributes used in the model
and model coefficients
• Make sure the model makes sense
Next, Build Predictive Models to Predict Customers who are Likely to Have Good_Credit
Model Evaluation (Machine Learning)
Model Evaluation
Simple SQL Apply scripts run 100% inside the
Database for immediate ML model
deployment
Apply the Models to Predict “Best Customers”
Deployment
Model Apply/”Scoring”
Manage and Analyze All Your Data
Big Data SQL / R
SQL / R / Python
Object
Store
“Engineered Features”
– Derived attributes
that reflect domain
knowledge—key to
best models e.g.:
• Counts
• Totals
• Changes
over time
Boil down the Data Lake
Architecturally,
lots of options
and flexibility
In-Database Machine Learning
More Models
Better Models
Faster, More Secure
Less Cost
Ready to Deploy!
No Need To Extract and
Move Data
Data stays in Database
Zero time required.
No production impact.
Data Preparation and
Transformation
Accelerated with
Automatic Data Prep
No separate environment
required. Much faster data prep.
Data stays protected and secured.
Data Mining and
Model Building
SQL, R, Python
Oracle Data Miner UI
OML Notebooks
Oracle Data Miner and AutoML
greatly speed model building.
Less skill required. No coding.
No Need to Transform
Production Data
Embedded Data
Preparation
No need for second
production instance.
Model Scoring
Accelerated Via
Exadata Database Machine
Faster model validation
Easy to repeat model building as often as needed
• OAA (Oracle Data Mining + Oracle R Enterprise) and ORAAH combined
• OAA includes support for Partitioned Models, Transactional, Unstructured, Geo-spatial, Graph data. etc,
Oracle’s Machine Learning & Adv. Analytics Algorithms
CLASSIFICATION
• Naïve Bayes
• Logistic Regression (GLM)
• Decision Tree
• Random Forest
• Neural Network
• Support Vector Machine
• Explicit Semantic Analysis
CLUSTERING
• Hierarchical K-Means
• Hierarchical O-Cluster
• Expectation Maximization (EM)
ANOMALY DETECTION
• One-Class SVM
TIME SERIES
• State of the art forecasting using
Exponential Smoothing
• Includes all popular models
e.g. Holt-Winters with trends,
seasons, irregularity, missing data
REGRESSION
• Linear Model
• Generalized Linear Model
• Support Vector Machine (SVM)
• Stepwise Linear regression
• Neural Network
• LASSO *
ATTRIBUTE IMPORTANCE
• Minimum Description Length
• Principal Comp Analysis (PCA)
• Unsupervised Pair-wise KL Div
• CUR decomposition for row & AI
ASSOCIATION RULES
• A priori/ market basket
PREDICTIVE QUERIES
• Predict, cluster, detect, features
SQL ANALYTICS
• SQL Windows, SQL Patterns,
SQL Aggregates
FEATURE EXTRACTION
• Principal Comp Analysis (PCA)
• Non-negative Matrix Factorization
• Singular Value Decomposition (SVD)
• Explicit Semantic Analysis (ESA)
TEXT MINING SUPPORT
• Algorithms support text
• Tokenization and theme extraction
• Explicit Semantic Analysis (ESA) for
document similarity
STATISTICAL FUNCTIONS
• Basic statistics: min, max,
median, stdev, t-test, F-test,
Pearson’s, Chi-Sq, ANOVA, etc.
R PACKAGES
• CRAN R Algorithm Packages
through Embedded R Execution
• Spark MLlib algorithm integration
EXPORTABLE ML MODELS
• REST APIs for deployment
X
1
X
2
A
1
A
2
A
3
A
4
A
5
A
6
A
7
ANALYTICAL SQL
• SQL Windows
• SQL Aggregate functions
• LAG/LEAD functions
• SQL for Pattern Matching
• Additional approximate
query
processing: APPROX_COUNT
, APPROX_SUM,
APPROX_RANK
• Regular Expressions
• Linear regression
• ANOVA (Analysis of
variance)
• Test Distribution fit
(e.g. Normal distribution
test, Binomial test, Weibull
test, Uniform
test, Exponential
test, Poisson test, etc.)
• Statistical Aggregates (min,
max, mean, median, stdev,
mode, quantiles, plus x
sigma, minus x sigma, top n
outliers, bottom n outliers)
STATISTICAL FUNCTIONS
• Descriptive statistics
(e.g. median, stdev, mode, sum,
etc.)
• Hypothesis testing
(t-test, F-test, Kolmogorov-
Smirnov test, Mann Whitney
test, Wilcoxon Signed Ranks test
• Correlations analysis
(parametric and nonparametric
e.g.
Pearson’s test for
correlation, Spearman's rho
coefficient, Kendall's tau-b
correlation coefficient)
• Ranking functions
• Cross Tabulations with Chi-square
statistics|
Oracle’s Machine Learning & Adv. Analytics Algorithms
Algorithms Operate on Data
ML and AI are just “Algorithms”
Move the Algorithms; Not the Data!;
It Changes Everything!
Thank You
Any Questions ?
Sandesh Rao
VP AIOps for the Autonomous Database
@sandeshr
https://www.linkedin.com/in/raosandesh/
https://www.slideshare.net/SandeshRao4

More Related Content

What's hot

How to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmeaHow to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmeaSandesh Rao
 
Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseSandesh Rao
 
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEAIntroduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEASandesh Rao
 
20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous Database20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous DatabaseSandesh Rao
 
RAC Troubleshooting and Diagnosability Sangam2016
RAC Troubleshooting and Diagnosability Sangam2016RAC Troubleshooting and Diagnosability Sangam2016
RAC Troubleshooting and Diagnosability Sangam2016Sandesh Rao
 
LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...
LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...
LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...Sandesh Rao
 
What's new in oracle trace file analyzer 18.2.0
What's new in oracle trace file analyzer 18.2.0What's new in oracle trace file analyzer 18.2.0
What's new in oracle trace file analyzer 18.2.0Sandesh Rao
 
Exachk Customer Presentation
Exachk Customer PresentationExachk Customer Presentation
Exachk Customer PresentationSandesh Rao
 
Machine Learning and AI at Oracle
Machine Learning and AI at OracleMachine Learning and AI at Oracle
Machine Learning and AI at OracleSandesh Rao
 
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...Sandesh Rao
 
20 Tips and Tricks with the Autonomous Database
20 Tips and Tricks with the Autonomous Database 20 Tips and Tricks with the Autonomous Database
20 Tips and Tricks with the Autonomous Database Sandesh Rao
 
NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...
NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...
NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...Sandesh Rao
 
Data meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow IndiaData meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow IndiaSandesh Rao
 
NZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RAC
NZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RACNZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RAC
NZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RACSandesh Rao
 
AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...
AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...
AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...Sandesh Rao
 
AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...
AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...
AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...Sandesh Rao
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RACAUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RACSandesh Rao
 
Biwa summit 2015 oaa oracle data miner hands on lab
Biwa summit 2015 oaa oracle data miner hands on labBiwa summit 2015 oaa oracle data miner hands on lab
Biwa summit 2015 oaa oracle data miner hands on labCharlie Berger
 
TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...
TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...
TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...Trivadis
 
AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...
AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...
AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...Sandesh Rao
 

What's hot (20)

How to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmeaHow to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmea
 
Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous Database
 
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEAIntroduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
 
20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous Database20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous Database
 
RAC Troubleshooting and Diagnosability Sangam2016
RAC Troubleshooting and Diagnosability Sangam2016RAC Troubleshooting and Diagnosability Sangam2016
RAC Troubleshooting and Diagnosability Sangam2016
 
LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...
LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...
LAD -GroundBreakers-Jul 2019 - The Machine Learning behind the Autonomous Dat...
 
What's new in oracle trace file analyzer 18.2.0
What's new in oracle trace file analyzer 18.2.0What's new in oracle trace file analyzer 18.2.0
What's new in oracle trace file analyzer 18.2.0
 
Exachk Customer Presentation
Exachk Customer PresentationExachk Customer Presentation
Exachk Customer Presentation
 
Machine Learning and AI at Oracle
Machine Learning and AI at OracleMachine Learning and AI at Oracle
Machine Learning and AI at Oracle
 
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
 
20 Tips and Tricks with the Autonomous Database
20 Tips and Tricks with the Autonomous Database 20 Tips and Tricks with the Autonomous Database
20 Tips and Tricks with the Autonomous Database
 
NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...
NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...
NZOUG - GroundBreakers-2018 -Using Oracle Autonomous Health Framework to Pres...
 
Data meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow IndiaData meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow India
 
NZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RAC
NZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RACNZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RAC
NZOUG-GroundBreakers-2018 - Troubleshooting and Diagnosing 18c RAC
 
AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...
AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...
AUSOUG - Introducing New AI Ops Innovations in Oracle 19c Autonomous Health F...
 
AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...
AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...
AIOUG - Groundbreakers - Jul 2019 - 19 Troubleshooting Tips and Tricks for Da...
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RACAUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
 
Biwa summit 2015 oaa oracle data miner hands on lab
Biwa summit 2015 oaa oracle data miner hands on labBiwa summit 2015 oaa oracle data miner hands on lab
Biwa summit 2015 oaa oracle data miner hands on lab
 
TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...
TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...
TechEvent 2019: Create a Private Database Cloud in the Public Cloud using the...
 
AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...
AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...
AIOUG : OTNYathra - Troubleshooting and Diagnosing Oracle Database 12.2 and O...
 

Similar to Introduction to Machine Learning and Data Science using Autonomous Database ILOUG Feb 2020

Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...Sandesh Rao
 
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmeaIntroduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmeaSandesh Rao
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc
 
Rwa optimizer benefits and features
Rwa optimizer benefits and featuresRwa optimizer benefits and features
Rwa optimizer benefits and featuresAsif Rajani
 
Architecting the Framework for Compliance & Risk Management
Architecting the Framework for Compliance & Risk ManagementArchitecting the Framework for Compliance & Risk Management
Architecting the Framework for Compliance & Risk Managementjadams6
 
State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023RTTS
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learningSmita Agrawal
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverQA or the Highway
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learningSmita Agrawal
 
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...Agile Testing Alliance
 
Best Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System ReplacementBest Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System ReplacementEdgewater
 
Navigating HCM Compliance Through Managed Services Part 2
Navigating HCM Compliance Through Managed Services Part 2Navigating HCM Compliance Through Managed Services Part 2
Navigating HCM Compliance Through Managed Services Part 2Smart ERP Solutions, Inc.
 
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web TestingThe Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web TestingPerfecto by Perforce
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon Web Services
 
CA Mainframe Resource Intelligence
CA Mainframe Resource IntelligenceCA Mainframe Resource Intelligence
CA Mainframe Resource IntelligenceCA Technologies
 
Machine Learning Approach for Quality Assessment and Prediction in Large Soft...
Machine Learning Approach for Quality Assessmentand Prediction in Large Soft...Machine Learning Approach for Quality Assessmentand Prediction in Large Soft...
Machine Learning Approach for Quality Assessment and Prediction in Large Soft...RAKESH RANA
 
From Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesFrom Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesObeo
 
Ibm test data_management_v0.4
Ibm test data_management_v0.4Ibm test data_management_v0.4
Ibm test data_management_v0.4Rosario Cunha
 

Similar to Introduction to Machine Learning and Data Science using Autonomous Database ILOUG Feb 2020 (20)

Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...
 
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmeaIntroduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
 
Rwa optimizer benefits and features
Rwa optimizer benefits and featuresRwa optimizer benefits and features
Rwa optimizer benefits and features
 
Architecting the Framework for Compliance & Risk Management
Architecting the Framework for Compliance & Risk ManagementArchitecting the Framework for Compliance & Risk Management
Architecting the Framework for Compliance & Risk Management
 
State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learning
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
 
Claims
ClaimsClaims
Claims
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learning
 
NZS-4555 - IT Analytics Keynote - IT Analytics for the Enterprise
NZS-4555 - IT Analytics Keynote - IT Analytics for the EnterpriseNZS-4555 - IT Analytics Keynote - IT Analytics for the Enterprise
NZS-4555 - IT Analytics Keynote - IT Analytics for the Enterprise
 
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
 
Best Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System ReplacementBest Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System Replacement
 
Navigating HCM Compliance Through Managed Services Part 2
Navigating HCM Compliance Through Managed Services Part 2Navigating HCM Compliance Through Managed Services Part 2
Navigating HCM Compliance Through Managed Services Part 2
 
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web TestingThe Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)
 
CA Mainframe Resource Intelligence
CA Mainframe Resource IntelligenceCA Mainframe Resource Intelligence
CA Mainframe Resource Intelligence
 
Machine Learning Approach for Quality Assessment and Prediction in Large Soft...
Machine Learning Approach for Quality Assessmentand Prediction in Large Soft...Machine Learning Approach for Quality Assessmentand Prediction in Large Soft...
Machine Learning Approach for Quality Assessment and Prediction in Large Soft...
 
From Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesFrom Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems Architectures
 
Ibm test data_management_v0.4
Ibm test data_management_v0.4Ibm test data_management_v0.4
Ibm test data_management_v0.4
 

More from Sandesh Rao

Whats new in Autonomous Database in 2022
Whats new in Autonomous Database in 2022Whats new in Autonomous Database in 2022
Whats new in Autonomous Database in 2022Sandesh Rao
 
Oracle Database performance tuning using oratop
Oracle Database performance tuning using oratopOracle Database performance tuning using oratop
Oracle Database performance tuning using oratopSandesh Rao
 
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
Analysis of Database Issues using AHF and Machine Learning v2 -  AOUG2022Analysis of Database Issues using AHF and Machine Learning v2 -  AOUG2022
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022Sandesh Rao
 
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
Analysis of Database Issues using AHF and Machine Learning v2 -  SOUGAnalysis of Database Issues using AHF and Machine Learning v2 -  SOUG
Analysis of Database Issues using AHF and Machine Learning v2 - SOUGSandesh Rao
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021Sandesh Rao
 
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
15 Troubleshooting tips and Tricks for Database 21c - KSAOUGSandesh Rao
 
How to Use EXAchk Effectively to Manage Exadata Environments
How to Use EXAchk Effectively to Manage Exadata EnvironmentsHow to Use EXAchk Effectively to Manage Exadata Environments
How to Use EXAchk Effectively to Manage Exadata EnvironmentsSandesh Rao
 
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUGSandesh Rao
 
TFA Collector - what can one do with it
TFA Collector - what can one do with it TFA Collector - what can one do with it
TFA Collector - what can one do with it Sandesh Rao
 
Troubleshooting tips and tricks for Oracle Database Oct 2020
Troubleshooting tips and tricks for Oracle Database Oct 2020Troubleshooting tips and tricks for Oracle Database Oct 2020
Troubleshooting tips and tricks for Oracle Database Oct 2020Sandesh Rao
 
TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new Sandesh Rao
 
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...Sandesh Rao
 
The Machine Learning behind the Autonomous Database ILOUG Feb 2020
The Machine Learning behind the Autonomous Database   ILOUG Feb 2020 The Machine Learning behind the Autonomous Database   ILOUG Feb 2020
The Machine Learning behind the Autonomous Database ILOUG Feb 2020 Sandesh Rao
 
Troubleshooting Tips and Tricks for Database 19c ILOUG Feb 2020
Troubleshooting Tips and Tricks for Database 19c   ILOUG Feb 2020Troubleshooting Tips and Tricks for Database 19c   ILOUG Feb 2020
Troubleshooting Tips and Tricks for Database 19c ILOUG Feb 2020Sandesh Rao
 
Troubleshooting Tips and Tricks for Database 19c - Sangam 2019
Troubleshooting Tips and Tricks for Database 19c - Sangam 2019Troubleshooting Tips and Tricks for Database 19c - Sangam 2019
Troubleshooting Tips and Tricks for Database 19c - Sangam 2019Sandesh Rao
 
Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...
Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...
Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...Sandesh Rao
 
Troubleshooting Tips and Tricks for Database 19c - EMEA Tour Oct 2019
Troubleshooting Tips and Tricks for Database 19c - EMEA Tour  Oct 2019Troubleshooting Tips and Tricks for Database 19c - EMEA Tour  Oct 2019
Troubleshooting Tips and Tricks for Database 19c - EMEA Tour Oct 2019Sandesh Rao
 
LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...
LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...
LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...Sandesh Rao
 
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...Sandesh Rao
 

More from Sandesh Rao (19)

Whats new in Autonomous Database in 2022
Whats new in Autonomous Database in 2022Whats new in Autonomous Database in 2022
Whats new in Autonomous Database in 2022
 
Oracle Database performance tuning using oratop
Oracle Database performance tuning using oratopOracle Database performance tuning using oratop
Oracle Database performance tuning using oratop
 
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
Analysis of Database Issues using AHF and Machine Learning v2 -  AOUG2022Analysis of Database Issues using AHF and Machine Learning v2 -  AOUG2022
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
 
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
Analysis of Database Issues using AHF and Machine Learning v2 -  SOUGAnalysis of Database Issues using AHF and Machine Learning v2 -  SOUG
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
 
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
15 Troubleshooting tips and Tricks for Database 21c - KSAOUG
 
How to Use EXAchk Effectively to Manage Exadata Environments
How to Use EXAchk Effectively to Manage Exadata EnvironmentsHow to Use EXAchk Effectively to Manage Exadata Environments
How to Use EXAchk Effectively to Manage Exadata Environments
 
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
 
TFA Collector - what can one do with it
TFA Collector - what can one do with it TFA Collector - what can one do with it
TFA Collector - what can one do with it
 
Troubleshooting tips and tricks for Oracle Database Oct 2020
Troubleshooting tips and tricks for Oracle Database Oct 2020Troubleshooting tips and tricks for Oracle Database Oct 2020
Troubleshooting tips and tricks for Oracle Database Oct 2020
 
TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new
 
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
 
The Machine Learning behind the Autonomous Database ILOUG Feb 2020
The Machine Learning behind the Autonomous Database   ILOUG Feb 2020 The Machine Learning behind the Autonomous Database   ILOUG Feb 2020
The Machine Learning behind the Autonomous Database ILOUG Feb 2020
 
Troubleshooting Tips and Tricks for Database 19c ILOUG Feb 2020
Troubleshooting Tips and Tricks for Database 19c   ILOUG Feb 2020Troubleshooting Tips and Tricks for Database 19c   ILOUG Feb 2020
Troubleshooting Tips and Tricks for Database 19c ILOUG Feb 2020
 
Troubleshooting Tips and Tricks for Database 19c - Sangam 2019
Troubleshooting Tips and Tricks for Database 19c - Sangam 2019Troubleshooting Tips and Tricks for Database 19c - Sangam 2019
Troubleshooting Tips and Tricks for Database 19c - Sangam 2019
 
Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...
Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...
Oracle Real Application Clusters 19c- Best Practices and Internals- EMEA Tour...
 
Troubleshooting Tips and Tricks for Database 19c - EMEA Tour Oct 2019
Troubleshooting Tips and Tricks for Database 19c - EMEA Tour  Oct 2019Troubleshooting Tips and Tricks for Database 19c - EMEA Tour  Oct 2019
Troubleshooting Tips and Tricks for Database 19c - EMEA Tour Oct 2019
 
LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...
LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...
LAD -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA's ...
 
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
 

Recently uploaded

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Recently uploaded (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Introduction to Machine Learning and Data Science using Autonomous Database ILOUG Feb 2020

  • 1. VP AIOps for the Autonomous Database Sandesh Rao From DBA’s to Data Scientists ILOUG Introduction to Machine Learning @sandeshr https://www.linkedin.com/in/raosandesh/ https://www.slideshare.net/SandeshRao4
  • 2. The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation. Statements in this presentation relating to Oracle’s future plans, expectations, beliefs, intentions and prospects are “forward-looking statements” and are subject to material risks and uncertainties. A detailed discussion of these factors and other risks that affect our business is contained in Oracle’s Securities and Exchange Commission (SEC) filings, including our most recent reports on Form 10-K and Form 10-Q under the heading “Risk Factors.” These filings are available on the SEC’s website or on Oracle’s website at http://www.oracle.com/investor. All information in this presentation is current as of September 2019 and Oracle undertakes no duty to update any statement in light of new information or future events. Safe harbor statement
  • 3. 1. Overview of ML and the Autonomous Database 2. Regression 3. Classification 4. Clustering 5. Anomaly detection 6. Workload prediction 7. Dynamic maintenance windows 8. Oracle Machine Learning examples Agenda
  • 4. Overview of ML and Autonomous Database
  • 5. Tasks Specific to Business and Innovation • Architecture, planning, data modeling • Data security and lifecycle management • Application related tuning • End-to-End service level management Maintenance Tasks • Configuration and tuning of systems, network, storage • Database provisioning, patching • Database backups, H/A, disaster recovery • Database optimization Traditionally DBAs are Responsible for: Value Scale Innovation Maintenance
  • 6. Tasks Specific to Business and Innovation • Architecture, planning, data modeling • Data security and lifecycle management • Application related tuning • End-to-End service level management Maintenance Tasks • Configuration and tuning of systems, network, storage • Database provisioning, patching • Database backups, H/A, disaster recovery • Database optimization Freedom from Drudgery for DBA: More Time to Innovate and Improve the Business Autonomous Database Removes Generic Tasks Value Scale Innovation Maintenance
  • 7. Machine Learning Solving data-driven problems Discovering insights Making predictions Data Security Data classification, Data life-cycle mgmt Application Tuning SQL tuning, connection mgmt The Evolution of the DBA/Database Developer Role Data Engineer Architecture, “data wrangler”
  • 8. Data extraction Data wrangling Deriving new attributes (“feature engineering”) … … … Import predictions & insights Translate and deploy ML models Automate You Are Probably Already Doing Most of This Work! Database Developer to Data Scientist Journey 1 - https://www.infoworld.com/article/3228245/data-science/the-80-20-data-science-dilemma.html Typically 80% of the work Most data scientists spend only 20 percent of their time on actual data analysis and 80 percent of their time finding, cleaning, and reorganizing huge amounts of data, which is an inefficient data strategy1 Eliminated or minimized with Oracle Data Management platform becomes combine/hybrid DM + machine learning platform
  • 9. Albert Einstein “If I had an hour to solve a problem I'd spend 55 minutes thinking about the problem and 5 minutes thinking about solutions.”
  • 10. Lots of Data needs to be crunched • No time to manually sift through the data Machine Learning has become accessible • Anyone can be a Data Scientist • Software and algorithms are available • Frameworks allow for massive training with no coding • CI/CD available for MLOps Business use cases - How to find value from the data Why Machine Learning for us and why now?
  • 11. Analytics Value vs. Maturity Reports & Dashboards Data Information Predictions & Insights Appls with ML Analytical Maturity ValueofAnalytics Diagnostic Analysis & Reports Predictive / Machine Learning “ML Enabled” Applications What Happened? Why it Happened? What WILL happen? Automated ML Appls
  • 12. Database Developer to Data Scientist Journey
  • 13. ML Project Workflow Set the business objectives Gather compare and clean data Identify and extract features (important columns) from imported data This helps us identify the efficiency of the algorithm Take the input data which is also called the training data and apply the algorithm to it For the algorithm to function efficiently, it is important to pick the right value for hyper parameters (algorithm input parameters to the algorithm) Once the training data in the algorithm are combined we get a model 1 2 3 4 5
  • 14. Types of Machine Learning Supervised Learning Predict future outcomes with the help of training data provided by human experts Semi-Supervised Learning Discover patterns within raw data and make predictions, which are then reviewed by human experts, who provide feedback which is used to improve the model accuracy Unsupervised Learning Find patterns without any external input other than the raw data Reinforcement Learning Take decisions based on past rewards for this type of action
  • 16. REGRESSION Predicting numbers Customer lifetime value Estimate optimal pricing House price estimates - 10 0 10 20 30 40 50 60
  • 18. CLASSIFICATION Membership of a known class Identify likely high value customers Find customer likely to churn Fraud detection
  • 20. CLUSTERING Membership of an inferred class Customer segmentation Credit risk evaluation Document similarity
  • 22. ANOMALY DETECTION Outliers Dentist billing 85 fillings / hour Employees with high claims / grade One variable moving out of sync
  • 23. ASSOCIATION RULES Finding like- minded people You might be interested in... Root cause analysis Identify “harbingers of failure”
  • 24. TIME SERIES Temporal Aspect Hitting a threshold Forecasting energy use Seasonality of data
  • 26. NEURAL NETWORKS Learn (More) Like A Human Classification Regression Deep learning http://neuralnetworksanddeeplearning.com/chap1.html
  • 27. Machine Learning Algorithms • Multiple Regression, Support Vector Machine, Linear Model, LASSO, Random Forest, Ridge Regression, Generalized Linear Model, Stepwise Linear Regression Regression Association & Collaborative Filtering Reinforcement Learning - brute force, Monte Carlo, temporal difference.... • Many different use cases Neural network & deep Learning with Deep Neural Network • Hierarchical k-means, Orthogonal Partitioning Clustering, Expectation- Maximization Clustering Feature Extraction/Attribute Importance / Component Analysis • Decision Tree, Naive Bayes, Random Forest, Logistic Regression, Support Vector Machine Classification
  • 28. ML To determine workload and deviation from it
  • 29. What is Workload Automatically check workload for past x mins Decide if workload is abnormally high Highlight any abnormal workload issues Optionally run on demand Optionally snooze checking of a component Calculated via machine learning
  • 30. Adaptive Learning Workload Process Captures metrics for key performance dimensions across 5 X 1 minute time windows CAPTURE1 Using semi-supervised learning via SME threshold rules, the following models are retrained : • Isolation Forest • One-Class Support Vector Machine • Local Outlier Factor Model with highest confidence becomes the primary, if confidence is high enough TRAIN2 Straight after capture, the primary model is used to predict anomalies. Where anomalies are identified, metrics are compared to SME threshold rules to identify the type of anomaly PREDICT3 Every 5 Mins Every Week Every 5 Mins
  • 31. Prediction (Every 5 minutes) 5 X 1 min metrics captured for each dimension & ASH report captured for later analysis Metrics evaluated by the primary model to determine if there are anomalies If there is no primary model (i.e. <7 days of data or <=95% model confidence) then SME rules are used for anomaly detection Each anomaly is compared against the SME rules to determine which dimension it applies to Any anomalies are raised along with recently captured ASH report
  • 32. Resource usage prediction Configurable threshold boundary – notify Admin of forecasts above here Actual values (Black) Forecast values (Blue line) Upper & lower forecast range (light blue area) Unusual values (anomalies) Future forecast values
  • 33. ML To determine dynamic maintenance windows
  • 34. Identify Relevant Workload Metrics • Ex: Average Active Sessions, CPU/Mem/IO Utilization Time Series Decomposition • Trend • Seasonality • Residual Workload Seasonality Determination Locating Minimas Optimum Window Identification and Validation Model Generation and Training Flow Maintenance Slot Identification
  • 35. Maintenance window identification START_TIME CNT 2018-04-11 15:00:00 290 2018-04-11 16:00:00 31120 2018-04-11 17:00:00 21530 2018-04-11 18:00:00 26240 2018-04-11 19:00:00 40520 2018-04-11 20:00:00 54270 2018-04-11 21:00:00 51460 2018-04-11 22:00:00 44310 2018-04-11 23:00:00 25690 START_TIME 2018-04-11 15:00:00 -0.226098 2018-04-11 16:00:00 -0.069821 2018-04-11 17:00:00 -0.350088 2018-04-11 18:00:00 -0.187483 2018-04-11 19:00:00 -0.513240 2018-04-11 20:00:00 0.019737 2018-04-11 21:00:00 0.059213 2018-04-11 22:00:00 -0.011312 2018-04-11 23:00:00 -0.179156 START_TIME 2018-04-11 15:00:00 5.669881 2018-04-11 16:00:00 10.345606 2018-04-11 17:00:00 9.977203 2018-04-11 18:00:00 10.175040 2018-04-11 19:00:00 10.609551 2018-04-11 20:00:00 10.901727 2018-04-11 21:00:00 10.848560 2018-04-11 22:00:00 10.698966 2018-04-11 23:00:00 10.153857 Current Date : 2018-05-12 15:00:00 Current Position in Seasonality : -0.22609829742533585 Best Maintenance Period in next Cycle : 2018-05-12 19:00:00 Worst Maintenance Period in next Cycle : 2018-05-13 08:00:00 Original observation data 1 Convolution filter & average 2 Calculate seasonality 3 Use seasonality to predict best maintenance window 4
  • 37. Simple SQL Syntax—Statistical Comparisons (t-tests) Compare AVE Purchase Amounts Men vs. Women Grouped_By INCOME_LEVEL Statistical Functions SELECT SUBSTR(cust_income_level, 1, 22) income_level, AVG(DECODE(cust_gender, 'M', amount_sold, null)) sold_to_men, AVG(DECODE(cust_gender, 'F', amount_sold, null)) sold_to_women, STATS_T_TEST_INDEPU(cust_gender, amount_sold, 'STATISTIC', 'F') t_observed, STATS_T_TEST_INDEPU(cust_gender, amount_sold) two_sided_p_value FROM customers c, sales s WHERE c.cust_id = s.cust_id GROUP BY ROLLUP(cust_income_level) ORDER BY income_level, sold_to_men, sold_to_women, t_observed; STATS_T_TEST_INDEPU (SQL) Example; P_Values < 05 show statistically significantly differences in the amounts purchased by men vs. women
  • 38. Simple SQL Syntax—Attribute Importance - ML Model Build (PL/SQL) OAA Model Build and Real-time SQL Apply Prediction BEGIN DBMS_DATA_MINING.CREATE_MODEL( model_name => 'BUY_INSURANCE_AI', mining_function => DBMS_DATA_MINING.ATTRIBUTE_IMPORTANCE, data_table_name => 'CUST_INSUR_LTV', case_id_column_name => 'cust_id', target_column_name => 'BUY_INSURANCE', settings_table_name => 'Att_Import_Mode_Settings'); END; / SELECT attribute_name, rank , attribute_value FROM BUY_INSURANCE_AI ORDER BY rank, attribute_name; Model Results (SQL query) ATTRIBUTE_NAME RANK ATTRIBUTE_VALUE BANK_FUNDS 1 0.2161 MONEY_MONTLY_OVERDRAWN 2 0.1489 N_TRANS_ATM 3 0.1463 N_TRANS_TELLER 4 0.1156 T_AMOUNT_AUTOM_PAYMENTS 5 0.1095 A1A2A3A4 A5A6 A7
  • 39. Key Features Collaborative UI for data scientists • Packaged with Autonomous Data Warehouse Cloud (V1) • Easy access to shared notebooks, templates, permissions, scheduler, etc. • SQL ML algorithms API (V1) • Supports deployment of ML analytics Machine Learning Notebook for Autonomous Data Warehouse Cloud Oracle Machine Learning
  • 40. Multiple Languages UIs Supported for End Users & Apps Development Oracle Machine Leaning Application DevelopersDBAs R & Python Data Scientists “Citizen” Data ScientistsNotebook Users & DS Teams New! New!
  • 41. Target “best” customers who have GOOD CREDIT and make payments Business Usecase
  • 42. Define Problem Statement Poorly Defined Better Data Mining Technique Predict employees that leave • Based on past employees that voluntarily left: • Create New Attribute EmplTurnover à O/1 Predict customers that churn • Based on past customers that have churned: • Create New Attribute Churn à YES/NO Target “best” customers • Recency, Frequency Monetary (RFM) Analysis • Specific Dollar Amount over Time Window: • Who has spent $500+ in most recent 18 months How can I make more $$? • What helps me sell soft drinks & coffee? Which customers are likely to buy? • How much is each customer likely to spend? Who are my “best customers”? • What descriptive “rules” describe “best customers”? How can I combat fraud? • Which transactions are the most anomalous? • Then roll-up to physician, claimant, employee…
  • 44. Target “best” customers who have GOOD CREDIT and make payments Data Scoping
  • 45. Create New Derived Attributes or “Engineered Features” Feature Engineering Source Attribute New Attribute/”Engineered Feature” Date of Birth AGE Address DISTANCE_TO_DESTINATION COMMUTE_TIME Call detail records (CDRs) #_DROPPED_CALLS PERCENT_iNTERNATIONAL Salary PERCENT_VS_PEERS Purchases TOTALS_PER_CATEGORY (e.g. Food, Clothing)
  • 46. Create new derived attributes to tease more information out of the data. For example: • RECENCY, FREQUENCY, MONETARY (RFM Analysis) Create New Derived Attributes or “Engineered Features” Feature Engineering
  • 47. Data remains in Database • Model building and scoring occur in- database • Leverage investment in Oracle IT • Eliminate data duplication - Eliminate separate analytical servers Deliver enterprise-wide “predictive” applications Don’t move the Data Traditional ML Hours, Days or Weeks Data Extraction Data Prep & Transformation Data Mining Model Building Data Mining Model “Scoring” Data Prep. & Transformation Data Import avings Model “Scoring” Embedded Data Prep Data Preparation Model Building Oracle’s in-DB Machine Learning Secs, Mins or Hours ORACLE AUTONOMOUS DATABASE
  • 48. Increasing sources of relevant data can boost model accuracy More Data Variety—Better Predictive Models Model with 20 variables Model with “Big Data” and hundreds -- thousands of input variables including: • Demographic data • Purchase POS transactional data • “Unstructured data”, text & comments • Spatial location data • Long term vs. recent historical behavior • Web visits • Sensor data • etc. Naïve Guess or Random 100% 0% Population Size Responders Model with 75 variables Model with 250 variables 100% Engineered Features – Derived attributes/variable that reflect domain knowledge—key to best models
  • 49. First, Identify the Key Attributes That Most Influence the Target Attribute Modeling and Machine Learning Attribute Importance Model
  • 50. Next, Build Predictive Models to Predict Customers who are Likely to Have Good_Credit Modeling and Machine Learning Split Data into Train and Test Build and Test Classification Model
  • 51. Test the ML model’s accuracy • Randomly selected “hold out” sample of data that was used to train the ML model • Compute Cumulative Gains, Lift, Accuracy, etc. • Review the attributes used in the model and model coefficients • Make sure the model makes sense Next, Build Predictive Models to Predict Customers who are Likely to Have Good_Credit Model Evaluation (Machine Learning) Model Evaluation
  • 52. Simple SQL Apply scripts run 100% inside the Database for immediate ML model deployment Apply the Models to Predict “Best Customers” Deployment Model Apply/”Scoring”
  • 53.
  • 54. Manage and Analyze All Your Data Big Data SQL / R SQL / R / Python Object Store “Engineered Features” – Derived attributes that reflect domain knowledge—key to best models e.g.: • Counts • Totals • Changes over time Boil down the Data Lake Architecturally, lots of options and flexibility
  • 55. In-Database Machine Learning More Models Better Models Faster, More Secure Less Cost Ready to Deploy! No Need To Extract and Move Data Data stays in Database Zero time required. No production impact. Data Preparation and Transformation Accelerated with Automatic Data Prep No separate environment required. Much faster data prep. Data stays protected and secured. Data Mining and Model Building SQL, R, Python Oracle Data Miner UI OML Notebooks Oracle Data Miner and AutoML greatly speed model building. Less skill required. No coding. No Need to Transform Production Data Embedded Data Preparation No need for second production instance. Model Scoring Accelerated Via Exadata Database Machine Faster model validation Easy to repeat model building as often as needed
  • 56. • OAA (Oracle Data Mining + Oracle R Enterprise) and ORAAH combined • OAA includes support for Partitioned Models, Transactional, Unstructured, Geo-spatial, Graph data. etc, Oracle’s Machine Learning & Adv. Analytics Algorithms CLASSIFICATION • Naïve Bayes • Logistic Regression (GLM) • Decision Tree • Random Forest • Neural Network • Support Vector Machine • Explicit Semantic Analysis CLUSTERING • Hierarchical K-Means • Hierarchical O-Cluster • Expectation Maximization (EM) ANOMALY DETECTION • One-Class SVM TIME SERIES • State of the art forecasting using Exponential Smoothing • Includes all popular models e.g. Holt-Winters with trends, seasons, irregularity, missing data REGRESSION • Linear Model • Generalized Linear Model • Support Vector Machine (SVM) • Stepwise Linear regression • Neural Network • LASSO * ATTRIBUTE IMPORTANCE • Minimum Description Length • Principal Comp Analysis (PCA) • Unsupervised Pair-wise KL Div • CUR decomposition for row & AI ASSOCIATION RULES • A priori/ market basket PREDICTIVE QUERIES • Predict, cluster, detect, features SQL ANALYTICS • SQL Windows, SQL Patterns, SQL Aggregates FEATURE EXTRACTION • Principal Comp Analysis (PCA) • Non-negative Matrix Factorization • Singular Value Decomposition (SVD) • Explicit Semantic Analysis (ESA) TEXT MINING SUPPORT • Algorithms support text • Tokenization and theme extraction • Explicit Semantic Analysis (ESA) for document similarity STATISTICAL FUNCTIONS • Basic statistics: min, max, median, stdev, t-test, F-test, Pearson’s, Chi-Sq, ANOVA, etc. R PACKAGES • CRAN R Algorithm Packages through Embedded R Execution • Spark MLlib algorithm integration EXPORTABLE ML MODELS • REST APIs for deployment X 1 X 2 A 1 A 2 A 3 A 4 A 5 A 6 A 7
  • 57. ANALYTICAL SQL • SQL Windows • SQL Aggregate functions • LAG/LEAD functions • SQL for Pattern Matching • Additional approximate query processing: APPROX_COUNT , APPROX_SUM, APPROX_RANK • Regular Expressions • Linear regression • ANOVA (Analysis of variance) • Test Distribution fit (e.g. Normal distribution test, Binomial test, Weibull test, Uniform test, Exponential test, Poisson test, etc.) • Statistical Aggregates (min, max, mean, median, stdev, mode, quantiles, plus x sigma, minus x sigma, top n outliers, bottom n outliers) STATISTICAL FUNCTIONS • Descriptive statistics (e.g. median, stdev, mode, sum, etc.) • Hypothesis testing (t-test, F-test, Kolmogorov- Smirnov test, Mann Whitney test, Wilcoxon Signed Ranks test • Correlations analysis (parametric and nonparametric e.g. Pearson’s test for correlation, Spearman's rho coefficient, Kendall's tau-b correlation coefficient) • Ranking functions • Cross Tabulations with Chi-square statistics| Oracle’s Machine Learning & Adv. Analytics Algorithms
  • 58. Algorithms Operate on Data ML and AI are just “Algorithms” Move the Algorithms; Not the Data!; It Changes Everything!
  • 59. Thank You Any Questions ? Sandesh Rao VP AIOps for the Autonomous Database @sandeshr https://www.linkedin.com/in/raosandesh/ https://www.slideshare.net/SandeshRao4