SlideShare a Scribd company logo
1 of 1
Download to read offline
The number of known solutions is very large; roughly five thousand.
How informative is each error code? An error code is informative if its presence in an incoming incident
narrows down the incident’s possible solutions, i.e., reduces our uncertainty of the incident’s solution.
We can quantify the notion of solution “uncertainty” using entropy, a metric from information theory.
Entropy is a quantitative measure of uncertainty and disorder. Solution entropy measures the uncertainty
of an incoming incident’s solution.
𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛 𝑒𝑛𝑡𝑟𝑜𝑝𝑦 ≔ − ෍
𝑖
𝑝𝑖 log 𝑝𝑖 , where 𝑝𝑖 is the proportion of incidents that have solution 𝑖.
Definitions:
• Baseline solution entropy (6.35): Solution entropy of all incidents (see top graph).
• Error type solution entropy: Solution entropy of incidents containing that error type.
• Error number solution entropy: Solution entropy of incidents containing that error type and error
number.
• Nearly all error type solution entropies are significantly lower than the baseline solution entropy,
indicating the presence of these error types significantly reduces solution uncertainty.
• Most error number solution entropies are significantly lower than their respective error type solution
entropies, indicating these error numbers carry additional information.
Alert descriptions tend to be messy and are in a format that does not allow
easy discovery and extraction of error codes:
<EventMessage>Apr 29 16:04:19 cmic CMICCore[9327]: DEGRADED: CMICCore: 11 #Failed over
for CMIC 1, 132, 1, 11 (39.84.32.11). This CMIC has taken over its
duties.</EventMessage><Subsystem>ServerMgmt</Subsystem><TrackingID>EBAY18-
CMICFailover</TrackingID><P
1. Parse out monograms/bigrams preceding numbers from alerts.
2. Manually determine which monograms/bigrams are potential error codes.
Stop words and noise are omitted.
3. Create and test regular expression patterns that extract these error codes.
4. Use the regular expression patterns to extract error codes from alerts.
• Identified 26 summary alert error types and 33 MPP alert error types from
~30,000 incidents from ~2,000 customer sites from July 2017 to July 2019.
Teradata's Global Support Organization (GSO)
strives to improve customer experience by quickly
and effectively resolving customers' technical issues
(i.e. incidents).
SAM is short for Service Analytics Machine, GSO's
initiative to automate certain aspects of customer
support using AI and machine learning. SAM
analyzes telemetry and currently serves as a
recommendation engine that recommends solutions
to known customer issues.
SAM’s Impact on Time to Resolutionof Known Issues
An incident occurs when a customer runs into
something wrong or unexpected with one of
Teradata's products.
For example, a customer may not be able to use a
product because the product crashes, is
unresponsive, or is slow.
AIC (Automatic Incident Creation) proactively
monitors telemetry and automatically creates an
incident when the AIC system detects a significant
cluster of anomalies/errors.
Alerts contain useful telemetry for SAM to analyze,
such as:
• Customer ID
• Timestamps
• Type of incident/alert
• Alert description
• Version numbers
• Backtraces
• Error codes
Feature engineering: Identify, extract, and profile
informative error codes for SAM to use as features
to learn on, hence improving SAM’s predictive
power.
• Identified, extracted, and profiled informative
error codes for SAM to use as features to
learn on, leading to improved predictive
power.
• Increased insight around error codes.
• Potentially increase the number of incidents
fingerprinted if error codes appear in incident
not covered by SAM’s existing models.
Teradata can leverage analytics and machine
learning to identify and extract informative
features, e.g. error codes, from noisy
telemetry.
• Incorporate error code features into SAM.
• Communicate findings with the broader
product support community to share insight
around error codes.
• Fully automate the process of identifying and
extracting informative error codes from
telemetry.
• Python (pandas, scikit-learn, NumPy, SciPy,
matplotlib, seaborn, pytest, re)
• Teradata Vantage
• SQL
• Jupyter Notebooks
• Git/GitHub
• Jenkins
• JIRA
• Getting accustomed to the software-heavy
aspect of building and maintaining a
machine learning model in production.
• Deriving value from messy, cryptic text data.
• This internship verified my passion in
pursuing data science as a career.
• Data science is a team effort; real data
science problems are complex and require
the collaboration of a cross-functional team.
• Manager: Brandon Quach
• Mentor: Chris Smith
• Director: Jenny Wang
• Product Owner: Brian Hutchins
• SAM data scientists: Brandon Quach, Chris
Smith, Jiacong Li, Andrew Washington
• The entire SAM team
Recommended
Solution
SAM
PSR Rules
Is a group of MPP
alerts
problematic?
MPP Alert
Rules
Is the telemetry
anomalous?
AIC Admin
Rules
Is a summary
alert serious
enough to warrant
an incident?
Telemetry
MPP
Alerts
Summary
Alerts
AIC
Incident
MPPalertsbundleintoasummaryalert.Summaryalertscomposeanincident.
Error Type Error Number
Error Code
CMICCore: 11
August 19 2019
Example of an informative error type with uninformative error numbers (the error
numbers tend to be unique, suggestingthe error numbers are arbitraryprocess IDs):
Example of an informative error type with uninformative error numbers (the error
number entropies are scarcely lower than the error type entropy):
Examples of an informativeerror type with informative error numbers:
Noisy Telemetry
Analytics-Driven Feature Extraction
Informative Features
Predictive Power

More Related Content

What's hot

Amazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine LearningAmazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine Learning
ijtsrd
 

What's hot (17)

Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshop
 
Anomaly Detection
Anomaly DetectionAnomaly Detection
Anomaly Detection
 
Anomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time SeriesAnomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time Series
 
Anomaly detection Workshop slides
Anomaly detection Workshop slidesAnomaly detection Workshop slides
Anomaly detection Workshop slides
 
Musings of kaggler
Musings of kagglerMusings of kaggler
Musings of kaggler
 
How ml can improve purchase conversions
How ml can improve purchase conversionsHow ml can improve purchase conversions
How ml can improve purchase conversions
 
Amazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine LearningAmazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine Learning
 
TURF Analysis
TURF Analysis TURF Analysis
TURF Analysis
 
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution Examp...
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution  Examp...PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution  Examp...
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution Examp...
 
Unsupervised Anomaly Detection with Isolation Forest - Elena Sharova
Unsupervised Anomaly Detection with Isolation Forest - Elena SharovaUnsupervised Anomaly Detection with Isolation Forest - Elena Sharova
Unsupervised Anomaly Detection with Isolation Forest - Elena Sharova
 
Anomaly detection
Anomaly detectionAnomaly detection
Anomaly detection
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World Systems
 
Anomaly detection, part 1
Anomaly detection, part 1Anomaly detection, part 1
Anomaly detection, part 1
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
 
Anomaly detection
Anomaly detectionAnomaly detection
Anomaly detection
 
Optimization
OptimizationOptimization
Optimization
 
Model Risk Management : Best Practices
Model Risk Management : Best PracticesModel Risk Management : Best Practices
Model Risk Management : Best Practices
 

Similar to Intern Poster Presentation

BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6
Rod Soto
 

Similar to Intern Poster Presentation (20)

Datascience101presentation4
Datascience101presentation4Datascience101presentation4
Datascience101presentation4
 
Analytics demystified
Analytics demystifiedAnalytics demystified
Analytics demystified
 
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
 
AI cybersecurity
AI cybersecurityAI cybersecurity
AI cybersecurity
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
 
Artificial Intelligence Primer
Artificial Intelligence PrimerArtificial Intelligence Primer
Artificial Intelligence Primer
 
Permission Driven Malware Detection using Machine Learning
Permission Driven Malware Detection using Machine LearningPermission Driven Malware Detection using Machine Learning
Permission Driven Malware Detection using Machine Learning
 
Analytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnalytics for large-scale time series and event data
Analytics for large-scale time series and event data
 
Can You Really Automate Yourself Secure
Can You Really Automate Yourself SecureCan You Really Automate Yourself Secure
Can You Really Automate Yourself Secure
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analytics
 
Foutse_Khomh.pptx
Foutse_Khomh.pptxFoutse_Khomh.pptx
Foutse_Khomh.pptx
 
The Machine Learning Audit
The Machine Learning AuditThe Machine Learning Audit
The Machine Learning Audit
 
BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWER
 
AI-900 Slides.pptx
AI-900 Slides.pptxAI-900 Slides.pptx
AI-900 Slides.pptx
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Intro 2 Machine Learning
Intro 2 Machine LearningIntro 2 Machine Learning
Intro 2 Machine Learning
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 

Recently uploaded

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 

Recently uploaded (20)

Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 

Intern Poster Presentation

  • 1. The number of known solutions is very large; roughly five thousand. How informative is each error code? An error code is informative if its presence in an incoming incident narrows down the incident’s possible solutions, i.e., reduces our uncertainty of the incident’s solution. We can quantify the notion of solution “uncertainty” using entropy, a metric from information theory. Entropy is a quantitative measure of uncertainty and disorder. Solution entropy measures the uncertainty of an incoming incident’s solution. 𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛 𝑒𝑛𝑡𝑟𝑜𝑝𝑦 ≔ − ෍ 𝑖 𝑝𝑖 log 𝑝𝑖 , where 𝑝𝑖 is the proportion of incidents that have solution 𝑖. Definitions: • Baseline solution entropy (6.35): Solution entropy of all incidents (see top graph). • Error type solution entropy: Solution entropy of incidents containing that error type. • Error number solution entropy: Solution entropy of incidents containing that error type and error number. • Nearly all error type solution entropies are significantly lower than the baseline solution entropy, indicating the presence of these error types significantly reduces solution uncertainty. • Most error number solution entropies are significantly lower than their respective error type solution entropies, indicating these error numbers carry additional information. Alert descriptions tend to be messy and are in a format that does not allow easy discovery and extraction of error codes: <EventMessage>Apr 29 16:04:19 cmic CMICCore[9327]: DEGRADED: CMICCore: 11 #Failed over for CMIC 1, 132, 1, 11 (39.84.32.11). This CMIC has taken over its duties.</EventMessage><Subsystem>ServerMgmt</Subsystem><TrackingID>EBAY18- CMICFailover</TrackingID><P 1. Parse out monograms/bigrams preceding numbers from alerts. 2. Manually determine which monograms/bigrams are potential error codes. Stop words and noise are omitted. 3. Create and test regular expression patterns that extract these error codes. 4. Use the regular expression patterns to extract error codes from alerts. • Identified 26 summary alert error types and 33 MPP alert error types from ~30,000 incidents from ~2,000 customer sites from July 2017 to July 2019. Teradata's Global Support Organization (GSO) strives to improve customer experience by quickly and effectively resolving customers' technical issues (i.e. incidents). SAM is short for Service Analytics Machine, GSO's initiative to automate certain aspects of customer support using AI and machine learning. SAM analyzes telemetry and currently serves as a recommendation engine that recommends solutions to known customer issues. SAM’s Impact on Time to Resolutionof Known Issues An incident occurs when a customer runs into something wrong or unexpected with one of Teradata's products. For example, a customer may not be able to use a product because the product crashes, is unresponsive, or is slow. AIC (Automatic Incident Creation) proactively monitors telemetry and automatically creates an incident when the AIC system detects a significant cluster of anomalies/errors. Alerts contain useful telemetry for SAM to analyze, such as: • Customer ID • Timestamps • Type of incident/alert • Alert description • Version numbers • Backtraces • Error codes Feature engineering: Identify, extract, and profile informative error codes for SAM to use as features to learn on, hence improving SAM’s predictive power. • Identified, extracted, and profiled informative error codes for SAM to use as features to learn on, leading to improved predictive power. • Increased insight around error codes. • Potentially increase the number of incidents fingerprinted if error codes appear in incident not covered by SAM’s existing models. Teradata can leverage analytics and machine learning to identify and extract informative features, e.g. error codes, from noisy telemetry. • Incorporate error code features into SAM. • Communicate findings with the broader product support community to share insight around error codes. • Fully automate the process of identifying and extracting informative error codes from telemetry. • Python (pandas, scikit-learn, NumPy, SciPy, matplotlib, seaborn, pytest, re) • Teradata Vantage • SQL • Jupyter Notebooks • Git/GitHub • Jenkins • JIRA • Getting accustomed to the software-heavy aspect of building and maintaining a machine learning model in production. • Deriving value from messy, cryptic text data. • This internship verified my passion in pursuing data science as a career. • Data science is a team effort; real data science problems are complex and require the collaboration of a cross-functional team. • Manager: Brandon Quach • Mentor: Chris Smith • Director: Jenny Wang • Product Owner: Brian Hutchins • SAM data scientists: Brandon Quach, Chris Smith, Jiacong Li, Andrew Washington • The entire SAM team Recommended Solution SAM PSR Rules Is a group of MPP alerts problematic? MPP Alert Rules Is the telemetry anomalous? AIC Admin Rules Is a summary alert serious enough to warrant an incident? Telemetry MPP Alerts Summary Alerts AIC Incident MPPalertsbundleintoasummaryalert.Summaryalertscomposeanincident. Error Type Error Number Error Code CMICCore: 11 August 19 2019 Example of an informative error type with uninformative error numbers (the error numbers tend to be unique, suggestingthe error numbers are arbitraryprocess IDs): Example of an informative error type with uninformative error numbers (the error number entropies are scarcely lower than the error type entropy): Examples of an informativeerror type with informative error numbers: Noisy Telemetry Analytics-Driven Feature Extraction Informative Features Predictive Power