SlideShare a Scribd company logo
1 of 1
Download to read offline
The number of known solutions is very large; roughly five thousand.
How informative is each error code? An error code is informative if its presence in an incoming incident
narrows down the incident’s possible solutions, i.e., reduces our uncertainty of the incident’s solution.
We can quantify the notion of solution “uncertainty” using entropy, a metric from information theory.
Entropy is a quantitative measure of uncertainty and disorder. Solution entropy measures the uncertainty
of an incoming incident’s solution.
𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛 𝑒𝑛𝑡𝑟𝑜𝑝𝑦 ≔ − ෍
𝑖
𝑝𝑖 log 𝑝𝑖 , where 𝑝𝑖 is the proportion of incidents that have solution 𝑖.
Definitions:
• Baseline solution entropy (6.35): Solution entropy of all incidents (see top graph).
• Error type solution entropy: Solution entropy of incidents containing that error type.
• Error number solution entropy: Solution entropy of incidents containing that error type and error
number.
• Nearly all error type solution entropies are significantly lower than the baseline solution entropy,
indicating the presence of these error types significantly reduces solution uncertainty.
• Most error number solution entropies are significantly lower than their respective error type solution
entropies, indicating these error numbers carry additional information.
Alert descriptions tend to be messy and are in a format that does not allow
easy discovery and extraction of error codes:
<EventMessage>Apr 29 16:04:19 cmic CMICCore[9327]: DEGRADED: CMICCore: 11 #Failed over
for CMIC 1, 132, 1, 11 (39.84.32.11). This CMIC has taken over its
duties.</EventMessage><Subsystem>ServerMgmt</Subsystem><TrackingID>EBAY18-
CMICFailover</TrackingID><P
1. Parse out monograms/bigrams preceding numbers from alerts.
2. Manually determine which monograms/bigrams are potential error codes.
Stop words and noise are omitted.
3. Create and test regular expression patterns that extract these error codes.
4. Use the regular expression patterns to extract error codes from alerts.
• Identified 26 summary alert error types and 33 MPP alert error types from
~30,000 incidents from ~2,000 customer sites from July 2017 to July 2019.
Teradata's Global Support Organization (GSO)
strives to improve customer experience by quickly
and effectively resolving customers' technical issues
(i.e. incidents).
SAM is short for Service Analytics Machine, GSO's
initiative to automate certain aspects of customer
support using AI and machine learning. SAM
analyzes telemetry and currently serves as a
recommendation engine that recommends solutions
to known customer issues.
SAM’s Impact on Time to Resolutionof Known Issues
An incident occurs when a customer runs into
something wrong or unexpected with one of
Teradata's products.
For example, a customer may not be able to use a
product because the product crashes, is
unresponsive, or is slow.
AIC (Automatic Incident Creation) proactively
monitors telemetry and automatically creates an
incident when the AIC system detects a significant
cluster of anomalies/errors.
Alerts contain useful telemetry for SAM to analyze,
such as:
• Customer ID
• Timestamps
• Type of incident/alert
• Alert description
• Version numbers
• Backtraces
• Error codes
Feature engineering: Identify, extract, and profile
informative error codes for SAM to use as features
to learn on, hence improving SAM’s predictive
power.
• Identified, extracted, and profiled informative
error codes for SAM to use as features to
learn on, leading to improved predictive
power.
• Increased insight around error codes.
• Potentially increase the number of incidents
fingerprinted if error codes appear in incident
not covered by SAM’s existing models.
Teradata can leverage analytics and machine
learning to identify and extract informative
features, e.g. error codes, from noisy
telemetry.
• Incorporate error code features into SAM.
• Communicate findings with the broader
product support community to share insight
around error codes.
• Fully automate the process of identifying and
extracting informative error codes from
telemetry.
• Python (pandas, scikit-learn, NumPy, SciPy,
matplotlib, seaborn, pytest, re)
• Teradata Vantage
• SQL
• Jupyter Notebooks
• Git/GitHub
• Jenkins
• JIRA
• Getting accustomed to the software-heavy
aspect of building and maintaining a
machine learning model in production.
• Deriving value from messy, cryptic text data.
• This internship verified my passion in
pursuing data science as a career.
• Data science is a team effort; real data
science problems are complex and require
the collaboration of a cross-functional team.
• Manager: Brandon Quach
• Mentor: Chris Smith
• Director: Jenny Wang
• Product Owner: Brian Hutchins
• SAM data scientists: Brandon Quach, Chris
Smith, Jiacong Li, Andrew Washington
• The entire SAM team
Recommended
Solution
SAM
PSR Rules
Is a group of MPP
alerts
problematic?
MPP Alert
Rules
Is the telemetry
anomalous?
AIC Admin
Rules
Is a summary
alert serious
enough to warrant
an incident?
Telemetry
MPP
Alerts
Summary
Alerts
AIC
Incident
MPPalertsbundleintoasummaryalert.Summaryalertscomposeanincident.
Error Type Error Number
Error Code
CMICCore: 11
August 19 2019
Example of an informative error type with uninformative error numbers (the error
numbers tend to be unique, suggestingthe error numbers are arbitraryprocess IDs):
Example of an informative error type with uninformative error numbers (the error
number entropies are scarcely lower than the error type entropy):
Examples of an informativeerror type with informative error numbers:
Noisy Telemetry
Analytics-Driven Feature Extraction
Informative Features
Predictive Power

More Related Content

What's hot

Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshopgforgovind
 
Anomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time SeriesAnomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time SeriesHumberto Marchezi
 
Anomaly detection Workshop slides
Anomaly detection Workshop slidesAnomaly detection Workshop slides
Anomaly detection Workshop slidesQuantUniversity
 
Musings of kaggler
Musings of kagglerMusings of kaggler
Musings of kagglerKai Xin Thia
 
How ml can improve purchase conversions
How ml can improve purchase conversionsHow ml can improve purchase conversions
How ml can improve purchase conversionsSudeep Shukla
 
Amazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine LearningAmazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine Learningijtsrd
 
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution Examp...
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution  Examp...PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution  Examp...
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution Examp...강민국 강민국
 
Unsupervised Anomaly Detection with Isolation Forest - Elena Sharova
Unsupervised Anomaly Detection with Isolation Forest - Elena SharovaUnsupervised Anomaly Detection with Isolation Forest - Elena Sharova
Unsupervised Anomaly Detection with Isolation Forest - Elena SharovaPyData
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsManojit Nandi
 
Anomaly detection, part 1
Anomaly detection, part 1Anomaly detection, part 1
Anomaly detection, part 1David Khosid
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxImpetus Technologies
 
Model Risk Management : Best Practices
Model Risk Management : Best PracticesModel Risk Management : Best Practices
Model Risk Management : Best PracticesQuantUniversity
 

What's hot (17)

Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshop
 
Anomaly Detection
Anomaly DetectionAnomaly Detection
Anomaly Detection
 
Anomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time SeriesAnomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time Series
 
Anomaly detection Workshop slides
Anomaly detection Workshop slidesAnomaly detection Workshop slides
Anomaly detection Workshop slides
 
Musings of kaggler
Musings of kagglerMusings of kaggler
Musings of kaggler
 
How ml can improve purchase conversions
How ml can improve purchase conversionsHow ml can improve purchase conversions
How ml can improve purchase conversions
 
Amazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine LearningAmazon Product Review Sentiment Analysis with Machine Learning
Amazon Product Review Sentiment Analysis with Machine Learning
 
TURF Analysis
TURF Analysis TURF Analysis
TURF Analysis
 
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution Examp...
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution  Examp...PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution  Examp...
PR-190: A Baseline For Detecting Misclassified and Out-of-Distribution Examp...
 
Unsupervised Anomaly Detection with Isolation Forest - Elena Sharova
Unsupervised Anomaly Detection with Isolation Forest - Elena SharovaUnsupervised Anomaly Detection with Isolation Forest - Elena Sharova
Unsupervised Anomaly Detection with Isolation Forest - Elena Sharova
 
Anomaly detection
Anomaly detectionAnomaly detection
Anomaly detection
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World Systems
 
Anomaly detection, part 1
Anomaly detection, part 1Anomaly detection, part 1
Anomaly detection, part 1
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
 
Anomaly detection
Anomaly detectionAnomaly detection
Anomaly detection
 
Optimization
OptimizationOptimization
Optimization
 
Model Risk Management : Best Practices
Model Risk Management : Best PracticesModel Risk Management : Best Practices
Model Risk Management : Best Practices
 

Similar to Intern Poster Presentation

Datascience101presentation4
Datascience101presentation4Datascience101presentation4
Datascience101presentation4Salford Systems
 
Analytics demystified
Analytics demystifiedAnalytics demystified
Analytics demystifiedMarc Moreau
 
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Sagar Deogirkar
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverQA or the Highway
 
Artificial Intelligence Primer
Artificial Intelligence PrimerArtificial Intelligence Primer
Artificial Intelligence PrimerImam Hoque
 
Permission Driven Malware Detection using Machine Learning
Permission Driven Malware Detection using Machine LearningPermission Driven Malware Detection using Machine Learning
Permission Driven Malware Detection using Machine LearningIRJET Journal
 
Analytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnalytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnodot
 
Can You Really Automate Yourself Secure
Can You Really Automate Yourself SecureCan You Really Automate Yourself Secure
Can You Really Automate Yourself SecureCigital
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsimtiaz khan
 
The Machine Learning Audit
The Machine Learning AuditThe Machine Learning Audit
The Machine Learning AuditAndrew Clark
 
BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6Rod Soto
 
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...Jon Mead
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERGanesan Narayanasamy
 
AI-900 Slides.pptx
AI-900 Slides.pptxAI-900 Slides.pptx
AI-900 Slides.pptxkprasad8
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloOCTO Technology
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software EngineeringMiroslaw Staron
 

Similar to Intern Poster Presentation (20)

Datascience101presentation4
Datascience101presentation4Datascience101presentation4
Datascience101presentation4
 
Analytics demystified
Analytics demystifiedAnalytics demystified
Analytics demystified
 
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
 
AI cybersecurity
AI cybersecurityAI cybersecurity
AI cybersecurity
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
 
Artificial Intelligence Primer
Artificial Intelligence PrimerArtificial Intelligence Primer
Artificial Intelligence Primer
 
Permission Driven Malware Detection using Machine Learning
Permission Driven Malware Detection using Machine LearningPermission Driven Malware Detection using Machine Learning
Permission Driven Malware Detection using Machine Learning
 
Analytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnalytics for large-scale time series and event data
Analytics for large-scale time series and event data
 
Can You Really Automate Yourself Secure
Can You Really Automate Yourself SecureCan You Really Automate Yourself Secure
Can You Really Automate Yourself Secure
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analytics
 
Foutse_Khomh.pptx
Foutse_Khomh.pptxFoutse_Khomh.pptx
Foutse_Khomh.pptx
 
The Machine Learning Audit
The Machine Learning AuditThe Machine Learning Audit
The Machine Learning Audit
 
BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWER
 
AI-900 Slides.pptx
AI-900 Slides.pptxAI-900 Slides.pptx
AI-900 Slides.pptx
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Intro 2 Machine Learning
Intro 2 Machine LearningIntro 2 Machine Learning
Intro 2 Machine Learning
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 

Recently uploaded

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Recently uploaded (20)

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Intern Poster Presentation

  • 1. The number of known solutions is very large; roughly five thousand. How informative is each error code? An error code is informative if its presence in an incoming incident narrows down the incident’s possible solutions, i.e., reduces our uncertainty of the incident’s solution. We can quantify the notion of solution “uncertainty” using entropy, a metric from information theory. Entropy is a quantitative measure of uncertainty and disorder. Solution entropy measures the uncertainty of an incoming incident’s solution. 𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛 𝑒𝑛𝑡𝑟𝑜𝑝𝑦 ≔ − ෍ 𝑖 𝑝𝑖 log 𝑝𝑖 , where 𝑝𝑖 is the proportion of incidents that have solution 𝑖. Definitions: • Baseline solution entropy (6.35): Solution entropy of all incidents (see top graph). • Error type solution entropy: Solution entropy of incidents containing that error type. • Error number solution entropy: Solution entropy of incidents containing that error type and error number. • Nearly all error type solution entropies are significantly lower than the baseline solution entropy, indicating the presence of these error types significantly reduces solution uncertainty. • Most error number solution entropies are significantly lower than their respective error type solution entropies, indicating these error numbers carry additional information. Alert descriptions tend to be messy and are in a format that does not allow easy discovery and extraction of error codes: <EventMessage>Apr 29 16:04:19 cmic CMICCore[9327]: DEGRADED: CMICCore: 11 #Failed over for CMIC 1, 132, 1, 11 (39.84.32.11). This CMIC has taken over its duties.</EventMessage><Subsystem>ServerMgmt</Subsystem><TrackingID>EBAY18- CMICFailover</TrackingID><P 1. Parse out monograms/bigrams preceding numbers from alerts. 2. Manually determine which monograms/bigrams are potential error codes. Stop words and noise are omitted. 3. Create and test regular expression patterns that extract these error codes. 4. Use the regular expression patterns to extract error codes from alerts. • Identified 26 summary alert error types and 33 MPP alert error types from ~30,000 incidents from ~2,000 customer sites from July 2017 to July 2019. Teradata's Global Support Organization (GSO) strives to improve customer experience by quickly and effectively resolving customers' technical issues (i.e. incidents). SAM is short for Service Analytics Machine, GSO's initiative to automate certain aspects of customer support using AI and machine learning. SAM analyzes telemetry and currently serves as a recommendation engine that recommends solutions to known customer issues. SAM’s Impact on Time to Resolutionof Known Issues An incident occurs when a customer runs into something wrong or unexpected with one of Teradata's products. For example, a customer may not be able to use a product because the product crashes, is unresponsive, or is slow. AIC (Automatic Incident Creation) proactively monitors telemetry and automatically creates an incident when the AIC system detects a significant cluster of anomalies/errors. Alerts contain useful telemetry for SAM to analyze, such as: • Customer ID • Timestamps • Type of incident/alert • Alert description • Version numbers • Backtraces • Error codes Feature engineering: Identify, extract, and profile informative error codes for SAM to use as features to learn on, hence improving SAM’s predictive power. • Identified, extracted, and profiled informative error codes for SAM to use as features to learn on, leading to improved predictive power. • Increased insight around error codes. • Potentially increase the number of incidents fingerprinted if error codes appear in incident not covered by SAM’s existing models. Teradata can leverage analytics and machine learning to identify and extract informative features, e.g. error codes, from noisy telemetry. • Incorporate error code features into SAM. • Communicate findings with the broader product support community to share insight around error codes. • Fully automate the process of identifying and extracting informative error codes from telemetry. • Python (pandas, scikit-learn, NumPy, SciPy, matplotlib, seaborn, pytest, re) • Teradata Vantage • SQL • Jupyter Notebooks • Git/GitHub • Jenkins • JIRA • Getting accustomed to the software-heavy aspect of building and maintaining a machine learning model in production. • Deriving value from messy, cryptic text data. • This internship verified my passion in pursuing data science as a career. • Data science is a team effort; real data science problems are complex and require the collaboration of a cross-functional team. • Manager: Brandon Quach • Mentor: Chris Smith • Director: Jenny Wang • Product Owner: Brian Hutchins • SAM data scientists: Brandon Quach, Chris Smith, Jiacong Li, Andrew Washington • The entire SAM team Recommended Solution SAM PSR Rules Is a group of MPP alerts problematic? MPP Alert Rules Is the telemetry anomalous? AIC Admin Rules Is a summary alert serious enough to warrant an incident? Telemetry MPP Alerts Summary Alerts AIC Incident MPPalertsbundleintoasummaryalert.Summaryalertscomposeanincident. Error Type Error Number Error Code CMICCore: 11 August 19 2019 Example of an informative error type with uninformative error numbers (the error numbers tend to be unique, suggestingthe error numbers are arbitraryprocess IDs): Example of an informative error type with uninformative error numbers (the error number entropies are scarcely lower than the error type entropy): Examples of an informativeerror type with informative error numbers: Noisy Telemetry Analytics-Driven Feature Extraction Informative Features Predictive Power