No Silver Bullet: Evaluating Risks and Rewards of Enterprise Artificial Intelligence

No Silver Bullet
A Rational Framework to Evaluate the
Risks and Rewards of Enterprise AI
@integrateai | @humekathryn | quamproxime.com

@integrateai | @humekathryn
Osama bin Laden’s compound in Abbottabad, Pakistan (2011)

Jeffrey Friedman & Richard Zeckhauser
95%
80%
60%
40%
30%
WTF?!?!

Jeffrey Friedman & Richard Zeckhauser
President Obama reportedly complained that the discussion
offered ‘not more certainty but more confusion,’ and that his
advisors were offering ‘probabilities that disguised uncertainty as
opposed to actually providing you with useful information.’

 
“If we did not take action, he might slip away. And it might be
years before he resurfaced.”

What the hell does this have to do with AI?

Everything!
(Well, not everything but this is a talk and
hyperbole is a strong rhetorical device)

“Over the past decades computers have broadly
automated tasks that programmers could describe with
clear rules and algorithms. Modern machine learning
techniques now allow us to do the same for tasks where
describing the precise rules is much harder.”
Bezos 2017 Letter to Shareholders

AI?
Machine
Learning
Data Science
Analytics
“Big Data”
Hilary Mason
Fast Forward Labs

This is a cat. This is most probably a cat. This cat is dead and alive.

Developments in Language Processing
Traditional NLP N-grams Word Embeddings

13% increase on auto insurance transactions
38% improvement in predictions during pilot

Chief Financial Ofﬁcer Chief Scientist

Sunk costs Angry customers Lost trust Unfair treatment

How can businesses reap the rewards
while managing the risks?

The Raw Ingredients of an AI product
Deep understanding of a business problem
Data, data, data
Algorithmic capability
Integrations and interfaces

Machine Learning Product Lifecycle
Design
What problem
are we solving?
Data Exploration
What does the data
look like?
Production
Can we scale the model?
Which is best?
Maintenance
How to update
as data changes?
Data Processing
Is our data ready
for use?
Model Prototyping
Will this work?
Should we pivot?
Data Generation
How do tools capture
information?
Data Collection
Who is well represented?
Where are blind spots?
Evaluation/Interpretation
How do we know
it’s working?

Case Study #1
Partially automate checklist completion
20,000 examples completed lists
First party keeps data & model

Design
Where does this ﬁt into the entire business process?

Design
What level of certainty does the problem demand?
versus

Design
Do you have a motivated subject matter expert?

Model Type Amount of data Structure of data
Subject Matter
Expertise
Explainability
Regression Small ok
Less Complex
Features
Selecting the
features
Medium-High
Bayesian Small ok
Hierarchical
Features
Deﬁning the priors Medium-High
Deep Learning Large only
Hierarchical
Features
Little
(Labeling)
Low
Reinforcement
Learning
Explore/Exploit
Depends
(Deep vs.
Classical)
Simulate the
environment
Depends
(Deep vs.
Classical)

Data Exploration
How much of this is actually a machine learning problem?

Model Prototyping
Will this work? Should we pivot?

Maintenance
How to update as data changes?

Lessons learned
• Key risk is wasted cost & time
• Seek
• Clearly deﬁned input-output with large labeled training set
• Motivated SME to collaborate with technical team
• Avoid
• Tasks that require certainty
• Tasks with little output variety
• Beware
• Hype leading to misunderstanding of what’s possible
• Tactics
• Scope PoC tightly
• Disprove hypothesis to minimize sunk cost

Case Study #2
Increase conversion rates
Design infrastructure to measure outcomes
Third party gets data & owns model

“There is often a difference between the quantity you
want to measure and the one you can measure.
When these differ,
your model will become good at predicting the quantity you measured,
not the quantity for which it was meant to be a proxy. "
https://medium.com/@yonatanzunger/asking-the-right-questions-about-ai-7ed2d9820c48

COMPAS risk assessment tool
• What we think we’re asking: “what is the
probability that this person will commit a
serious crime in the future, as a function of the
sentence you give them now?”
• What we’re actually asking: “who is more likely
to be convicted?”
Design
What problem are we solving?

Data Collection
How much data should we share with our vendor?
versus

https://visual.ly/community/infographic/
human-rights/taxonomy-transitions
Data Collection
Removing attributes doesn’t always
remove bias

Production
Can we scale the model?

Evaluation/Interpretation
How do we know it’s working?

Lessons learned
• Risks around sunk cost, data privacy & security, ethics
• Seek
• Problems where a good guess can provide measurable boost
• Collaborative and ﬂexible partners
• Beware
• Understand what your proxies actually optimize for
• Be prepared to problem solve with imbalanced data sets
• Be Pragmatic but Principled
• Explainability is not an absolute - it’s context dependent
• Go the extra mile to evaluate fairness
• Tactics
• Performance-based fee structure to align incentives

What should we do?

Executive Leadership
Line Management
Project Leads
ML & Engineers
Vendors

There’s no silver bullet.
We are empowered to design
the future we want to live in.

@integrateai | @humekathryn | quamproxime.com
Thank you!

No Silver Bullet: Evaluating Risks and Rewards of Enterprise Artificial Intelligence

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to No Silver Bullet: Evaluating Risks and Rewards of Enterprise Artificial Intelligence

Similar to No Silver Bullet: Evaluating Risks and Rewards of Enterprise Artificial Intelligence (20)

Recently uploaded

Recently uploaded (20)

No Silver Bullet: Evaluating Risks and Rewards of Enterprise Artificial Intelligence