SlideShare a Scribd company logo
A Fireside Chat moderated by
David Loshin
Affiliate Research Director, TDWI
President, Knowledge Integrity, Inc.
Senior Lecturer, University of Maryland
November 2, 2023
Supercharging AI with Data Enrichment
SPONSOR
2
President, Knowledge Integrity, Inc.
Senior Lecturer and Lead for External Relations,
University of Maryland
What we will talk
about today
• Setting the stage:
• Generative AI as a business imperative
• Data imperatives for AI model quality
• Discussion: The pivotal role of data
enrichment in training and fine-tuning
Generative AI models
Emergence of Generative AI
What is Generative AI?
• A subset of artificial intelligence that includes systems designed
to generate outputs such as images, music, text, or other forms of
media, based on its training data
• Learns from existing and generate new data that is consistent
with the original data set
• Generative AI systems that have been trained on billions of
parameters use prediction to create new instances of data in
response to provided prompts
• Large Language Models (LLMs) are a type of Generative AI that
have been trained on massive amounts of content
Ensuring Trustworthy & Appropriate Results
Data
volumes
Data
quality
Data
access
• Issues include:
– Bias
– Privacy
– Ethical concerns
– Legal concerns
– Hallucinations
Data Enrichment & LLM Training
• Improving the utility of data through appending and integration of
relevant content from additional sources
• Enrichment is used for
– Refining contextual nuances
– Improving fidelity of prompt responses
– Improve pattern recognition to reduce probability of hallucinations
– Improve interpretability of results
The leader in data integrity
Our software, data enrichment products and
strategic services deliver accuracy, consistency, and
context in your data, powering confident decisions.
of the Fortune 100
99
countries
100 2,500
employees
customers
12,000
Brands you trust, trust us
Data leaders partner with us
10
AI initiatives succeed with trusted data
of leading
businesses have
ongoing investments
in artificial
intelligence
91%
From Noise to Brilliance: Supercharge AI with Data Enrichment
Algorithms
Data
Modeling
Large
Language
Models
Deep
Learning
Hyperparameter
Tuning
Training
Data
Retrieval
Augmented
Generation
Supervised
Learning
Natural
Language
Processing
Bias
and
Fairness
Artificial
Intelligence
Feature
Engineering
Neural
Networks
Chatbots
Machine
Learning
Data
Mining
11
Source: NewVantage
For trusted data,
you need data integrity
Data integrity is data with maximum
accuracy, consistency, and context for
confident business decision-making
Data
Integrity
From Noise to Brilliance: Supercharge AI with Data Enrichment
12
What is data
enrichment, exactly?
13
It’s the process of enhancing your data by
appending relevant context from additional
sources – improving its overall value,
accuracy, and usability.
From Noise to Brilliance: Supercharge AI with Data Enrichment
Trusted third-party data at a global scale
Addresses &
Property
Verified and validated address and
property data for map display and
analytics
Boundaries
Administrative, community, and
industry-specific boundaries for data
enrichment and territory analysis
Demographics
Demographic and consumer context
data for better understanding people
and behavior
Points of
Interest
Detailed business, leisure, and
geographic features for location
and competitive intelligence
Streets
Robust street-level data for mapping,
analysis, routing, and geocoding
Risk
Natural hazard boundaries related to
flood, fire, earthquakes, and weather
14
Expertly curated datasets containing thousands of attributes for faster, confident decisions
From Noise to Brilliance: Supercharge AI with Data Enrichment
15 From Noise to Brilliance: Supercharge AI with Data Enrichment
Purchases &
Shopping
Building & Parcel
Boundaries
Lifestyles
PreciselyID
School Rankings
Points of Interest
Addresses Population
Property Attributes
Weather
Natural & Manmade
Hazards
Travel Time
Administrative
Boundaries
Land & Property Consumer Environment
Data enrichment can be easy with the right tools
A unique identifier for every address that doesn’t change, and other methods for appending data
Addressing AI limitations with enrichment
Inaccurate training data
leads to poor model
accuracy and
performance, yielding
low-quality results
Clean data reduces the
need for extensive data
prep, simplifying the
overall AI pipeline and
improving efficiency
High-integrity data
reduces the time and
computational resources
required for model
development
Practitioners can rely on
consistent data to
extract meaningful
features that contribute
to model performance
Transparent, accurate
data aids in the
understanding of model
decisions, builds trust,
and identifies biases
Data with integrity
avoids introducing noise
that contributes to
overfitting, resulting in
more robust models
Models trained on high-
integrity data are easier
to maintain, as changes
are less likely to cause
unexpected issues
Easier model
maintenance
Reduced
Preprocessin
g Overhead
Effective
Feature
Engineering
Enhanced
Model
Interpretability
Reduced
Overfitting
Faster model
training
Model
Accuracy and
Performance
When AI models are built
on reliable data, they are
more likely to perform
consistently and
dependably
Reliable
Model
Deployment
17 From Noise to Brilliance: Supercharge AI with Data Enrichment
• Financial crimes
and compliance
• Customer insight
• Branch location analytics
• Fraud analytics
• Risk analysis
• Customer insight
• Fraud analytics
• Pricing
• Network and coverage
planning
• Customer insight
• Location-based
marketing & advertising
• Asset management
FINANCIAL SERVICES INSURANCE TELECOMMUNICATIONS
• Customer insight
• Retail location analysis
• Location-based
marketing & advertising
• Home search
• Appraisal analysis
• Valuation modeling
RETAIL
• Service optimization
and delivery
• Planning
• Compliance and safety
• Emergency response
and management
• Economic development
• Site selection
• Market analysis
• Lifestyle modeling
GOVERNMENT REAL ESTATE
• Customer insight
• Checkout analytics
• Logistics and delivery
• Location-based
marketing & advertising
eCOMMERCE
Solve complex, real-world challenges
Key takeaways
Appending relevant context from
additional sources
What is data enrichment?
Accuracy, performance, and utility
across various applications
How does it improve your AI?
Improves business outcomes, saves
money, and user trust
How does it benefit you?
Fireside chat
19
Copyright © 2023 TDWI
QUESTIONS?
CONTACT INFORMATION
If you have further questions or comments:
David Loshin, Knowledge Integrity, Inc.
loshin@knowledge-integrity.com
Antonio Cotroneo, Precisely
antonio.cotroneo@precisely.com
Thanks to Our Sponsor
2
THANK YOU!
Copyright TDWI

More Related Content

Similar to Supercharging AI with Data Enrichment

Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Precisely
 
Towards the Industrialization of AI
Towards the Industrialization of AITowards the Industrialization of AI
Towards the Industrialization of AI
Hui Lei
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data Architecture
Precisely
 
Mastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business SuccessMastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business Success
Precisely
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and Foundations
NUS-ISS
 
braincavesoft-com-big-data-analytics.pdf
braincavesoft-com-big-data-analytics.pdfbraincavesoft-com-big-data-analytics.pdf
braincavesoft-com-big-data-analytics.pdf
Braincave Software Private Limited
 
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Enterprise Knowledge
 
braincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics (1).pdfbraincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics (1).pdf
Braincave Software Private Limited
 
braincavesoft-com-data-analytics.pdf
braincavesoft-com-data-analytics.pdfbraincavesoft-com-data-analytics.pdf
braincavesoft-com-data-analytics.pdf
Braincave Software Private Limited
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
Certus Solutions
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agility
Mike ORourke
 
Sage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) systemSage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) system
Net at Work
 
Data quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisionsData quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisions
Precisely
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptx
PrabhaJoshi4
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Precisely
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdf
Hendri Karisma
 
Operationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer DataOperationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer Data
Precisely
 
Brainstorm:KC 2016
Brainstorm:KC 2016Brainstorm:KC 2016
Brainstorm:KC 2016
Scott Cameron
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLP
Skyl.ai
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Lucidworks
 

Similar to Supercharging AI with Data Enrichment (20)

Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
 
Towards the Industrialization of AI
Towards the Industrialization of AITowards the Industrialization of AI
Towards the Industrialization of AI
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data Architecture
 
Mastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business SuccessMastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business Success
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and Foundations
 
braincavesoft-com-big-data-analytics.pdf
braincavesoft-com-big-data-analytics.pdfbraincavesoft-com-big-data-analytics.pdf
braincavesoft-com-big-data-analytics.pdf
 
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
 
braincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics (1).pdfbraincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics (1).pdf
 
braincavesoft-com-data-analytics.pdf
braincavesoft-com-data-analytics.pdfbraincavesoft-com-data-analytics.pdf
braincavesoft-com-data-analytics.pdf
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agility
 
Sage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) systemSage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) system
 
Data quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisionsData quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisions
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptx
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdf
 
Operationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer DataOperationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer Data
 
Brainstorm:KC 2016
Brainstorm:KC 2016Brainstorm:KC 2016
Brainstorm:KC 2016
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLP
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
 

More from Precisely

AI-Ready Data - The Key to Transforming Projects into Production.pptx
AI-Ready Data - The Key to Transforming Projects into Production.pptxAI-Ready Data - The Key to Transforming Projects into Production.pptx
AI-Ready Data - The Key to Transforming Projects into Production.pptx
Precisely
 
Building a Multi-Layered Defense for Your IBM i Security
Building a Multi-Layered Defense for Your IBM i SecurityBuilding a Multi-Layered Defense for Your IBM i Security
Building a Multi-Layered Defense for Your IBM i Security
Precisely
 
Optimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdf
Optimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdfOptimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdf
Optimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdf
Precisely
 
Chaining, Looping, and Long Text for Script Development and Automation.pdf
Chaining, Looping, and Long Text for Script Development and Automation.pdfChaining, Looping, and Long Text for Script Development and Automation.pdf
Chaining, Looping, and Long Text for Script Development and Automation.pdf
Precisely
 
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligenceRevolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Precisely
 
Navigating the Cloud: Best Practices for Successful Migration
Navigating the Cloud: Best Practices for Successful MigrationNavigating the Cloud: Best Practices for Successful Migration
Navigating the Cloud: Best Practices for Successful Migration
Precisely
 
Unlocking the Power of Your IBM i and Z Security Data with Google Chronicle
Unlocking the Power of Your IBM i and Z Security Data with Google ChronicleUnlocking the Power of Your IBM i and Z Security Data with Google Chronicle
Unlocking the Power of Your IBM i and Z Security Data with Google Chronicle
Precisely
 
How to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdfHow to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdf
Precisely
 
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Precisely
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
Precisely
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Precisely
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10
Precisely
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Precisely
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Precisely
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Precisely
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity Trends
Precisely
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAP
Precisely
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
Precisely
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIs
Precisely
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
Precisely
 

More from Precisely (20)

AI-Ready Data - The Key to Transforming Projects into Production.pptx
AI-Ready Data - The Key to Transforming Projects into Production.pptxAI-Ready Data - The Key to Transforming Projects into Production.pptx
AI-Ready Data - The Key to Transforming Projects into Production.pptx
 
Building a Multi-Layered Defense for Your IBM i Security
Building a Multi-Layered Defense for Your IBM i SecurityBuilding a Multi-Layered Defense for Your IBM i Security
Building a Multi-Layered Defense for Your IBM i Security
 
Optimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdf
Optimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdfOptimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdf
Optimierte Daten und Prozesse mit KI / ML + SAP Fiori.pdf
 
Chaining, Looping, and Long Text for Script Development and Automation.pdf
Chaining, Looping, and Long Text for Script Development and Automation.pdfChaining, Looping, and Long Text for Script Development and Automation.pdf
Chaining, Looping, and Long Text for Script Development and Automation.pdf
 
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligenceRevolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
 
Navigating the Cloud: Best Practices for Successful Migration
Navigating the Cloud: Best Practices for Successful MigrationNavigating the Cloud: Best Practices for Successful Migration
Navigating the Cloud: Best Practices for Successful Migration
 
Unlocking the Power of Your IBM i and Z Security Data with Google Chronicle
Unlocking the Power of Your IBM i and Z Security Data with Google ChronicleUnlocking the Power of Your IBM i and Z Security Data with Google Chronicle
Unlocking the Power of Your IBM i and Z Security Data with Google Chronicle
 
How to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdfHow to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdf
 
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity Trends
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAP
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIs
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
 

Recently uploaded

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 

Recently uploaded (20)

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 

Supercharging AI with Data Enrichment

  • 1. A Fireside Chat moderated by David Loshin Affiliate Research Director, TDWI President, Knowledge Integrity, Inc. Senior Lecturer, University of Maryland November 2, 2023 Supercharging AI with Data Enrichment
  • 3. President, Knowledge Integrity, Inc. Senior Lecturer and Lead for External Relations, University of Maryland
  • 4. What we will talk about today • Setting the stage: • Generative AI as a business imperative • Data imperatives for AI model quality • Discussion: The pivotal role of data enrichment in training and fine-tuning Generative AI models
  • 6. What is Generative AI? • A subset of artificial intelligence that includes systems designed to generate outputs such as images, music, text, or other forms of media, based on its training data • Learns from existing and generate new data that is consistent with the original data set • Generative AI systems that have been trained on billions of parameters use prediction to create new instances of data in response to provided prompts • Large Language Models (LLMs) are a type of Generative AI that have been trained on massive amounts of content
  • 7. Ensuring Trustworthy & Appropriate Results Data volumes Data quality Data access • Issues include: – Bias – Privacy – Ethical concerns – Legal concerns – Hallucinations
  • 8. Data Enrichment & LLM Training • Improving the utility of data through appending and integration of relevant content from additional sources • Enrichment is used for – Refining contextual nuances – Improving fidelity of prompt responses – Improve pattern recognition to reduce probability of hallucinations – Improve interpretability of results
  • 9.
  • 10. The leader in data integrity Our software, data enrichment products and strategic services deliver accuracy, consistency, and context in your data, powering confident decisions. of the Fortune 100 99 countries 100 2,500 employees customers 12,000 Brands you trust, trust us Data leaders partner with us 10
  • 11. AI initiatives succeed with trusted data of leading businesses have ongoing investments in artificial intelligence 91% From Noise to Brilliance: Supercharge AI with Data Enrichment Algorithms Data Modeling Large Language Models Deep Learning Hyperparameter Tuning Training Data Retrieval Augmented Generation Supervised Learning Natural Language Processing Bias and Fairness Artificial Intelligence Feature Engineering Neural Networks Chatbots Machine Learning Data Mining 11 Source: NewVantage
  • 12. For trusted data, you need data integrity Data integrity is data with maximum accuracy, consistency, and context for confident business decision-making Data Integrity From Noise to Brilliance: Supercharge AI with Data Enrichment 12
  • 13. What is data enrichment, exactly? 13 It’s the process of enhancing your data by appending relevant context from additional sources – improving its overall value, accuracy, and usability. From Noise to Brilliance: Supercharge AI with Data Enrichment
  • 14. Trusted third-party data at a global scale Addresses & Property Verified and validated address and property data for map display and analytics Boundaries Administrative, community, and industry-specific boundaries for data enrichment and territory analysis Demographics Demographic and consumer context data for better understanding people and behavior Points of Interest Detailed business, leisure, and geographic features for location and competitive intelligence Streets Robust street-level data for mapping, analysis, routing, and geocoding Risk Natural hazard boundaries related to flood, fire, earthquakes, and weather 14 Expertly curated datasets containing thousands of attributes for faster, confident decisions From Noise to Brilliance: Supercharge AI with Data Enrichment
  • 15. 15 From Noise to Brilliance: Supercharge AI with Data Enrichment Purchases & Shopping Building & Parcel Boundaries Lifestyles PreciselyID School Rankings Points of Interest Addresses Population Property Attributes Weather Natural & Manmade Hazards Travel Time Administrative Boundaries Land & Property Consumer Environment Data enrichment can be easy with the right tools A unique identifier for every address that doesn’t change, and other methods for appending data
  • 16. Addressing AI limitations with enrichment Inaccurate training data leads to poor model accuracy and performance, yielding low-quality results Clean data reduces the need for extensive data prep, simplifying the overall AI pipeline and improving efficiency High-integrity data reduces the time and computational resources required for model development Practitioners can rely on consistent data to extract meaningful features that contribute to model performance Transparent, accurate data aids in the understanding of model decisions, builds trust, and identifies biases Data with integrity avoids introducing noise that contributes to overfitting, resulting in more robust models Models trained on high- integrity data are easier to maintain, as changes are less likely to cause unexpected issues Easier model maintenance Reduced Preprocessin g Overhead Effective Feature Engineering Enhanced Model Interpretability Reduced Overfitting Faster model training Model Accuracy and Performance When AI models are built on reliable data, they are more likely to perform consistently and dependably Reliable Model Deployment
  • 17. 17 From Noise to Brilliance: Supercharge AI with Data Enrichment • Financial crimes and compliance • Customer insight • Branch location analytics • Fraud analytics • Risk analysis • Customer insight • Fraud analytics • Pricing • Network and coverage planning • Customer insight • Location-based marketing & advertising • Asset management FINANCIAL SERVICES INSURANCE TELECOMMUNICATIONS • Customer insight • Retail location analysis • Location-based marketing & advertising • Home search • Appraisal analysis • Valuation modeling RETAIL • Service optimization and delivery • Planning • Compliance and safety • Emergency response and management • Economic development • Site selection • Market analysis • Lifestyle modeling GOVERNMENT REAL ESTATE • Customer insight • Checkout analytics • Logistics and delivery • Location-based marketing & advertising eCOMMERCE Solve complex, real-world challenges
  • 18. Key takeaways Appending relevant context from additional sources What is data enrichment? Accuracy, performance, and utility across various applications How does it improve your AI? Improves business outcomes, saves money, and user trust How does it benefit you?
  • 21. CONTACT INFORMATION If you have further questions or comments: David Loshin, Knowledge Integrity, Inc. loshin@knowledge-integrity.com Antonio Cotroneo, Precisely antonio.cotroneo@precisely.com
  • 22. Thanks to Our Sponsor 2