SlideShare a Scribd company logo
1 of 23
Download to read offline
A Fireside Chat moderated by
David Loshin
Affiliate Research Director, TDWI
President, Knowledge Integrity, Inc.
Senior Lecturer, University of Maryland
November 2, 2023
Supercharging AI with Data Enrichment
SPONSOR
2
President, Knowledge Integrity, Inc.
Senior Lecturer and Lead for External Relations,
University of Maryland
What we will talk
about today
• Setting the stage:
• Generative AI as a business imperative
• Data imperatives for AI model quality
• Discussion: The pivotal role of data
enrichment in training and fine-tuning
Generative AI models
Emergence of Generative AI
What is Generative AI?
• A subset of artificial intelligence that includes systems designed
to generate outputs such as images, music, text, or other forms of
media, based on its training data
• Learns from existing and generate new data that is consistent
with the original data set
• Generative AI systems that have been trained on billions of
parameters use prediction to create new instances of data in
response to provided prompts
• Large Language Models (LLMs) are a type of Generative AI that
have been trained on massive amounts of content
Ensuring Trustworthy & Appropriate Results
Data
volumes
Data
quality
Data
access
• Issues include:
– Bias
– Privacy
– Ethical concerns
– Legal concerns
– Hallucinations
Data Enrichment & LLM Training
• Improving the utility of data through appending and integration of
relevant content from additional sources
• Enrichment is used for
– Refining contextual nuances
– Improving fidelity of prompt responses
– Improve pattern recognition to reduce probability of hallucinations
– Improve interpretability of results
The leader in data integrity
Our software, data enrichment products and
strategic services deliver accuracy, consistency, and
context in your data, powering confident decisions.
of the Fortune 100
99
countries
100 2,500
employees
customers
12,000
Brands you trust, trust us
Data leaders partner with us
10
AI initiatives succeed with trusted data
of leading
businesses have
ongoing investments
in artificial
intelligence
91%
From Noise to Brilliance: Supercharge AI with Data Enrichment
Algorithms
Data
Modeling
Large
Language
Models
Deep
Learning
Hyperparameter
Tuning
Training
Data
Retrieval
Augmented
Generation
Supervised
Learning
Natural
Language
Processing
Bias
and
Fairness
Artificial
Intelligence
Feature
Engineering
Neural
Networks
Chatbots
Machine
Learning
Data
Mining
11
Source: NewVantage
For trusted data,
you need data integrity
Data integrity is data with maximum
accuracy, consistency, and context for
confident business decision-making
Data
Integrity
From Noise to Brilliance: Supercharge AI with Data Enrichment
12
What is data
enrichment, exactly?
13
It’s the process of enhancing your data by
appending relevant context from additional
sources – improving its overall value,
accuracy, and usability.
From Noise to Brilliance: Supercharge AI with Data Enrichment
Trusted third-party data at a global scale
Addresses &
Property
Verified and validated address and
property data for map display and
analytics
Boundaries
Administrative, community, and
industry-specific boundaries for data
enrichment and territory analysis
Demographics
Demographic and consumer context
data for better understanding people
and behavior
Points of
Interest
Detailed business, leisure, and
geographic features for location
and competitive intelligence
Streets
Robust street-level data for mapping,
analysis, routing, and geocoding
Risk
Natural hazard boundaries related to
flood, fire, earthquakes, and weather
14
Expertly curated datasets containing thousands of attributes for faster, confident decisions
From Noise to Brilliance: Supercharge AI with Data Enrichment
15 From Noise to Brilliance: Supercharge AI with Data Enrichment
Purchases &
Shopping
Building & Parcel
Boundaries
Lifestyles
PreciselyID
School Rankings
Points of Interest
Addresses Population
Property Attributes
Weather
Natural & Manmade
Hazards
Travel Time
Administrative
Boundaries
Land & Property Consumer Environment
Data enrichment can be easy with the right tools
A unique identifier for every address that doesn’t change, and other methods for appending data
Addressing AI limitations with enrichment
Inaccurate training data
leads to poor model
accuracy and
performance, yielding
low-quality results
Clean data reduces the
need for extensive data
prep, simplifying the
overall AI pipeline and
improving efficiency
High-integrity data
reduces the time and
computational resources
required for model
development
Practitioners can rely on
consistent data to
extract meaningful
features that contribute
to model performance
Transparent, accurate
data aids in the
understanding of model
decisions, builds trust,
and identifies biases
Data with integrity
avoids introducing noise
that contributes to
overfitting, resulting in
more robust models
Models trained on high-
integrity data are easier
to maintain, as changes
are less likely to cause
unexpected issues
Easier model
maintenance
Reduced
Preprocessin
g Overhead
Effective
Feature
Engineering
Enhanced
Model
Interpretability
Reduced
Overfitting
Faster model
training
Model
Accuracy and
Performance
When AI models are built
on reliable data, they are
more likely to perform
consistently and
dependably
Reliable
Model
Deployment
17 From Noise to Brilliance: Supercharge AI with Data Enrichment
• Financial crimes
and compliance
• Customer insight
• Branch location analytics
• Fraud analytics
• Risk analysis
• Customer insight
• Fraud analytics
• Pricing
• Network and coverage
planning
• Customer insight
• Location-based
marketing & advertising
• Asset management
FINANCIAL SERVICES INSURANCE TELECOMMUNICATIONS
• Customer insight
• Retail location analysis
• Location-based
marketing & advertising
• Home search
• Appraisal analysis
• Valuation modeling
RETAIL
• Service optimization
and delivery
• Planning
• Compliance and safety
• Emergency response
and management
• Economic development
• Site selection
• Market analysis
• Lifestyle modeling
GOVERNMENT REAL ESTATE
• Customer insight
• Checkout analytics
• Logistics and delivery
• Location-based
marketing & advertising
eCOMMERCE
Solve complex, real-world challenges
Key takeaways
Appending relevant context from
additional sources
What is data enrichment?
Accuracy, performance, and utility
across various applications
How does it improve your AI?
Improves business outcomes, saves
money, and user trust
How does it benefit you?
Fireside chat
19
Copyright © 2023 TDWI
QUESTIONS?
CONTACT INFORMATION
If you have further questions or comments:
David Loshin, Knowledge Integrity, Inc.
loshin@knowledge-integrity.com
Antonio Cotroneo, Precisely
antonio.cotroneo@precisely.com
Thanks to Our Sponsor
2
THANK YOU!
Copyright TDWI

More Related Content

Similar to Supercharging AI with Data Enrichment

Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...Precisely
 
Towards the Industrialization of AI
Towards the Industrialization of AITowards the Industrialization of AI
Towards the Industrialization of AIHui Lei
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data ArchitecturePrecisely
 
Mastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business SuccessMastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business SuccessPrecisely
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsNUS-ISS
 
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Enterprise Knowledge
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agilityMike ORourke
 
Sage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) systemSage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) systemNet at Work
 
Data quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisionsData quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisionsPrecisely
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxPrabhaJoshi4
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackPrecisely
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfHendri Karisma
 
Operationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer DataOperationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer DataPrecisely
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLPSkyl.ai
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksLucidworks
 

Similar to Supercharging AI with Data Enrichment (20)

Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
Decision Confidence: Using Modern Approaches to Data Quality to Improve Trust...
 
Towards the Industrialization of AI
Towards the Industrialization of AITowards the Industrialization of AI
Towards the Industrialization of AI
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data Architecture
 
Mastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business SuccessMastering Data Governance in Modern Era for Holistic Business Success
Mastering Data Governance in Modern Era for Holistic Business Success
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and Foundations
 
braincavesoft-com-big-data-analytics.pdf
braincavesoft-com-big-data-analytics.pdfbraincavesoft-com-big-data-analytics.pdf
braincavesoft-com-big-data-analytics.pdf
 
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
 
braincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics (1).pdfbraincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics (1).pdf
 
braincavesoft-com-data-analytics.pdf
braincavesoft-com-data-analytics.pdfbraincavesoft-com-data-analytics.pdf
braincavesoft-com-data-analytics.pdf
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agility
 
Sage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) systemSage People: Secure Employee Data in a Cloud-based (HCM) system
Sage People: Secure Employee Data in a Cloud-based (HCM) system
 
Data quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisionsData quality + data governance: the formula for bigger, better decisions
Data quality + data governance: the formula for bigger, better decisions
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptx
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdf
 
Operationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer DataOperationalizing a Vision for the Monetization of Telco Consumer Data
Operationalizing a Vision for the Monetization of Telco Consumer Data
 
Brainstorm:KC 2016
Brainstorm:KC 2016Brainstorm:KC 2016
Brainstorm:KC 2016
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLP
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
 

More from Precisely

Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenPrecisely
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Precisely
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Precisely
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Precisely
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fPrecisely
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsPrecisely
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPPrecisely
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenPrecisely
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsPrecisely
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyPrecisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowPrecisely
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellencePrecisely
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation ManagementPrecisely
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowPrecisely
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckPrecisely
 
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformanceMainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformancePrecisely
 
Preventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations ManagementPreventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations ManagementPrecisely
 
Migrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and ConsMigrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and ConsPrecisely
 

More from Precisely (20)

Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity Trends
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAP
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIs
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to Know
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center Excellence
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar Deck
 
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformanceMainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
 
Preventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations ManagementPreventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations Management
 
Migrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and ConsMigrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and Cons
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

Supercharging AI with Data Enrichment

  • 1. A Fireside Chat moderated by David Loshin Affiliate Research Director, TDWI President, Knowledge Integrity, Inc. Senior Lecturer, University of Maryland November 2, 2023 Supercharging AI with Data Enrichment
  • 3. President, Knowledge Integrity, Inc. Senior Lecturer and Lead for External Relations, University of Maryland
  • 4. What we will talk about today • Setting the stage: • Generative AI as a business imperative • Data imperatives for AI model quality • Discussion: The pivotal role of data enrichment in training and fine-tuning Generative AI models
  • 6. What is Generative AI? • A subset of artificial intelligence that includes systems designed to generate outputs such as images, music, text, or other forms of media, based on its training data • Learns from existing and generate new data that is consistent with the original data set • Generative AI systems that have been trained on billions of parameters use prediction to create new instances of data in response to provided prompts • Large Language Models (LLMs) are a type of Generative AI that have been trained on massive amounts of content
  • 7. Ensuring Trustworthy & Appropriate Results Data volumes Data quality Data access • Issues include: – Bias – Privacy – Ethical concerns – Legal concerns – Hallucinations
  • 8. Data Enrichment & LLM Training • Improving the utility of data through appending and integration of relevant content from additional sources • Enrichment is used for – Refining contextual nuances – Improving fidelity of prompt responses – Improve pattern recognition to reduce probability of hallucinations – Improve interpretability of results
  • 9.
  • 10. The leader in data integrity Our software, data enrichment products and strategic services deliver accuracy, consistency, and context in your data, powering confident decisions. of the Fortune 100 99 countries 100 2,500 employees customers 12,000 Brands you trust, trust us Data leaders partner with us 10
  • 11. AI initiatives succeed with trusted data of leading businesses have ongoing investments in artificial intelligence 91% From Noise to Brilliance: Supercharge AI with Data Enrichment Algorithms Data Modeling Large Language Models Deep Learning Hyperparameter Tuning Training Data Retrieval Augmented Generation Supervised Learning Natural Language Processing Bias and Fairness Artificial Intelligence Feature Engineering Neural Networks Chatbots Machine Learning Data Mining 11 Source: NewVantage
  • 12. For trusted data, you need data integrity Data integrity is data with maximum accuracy, consistency, and context for confident business decision-making Data Integrity From Noise to Brilliance: Supercharge AI with Data Enrichment 12
  • 13. What is data enrichment, exactly? 13 It’s the process of enhancing your data by appending relevant context from additional sources – improving its overall value, accuracy, and usability. From Noise to Brilliance: Supercharge AI with Data Enrichment
  • 14. Trusted third-party data at a global scale Addresses & Property Verified and validated address and property data for map display and analytics Boundaries Administrative, community, and industry-specific boundaries for data enrichment and territory analysis Demographics Demographic and consumer context data for better understanding people and behavior Points of Interest Detailed business, leisure, and geographic features for location and competitive intelligence Streets Robust street-level data for mapping, analysis, routing, and geocoding Risk Natural hazard boundaries related to flood, fire, earthquakes, and weather 14 Expertly curated datasets containing thousands of attributes for faster, confident decisions From Noise to Brilliance: Supercharge AI with Data Enrichment
  • 15. 15 From Noise to Brilliance: Supercharge AI with Data Enrichment Purchases & Shopping Building & Parcel Boundaries Lifestyles PreciselyID School Rankings Points of Interest Addresses Population Property Attributes Weather Natural & Manmade Hazards Travel Time Administrative Boundaries Land & Property Consumer Environment Data enrichment can be easy with the right tools A unique identifier for every address that doesn’t change, and other methods for appending data
  • 16. Addressing AI limitations with enrichment Inaccurate training data leads to poor model accuracy and performance, yielding low-quality results Clean data reduces the need for extensive data prep, simplifying the overall AI pipeline and improving efficiency High-integrity data reduces the time and computational resources required for model development Practitioners can rely on consistent data to extract meaningful features that contribute to model performance Transparent, accurate data aids in the understanding of model decisions, builds trust, and identifies biases Data with integrity avoids introducing noise that contributes to overfitting, resulting in more robust models Models trained on high- integrity data are easier to maintain, as changes are less likely to cause unexpected issues Easier model maintenance Reduced Preprocessin g Overhead Effective Feature Engineering Enhanced Model Interpretability Reduced Overfitting Faster model training Model Accuracy and Performance When AI models are built on reliable data, they are more likely to perform consistently and dependably Reliable Model Deployment
  • 17. 17 From Noise to Brilliance: Supercharge AI with Data Enrichment • Financial crimes and compliance • Customer insight • Branch location analytics • Fraud analytics • Risk analysis • Customer insight • Fraud analytics • Pricing • Network and coverage planning • Customer insight • Location-based marketing & advertising • Asset management FINANCIAL SERVICES INSURANCE TELECOMMUNICATIONS • Customer insight • Retail location analysis • Location-based marketing & advertising • Home search • Appraisal analysis • Valuation modeling RETAIL • Service optimization and delivery • Planning • Compliance and safety • Emergency response and management • Economic development • Site selection • Market analysis • Lifestyle modeling GOVERNMENT REAL ESTATE • Customer insight • Checkout analytics • Logistics and delivery • Location-based marketing & advertising eCOMMERCE Solve complex, real-world challenges
  • 18. Key takeaways Appending relevant context from additional sources What is data enrichment? Accuracy, performance, and utility across various applications How does it improve your AI? Improves business outcomes, saves money, and user trust How does it benefit you?
  • 21. CONTACT INFORMATION If you have further questions or comments: David Loshin, Knowledge Integrity, Inc. loshin@knowledge-integrity.com Antonio Cotroneo, Precisely antonio.cotroneo@precisely.com
  • 22. Thanks to Our Sponsor 2