SlideShare a Scribd company logo
1 of 24
Download to read offline
Consume your data for AI
The role of modern Asset Catalogs
Jay Limburn
Distinguished Engineer and Director of Offering Management
IBM Watson Data & AI
@jaylimburn
“Two guys in a Starbucks can have access to the 

same computing power as a Fortune 500 company.” 

 

Jim Deters, Co-Founder and CEO, Galvanize
Digital businesses are disrupting ALL industries and professions
72% are
vulnerable to
disruption within 3
years
OVER 90% of Business
Execs see the need for
this transformation
ONLY 14% believe they
have the ability to be able
to ACT on data quickly
Source: FROM DATA TO DISRUPTION: INNOVATION THROUGH DIGITAL INTELLIGENCE"
IBM-sponsored report by Harvard Business Review Analytic Services, 2016"
Tools & Infrastructure
• Need an environment
that enables a “fail
fast” approach
• Discrete tools
present barriers to
productivity
Governance
• If the data isn’t
secure, self-service
isn’t a reality
• Challenge
understanding data
lineage and getting to
a system of truth
Skills
• Data Science skills
are in low supply and
high demand
• Nurturing new data
professionals is
challenging
Data
• Data resides in silos
& difficult to access
• Unstructured and
external data wasn’t
considered
3
Why are enterprises struggling to embrace AI?
IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation
“85% of CIOs
will be piloting AI programs by 2020”
“In 2018, blended AI will disrupt your
customer service and sales strategy”
75% of
commercial enterprise apps will use AI by 2021
4IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation
Enterprises generate TONS OF DATA
Data
Data that requires
governance
Which must be cleaned
and shaped for training
…then models must be designed
…that must be hosted and
monitored
…and trained on
high performance compute
SPSS
To select an
optimal model...
Watson is AI for Smarter Business
Continuous Learning
Compliance
Assist
Customer
Care
Expert
Assist
Watson
Assistant for
Industry
Watson
Cybersecurity
Compare &
Comply
Voice of the
Customer
Watson ApplicationsWatson Business Solution
ISV & third party apps
Search & Find
Relevant Data
Connect & Access
Data
Prepare Data
(Ingest, Curate,
& Enrich)
Build & Train
AI Models
Deploy
AI Models
Monitor, Analyze,
Manage
Watson Machine Learning and Deep Learning as a Service
Watson Knowledge Catalog
Active Policy
Enforcement
AI Powered Catalog Search
and Social Collaboration
Data/Knowledge KitsIntelligent AI
Asset Catalog
Model Governance,
Traceability & Lineage
Watson Studio
Watson APIs
5"
Watson Studio & Watson Knowledge Catalog
Supporting the end-to-end AI workflow
Prepare Data !
for Analysis!
Build and Train
ML/DL Models! Deploy Models!
Monitor, Analyze
and Manage!
Search and Find
Relevant Data !
Connect & !
Access Data!
Connect and
discover content
from multiple data
sources in the cloud
or on premises.
Bring structured
and unstructured
data to one toolkit.
Clean and prepare your
data with Data
Refinery, a tool to
create data
preparation pipelines
visually.
Use popular open
source libraries to
prepare unstructured
data.
Democratize the
creation of ML and DL
models. Design your AI
models
programmatically or
visually with the most
popular open source
and IBM ML/DL
frameworks or
leverage transfer
learning on pre-
trained models using
Watson tools to adapt
to your business
domain. Train at scale
on GPUs and
distributed compute
Deploy your models
easily and have them
scale automatically for
online, batch or
streaming use cases
Monitor the
performance of the
models in production
and trigger automatic
retraining and
redeployment of
models. Build
Enterprise Trust with
Bias Detection,
Mitigation Model
Robustness and
Testing Service Model
Security.
Find data (structured,
unstructured) and AI
assets (e.g., ML/DL
models, notebooks,
Watson Data Kits) in
the Knowledge
Catalog with
intelligent search and
giving the right access
to the right users.
are unable to
collaborate on
common data
80%
say fragmented
data gets in
the way
84%
require faster
data and AI and
analytics
to compete
9 10more
than
Out
of
Source: FROM DATA TO DISRUPTION: INNOVATION THROUGH DIGITAL INTELLIGENCE
IBM-sponsored report by Harvard Business Review Analytic Services, 2016
Knowledge Workers have data issues
of their time is spent searching for data!80%
The Data Lake fallacy
Enterprise Data Lakes are not
delivering on their promise
•  Inability to easily find data
•  Lack of trust in how the data will be
used
•  Obstacles to finding and sharing
•  Too difficult to ingest sources
•  Ever increasing cost 60
75
80
30
15
17
10
10
3
LOB Knowledge Workers
Business Analysts
Data Scientists
% Time spent working with data
Finding Data Using Data Sharing Data
Users spent significantly more time finding the correct data, rather then extracting value from it.
“Through 2018, 90% of deployed data lakes will be useless
as they are overwhelmed with information assets captured
for uncertain use cases.” - Gartner
A 10% increase in data accessibility will result in more
than $65 million additional net income for the typical
Fortune 1000 company. - Baseline
Organizations are racing to unleash the power of data and apply AI
Information Architecture
Machine Learning
Deep Learning & AI
Data Lakes
Metadata Management Asset Catalogs
Data Catalogs
Collect, Understand and Govern Share, Activate and Curate
ActivateCatalogDiscover
10Powered by Watson Data & AI
An intelligent asset catalog for a 360 degree view of your data and AI
Intelligent discovery of data
and AI assets with advanced
classification and profiling to
provide context
Ø  Intelligent data classification and
profiling that determines what the
data is and how it should be used
Ø  Quickly build a 360 view of all assets
and provide them for AI and Analytics
Ø  Crawlers to auto discover usage
information of data to understand
how data is used
A rich metadata index of all data
and AI assets with social
collaboration and enhanced
findability
Powerful governance tools to
control and protect access to
data with visibility to data use
Ø  Business Glossary to define business
terms and map them to technical
assets
Ø  Active Policy Engine to author,
activate and enforce business policies
and rules
Ø  Governance and Insights dashboards
to understand how data is used and
how the governance program is
impacting it
Ø  A business friendly shopping portal for
your enterprise data
Ø  Integrated with other platform
solutions to facilitate self service
analytics and AI
Ø  Access controls and security
Ø  Seamlessly integrated for productive
use with Data prep, movement,
dashboarding, Machine learning and
Data science
Watson Knowledge Catalog
Unlock tribal knowledge and unleash your data science projects
Value Proposition
11IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation
Value enabled by an intelligent asset Catalog Who benefits?
Improve your data science practice by ensuring you discover data and AI assets
more quickly than before, allowing teams to spend more time building AI
applications and data science.
•  Data Scientists
•  Business Analysts
•  Developers
Improve the accuracy of your machine learning and deep learning models by
ensuring that the most relevant data is always available to the data science
community.
•  Data Scientists
Foster a culture of data collaboration allowing models to be shared and
automatically classified across the enterprise, jump starting future data science
projects and overall improving the data science practice.
•  Data Scientists
•  CDO
•  CMO
Understand where all your core business entities exist across your ecosystem and
who is using them. Where is all my PII data?
•  CDO
Revitalize your data lake initiatives by augmenting it with an intelligent view
encompassing all data, structured and unstructured and open it up for new
business opportunities.
•  Data Scientists
•  Business Analysts
•  CDO
A 360 view of all information to fuel data innovation and AI
Cloud data | On prem data | Data we own | Data we don’t | Structured | Unstructured
Deep Learning
Machine Learning
Business Analytics
& Real time Analytics
Make your data workers more effective, super charging your data science and AI
Machine Learning
Metadata Index
UIs Collaboration
Enforcement Monitoring
Knowledge Catalog
APIs
Classification&NLU
Classification&NLUData lake
Data warehouse
Cloud Sources
On-premise sources
Open data
Social data
Sensor data
Dark data
Watson Knowledge Catalog
Differentiating Capabilities
• IBM is the only partner that does not require
data movement & storage in our cloud.
• Providing choice & flexibility for data location
gives the opportunity for highly regulated
industries to benefit from IBM Cloud and
Watson.
Your data, Your way
• Watson Knowledge Catalog supports all
assets… not just data. Think dashboards,
data science & machine learning models,
connections, notebooks, etc. – All available
in one easy to find self service experience.
Focus on Enabling AI
• Watson Knowledge Catalog is fully
integrated with Watson Studio meaning that
once assets have been discovered users can
easily drive productive use through
integrated tools for data shaping, data
movement, data science, AI and machine
learning.
Seamless integration for Productive
Use
• Watson Knowledge Catalog contains an
advanced policy enforcement engine to
establish categories, policies & rules on data
assets.
• Activation of these policies means data is
masked or protected on the fly at the point of
access
Modern Policy Activation
• To be able to truly ensure your Data Science
teams have a full view of data it has to
include structured and unstructured data.
• Watson Knowledge Catalog uses Watson
Natural Language processing to extract and
understand the value locked away in
unstructured and structured data and
presents it uniformly for consumption.
Structured & Unstructured
• ‘Watson Recommends’ is our AI powered
recommendation engine that analyzes the
digital exhaust of the system and
relationships of the assets to provide
recommendations on assets for use.
• Integrated with the social collaboration
capabilities of the catalog to further fuel the
recommendation engine.
AI powered recommendations
13
Watson Studio
Tools for supporting the end-to-end AI workflow
Model Lifecycle Management
Machine Learning Runtimes Deep Learning Runtimes
Authoring Tools
Cloud Infrastructure as a Service
• Most popular open source frameworks
• IBM best-in-class frameworks
• Create, collaborate, deploy, and monitor
• Best of breed open source & IBM tools
• Code (R, Python or Scala) and no-code/visual
modeling tools
• Fully managed service
• Container-based resource management
• Elastic pay as you go cpu/gpu power
3
Data Refinery
Making data fit for use
Self-service data refinement and cleaning Comprehensive profiling
Interactive visualization Scheduling and monitoring
Data Refinery
Self service data prep
Watson	Recommends	–	AI	powered	suggestions	
Use AI to improve your AI
• Catalog extended to
support all AI assets
not just data assets.
• AI based engine to
suggest the best
assets for each user
based upon digital
exhaust.
• Uses AI to self learn
and improve over time
to guide users to
previous dark data that
would have been lost 

•  Search	results	automatically	push	the	most	
relevant	items	to	the	top	of	the	list	
•  New	AI	based	
Recommended	assets	list	
•  New	most	popular	assets	
list	
Coming	Soon	
•  Understand	and	track	
model	lineage	–	Know	
which	data	was	use	to	train	
which	models	and	when
Social	Collaboration	Features	
Curation for everyone! crowd sourced model training
• Empower all users to
determine the value of
all assets across the
business
• Social interactions with
data feed into Watson
Recommends
algorithms to improve
over time

•  Users of the system
improve it over time
• Provide comments and
further enrich the
metadata that
describes your key
assets
•  Most	liked	assets	available	to	
drive	people	to	the	most	popular	
assets	
•  Comment	and	collaborate	on	
datasets	to	train	the	Watson	
recommends	service	and	aid	
findability	and	understanding	of	
assets
Support	for	Unstructured	Data	
Tearing down the barriers between structured and unstructured content to power AI
•  Extract key entities
from unstructured
content and catalog
alongside structured
content opening up all
assets for self service
• Leverage Watson NLU to
identify the content and
context of unstructured
data assets to fuel your
Data Science and AI
practices
• Integrated document
previewers
• Categorize and
understand where you
key business entities
exist across all assets
•  Integrated	unstructured	document	viewers	
•  Natural	Language	Understanding	extracts	the	
key	concepts	from	the	document	to	be	used	
in	analytics	and	ML.
Intelligent	Data	Masking	
Automatically mask sensitive data elements through policy activation
• Extensions to the policy
activation engine to
mask or anonymize data
at the point of
consumption
• Provides a personalized
extract of data based
upon company and
regulatory policies
allowing previously
unavailable assets to be
available for AI, Data
Science and Analytics
• Embedded within
Catalog and Refinery to
ensure extracts contain
anonymized data
•  Author	data	anonymization	rules	
natively	within	the	Policy	Manager	
•  Select	from	2	options	to	
anonymize	data:		
•  Substitute	
•  Mask	
•  Apply	data	anonymization	at	the	
point	of	use	within	the	system	
ensuring	that	users	get	value	from	
the	information	they	are	
permitted	to	see
Data	Quality	
Build a trusted currency for data 
• ML based data quality to
inform users quickly if an
asset is relevant for their
purpose.

• Optimized for large data
set to calculate 8
different quality
dimensions
• Aggregate score used to
provide trust in quality of
data in a standardized
form across the
enterprise
• Provides drift analysis to
capture and track how
quality of the assets
evolve over time.
•  Detailed	breakdown	to	further	
inform	the	user	which	aspects	
contributed	to	overall	score.	
•  Auto	classification	confidence	
scores	and	frequency	analysis	
provides	further	detailed	
information	on	asset	relevance	
•  Trusted	currency	calculated	to	
inform	users	at	a	glance	if	this	
asset	is	useful.
Cloud Private for Data & Watson Studio enables the end to
end AI lifecycle.
ML Models, Metadata, Data
IBM Cloud
Private
for Data
Enterprise Data & Apps
IBM
Watson
Studio
AI Business Processes
21© 2018 IBM Corporation
Collect, Understand & Govern Share, Activate & Curate
IBM’s Hybrid capabilities
IBM Watson Studio
IBM Data Refinery
Data Preparation, Integration
IBM Watson 
Knowledge Catalog
Jupyter Notebooks, RStudio, Watson ML,
Visual Recognition
IBM Unified
Governance
Information Governance Catalog
IGC DataSets Compatibility"
Business & technical metadata
IGC-Catalog Sync"
Bi-directional
1
2
Enterprise Data, External
Data & Feeds
Enterprise Data, External
Data & Feeds
Data
Engineer
Compliance
Officer
CDO
 Data Scientist
 Citizen Data
Scientist/Analyst
App
Developer
IBM Industry Models
3rd Party metadata systems
Collect, Understand and Govern Share, Activate, Curate
IBM Cloud Private IBM Cloud
IBM DSX
Summary
23
Have you
found what
you are
looking for
yet?
Digital disruption means AI is paramount to
remaining competitive
AI requires understanding and context of ever
growing volumes of data and models
Self Service is key to unleashing the data
science and analytics teams to innovate and
power the AI needed to compete
An intelligent catalog provides the first step
towards the AI journey
And its really easy to get started today…….
All Information | All Users| | Everywhere
24IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation
Thanks
https://www.ibm.com/cloud/watson-knowledge-catalog
Watson Knowledge Catalog

More Related Content

What's hot

Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...DATAVERSITY
 
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...DATAVERSITY
 
Smart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a ServiceSmart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a ServiceDATAVERSITY
 
Building Effective Data Visualizations
Building Effective Data VisualizationsBuilding Effective Data Visualizations
Building Effective Data VisualizationsDATAVERSITY
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDATAVERSITY
 
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesDATAVERSITY
 
The Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementThe Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementDATAVERSITY
 
Big Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesBig Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesSlideTeam
 
Advanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipAdvanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipDATAVERSITY
 
Accelerate Your Move to the Cloud with Data Catalogs and Governance
Accelerate Your Move to the Cloud with Data Catalogs and GovernanceAccelerate Your Move to the Cloud with Data Catalogs and Governance
Accelerate Your Move to the Cloud with Data Catalogs and GovernanceDATAVERSITY
 
Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?
Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?
Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?DATAVERSITY
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallDATAVERSITY
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects FailSense Corp
 
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...North Texas Chapter of the ISSA
 
Death of the Dashboard
Death of the DashboardDeath of the Dashboard
Death of the DashboardDATAVERSITY
 
Data Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data QualityData Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data QualityDATAVERSITY
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata StrategiesDATAVERSITY
 
Business Value Metrics for Data Governance
Business Value Metrics for Data GovernanceBusiness Value Metrics for Data Governance
Business Value Metrics for Data GovernanceDATAVERSITY
 
Are You Your Company's Chief Data Officer?
Are You Your Company's Chief Data Officer?Are You Your Company's Chief Data Officer?
Are You Your Company's Chief Data Officer?Brendan Aldrich
 

What's hot (20)

Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
 
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
 
Smart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a ServiceSmart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a Service
 
Building Effective Data Visualizations
Building Effective Data VisualizationsBuilding Effective Data Visualizations
Building Effective Data Visualizations
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
The Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementThe Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata Management
 
Big Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesBig Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation Slides
 
Advanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipAdvanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and Stewardship
 
Accelerate Your Move to the Cloud with Data Catalogs and Governance
Accelerate Your Move to the Cloud with Data Catalogs and GovernanceAccelerate Your Move to the Cloud with Data Catalogs and Governance
Accelerate Your Move to the Cloud with Data Catalogs and Governance
 
Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?
Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?
Data Insights and Analytics Webinar: CDO vs. CAO - What’s the Difference?
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal Ball
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
 
Death of the Dashboard
Death of the DashboardDeath of the Dashboard
Death of the Dashboard
 
Data Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data QualityData Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data Quality
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata Strategies
 
Business Value Metrics for Data Governance
Business Value Metrics for Data GovernanceBusiness Value Metrics for Data Governance
Business Value Metrics for Data Governance
 
Are You Your Company's Chief Data Officer?
Are You Your Company's Chief Data Officer?Are You Your Company's Chief Data Officer?
Are You Your Company's Chief Data Officer?
 

Similar to How to Consume Your Data for AI

How IBM is Creating a Foundation for Cloud Innovation
How IBM is Creating a Foundation for Cloud InnovationHow IBM is Creating a Foundation for Cloud Innovation
How IBM is Creating a Foundation for Cloud InnovationCCG
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationDavid Solomon
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs MatterEric Kavanagh
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with MicrosoftCaserta
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAmazon Web Services
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
 
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONCisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONRenee Yao
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It? Caserta
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DATAVERSITY
 
Tusker Corporate Profile
Tusker Corporate ProfileTusker Corporate Profile
Tusker Corporate ProfilePrashant Kumar
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxGautamPopli1
 

Similar to How to Consume Your Data for AI (20)

IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
 
How IBM is Creating a Foundation for Cloud Innovation
How IBM is Creating a Foundation for Cloud InnovationHow IBM is Creating a Foundation for Cloud Innovation
How IBM is Creating a Foundation for Cloud Innovation
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven Organization
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs Matter
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with Microsoft
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AI
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONCisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It?
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 
Tusker Corporate Profile
Tusker Corporate ProfileTusker Corporate Profile
Tusker Corporate Profile
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

How to Consume Your Data for AI

  • 1. Consume your data for AI The role of modern Asset Catalogs Jay Limburn Distinguished Engineer and Director of Offering Management IBM Watson Data & AI @jaylimburn
  • 2. “Two guys in a Starbucks can have access to the 
 same computing power as a Fortune 500 company.” Jim Deters, Co-Founder and CEO, Galvanize Digital businesses are disrupting ALL industries and professions 72% are vulnerable to disruption within 3 years OVER 90% of Business Execs see the need for this transformation ONLY 14% believe they have the ability to be able to ACT on data quickly Source: FROM DATA TO DISRUPTION: INNOVATION THROUGH DIGITAL INTELLIGENCE" IBM-sponsored report by Harvard Business Review Analytic Services, 2016"
  • 3. Tools & Infrastructure • Need an environment that enables a “fail fast” approach • Discrete tools present barriers to productivity Governance • If the data isn’t secure, self-service isn’t a reality • Challenge understanding data lineage and getting to a system of truth Skills • Data Science skills are in low supply and high demand • Nurturing new data professionals is challenging Data • Data resides in silos & difficult to access • Unstructured and external data wasn’t considered 3 Why are enterprises struggling to embrace AI? IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation “85% of CIOs will be piloting AI programs by 2020” “In 2018, blended AI will disrupt your customer service and sales strategy” 75% of commercial enterprise apps will use AI by 2021
  • 4. 4IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation Enterprises generate TONS OF DATA Data Data that requires governance Which must be cleaned and shaped for training …then models must be designed …that must be hosted and monitored …and trained on high performance compute SPSS To select an optimal model...
  • 5. Watson is AI for Smarter Business Continuous Learning Compliance Assist Customer Care Expert Assist Watson Assistant for Industry Watson Cybersecurity Compare & Comply Voice of the Customer Watson ApplicationsWatson Business Solution ISV & third party apps Search & Find Relevant Data Connect & Access Data Prepare Data (Ingest, Curate, & Enrich) Build & Train AI Models Deploy AI Models Monitor, Analyze, Manage Watson Machine Learning and Deep Learning as a Service Watson Knowledge Catalog Active Policy Enforcement AI Powered Catalog Search and Social Collaboration Data/Knowledge KitsIntelligent AI Asset Catalog Model Governance, Traceability & Lineage Watson Studio Watson APIs 5"
  • 6. Watson Studio & Watson Knowledge Catalog Supporting the end-to-end AI workflow Prepare Data ! for Analysis! Build and Train ML/DL Models! Deploy Models! Monitor, Analyze and Manage! Search and Find Relevant Data ! Connect & ! Access Data! Connect and discover content from multiple data sources in the cloud or on premises. Bring structured and unstructured data to one toolkit. Clean and prepare your data with Data Refinery, a tool to create data preparation pipelines visually. Use popular open source libraries to prepare unstructured data. Democratize the creation of ML and DL models. Design your AI models programmatically or visually with the most popular open source and IBM ML/DL frameworks or leverage transfer learning on pre- trained models using Watson tools to adapt to your business domain. Train at scale on GPUs and distributed compute Deploy your models easily and have them scale automatically for online, batch or streaming use cases Monitor the performance of the models in production and trigger automatic retraining and redeployment of models. Build Enterprise Trust with Bias Detection, Mitigation Model Robustness and Testing Service Model Security. Find data (structured, unstructured) and AI assets (e.g., ML/DL models, notebooks, Watson Data Kits) in the Knowledge Catalog with intelligent search and giving the right access to the right users.
  • 7. are unable to collaborate on common data 80% say fragmented data gets in the way 84% require faster data and AI and analytics to compete 9 10more than Out of Source: FROM DATA TO DISRUPTION: INNOVATION THROUGH DIGITAL INTELLIGENCE IBM-sponsored report by Harvard Business Review Analytic Services, 2016 Knowledge Workers have data issues of their time is spent searching for data!80%
  • 8. The Data Lake fallacy Enterprise Data Lakes are not delivering on their promise •  Inability to easily find data •  Lack of trust in how the data will be used •  Obstacles to finding and sharing •  Too difficult to ingest sources •  Ever increasing cost 60 75 80 30 15 17 10 10 3 LOB Knowledge Workers Business Analysts Data Scientists % Time spent working with data Finding Data Using Data Sharing Data Users spent significantly more time finding the correct data, rather then extracting value from it. “Through 2018, 90% of deployed data lakes will be useless as they are overwhelmed with information assets captured for uncertain use cases.” - Gartner A 10% increase in data accessibility will result in more than $65 million additional net income for the typical Fortune 1000 company. - Baseline
  • 9. Organizations are racing to unleash the power of data and apply AI Information Architecture Machine Learning Deep Learning & AI Data Lakes Metadata Management Asset Catalogs Data Catalogs Collect, Understand and Govern Share, Activate and Curate
  • 10. ActivateCatalogDiscover 10Powered by Watson Data & AI An intelligent asset catalog for a 360 degree view of your data and AI Intelligent discovery of data and AI assets with advanced classification and profiling to provide context Ø  Intelligent data classification and profiling that determines what the data is and how it should be used Ø  Quickly build a 360 view of all assets and provide them for AI and Analytics Ø  Crawlers to auto discover usage information of data to understand how data is used A rich metadata index of all data and AI assets with social collaboration and enhanced findability Powerful governance tools to control and protect access to data with visibility to data use Ø  Business Glossary to define business terms and map them to technical assets Ø  Active Policy Engine to author, activate and enforce business policies and rules Ø  Governance and Insights dashboards to understand how data is used and how the governance program is impacting it Ø  A business friendly shopping portal for your enterprise data Ø  Integrated with other platform solutions to facilitate self service analytics and AI Ø  Access controls and security Ø  Seamlessly integrated for productive use with Data prep, movement, dashboarding, Machine learning and Data science Watson Knowledge Catalog Unlock tribal knowledge and unleash your data science projects
  • 11. Value Proposition 11IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation Value enabled by an intelligent asset Catalog Who benefits? Improve your data science practice by ensuring you discover data and AI assets more quickly than before, allowing teams to spend more time building AI applications and data science. •  Data Scientists •  Business Analysts •  Developers Improve the accuracy of your machine learning and deep learning models by ensuring that the most relevant data is always available to the data science community. •  Data Scientists Foster a culture of data collaboration allowing models to be shared and automatically classified across the enterprise, jump starting future data science projects and overall improving the data science practice. •  Data Scientists •  CDO •  CMO Understand where all your core business entities exist across your ecosystem and who is using them. Where is all my PII data? •  CDO Revitalize your data lake initiatives by augmenting it with an intelligent view encompassing all data, structured and unstructured and open it up for new business opportunities. •  Data Scientists •  Business Analysts •  CDO
  • 12. A 360 view of all information to fuel data innovation and AI Cloud data | On prem data | Data we own | Data we don’t | Structured | Unstructured Deep Learning Machine Learning Business Analytics & Real time Analytics Make your data workers more effective, super charging your data science and AI Machine Learning Metadata Index UIs Collaboration Enforcement Monitoring Knowledge Catalog APIs Classification&NLU Classification&NLUData lake Data warehouse Cloud Sources On-premise sources Open data Social data Sensor data Dark data
  • 13. Watson Knowledge Catalog Differentiating Capabilities • IBM is the only partner that does not require data movement & storage in our cloud. • Providing choice & flexibility for data location gives the opportunity for highly regulated industries to benefit from IBM Cloud and Watson. Your data, Your way • Watson Knowledge Catalog supports all assets… not just data. Think dashboards, data science & machine learning models, connections, notebooks, etc. – All available in one easy to find self service experience. Focus on Enabling AI • Watson Knowledge Catalog is fully integrated with Watson Studio meaning that once assets have been discovered users can easily drive productive use through integrated tools for data shaping, data movement, data science, AI and machine learning. Seamless integration for Productive Use • Watson Knowledge Catalog contains an advanced policy enforcement engine to establish categories, policies & rules on data assets. • Activation of these policies means data is masked or protected on the fly at the point of access Modern Policy Activation • To be able to truly ensure your Data Science teams have a full view of data it has to include structured and unstructured data. • Watson Knowledge Catalog uses Watson Natural Language processing to extract and understand the value locked away in unstructured and structured data and presents it uniformly for consumption. Structured & Unstructured • ‘Watson Recommends’ is our AI powered recommendation engine that analyzes the digital exhaust of the system and relationships of the assets to provide recommendations on assets for use. • Integrated with the social collaboration capabilities of the catalog to further fuel the recommendation engine. AI powered recommendations 13
  • 14. Watson Studio Tools for supporting the end-to-end AI workflow Model Lifecycle Management Machine Learning Runtimes Deep Learning Runtimes Authoring Tools Cloud Infrastructure as a Service • Most popular open source frameworks • IBM best-in-class frameworks • Create, collaborate, deploy, and monitor • Best of breed open source & IBM tools • Code (R, Python or Scala) and no-code/visual modeling tools • Fully managed service • Container-based resource management • Elastic pay as you go cpu/gpu power
  • 15. 3 Data Refinery Making data fit for use Self-service data refinement and cleaning Comprehensive profiling Interactive visualization Scheduling and monitoring Data Refinery Self service data prep
  • 16. Watson Recommends – AI powered suggestions Use AI to improve your AI • Catalog extended to support all AI assets not just data assets. • AI based engine to suggest the best assets for each user based upon digital exhaust. • Uses AI to self learn and improve over time to guide users to previous dark data that would have been lost •  Search results automatically push the most relevant items to the top of the list •  New AI based Recommended assets list •  New most popular assets list Coming Soon •  Understand and track model lineage – Know which data was use to train which models and when
  • 17. Social Collaboration Features Curation for everyone! crowd sourced model training • Empower all users to determine the value of all assets across the business • Social interactions with data feed into Watson Recommends algorithms to improve over time •  Users of the system improve it over time • Provide comments and further enrich the metadata that describes your key assets •  Most liked assets available to drive people to the most popular assets •  Comment and collaborate on datasets to train the Watson recommends service and aid findability and understanding of assets
  • 18. Support for Unstructured Data Tearing down the barriers between structured and unstructured content to power AI •  Extract key entities from unstructured content and catalog alongside structured content opening up all assets for self service • Leverage Watson NLU to identify the content and context of unstructured data assets to fuel your Data Science and AI practices • Integrated document previewers • Categorize and understand where you key business entities exist across all assets •  Integrated unstructured document viewers •  Natural Language Understanding extracts the key concepts from the document to be used in analytics and ML.
  • 19. Intelligent Data Masking Automatically mask sensitive data elements through policy activation • Extensions to the policy activation engine to mask or anonymize data at the point of consumption • Provides a personalized extract of data based upon company and regulatory policies allowing previously unavailable assets to be available for AI, Data Science and Analytics • Embedded within Catalog and Refinery to ensure extracts contain anonymized data •  Author data anonymization rules natively within the Policy Manager •  Select from 2 options to anonymize data: •  Substitute •  Mask •  Apply data anonymization at the point of use within the system ensuring that users get value from the information they are permitted to see
  • 20. Data Quality Build a trusted currency for data • ML based data quality to inform users quickly if an asset is relevant for their purpose. • Optimized for large data set to calculate 8 different quality dimensions • Aggregate score used to provide trust in quality of data in a standardized form across the enterprise • Provides drift analysis to capture and track how quality of the assets evolve over time. •  Detailed breakdown to further inform the user which aspects contributed to overall score. •  Auto classification confidence scores and frequency analysis provides further detailed information on asset relevance •  Trusted currency calculated to inform users at a glance if this asset is useful.
  • 21. Cloud Private for Data & Watson Studio enables the end to end AI lifecycle. ML Models, Metadata, Data IBM Cloud Private for Data Enterprise Data & Apps IBM Watson Studio AI Business Processes 21© 2018 IBM Corporation Collect, Understand & Govern Share, Activate & Curate
  • 22. IBM’s Hybrid capabilities IBM Watson Studio IBM Data Refinery Data Preparation, Integration IBM Watson Knowledge Catalog Jupyter Notebooks, RStudio, Watson ML, Visual Recognition IBM Unified Governance Information Governance Catalog IGC DataSets Compatibility" Business & technical metadata IGC-Catalog Sync" Bi-directional 1 2 Enterprise Data, External Data & Feeds Enterprise Data, External Data & Feeds Data Engineer Compliance Officer CDO Data Scientist Citizen Data Scientist/Analyst App Developer IBM Industry Models 3rd Party metadata systems Collect, Understand and Govern Share, Activate, Curate IBM Cloud Private IBM Cloud IBM DSX
  • 23. Summary 23 Have you found what you are looking for yet? Digital disruption means AI is paramount to remaining competitive AI requires understanding and context of ever growing volumes of data and models Self Service is key to unleashing the data science and analytics teams to innovate and power the AI needed to compete An intelligent catalog provides the first step towards the AI journey And its really easy to get started today…….
  • 24. All Information | All Users| | Everywhere 24IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation Thanks https://www.ibm.com/cloud/watson-knowledge-catalog Watson Knowledge Catalog