SlideShare a Scribd company logo
Empowering Data Scientists with the Alation Data Catalog
11/6/2019
Ming Yuan
Zurich North America
©Zurich
INTERNAL USE ONLY
Zurich Insurance Group is a leading multi-line insurer that serves its customers in global and local markets.
With approximately 54,000 employees, Zurich provides a wide range of property and casualty, and life
insurance products and services in more than 210 countries and territories. Zurich’s customers include
individuals, small businesses, and mid-sized and large companies, as well as multinational corporations. The
Group is headquartered in Zurich, Switzerland, where it was founded in 1872. The holding company, Zurich
Insurance Group Ltd (ZURN), is listed on the SIX Swiss Exchange and has a level I American Depositary
Receipt (ZURVY) program, which is traded over-the-counter on OTCQX.
Zurich North America (ZNA), part of the Zurich Insurance group, is one of the largest providers of insurance
solutions and services to businesses and individuals in the North America. Our customers represent industries
ranging from agriculture to construction and include more than 90 percent of the Fortune 500. We’ve
backed the building of some of the most recognizable structures in North America — from the Hoover Dam
to Madison Square Garden to the Confederation Bridge.
Introduction of Zurich Insurance
2
©Zurich
INTERNAL USE ONLY
 Data warehouse system serves as the single source of truth in the enterprise
 Metadata is collected and stored in Information Governance Catalog system
 Governance processes on data access and utilization are established
 Data is used in day-to-day decision makings in key business domains
 A strong data science team delivers predictive models and business insights
 We are an early adopter of big-data analytics and cloud analytics
Zurich has leveraged data for decades
3
Multiple Databases
On-premises Data Warehouse
Hadoop Data Lake
Cloud Data
Lake
©Zurich
INTERNAL USE ONLY
Customer-led
Improve service quality and
customer experience
Innovative
Better products, services and
customer care
Easy to work with
More agile and responsive
organization
Zurich is a data-driven innovative company
4
Zurich
Strategy
Data Driven
Business
Impact
Personalized offers
Predictable behaviors
Prescriptive insights
Optimized business performance
Improved operational efficiency
New flexible propositions
Tailored pricing
Real-time service delivery
Drive customer and business impact through Data & Analytics solutions
Data Driven Customer Experience
Generate actionable customer insights to
improve customer experience
Data Driven Automation
Improve data flows and
intelligent processing
Data Driven Products
and Services
Use data insights to create and upgrade
products and services
Data Driven Decision Making
Embed critical business data in decision making processes and systems
Data
Strategy
Core
Pillars
©Zurich
INTERNAL USE ONLY
What is the focus of Data and Analytics at Zurich?
5
We build and deliver new data and insights that help our customers and
Zurich make better decisions and drive business impact
Customer ClaimsUnderwriting
 Share data and insights with our
customers to help our customers better
understand and manage their risk
 Improve outcomes in the claim handling
process – based on potential for severity
 Enhance risk selection, program structure
and pricing decisions
 Support portfolio management decisions
Data Foundation
 Invest in data (internal and external) and enhance accessibility and usability
 Invest in building a more data-enabled organization
©Zurich
INTERNAL USE ONLY
 Legacy metadata management system is less
user-friendly, not allowing for dashboard view for
objects and search
 Metadata is not fully reviewed and certified as part
of an automated stewardship workflow
 Metadata system doesn’t provide data-utilization
patterns from machine learning
and crowdsourcing
 Data samples and logs not available to
improve usability
 Metadata import not automated within UI, requires
API or manual entry
Finding the right data took a lot of time and effort
6
©Zurich
INTERNAL USE ONLY
Key features that data stewards look for from a data catalog
7
Maintain business glossary for data domains that are owned by function or business unit
Import technical metadata and catalog it as data assets
Curate technical metadata relating them to logical business terms
Maintain data-flows mapping transformations
Specify data quality measures relating to the logical data model
Specify data quality reports & measures to be applied to concrete technical data assets
©Zurich
INTERNAL USE ONLY
Alation automates time-consuming steps in metadata ingestion
8
Database
Data
Warehouse
Cloud Data
Lake
JSON
Streams
Ingest and refresh schema, table, and column definitions
Build data lineage, popularity, common queries, and more
Profile and store sample data sets
Collect user information and usage metrics
Open APIs to programmatically import business glossaries
2,053,632
©Zurich
INTERNAL USE ONLY
Key features that data consumers look for from a data catalog
9
Search, explore and discover data assets
explore assets by policy level, owner, business terms, metrics, systems, physical structures, etc.
Interpret data with correct meaning and context
find terms, scope, granularity, relations to other data sets, processes & use cases, etc.
Retrieve unique, validated golden record of master- & reference data
look at actual data values available of managed master & reference-data and understand change process
Navigate data flows to analyse processes and assess change impact
visually trace data lineage on detail level or get an overview of data-flows on an system interface level
Understand quality of data by looking at quality rules
analyse how suitable the available information is by seeing what rules it adheres to on various dimensions
Evaluate data quality reports and drive improvement actions
visualize quality status on management and detail level and make improvement actions trackable
©Zurich
INTERNAL USE ONLY
Natural-language search makes it easy to discover unknowns
10
English in, answers out.
No need to translate into technical labels, jargon or
coded data
Find tables, files, queries, articles, experts and
BI workbooks in one place
Keep results fresh with scheduled metadata and query
log extractions
Different objects…
Article
Published Query
Table (Description)
©Zurich
INTERNAL USE ONLY
Users and Stewards actively curate Alation Catalog pages
11
Endorsements, deprecations,
and warnings propagate
best practices
Sample entries for better
understanding the data
Query sharing to
encourage reuse
Find subject matter experts by
table usage
View popularity rankings
to determine the most
used asset
©Zurich
INTERNAL USE ONLY
Everyone collaborates with Articles and Conversations
Seamlessly communicate
directly in Alation
Reference past conversations
Increase transparency
by tracking Contributors
©Zurich
INTERNAL USE ONLY
Query intelligently with the Compose feature
13
Download and examine query
results
Help develop SQL statements
©Zurich
INTERNAL USE ONLY
What did the users say about the tool?
14
Functionality Legacy Metadata Tool Points Alation Points
Functional
Fuzzy Search It does not have fuzzy
search.
0 It has fuzzy search 1
Search Algorithm Not very efficient 0 Very efficient 1
Advanced search Advanced search is
available
1 Advanced search is available 1
Data Lineage Very complex 0 Less complex 1
Data Profiling Not readily available 0 Its available 1
Sample Data Not readily available 0 Its available 1
SQL query capability Not available 0 Its available 1
SQL query log access Not available 0 Its available 1
Auto-completion suggestions in SQL Not available 0 Its available with @ 1
Connect to DataIKU Not possible 0 Connected to dataIKU 1
Query on Metadata Its available 1 Not available 0
Search multiple words on single search Its available through query 1 Not available 0
Machine Learning capability Not available 0 Its available 1
REST API Its available 1 Its available 1
Download the search result Its available 1 Not available 0
Add reference and comments with easy
steps
Not available 0 Its available 1
Glossary terms Its available 1 Its available 1
Saving and Publishing the Data Not available 0 Its available 1
How often a particular table or schema
accessed
It is not possible to track 0 It is possible to track 1
Who accessed the data It is not possible to track 0 It is possible to track 1
Metadata and Data side by side Not possible 0 It is possible 1
Non Functional
Collaboration with teams No 0 Yes 1
Need a team for metadata management Separate team needed 0 Users are the contributors as
well.
1
Improve productivity No 0 Yes 1
Search performance Slow 0 Fast 1
6 22
Alation seems very easy to use and was easy
to follow.”
-- Enterprise Data Specialist
“
 Ability to endorse, warn, deprecate almost any item (source,
table, column, query) and redirect users from a deprecated
item to an endorsed item - depth of tool is very impressive
 Has potential to dramatically improve quality of metadata
through crowd-sourcing and maintenance
(see Dependencies)
 Integration with Dataiku - ability to add a table in Alation to a
Dataiku project, either starting from Alation or from Dataiku
 Traceability of most frequent users and popularity of Alation
items
 Use of text analytics for data discovery and smart query
composition
 Ability to publish queries for other users to view and leverage
-- Data Scientist
©Zurich
INTERNAL USE ONLY
Data catalog is a critical component in self-service analytics
15
We are integrating Alation with data engineering and data processing tools for data scientists.
Easy imports
Easy searches
©Zurich
INTERNAL USE ONLY
 Modern data catalogs increase the business value of data and analytics investments
 Listen to the data consumers
 Choose one data catalog that can naturally fit into your data and metadata management
processes
 Automate metadata ingestion and validation processes
 Collaboration, contribution, and self-service enablement are key functions
 Work with a small group of experts to start the adoption
Data catalog fulfills the promise of building data-driven culture
16
Search QueryCurate Collaborate Reuse
©Zurich
INTERNAL USE ONLY
17
Thank you
ming.yuan@zurichna.com

More Related Content

What's hot

Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
Costa Pissaris
 
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
Concept Searching, Inc
 

What's hot (20)

Consumer Data Management
Consumer Data ManagementConsumer Data Management
Consumer Data Management
 
Unlocking New Insights with Information Discovery
Unlocking New Insights with Information DiscoveryUnlocking New Insights with Information Discovery
Unlocking New Insights with Information Discovery
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
 
The Economic Value of Data: A New Revenue Stream for Global Custodians
The Economic Value of Data: A New Revenue Stream for Global CustodiansThe Economic Value of Data: A New Revenue Stream for Global Custodians
The Economic Value of Data: A New Revenue Stream for Global Custodians
 
Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Data-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesData-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata Strategies
 
Working With Different Kinds of Data
Working With Different Kinds of DataWorking With Different Kinds of Data
Working With Different Kinds of Data
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
 
Analytics in a day
Analytics in a day Analytics in a day
Analytics in a day
 
GDPR Compliance Made Easy with Data Virtualization
GDPR Compliance Made Easy with Data VirtualizationGDPR Compliance Made Easy with Data Virtualization
GDPR Compliance Made Easy with Data Virtualization
 
Reduce Your Taxonomy Deployment Time from Months to Weeks Webinar
Reduce Your Taxonomy Deployment Time from Months to Weeks WebinarReduce Your Taxonomy Deployment Time from Months to Weeks Webinar
Reduce Your Taxonomy Deployment Time from Months to Weeks Webinar
 
A Study on 21st Century Business Intelligence
A Study on 21st Century Business Intelligence A Study on 21st Century Business Intelligence
A Study on 21st Century Business Intelligence
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!
 
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
 
Data Marketplace - Rethink the Data
Data Marketplace - Rethink the DataData Marketplace - Rethink the Data
Data Marketplace - Rethink the Data
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 

Similar to Forrester2019

Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
Julian Tong
 
Enterprise Data Management
Enterprise Data ManagementEnterprise Data Management
Enterprise Data Management
Bhavendra Chavan
 
Data-Ed: Trends in Data Modeling
Data-Ed: Trends in Data ModelingData-Ed: Trends in Data Modeling
Data-Ed: Trends in Data Modeling
Data Blueprint
 
Optimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WP
Optimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WPOptimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WP
Optimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WP
Radium Communications
 

Similar to Forrester2019 (20)

Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Guided Analytics vs. Self-Service BI: Choose Your Path to Data-driven Success!
Guided Analytics vs. Self-Service BI: Choose Your Path to Data-driven Success!Guided Analytics vs. Self-Service BI: Choose Your Path to Data-driven Success!
Guided Analytics vs. Self-Service BI: Choose Your Path to Data-driven Success!
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
Improving practitioner decision making capabilities with data and analytics v1
Improving practitioner decision making capabilities with data and analytics v1Improving practitioner decision making capabilities with data and analytics v1
Improving practitioner decision making capabilities with data and analytics v1
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
RWE & Patient Analytics Leveraging Databricks – A Use Case
RWE & Patient Analytics Leveraging Databricks – A Use CaseRWE & Patient Analytics Leveraging Databricks – A Use Case
RWE & Patient Analytics Leveraging Databricks – A Use Case
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Enterprise Data Management
Enterprise Data ManagementEnterprise Data Management
Enterprise Data Management
 
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
 
KIT601 Unit I.pptx
KIT601 Unit I.pptxKIT601 Unit I.pptx
KIT601 Unit I.pptx
 
Oracle canvas 140604 2
Oracle canvas 140604 2Oracle canvas 140604 2
Oracle canvas 140604 2
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data Modeling
 
Data-Ed: Trends in Data Modeling
Data-Ed: Trends in Data ModelingData-Ed: Trends in Data Modeling
Data-Ed: Trends in Data Modeling
 
ERP technology Areas.pptx
ERP technology Areas.pptxERP technology Areas.pptx
ERP technology Areas.pptx
 
DGIQ - Case Studies_ Applications of Data Governance in the Enterprise (Final...
DGIQ - Case Studies_ Applications of Data Governance in the Enterprise (Final...DGIQ - Case Studies_ Applications of Data Governance in the Enterprise (Final...
DGIQ - Case Studies_ Applications of Data Governance in the Enterprise (Final...
 
The Analytic Trifecta: Abstraction, the Cloud, and Visualization
The Analytic Trifecta: Abstraction, the Cloud, and VisualizationThe Analytic Trifecta: Abstraction, the Cloud, and Visualization
The Analytic Trifecta: Abstraction, the Cloud, and Visualization
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
 
Optimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WP
Optimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WPOptimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WP
Optimizing_Customer_Lifecycle_with_Big_Data_Analytics_4079WP
 

More from Ming Yuan (7)

Cloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummitCloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummit
 
R & Python on Hadoop
R & Python on HadoopR & Python on Hadoop
R & Python on Hadoop
 
SSO with sfdc
SSO with sfdcSSO with sfdc
SSO with sfdc
 
Singleton
SingletonSingleton
Singleton
 
Rest and beyond
Rest and beyondRest and beyond
Rest and beyond
 
Simplifying Apache Cascading
Simplifying Apache CascadingSimplifying Apache Cascading
Simplifying Apache Cascading
 
Building calloutswithoutwsdl2apex
Building calloutswithoutwsdl2apexBuilding calloutswithoutwsdl2apex
Building calloutswithoutwsdl2apex
 

Recently uploaded

standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 

Recently uploaded (20)

standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

Forrester2019

  • 1. Empowering Data Scientists with the Alation Data Catalog 11/6/2019 Ming Yuan Zurich North America
  • 2. ©Zurich INTERNAL USE ONLY Zurich Insurance Group is a leading multi-line insurer that serves its customers in global and local markets. With approximately 54,000 employees, Zurich provides a wide range of property and casualty, and life insurance products and services in more than 210 countries and territories. Zurich’s customers include individuals, small businesses, and mid-sized and large companies, as well as multinational corporations. The Group is headquartered in Zurich, Switzerland, where it was founded in 1872. The holding company, Zurich Insurance Group Ltd (ZURN), is listed on the SIX Swiss Exchange and has a level I American Depositary Receipt (ZURVY) program, which is traded over-the-counter on OTCQX. Zurich North America (ZNA), part of the Zurich Insurance group, is one of the largest providers of insurance solutions and services to businesses and individuals in the North America. Our customers represent industries ranging from agriculture to construction and include more than 90 percent of the Fortune 500. We’ve backed the building of some of the most recognizable structures in North America — from the Hoover Dam to Madison Square Garden to the Confederation Bridge. Introduction of Zurich Insurance 2
  • 3. ©Zurich INTERNAL USE ONLY  Data warehouse system serves as the single source of truth in the enterprise  Metadata is collected and stored in Information Governance Catalog system  Governance processes on data access and utilization are established  Data is used in day-to-day decision makings in key business domains  A strong data science team delivers predictive models and business insights  We are an early adopter of big-data analytics and cloud analytics Zurich has leveraged data for decades 3 Multiple Databases On-premises Data Warehouse Hadoop Data Lake Cloud Data Lake
  • 4. ©Zurich INTERNAL USE ONLY Customer-led Improve service quality and customer experience Innovative Better products, services and customer care Easy to work with More agile and responsive organization Zurich is a data-driven innovative company 4 Zurich Strategy Data Driven Business Impact Personalized offers Predictable behaviors Prescriptive insights Optimized business performance Improved operational efficiency New flexible propositions Tailored pricing Real-time service delivery Drive customer and business impact through Data & Analytics solutions Data Driven Customer Experience Generate actionable customer insights to improve customer experience Data Driven Automation Improve data flows and intelligent processing Data Driven Products and Services Use data insights to create and upgrade products and services Data Driven Decision Making Embed critical business data in decision making processes and systems Data Strategy Core Pillars
  • 5. ©Zurich INTERNAL USE ONLY What is the focus of Data and Analytics at Zurich? 5 We build and deliver new data and insights that help our customers and Zurich make better decisions and drive business impact Customer ClaimsUnderwriting  Share data and insights with our customers to help our customers better understand and manage their risk  Improve outcomes in the claim handling process – based on potential for severity  Enhance risk selection, program structure and pricing decisions  Support portfolio management decisions Data Foundation  Invest in data (internal and external) and enhance accessibility and usability  Invest in building a more data-enabled organization
  • 6. ©Zurich INTERNAL USE ONLY  Legacy metadata management system is less user-friendly, not allowing for dashboard view for objects and search  Metadata is not fully reviewed and certified as part of an automated stewardship workflow  Metadata system doesn’t provide data-utilization patterns from machine learning and crowdsourcing  Data samples and logs not available to improve usability  Metadata import not automated within UI, requires API or manual entry Finding the right data took a lot of time and effort 6
  • 7. ©Zurich INTERNAL USE ONLY Key features that data stewards look for from a data catalog 7 Maintain business glossary for data domains that are owned by function or business unit Import technical metadata and catalog it as data assets Curate technical metadata relating them to logical business terms Maintain data-flows mapping transformations Specify data quality measures relating to the logical data model Specify data quality reports & measures to be applied to concrete technical data assets
  • 8. ©Zurich INTERNAL USE ONLY Alation automates time-consuming steps in metadata ingestion 8 Database Data Warehouse Cloud Data Lake JSON Streams Ingest and refresh schema, table, and column definitions Build data lineage, popularity, common queries, and more Profile and store sample data sets Collect user information and usage metrics Open APIs to programmatically import business glossaries 2,053,632
  • 9. ©Zurich INTERNAL USE ONLY Key features that data consumers look for from a data catalog 9 Search, explore and discover data assets explore assets by policy level, owner, business terms, metrics, systems, physical structures, etc. Interpret data with correct meaning and context find terms, scope, granularity, relations to other data sets, processes & use cases, etc. Retrieve unique, validated golden record of master- & reference data look at actual data values available of managed master & reference-data and understand change process Navigate data flows to analyse processes and assess change impact visually trace data lineage on detail level or get an overview of data-flows on an system interface level Understand quality of data by looking at quality rules analyse how suitable the available information is by seeing what rules it adheres to on various dimensions Evaluate data quality reports and drive improvement actions visualize quality status on management and detail level and make improvement actions trackable
  • 10. ©Zurich INTERNAL USE ONLY Natural-language search makes it easy to discover unknowns 10 English in, answers out. No need to translate into technical labels, jargon or coded data Find tables, files, queries, articles, experts and BI workbooks in one place Keep results fresh with scheduled metadata and query log extractions Different objects… Article Published Query Table (Description)
  • 11. ©Zurich INTERNAL USE ONLY Users and Stewards actively curate Alation Catalog pages 11 Endorsements, deprecations, and warnings propagate best practices Sample entries for better understanding the data Query sharing to encourage reuse Find subject matter experts by table usage View popularity rankings to determine the most used asset
  • 12. ©Zurich INTERNAL USE ONLY Everyone collaborates with Articles and Conversations Seamlessly communicate directly in Alation Reference past conversations Increase transparency by tracking Contributors
  • 13. ©Zurich INTERNAL USE ONLY Query intelligently with the Compose feature 13 Download and examine query results Help develop SQL statements
  • 14. ©Zurich INTERNAL USE ONLY What did the users say about the tool? 14 Functionality Legacy Metadata Tool Points Alation Points Functional Fuzzy Search It does not have fuzzy search. 0 It has fuzzy search 1 Search Algorithm Not very efficient 0 Very efficient 1 Advanced search Advanced search is available 1 Advanced search is available 1 Data Lineage Very complex 0 Less complex 1 Data Profiling Not readily available 0 Its available 1 Sample Data Not readily available 0 Its available 1 SQL query capability Not available 0 Its available 1 SQL query log access Not available 0 Its available 1 Auto-completion suggestions in SQL Not available 0 Its available with @ 1 Connect to DataIKU Not possible 0 Connected to dataIKU 1 Query on Metadata Its available 1 Not available 0 Search multiple words on single search Its available through query 1 Not available 0 Machine Learning capability Not available 0 Its available 1 REST API Its available 1 Its available 1 Download the search result Its available 1 Not available 0 Add reference and comments with easy steps Not available 0 Its available 1 Glossary terms Its available 1 Its available 1 Saving and Publishing the Data Not available 0 Its available 1 How often a particular table or schema accessed It is not possible to track 0 It is possible to track 1 Who accessed the data It is not possible to track 0 It is possible to track 1 Metadata and Data side by side Not possible 0 It is possible 1 Non Functional Collaboration with teams No 0 Yes 1 Need a team for metadata management Separate team needed 0 Users are the contributors as well. 1 Improve productivity No 0 Yes 1 Search performance Slow 0 Fast 1 6 22 Alation seems very easy to use and was easy to follow.” -- Enterprise Data Specialist “  Ability to endorse, warn, deprecate almost any item (source, table, column, query) and redirect users from a deprecated item to an endorsed item - depth of tool is very impressive  Has potential to dramatically improve quality of metadata through crowd-sourcing and maintenance (see Dependencies)  Integration with Dataiku - ability to add a table in Alation to a Dataiku project, either starting from Alation or from Dataiku  Traceability of most frequent users and popularity of Alation items  Use of text analytics for data discovery and smart query composition  Ability to publish queries for other users to view and leverage -- Data Scientist
  • 15. ©Zurich INTERNAL USE ONLY Data catalog is a critical component in self-service analytics 15 We are integrating Alation with data engineering and data processing tools for data scientists. Easy imports Easy searches
  • 16. ©Zurich INTERNAL USE ONLY  Modern data catalogs increase the business value of data and analytics investments  Listen to the data consumers  Choose one data catalog that can naturally fit into your data and metadata management processes  Automate metadata ingestion and validation processes  Collaboration, contribution, and self-service enablement are key functions  Work with a small group of experts to start the adoption Data catalog fulfills the promise of building data-driven culture 16 Search QueryCurate Collaborate Reuse
  • 17. ©Zurich INTERNAL USE ONLY 17 Thank you ming.yuan@zurichna.com