18. ATACCAMA ONE | Platform Components
Data Discovery
& Profiling
Data Quality
Management
Master Data
Management
Big Data
Processing &
Data
Integration
19. SaaS
Hadoop
On Premise
Cloud
ROBUST DATA PROCESSING ENGINE
Any Data / Any Domain – Integration – Performance –
Scalability
ENTERPRISE-PROVEN CAPABILITIES
High Availability – Auditing – Identity Management –
Data Lineage
COLLABORATIVE DATA STEWARDSHIP UI
AI & Machine Learning – Self-Service – Collaboration
– UX/UI
›
ATACCAMA ONE | Platform Features
20. ATACCAMA ONE | High Level Architecture
BUSINESS
VALUE
VALUE
CREATION
DATA
CURATION
DATA
DELIVERY
DATA
ACQUISITION
Process
Improvement
New Campaign
Regulatory
New Product
New App
Analytics
SMART
ALGORITHMS
DATA
STEWARDS
›
› ›
CURATED DATA
CATALOG
DATA APIs
METADATA APIs
Provide
Discover | Profile | Catalog | Cleanse
Govern | Consolidate | Master
21. Our collaboration – high level summar
› Our integration allows for metadata produced by Ataccama ONE DQM/MDM platform offerings to be
cataloged by the Collibra Data Catalog platform
› Collibra’s Data Catalog will define and describe business rules
› Ataccama ONE will execute business rules and feed back through to Collibra for client facing visualization
and logging.
22. Ataccama Data Quality Engine
Framework application originally designed to solve various DQ use cases.
Allows user to create custom ETL-based jobs via Eclipse based GUI.
Contains built-in application server and workflow allowing scheduling and triggering jobs via HTTP.
All jobs may be deployed as an online service.
›
›
›
›
23. How is it implemented?
Collibra Reader
Utilizing Collibra Rest Api v2 to read Assets of a
certain type.
It can read meta-information of an Asset as well as its
Attributes and Relations.
Each Asset is represented by one row in the DQE
processing.
Collibra Writer
Utilizing Collibra Import api to create or update Assets
in Collibra.
It creates Collibra import jobs from rows on the step
input.
It can write Assets, Attributes and Relations.
›
›
›
›
›
›
24. Why Data Quality Engine?
PROS
Already contains support for HTTP calls.
It is easily extendable.
It serves as an integration tool or backend processor
for all Ataccama installations.
It can work without any front-end.
CONS
The configuration is not business friendly.
Very complex use-cases might be hard to implement.
›
›
›
›
›
›
25. DQ Issue
Tracker
DQC Engine
DQ
Dashboard
Exports
Retrieve data
for DQ processing
1
Summarized DQ Results
pulled by DGC
2
3 Summary
DQ Reports
4 DQ Issues for Manual
Resolution
5 Data Corrections & Extensions
of the DQ rules
Cleansed & Merged data
(exports)
6
Collibra DGC
Reference
Data Manager
7
Why Data Quality Engine?
26. DQM Use Case
DQC Engine Collibra DGC
Ataccama
DQM
Retrieving metadata
From Collibra. 1
3 Sending the DQM
Results back to Collibra.
2
Retrieving Data
for DQM processing.
27. Issue Tracking Use Use
DQC Engine Collibra DGC
Send Recorded
DQ Issues for resolution. 1
2 Update the resolution
status
DQ Issue
Tracker