Forces and Threats in a Data Warehouse (and why metadata and architecture is important)

Stefan Urbanek
Stefan Urbaneksoftware engineer at Facebook
FORCES AND THREATS
or Why Architecture and Metadata are Important?
Štefan Urbánek
stefan.urbanek@gmail.com
@Stiivi DataNatives, November 2019
OUTLINE
▪︎ Forces and Threats
▪︎ Traditional Suffering
▪︎ Damping the Forces and Threats
▪︎ How to start?
▪︎ Conclusions
Data warehouse – a concept,
not a technology.
X Y Z
FORCES AND THREATS
Data Warehouse Forces and Threats:
interactions that, when unopposed,
will change the motion of
technical, economical, social,
organisational and process structures
with high potential of causing suffering
CHANGE
force
Data Warehouse
Technology
* you wish!
?
?
?
?
?
**
CHANGE
force
Data Warehouse
Technology
?
?
?
?
?
1995
2000
2005
2010
2015
GROWTH
force
+
+
+
+
+
+
+
+
structure
volume
+
+
COMPLEXITY
force
…
potential relationship
ownership definition
production
COMPLEXITY
force
label/definition
consumer, owner
n2 potential relationships* *only between tables,
not even mentioning columns …
What is relevant?
THREATS*
Mistakes in data
might lead not only to wrong business decisions,
but might also have
legal, financial or existential implications.
*serious and real
TRADITIONAL SUFFERING
usually chronic
SUFFERING
▪︎ Bad consistency and no transparency
No definitions, too many definitions, obscure definitions. Vague opinions in production.
▪︎ Slow time-to-market
Time from a requirement or from observing a change to deployment in the production takes too
much time.
▪︎ Low performance
… despite having the best hardware, systems, algorithms.
(some of it)
How do we know, the data we are looking at
is the data we think we are looking at?
PERFORMANCE
We solved “CPU starvation” problem!!!
Why are our [internal] clients getting data 48-72 hours later?
! 20-30x "
⨝
⨝
∑
⨝ ⨝ …⨝
quite big quite a lot
⨝ ⨝
⨝ ⨝
⨝
⨝⨝
⨝
⨝
⨝
stand-alone ETL process/script
probably the same, who knows?
IS ∑
IS ∑
uncontrolled growth
DAMPING THE FORCES
AND THREATS
ARCHITECTURE    +    METADATA
separation of concerns
and reduction of complexity potential
reduction and annotation
of problem space
and facilitation of reasoning
∑ A→B
ARCHITECTURE
Data Warehouse
“Agreed-upon Analytical Truth”
Metadata
Sandbox/Playground
Staging ~1:1 “Cleaned Augmented
Operational Reality”
Sources
Quality Assurance
Humans
Cubes, Cuboids and Aggregates
Machines
External Data
Platform Data
Regulated Data
∑
∑
proof-of-concepts, ad-hoc analysis
business rules
“typed tables”
External API
decision making
automation
analytics-augmented
application
3rd
NF, , ❄, …
data scientists
decision makers
financial
datamart(s)
quality indicators
src tgt
ownershipdata models transformations
∑ A→B ?
…
DataWarehouse
“Agreed-uponAnalyticalTruth”
Metadata
Sandbox/Playground
Staging~1:1“CleanedAugmented
OperationalReality”
Sources
QualityAssurance
Humans
Cubes,CuboidsandAggregates
Machines
ExternalData
PlatformData
RegulatedData
∑
∑
proof-of-concepts,ad-hocanalysis
businessrules
“typedtables”
ExternalAPI
decisionmaking
automation
analytics-augmented
application
3rd
NF,,❄,…
datascientists
decisionmakers
financial
datamart(s)
qualityindicators
srctgt
ownershipdatamodelstransformations
∑A→B?
…
METADATA
METADATA
▪︎ Data Warehouse Assets
concepts, entities, attributes, definitions,
business rules, quality indicators,
concept ownerships, …
▪︎ Many Perspectives
conceptual, logical, physical, multi-dimensional, security, …
▪︎ Formalised, stored, shared, used
revenue??
?
visits
customers
logical → physical
multidimensional hierarchical → 3rd normal form logical
3rd normal form logical → physical
query → precomputed + computed
denormalisation → joins
physical → logical
… → …
Metadata Processing
Metadata
data models
transformations
∑ A→B
Compose Compile SQL
physical schema
dialect
reality
semantics
realisation
relational algebra
Execute
A,B,C
parameters
.cob
.java
.py
HOW?
STARTING WITH ARCHITECTURE
1. Pick one:
If in doubt – any known to work. Any separation of concerns is better than none.
2. Make it formal and documented.
Otherwise our effort will be dissolved and the content swampified.
3. Stick with it for a while and observe.
4. Adjust as necessary.
STARTING WITH METADATA
1. Pick a problem
2. Use a spreadsheet
Software at hand, no installation needed; universal, readable and editable by non-engineers.
3. Suffer through the spreadsheet-exchange drill phase
Mirror of our processes – seeing the genuine pain points will be useful later.
4. Use functional approach to metadata composition and application
… from those spreadsheets. Example: relational algebra library in the language of our ecosystem.
99.(later) Move spreadsheets into a metadata repository
“HELLO METADATA” PROBLEMS
▪︎ Data quality indicators1
▪︎ Structural (model ↔ schema) consistency check1
▪︎ Automation of common patterns
denormalisation, aggregation, pivot
▪︎ Automate “relationalization” of freely-structured data
JSON → relational
▪︎ Browsability
1non-invasive, non-destructive
Doing Things To Data
Doing More Things
To Data
…
Doing Things To Data
Doing More Things
To Data
…
Pipelines without metadata
Pipelines with metadata
metadata
data
DATA QUALITY INDICATORS
Doing Things To Data
Doing More Things
To Data
…
metadata
data quality
measurementsdata quality
indicators
data
metadata
definition, computation, warning/error thresholds,
ownership, affected business entity, …
COMMON PATTERNS
Automatically Generated Artefacts
Metadata
Manually Crafted Artefacts
IS ∑
denormalize
aggregate
pivot
patterns
∑
controlled growth
probably the same, who knows?
IS ∑
IS ∑
uncontrolled growth
VISUALISATION AND EXPLORATION
Browse-ability: How can we explore a metric? How can we drill down?
User Interface
Metadata
Physical Data
Region
…
name
Sales
Revenue
Visits
…
…
3
2
1
id
Cubes
Geography
…
name
Date
2
…
id …
1
Dimensions
Europe
Germany
Berlin
regions
Country
City
Levels
2 region_code
country_name
…
2
Country
3
country_iso
1
key
Region
…nameid
City
2
dim label
2
region_name
city_namecity_code
…
countries
cities
generated
which column?
concept-to-user propagation
GET /cube/sales/aggregate? cut=date:2010
& split=status:1&drilldown=date|region
& page=10 page_size=100&
SQL
→
Metadata
Logical Model Physical
Physical Data Store
Query ContextInput
Output
Cube
all attributes
base attributes
⨝ joins
database
metadata
Store
Mapper
locale
parameters
create schema
collect and sort
dependencies
map attributesmappings
mappings of
base attributes
fact table
naming
convention
hierarchies
Star Schema
/❄
compile attributes
base attributesdependant
attributes
columns
make star
(topological sort)
query attributes
SQL Query Contextcreate context
base columns
column expressions for attributes
SELECT, GROUP BY
“star” join statement
FROM
conditions
WHERE
Cubes 1.1 – SQL Query Construction
A,B,C?
SQL
TRANSPARENT REPRESENTATIONS
Physical Data Store(s)
Pre-Aggregated
3
rd
Normal Form
source of truth
derived and managed artefacts
Metadata
∑
∑
∑
∑
Multi-Dimensional
Query Server
∑ Aggregator
metadata repository
past 12 months
?
⨝s are expensive
Alternative artefacts: a multi-dimensional data store
CONCLUSIONS
SHIELD AGAINST FORCES AND THREATS
▪︎ Change
▪︎ Growth (structural)
▪︎ Complexity
▪︎ Threats
financial, legal, existential
Force/Threat Architecture Metadata
Change separation of concerns abstraction, generalisation
Growth (structural) separation of concerns, modularity
optimisation through better
reasoning
Complexity separation of concerns, destroy-ability
reduction of problem-space,
coping with heterogeneity
Threats transparency, separation of quality
data accounting,
verifiable data quality,
provable consistency,
source of truth
There is path out of the suffering caused by the data warehouse
forces and threats:
The “shield” of architecture and metadata.
THANK YOU
Štefan Urbánek
stefan.urbanek@gmail.com
@Stiivi
1 of 40

Recommended

Data Lakehouse, Data Mesh, and Data Fabric (r1) by
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
5.5K views27 slides
The delta architecture by
The delta architectureThe delta architecture
The delta architecturePrakash Chockalingam
540 views41 slides
Data Mesh by
Data MeshData Mesh
Data MeshPiethein Strengholt
3.2K views50 slides
Spark (Structured) Streaming vs. Kafka Streams by
Spark (Structured) Streaming vs. Kafka StreamsSpark (Structured) Streaming vs. Kafka Streams
Spark (Structured) Streaming vs. Kafka StreamsGuido Schmutz
5K views41 slides
Kafka Tutorial - introduction to the Kafka streaming platform by
Kafka Tutorial - introduction to the Kafka streaming platformKafka Tutorial - introduction to the Kafka streaming platform
Kafka Tutorial - introduction to the Kafka streaming platformJean-Paul Azar
1.9K views85 slides
Democratizing Data Quality Through a Centralized Platform by
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
1.4K views36 slides

More Related Content

What's hot

Enterprise Data Architecture Deliverables by
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesLars E Martinsson
28.6K views16 slides
Modern Data Architecture by
Modern Data Architecture Modern Data Architecture
Modern Data Architecture Mark Hewitt
244 views15 slides
Architect’s Open-Source Guide for a Data Mesh Architecture by
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
3.1K views48 slides
Making Apache Spark Better with Delta Lake by
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeDatabricks
5.4K views40 slides
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ... by
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...DevGAMM Conference
97 views44 slides
Big Data Architecture and Design Patterns by
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design PatternsJohn Yeung
849 views60 slides

What's hot(20)

Enterprise Data Architecture Deliverables by Lars E Martinsson
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
Lars E Martinsson28.6K views
Modern Data Architecture by Mark Hewitt
Modern Data Architecture Modern Data Architecture
Modern Data Architecture
Mark Hewitt244 views
Architect’s Open-Source Guide for a Data Mesh Architecture by Databricks
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks3.1K views
Making Apache Spark Better with Delta Lake by Databricks
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta Lake
Databricks5.4K views
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ... by DevGAMM Conference
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Big Data Architecture and Design Patterns by John Yeung
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design Patterns
John Yeung849 views
Unpacking TOGAF's 'Phase B': Business Transformation, Business Architecture a... by Tetradian Consulting
Unpacking TOGAF's 'Phase B': Business Transformation, Business Architecture a...Unpacking TOGAF's 'Phase B': Business Transformation, Business Architecture a...
Unpacking TOGAF's 'Phase B': Business Transformation, Business Architecture a...
Tetradian Consulting13.9K views
Data Lakehouse Symposium | Day 4 by Databricks
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks1.8K views
Adopting a Process-Driven Approach to Master Data Management by Software AG
Adopting a Process-Driven Approach to Master Data ManagementAdopting a Process-Driven Approach to Master Data Management
Adopting a Process-Driven Approach to Master Data Management
Software AG7.6K views
Improving Data Literacy Around Data Architecture by DATAVERSITY
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
DATAVERSITY973 views
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit... by Amazon Web Services
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Amazon Web Services2.1K views
The ABCs of Treating Data as Product by DATAVERSITY
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
DATAVERSITY966 views
How to Use a Semantic Layer to Deliver Actionable Insights at Scale by DATAVERSITY
How to Use a Semantic Layer to Deliver Actionable Insights at ScaleHow to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
DATAVERSITY406 views
Five Things to Consider About Data Mesh and Data Governance by DATAVERSITY
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY1.8K views
Enabling a Data Mesh Architecture with Data Virtualization by Denodo
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
Denodo 536 views
Open core summit: Observability for data pipelines with OpenLineage by Julien Le Dem
Open core summit: Observability for data pipelines with OpenLineageOpen core summit: Observability for data pipelines with OpenLineage
Open core summit: Observability for data pipelines with OpenLineage
Julien Le Dem1.2K views
Data Warehouse or Data Lake, Which Do I Choose? by DATAVERSITY
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
DATAVERSITY807 views
Free Training: How to Build a Lakehouse by Databricks
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
Databricks3.4K views
Building an open data platform with apache iceberg by Alluxio, Inc.
Building an open data platform with apache icebergBuilding an open data platform with apache iceberg
Building an open data platform with apache iceberg
Alluxio, Inc.564 views

Similar to Forces and Threats in a Data Warehouse (and why metadata and architecture is important)

How Can Analytics Improve Business? by
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?Inside Analysis
617 views50 slides
Come diventare data scientist - Paolo Pellegrini by
Come diventare data scientist - Paolo PellegriniCome diventare data scientist - Paolo Pellegrini
Come diventare data scientist - Paolo PellegriniDonatella Cambosu
2.9K views50 slides
(Big) Data infographic - EnjoyDigitAll by BNP Paribas by
(Big) Data infographic - EnjoyDigitAll by BNP Paribas(Big) Data infographic - EnjoyDigitAll by BNP Paribas
(Big) Data infographic - EnjoyDigitAll by BNP ParibasEnjoyDigitAll by BNP Paribas
47.6K views1 slide
Building the Artificially Intelligent Enterprise by
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
254 views59 slides
Accelerate Self-Service Analytics with Data Virtualization and Visualization by
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
340 views56 slides
Day 2 aziz apj aziz_big_datakeynote_press by
Day 2 aziz apj aziz_big_datakeynote_pressDay 2 aziz apj aziz_big_datakeynote_press
Day 2 aziz apj aziz_big_datakeynote_pressIntelAPAC
1.6K views22 slides

Similar to Forces and Threats in a Data Warehouse (and why metadata and architecture is important)(20)

How Can Analytics Improve Business? by Inside Analysis
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
Inside Analysis617 views
Come diventare data scientist - Paolo Pellegrini by Donatella Cambosu
Come diventare data scientist - Paolo PellegriniCome diventare data scientist - Paolo Pellegrini
Come diventare data scientist - Paolo Pellegrini
Donatella Cambosu2.9K views
Building the Artificially Intelligent Enterprise by Databricks
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
Databricks254 views
Accelerate Self-Service Analytics with Data Virtualization and Visualization by Denodo
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo 340 views
Day 2 aziz apj aziz_big_datakeynote_press by IntelAPAC
Day 2 aziz apj aziz_big_datakeynote_pressDay 2 aziz apj aziz_big_datakeynote_press
Day 2 aziz apj aziz_big_datakeynote_press
IntelAPAC1.6K views
Building a New Platform for Customer Analytics by Caserta
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
Caserta 956 views
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN) by Denodo
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Denodo 51 views
Knowledge Graphs Webinar- 11/7/2017 by Neo4j
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
Neo4j2.6K views
Building Resiliency and Agility with Data Virtualization for the New Normal by Denodo
Building Resiliency and Agility with Data Virtualization for the New NormalBuilding Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New Normal
Denodo 65 views
Introduction to AutoML and Data Science using the Oracle Autonomous Database ... by Sandesh Rao
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
Sandesh Rao371 views
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio... by Cambridge Semantics
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Cambridge Semantics3.7K views
The 3 Key Barriers Keeping Companies from Deploying Data Products by Dataiku
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
Dataiku1.4K views
Big data in marketing at harvard business club nick1 june 15 2013 by nkabra
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
nkabra1.1K views
Building a Data Strategy – Practical Steps for Aligning with Business Goals by DATAVERSITY
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY1K views

More from Stefan Urbanek

StepTalk Introduction by
StepTalk IntroductionStepTalk Introduction
StepTalk IntroductionStefan Urbanek
433 views16 slides
Sepro - introduction by
Sepro - introductionSepro - introduction
Sepro - introductionStefan Urbanek
2.1K views41 slides
New york data brewery meetup #1 – introduction by
New york data brewery meetup #1 – introductionNew york data brewery meetup #1 – introduction
New york data brewery meetup #1 – introductionStefan Urbanek
2.5K views42 slides
Cubes 1.0 Overview by
Cubes 1.0 OverviewCubes 1.0 Overview
Cubes 1.0 OverviewStefan Urbanek
4.5K views79 slides
Cubes – pluggable model explained by
Cubes – pluggable model explainedCubes – pluggable model explained
Cubes – pluggable model explainedStefan Urbanek
17.1K views35 slides
Cubes – ways of deployment by
Cubes – ways of deploymentCubes – ways of deployment
Cubes – ways of deploymentStefan Urbanek
3.2K views9 slides

More from Stefan Urbanek(20)

New york data brewery meetup #1 – introduction by Stefan Urbanek
New york data brewery meetup #1 – introductionNew york data brewery meetup #1 – introduction
New york data brewery meetup #1 – introduction
Stefan Urbanek2.5K views
Cubes – pluggable model explained by Stefan Urbanek
Cubes – pluggable model explainedCubes – pluggable model explained
Cubes – pluggable model explained
Stefan Urbanek17.1K views
Cubes – ways of deployment by Stefan Urbanek
Cubes – ways of deploymentCubes – ways of deployment
Cubes – ways of deployment
Stefan Urbanek3.2K views
Knowledge Management Lecture 4: Models by Stefan Urbanek
Knowledge Management Lecture 4: ModelsKnowledge Management Lecture 4: Models
Knowledge Management Lecture 4: Models
Stefan Urbanek36.6K views
Dallas Data Brewery Meetup #2: Data Quality Perception by Stefan Urbanek
Dallas Data Brewery Meetup #2: Data Quality PerceptionDallas Data Brewery Meetup #2: Data Quality Perception
Dallas Data Brewery Meetup #2: Data Quality Perception
Stefan Urbanek1.1K views
Dallas Data Brewery - introduction by Stefan Urbanek
Dallas Data Brewery - introductionDallas Data Brewery - introduction
Dallas Data Brewery - introduction
Stefan Urbanek953 views
Bubbles – Virtual Data Objects by Stefan Urbanek
Bubbles – Virtual Data ObjectsBubbles – Virtual Data Objects
Bubbles – Virtual Data Objects
Stefan Urbanek76.4K views
Python business intelligence (PyData 2012 talk) by Stefan Urbanek
Python business intelligence (PyData 2012 talk)Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)
Stefan Urbanek22.4K views
Cubes - Lightweight Python OLAP (EuroPython 2012 talk) by Stefan Urbanek
Cubes - Lightweight Python OLAP (EuroPython 2012 talk)Cubes - Lightweight Python OLAP (EuroPython 2012 talk)
Cubes - Lightweight Python OLAP (EuroPython 2012 talk)
Stefan Urbanek12.9K views
Knowledge Management Lecture 3: Cycle by Stefan Urbanek
Knowledge Management Lecture 3: CycleKnowledge Management Lecture 3: Cycle
Knowledge Management Lecture 3: Cycle
Stefan Urbanek34K views
Knowledge Management Lecture 2: Individuals, communities and organizations by Stefan Urbanek
Knowledge Management Lecture 2: Individuals, communities and organizationsKnowledge Management Lecture 2: Individuals, communities and organizations
Knowledge Management Lecture 2: Individuals, communities and organizations
Stefan Urbanek5.2K views
Knowledge Management Lecture 1: definition, history and presence by Stefan Urbanek
Knowledge Management Lecture 1: definition, history and presenceKnowledge Management Lecture 1: definition, history and presence
Knowledge Management Lecture 1: definition, history and presence
Stefan Urbanek25.5K views
Cubes - Lightweight OLAP Framework by Stefan Urbanek
Cubes - Lightweight OLAP FrameworkCubes - Lightweight OLAP Framework
Cubes - Lightweight OLAP Framework
Stefan Urbanek5.8K views
Data Cleansing introduction (for BigClean Prague 2011) by Stefan Urbanek
Data Cleansing introduction (for BigClean Prague 2011)Data Cleansing introduction (for BigClean Prague 2011)
Data Cleansing introduction (for BigClean Prague 2011)
Stefan Urbanek4.9K views
Knowledge Management Introduction by Stefan Urbanek
Knowledge Management IntroductionKnowledge Management Introduction
Knowledge Management Introduction
Stefan Urbanek1.9K views

Recently uploaded

Listed Instruments Survey 2022.pptx by
Listed Instruments Survey  2022.pptxListed Instruments Survey  2022.pptx
Listed Instruments Survey 2022.pptxsecretariat4
121 views12 slides
Custom Tag Manager Templates by
Custom Tag Manager TemplatesCustom Tag Manager Templates
Custom Tag Manager TemplatesMarkus Baersch
30 views17 slides
Shreyas hospital statistics.pdf by
Shreyas hospital statistics.pdfShreyas hospital statistics.pdf
Shreyas hospital statistics.pdfsamithavinal
5 views9 slides
Dr. Ousmane Badiane-2023 ReSAKSS Conference by
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceAKADEMIYA2063
5 views34 slides
apple.pptx by
apple.pptxapple.pptx
apple.pptxhoneybeeqwe
6 views15 slides
Data about the sector workshop by
Data about the sector workshopData about the sector workshop
Data about the sector workshopinfo828217
29 views27 slides

Recently uploaded(20)

Listed Instruments Survey 2022.pptx by secretariat4
Listed Instruments Survey  2022.pptxListed Instruments Survey  2022.pptx
Listed Instruments Survey 2022.pptx
secretariat4121 views
Shreyas hospital statistics.pdf by samithavinal
Shreyas hospital statistics.pdfShreyas hospital statistics.pdf
Shreyas hospital statistics.pdf
samithavinal5 views
Dr. Ousmane Badiane-2023 ReSAKSS Conference by AKADEMIYA2063
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS Conference
AKADEMIYA20635 views
Data about the sector workshop by info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821729 views
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion by Bertram Ludäscher
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
LIVE OAK MEMORIAL PARK.pptx by ms2332always
LIVE OAK MEMORIAL PARK.pptxLIVE OAK MEMORIAL PARK.pptx
LIVE OAK MEMORIAL PARK.pptx
ms2332always7 views
Customer Data Cleansing Project.pptx by Nat O
Customer Data Cleansing Project.pptxCustomer Data Cleansing Project.pptx
Customer Data Cleansing Project.pptx
Nat O6 views
4_4_WP_4_06_ND_Model.pptx by d6fmc6kwd4
4_4_WP_4_06_ND_Model.pptx4_4_WP_4_06_ND_Model.pptx
4_4_WP_4_06_ND_Model.pptx
d6fmc6kwd47 views
Product Research sample.pdf by AllenSingson
Product Research sample.pdfProduct Research sample.pdf
Product Research sample.pdf
AllenSingson33 views
Data Journeys Hard Talk workshop final.pptx by info828217
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptx
info82821711 views
Best Home Security Systems.pptx by mogalang
Best Home Security Systems.pptxBest Home Security Systems.pptx
Best Home Security Systems.pptx
mogalang9 views
DGST Methodology Presentation.pdf by maddierlegum
DGST Methodology Presentation.pdfDGST Methodology Presentation.pdf
DGST Methodology Presentation.pdf
maddierlegum7 views

Forces and Threats in a Data Warehouse (and why metadata and architecture is important)