NoSQL and Data Modeling for Data Modelers

Karen Lopez
Karen LopezSr. Project Manager and Architect at InfoAdvisors www.datamodel.com
Big Data, NoSQL &
Data Modeling
10 Tips for Data Modeling Success on Modern Data Projects
Karen Lopez, InfoAdvisors
www.datamodel.com
Data Models – Traditional Process
Conceptual
(Data)
Model
Logical
Data Model
Physical
Data
Model(s) OLTP
OLTP
OLTP OLTP
OLTP
MARTMART
OLTP
OLTP
OLTP
Aug 2014©InfoAdvisors - infoadvisors.com
Relational
Aug 2014©InfoAdvisors - infoadvisors.com
Data Models started
with relational
modeling, so they look
like relational database
structures.
But….
That doesn’t mean they can’t be used to
model data that goes into a non-
relational format.
All that formatting happens at build OR
consumption time, not requirements
time.
Aug 2014©InfoAdvisors - infoadvisors.com
The Big Data Story
Lots of data
Coming at us fast
Lots of variety in format & quality
We want all the data
Highly available
“It’s web scale”
Aug 2014©InfoAdvisors - infoadvisors.com
What do we really mean by scale?
Bringing computing to the data
Massively parallel processing
Cheap, commodity hardware, but lots
of it
Optimized for
Query/Reads/Questions/Telling stories
Aug 2014©InfoAdvisors - infoadvisors.com
We’ve been down this road before…
Traditional
transactional
applications
Reporting-
optimized
tables/structures
Data Warehouse
/ Dimensional
Modeling
Aug 2014©InfoAdvisors - infoadvisors.com
Highly normalized Highly Denormalized
ETL
EDW
Data
Mart
Data
Mart
Hadoop
ETL
EDW
Analytics
Mart
Data
Mart
NoSQL, Not Only SQL
Relational Graph
Columnar/Column
Family
Key Value
Document
Databases
Others
Aug 2014©InfoAdvisors - infoadvisors.com
Sample Hive Statement
CREATE EXTERNAL TABLE TaxRebateUsage (
state string,
zipcode string,
agi_class int,
n1 int,
mars2 int,
prep int,
n2 int,
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS TEXTFILE
Aug 2014©InfoAdvisors - infoadvisors.com
Sample JSON/MongoDB Notation
Aug 2014©InfoAdvisors - infoadvisors.com
Sample FoundationDB Statement
Aug 2014©InfoAdvisors - infoadvisors.com
Sample Cassandra Statement
Aug 2014©InfoAdvisors - infoadvisors.com
Sample Vertica Statement
Aug 2014©InfoAdvisors - infoadvisors.com
Sample Neo4j Statement
Aug 2014©InfoAdvisors - infoadvisors.com
Those weren’t SCHEMALESS….
They had data facts, which had meanings. And sometimes
expected formats, precisions, and types.
In the NoSQL world, we don’t apply those necessarily at
write time, but at read time.
SCHEMALESS really is MULTIPLE SCHEMAs (Polyschematic)
or VARYING SCHEMAs.
Aug 2014©InfoAdvisors - infoadvisors.com
The Big Data Big Lies
Schemaless
• Schema on
Read, not
Schema on Write
• Polyschematic
Big
• New data stories
• New
technologies
• Not just volume
Aug 2014©InfoAdvisors - infoadvisors.com
10 Tips For Modeling in a Hybrid World
1. Models require a modeler
2. Data modeling tools are essential
3. There are many types of data models: know which ones
you need
4. Modeling does not have to happen at the same time in
every project. It should happen at the right time
5. Modeling is not just schema design. Think outside the
boxes and lines
Aug 2014©InfoAdvisors - infoadvisors.com
10 Tips for Modeling in a Hybrid World
6. A data model is much more than a diagram
7. You will need training.
8. Team members may not understand modeling.
They will need training
9. NoSQL is not one thing. Learn many patterns
10.Modern data architectures are likely hybrid
solutions. You can’t just support one part.
Aug 2014©InfoAdvisors - infoadvisors.com
What does this mean for data modelers?
There will be jobs for traditional, ERD, relational modelers….
….just like there are still jobs of RPG and COBOL programmers
All data has a data story. Many data stories.
A good modeler is a an architect at heart – finding the right
solution for the data story.
Aug 2014©InfoAdvisors - infoadvisors.com
Business Intelligence Journal
Look for September 2014
Issue Article on Modern
Data Architectures
Aug 2014©InfoAdvisors - infoadvisors.com
Thank You!
www.infoadvisors.com
www.datamodel.com
www.dataversity.net
community.embarcadero.com
#TEAMDATA
Aug 2014©InfoAdvisors - infoadvisors.com
1 of 23

More Related Content

Similar to NoSQL and Data Modeling for Data Modelers(20)

These Are The Data You Are Looking ForThese Are The Data You Are Looking For
These Are The Data You Are Looking For
Embarcadero Technologies728 views
Big Data FudBig Data Fud
Big Data Fud
Sudhir(SMACI) Menon330 views
Hadoop for Finance - sample chapterHadoop for Finance - sample chapter
Hadoop for Finance - sample chapter
Rajiv Tiwari594 views
Road Map for Careers in Big DataRoad Map for Careers in Big Data
Road Map for Careers in Big Data
Mich Talebzadeh (Ph.D.)498 views
Seven big data predictions for 2015Seven big data predictions for 2015
Seven big data predictions for 2015
Xoom Trainings104 views
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
VMware Tanzu4.3K views
Big data rmougBig data rmoug
Big data rmoug
Gwen (Chen) Shapira962 views
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
DATAVERSITY6.2K views
Bbbt presentation 210415_final_2Bbbt presentation 210415_final_2
Bbbt presentation 210415_final_2
Roland Bullivant569 views
Top BI trends and predictions for 2017Top BI trends and predictions for 2017
Top BI trends and predictions for 2017
Panorama Software312 views

Recently uploaded(20)

ColonyOSColonyOS
ColonyOS
JohanKristiansson69 views
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel Alerts
Timothy Spann102 views
PTicketInput.pdfPTicketInput.pdf
PTicketInput.pdf
stuartmcphersonflipm314 views
Data structure and algorithm. Data structure and algorithm.
Data structure and algorithm.
Abdul salam 12 views
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)
Narendra Narendra10 views
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
thomasjvarghese4918 views
MOSORE_BRESCIAMOSORE_BRESCIA
MOSORE_BRESCIA
Federico Karagulian5 views
RuleBookForTheFairDataEconomy.pptxRuleBookForTheFairDataEconomy.pptx
RuleBookForTheFairDataEconomy.pptx
noraelstela166 views
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra15 views
PROGRAMME.pdfPROGRAMME.pdf
PROGRAMME.pdf
HiNedHaJar14 views
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9011 views

NoSQL and Data Modeling for Data Modelers

  • 1. Big Data, NoSQL & Data Modeling 10 Tips for Data Modeling Success on Modern Data Projects Karen Lopez, InfoAdvisors www.datamodel.com
  • 2. Data Models – Traditional Process Conceptual (Data) Model Logical Data Model Physical Data Model(s) OLTP OLTP OLTP OLTP OLTP MARTMART OLTP OLTP OLTP Aug 2014©InfoAdvisors - infoadvisors.com
  • 3. Relational Aug 2014©InfoAdvisors - infoadvisors.com Data Models started with relational modeling, so they look like relational database structures.
  • 4. But…. That doesn’t mean they can’t be used to model data that goes into a non- relational format. All that formatting happens at build OR consumption time, not requirements time. Aug 2014©InfoAdvisors - infoadvisors.com
  • 5. The Big Data Story Lots of data Coming at us fast Lots of variety in format & quality We want all the data Highly available “It’s web scale” Aug 2014©InfoAdvisors - infoadvisors.com
  • 6. What do we really mean by scale? Bringing computing to the data Massively parallel processing Cheap, commodity hardware, but lots of it Optimized for Query/Reads/Questions/Telling stories Aug 2014©InfoAdvisors - infoadvisors.com
  • 7. We’ve been down this road before… Traditional transactional applications Reporting- optimized tables/structures Data Warehouse / Dimensional Modeling Aug 2014©InfoAdvisors - infoadvisors.com Highly normalized Highly Denormalized
  • 10. NoSQL, Not Only SQL Relational Graph Columnar/Column Family Key Value Document Databases Others Aug 2014©InfoAdvisors - infoadvisors.com
  • 11. Sample Hive Statement CREATE EXTERNAL TABLE TaxRebateUsage ( state string, zipcode string, agi_class int, n1 int, mars2 int, prep int, n2 int, ) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS TEXTFILE Aug 2014©InfoAdvisors - infoadvisors.com
  • 12. Sample JSON/MongoDB Notation Aug 2014©InfoAdvisors - infoadvisors.com
  • 13. Sample FoundationDB Statement Aug 2014©InfoAdvisors - infoadvisors.com
  • 14. Sample Cassandra Statement Aug 2014©InfoAdvisors - infoadvisors.com
  • 15. Sample Vertica Statement Aug 2014©InfoAdvisors - infoadvisors.com
  • 16. Sample Neo4j Statement Aug 2014©InfoAdvisors - infoadvisors.com
  • 17. Those weren’t SCHEMALESS…. They had data facts, which had meanings. And sometimes expected formats, precisions, and types. In the NoSQL world, we don’t apply those necessarily at write time, but at read time. SCHEMALESS really is MULTIPLE SCHEMAs (Polyschematic) or VARYING SCHEMAs. Aug 2014©InfoAdvisors - infoadvisors.com
  • 18. The Big Data Big Lies Schemaless • Schema on Read, not Schema on Write • Polyschematic Big • New data stories • New technologies • Not just volume Aug 2014©InfoAdvisors - infoadvisors.com
  • 19. 10 Tips For Modeling in a Hybrid World 1. Models require a modeler 2. Data modeling tools are essential 3. There are many types of data models: know which ones you need 4. Modeling does not have to happen at the same time in every project. It should happen at the right time 5. Modeling is not just schema design. Think outside the boxes and lines Aug 2014©InfoAdvisors - infoadvisors.com
  • 20. 10 Tips for Modeling in a Hybrid World 6. A data model is much more than a diagram 7. You will need training. 8. Team members may not understand modeling. They will need training 9. NoSQL is not one thing. Learn many patterns 10.Modern data architectures are likely hybrid solutions. You can’t just support one part. Aug 2014©InfoAdvisors - infoadvisors.com
  • 21. What does this mean for data modelers? There will be jobs for traditional, ERD, relational modelers…. ….just like there are still jobs of RPG and COBOL programmers All data has a data story. Many data stories. A good modeler is a an architect at heart – finding the right solution for the data story. Aug 2014©InfoAdvisors - infoadvisors.com
  • 22. Business Intelligence Journal Look for September 2014 Issue Article on Modern Data Architectures Aug 2014©InfoAdvisors - infoadvisors.com