SlideShare a Scribd company logo
1 of 29
Enterprise Architect, MongoDB
Brian D. Goodman
Creating a single view:
Overview and data analysis
What Is He Going To Talk About?
What are we trying to do?
How do approach doing it?
How do you navigate to success?
Overview &
Data Analysis
Data Design &
Loading
Strategies
Securing Your
Deployment
Creating A Single
View
Part
1
Part
2
Part
3
A data platform is common process, tooling
and management from ingestion to
presentation
• Integrating technologies
• Streamlining ingestion
• Centralizing access
• Supporting analytics
I have sat in your chair looking for answers
Source: Elephant by James Fujii @ http://jamesfujiistoryartist.blogspot.com/2010/05/dancing-elephant-april-2010.html
Source: Gorilla by Luigi Lucarelli @ http://loaduniverse.blogspot.com/2012/04/gorilla-sketch.html
13+ years 1.5 years Present
ID, name, address, phone …
ID, name, description, qty
…
ID, ID_PRODUCT,
ID_CUSTOMER, QTY, PRICE …
127.0.0.1 - user
[10/Oct/2000:13:55:36 -0700]
"GET /product.html HTTP/1.0"
200 2326
10577, 41.041731, -
73.70972, Purchase, New
York, 3454, 183686, 4
Apple, 0.938748, mixed,
[ Company, Brand, … ],
[ dbpedia, freebase, yago, website
]
China, 0.519803, mixed,
[ Country, Location, Admin_Div. … ],
[ dbpedia, freebase … ]
Single view use cases abound
Financial Services
Creating a single view
of a product, trader,
client
Generating
recommendations to
sales people
Identifying anomalous
patterns
Data sources: client
profile, trade data,
market data, service
catalog
Retail
Creating a single view
of a product or customer
Generating
recommendations to
customers
Proactive, personal
marketing to customers
Data sources: customer
profile, loyalty program,
inventory, product
catalog
Healthcare
Creating 360-degree
patient view
Managing electronic
health-care records for
patients
Population management
for at-risk demographics
Data sources: patient
records, sensor data,
care center metrics
High-level architecture for a Single View
Zooming in to Single View backend
Traditional mental model
MongoDB mental model
Getting to a Single View of the customer
Calculation using the
aggregation framework
MapReduce
Analytics
Embedding data
in context
Generic
container to
manage all the
contact points
and analytical
observations
Full-text search
for ad hoc query
Geo-indexing for
extending insight
Questions we can now ask …
• What kinds of products has this person purchased?
e.g. distinct( “orders.category”, { “id” : 12345678 } )
• How close are they from a point of service? Now.
e.g. find("location" : { $near : [40.8768, -73.787] } )
• Who is most dissatisfiedwithour service?
e.g. find().sort({ sentiment : 1 } ).limit( 100 )
• What shouldmy customer representative mention in the next conversation?
e.g. find( { “action.topic”: “talkingpoint” } ).sort( createdOn : -1 )
Start focused and iterate quickly
1. Focus on a single effort, a couple of data sources, and a
familiar ‘actor” to avoid being overwhelmed
2. Consider what the data is and a few questions you might ask
of it – this informs the initial data model
3. Prototype, prove then iterate
RDBMs terminology has an equivalent
Source: RDBMS to MongoDB migration guide: http://info.mongodb.com/rs/mongodb/images/RDBMStoMongoDBMigration.pdf
Normalization acts as a decision point to
either embed or continue to link
Source: RDBMS to MongoDB migration guide: http://info.mongodb.com/rs/mongodb/images/RDBMStoMongoDBMigration.pdf
Embedding vs. linking is all about navigating the
compromise between performance & data
duplication
Relationshi
p
Embed Link Example
1 : 1 If atomic ✔ ≈ person to twitter handle
1 : n ✖ ✔ person to click paths
1 : few ✔ ✖ person to content of interest
n : n
For
performance
✔
✔ person to content read
Dynamic schemas enables flexibility and
agility
• Data is coming in many forms
• New data sources surface constantly
• Single-views facilitate analytics and breed more data
• Flexible schemas make it easy to evolve vs. reinvent
• Schemas reflect the access patterns of the data
Source: Sogeti: bigdata-introduction-sogeti-consultingservices-businesstechnology-20130628v5-130723151309-phpapp02.ppt
Modeling a single view of the customer begins
with the types of queries we think we will want
to make
• What is the customer (individual or company)doing that is not normal?
• What are the customer’s peers doing, that they would do, but haven’t?
• What productshas the customer purchased and what is likelihood that they will
buy something new in the next week | thirty days?
• What willthe customer do this week?
• What are customers tellingus about them, us, the world?
• What content / products are effectiveat generating sales?
• Who is influential – both us and customers?
• What does the world look like from our proprietarypointof view vs. the what is
happening in the public?
• Which customers should I focus on, what should I show them and why is it smart?
What do all these crazy formats look like?
Twitter
timeline
Product
catalog
Comments
Content
Binary
Centering around customers enables single
access pattern for fast rich queries and
iterative extension
• Using arrays to allow for multiple relationships
(organizations & locations)
• Tags as a means to flexibly profile
e.g. top_10_percent, badge_of_honor, flight_risk
• Actions array to store analytical outputs that
describe the customer to sales channels and
people e.g. anomalies, recommendations, predictions
• Products array to store products they
customer has purchased and if they publicly
engaged e.g. rating, commenting, tweeting returned etc.
• Products collection to store details
• Product comments reference Customers
A single-view of the customer enables fast,
rich queries and iterative extension
{ "_id" : ObjectId("5063114bd386d8fadbd6b004"),
"name" : "Brian D. Goodman",
"organizations" : ["MongoDB”],
"locations" : [
{ "type" : "work",
"address" : "229 W 34 St., 5th floor",
"city" : "New York",
"state" : "NY",
"zipcode" : "10036"
}
],
"tags" : ["blogger”, "photographer”, "technology”],
"lastAccess" : ISODate("2012-12-19T06:15:33.035Z"),
"actions" : [
{ "type" : "recommendation",
"date" : "…",
"tags" : [
”photography",
”video"
],
"description" : "Zack Arias released new
video!",
"action" : "http://…"
}
],
"products" : [
{ "_id" : ObjectId("5063114bd333d8fadbd6b444"),
"purchased" : ISODate("2012-12-
19T06:15:33.035Z"),
"rated" : 1,
"commented" : 1,
"tweeted" : 0,
"returned" : 0
}
]
Adding streaming customer sentiment to base
model enables new analytics and customer
insight
• Sentiment analytics could be performed
– Across all customer expressions
– Towards the company
– Towards specific products
• An aggregate sentiment score might place the
customer on a positive / negative scale as a
means to identify predisposition and can be
contrasted with sentiment towards the
company
• Sentiment towards products might be stored
in an array, similar to comments, enabling
context specific dispositions
Adding streaming customer sentiment to base
model enables new analytics and customer
insight
{ "_id" : ObjectId("5063114bd386d8fadbd6b004"),
"name" : "Brian D. Goodman",
"organizations" : ["MongoDB”],
"locations" : [
{ "type" : "work",
"address" : "229 W 34 St., 5th floor",
"city" : "New York",
"state" : "NY",
"zipcode" : "10036"
}
],
”sentiment” : {
”personal” : 1,
„towardsCompany” : 0
},
"tags" : ["blogger”, "photographer”, "technology”],
"lastAccess" : ISODate("2012-12-19T06:15:33.035Z"),
"actions" : [
. . .
],
"products" : [
{ "_id" : ObjectId("5063114bd333d8fadbd6b444"),
"purchased" : ISODate("2012-12-19T06:15:33.035Z"),
"rated" : 1,
"commented" : 1,
"tweeted" : 0,
"returned" : 0
}
]
}
So, we have a model, but what if I need more
than one way to look at my data?
• Aggregateinstead of recalculate
Examples: Creating an activity score based on a few variables on the fly
or… Storing a variety of counters, but not the individual records
• Store the original granularity for a detailed query and embed
aggregations
Example: Total number of reviews vs. counting the reviews themselves
• Embedding and linking gives you the best of performance and
management
Example: a master collection for addresses, but an embedded copy for
performance
MongoDB shifts responsibilities to developers
• Enforcing security, auditing, redaction, schema
• Assuming change
• Managing the data holistically
• Balancing the decision to embed vs. link and
the extra management of data that may be required
What does your single-view look like?
• What is at the center of your single-view?
• What questions are you asking of your data?
• How do you plan to get the data in there?
• How do you make sure it doesn’t exit out the backdoor?
• MongoDB Seattle : September 16, 2014
• MongoDB Boston : October 1, 2014
• MongoDB DC : October 14, 2014
• MongoDB SF : December 3, 2014
Coming to you soon…
Register Now: www.mongodb.com/mongodb-coming-you
Use code 25webinar for a 25% discount.
Questions?
Stay tuned after the webinar and take our
survey for your chance to win MongoDB
swag.
Enterprise Architect, MongoDB
Brian D. Goodman
Thank You

More Related Content

What's hot

Best Practices for MongoDB in Today's Telecommunications Market
Best Practices for MongoDB in Today's Telecommunications MarketBest Practices for MongoDB in Today's Telecommunications Market
Best Practices for MongoDB in Today's Telecommunications MarketMongoDB
 
Webinar: How Banks Use MongoDB as a Tick Database
Webinar: How Banks Use MongoDB as a Tick DatabaseWebinar: How Banks Use MongoDB as a Tick Database
Webinar: How Banks Use MongoDB as a Tick DatabaseMongoDB
 
The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360Capgemini
 
Customer Experience Digital Data Layer 1.0
Customer Experience Digital Data Layer 1.0 Customer Experience Digital Data Layer 1.0
Customer Experience Digital Data Layer 1.0 Amin Shawki
 
Big Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning associationBig Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning associationJean-Michel Franco
 
Callcenter HPE IDOL overview
Callcenter HPE IDOL overviewCallcenter HPE IDOL overview
Callcenter HPE IDOL overviewTania Akinina
 
Data Modeling and Relational to NoSQL
 Data Modeling and Relational to NoSQL  Data Modeling and Relational to NoSQL
Data Modeling and Relational to NoSQL DATAVERSITY
 
JSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa TechfestJSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa TechfestMatthew Groves
 
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business SuccessWhy an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business SuccessInformatica
 
Henninger_MakingReferenceDataMoreMeaningful-Final
Henninger_MakingReferenceDataMoreMeaningful-FinalHenninger_MakingReferenceDataMoreMeaningful-Final
Henninger_MakingReferenceDataMoreMeaningful-FinalScott Henninger
 
What Watson Explorer is and How it works
What Watson Explorer is and How it worksWhat Watson Explorer is and How it works
What Watson Explorer is and How it worksVirginia Fernandez
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChicago Hadoop Users Group
 
What Does Web 2.0 Mean for Enterprise Search
What Does Web 2.0 Mean for Enterprise SearchWhat Does Web 2.0 Mean for Enterprise Search
What Does Web 2.0 Mean for Enterprise SearchMichael Sampson
 
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
Thwart Fraud Using Graph-Enhanced Machine Learning and AIThwart Fraud Using Graph-Enhanced Machine Learning and AI
Thwart Fraud Using Graph-Enhanced Machine Learning and AINeo4j
 
Webinar: How to Drive Business Value in Financial Services with MongoDB
Webinar: How to Drive Business Value in Financial Services with MongoDBWebinar: How to Drive Business Value in Financial Services with MongoDB
Webinar: How to Drive Business Value in Financial Services with MongoDBMongoDB
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Neo4j
 
Graphs in the Real World
Graphs in the Real WorldGraphs in the Real World
Graphs in the Real WorldNeo4j
 
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachSlides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachDATAVERSITY
 

What's hot (20)

Best Practices for MongoDB in Today's Telecommunications Market
Best Practices for MongoDB in Today's Telecommunications MarketBest Practices for MongoDB in Today's Telecommunications Market
Best Practices for MongoDB in Today's Telecommunications Market
 
Webinar: How Banks Use MongoDB as a Tick Database
Webinar: How Banks Use MongoDB as a Tick DatabaseWebinar: How Banks Use MongoDB as a Tick Database
Webinar: How Banks Use MongoDB as a Tick Database
 
The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360
 
Customer Experience Digital Data Layer 1.0
Customer Experience Digital Data Layer 1.0 Customer Experience Digital Data Layer 1.0
Customer Experience Digital Data Layer 1.0
 
Big Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning associationBig Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning association
 
Callcenter HPE IDOL overview
Callcenter HPE IDOL overviewCallcenter HPE IDOL overview
Callcenter HPE IDOL overview
 
Data Modeling and Relational to NoSQL
 Data Modeling and Relational to NoSQL  Data Modeling and Relational to NoSQL
Data Modeling and Relational to NoSQL
 
JSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa TechfestJSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa Techfest
 
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business SuccessWhy an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business Success
 
Henninger_MakingReferenceDataMoreMeaningful-Final
Henninger_MakingReferenceDataMoreMeaningful-FinalHenninger_MakingReferenceDataMoreMeaningful-Final
Henninger_MakingReferenceDataMoreMeaningful-Final
 
What Watson Explorer is and How it works
What Watson Explorer is and How it worksWhat Watson Explorer is and How it works
What Watson Explorer is and How it works
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
 
What Does Web 2.0 Mean for Enterprise Search
What Does Web 2.0 Mean for Enterprise SearchWhat Does Web 2.0 Mean for Enterprise Search
What Does Web 2.0 Mean for Enterprise Search
 
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
Thwart Fraud Using Graph-Enhanced Machine Learning and AIThwart Fraud Using Graph-Enhanced Machine Learning and AI
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
 
Webinar: How to Drive Business Value in Financial Services with MongoDB
Webinar: How to Drive Business Value in Financial Services with MongoDBWebinar: How to Drive Business Value in Financial Services with MongoDB
Webinar: How to Drive Business Value in Financial Services with MongoDB
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
JDV Big Data Webinar v2
JDV Big Data Webinar v2JDV Big Data Webinar v2
JDV Big Data Webinar v2
 
SOA for Data Management
SOA for Data ManagementSOA for Data Management
SOA for Data Management
 
Graphs in the Real World
Graphs in the Real WorldGraphs in the Real World
Graphs in the Real World
 
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachSlides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
 

Viewers also liked

360° View of Your Customers
360° View of Your Customers360° View of Your Customers
360° View of Your CustomersOSF Commerce
 
Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Cloudera, Inc.
 
Webinar: Working with Graph Data in MongoDB
Webinar: Working with Graph Data in MongoDBWebinar: Working with Graph Data in MongoDB
Webinar: Working with Graph Data in MongoDBMongoDB
 
Webinar: Creating a Single View: Securing Your Deployment
Webinar: Creating a Single View: Securing Your DeploymentWebinar: Creating a Single View: Securing Your Deployment
Webinar: Creating a Single View: Securing Your DeploymentMongoDB
 
Creating a Single View Part 3: Securing Your Deployment
Creating a Single View Part 3: Securing Your DeploymentCreating a Single View Part 3: Securing Your Deployment
Creating a Single View Part 3: Securing Your DeploymentMongoDB
 
MongoDB - How to model and extract your data
MongoDB - How to model and extract your dataMongoDB - How to model and extract your data
MongoDB - How to model and extract your dataFrancesco Lo Franco
 
Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...
Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...
Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...MongoDB
 
Single View of Customer in Banking
Single View of Customer in BankingSingle View of Customer in Banking
Single View of Customer in BankingRajeev Krishnan
 
How Insurance Companies Use MongoDB
How Insurance Companies Use MongoDB How Insurance Companies Use MongoDB
How Insurance Companies Use MongoDB MongoDB
 
Extended 360 degree view of customer
Extended 360 degree view of customerExtended 360 degree view of customer
Extended 360 degree view of customerTrisha Dutta
 
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBMongoDB
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The CustomerAlan McSweeney
 
Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...
Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...
Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...MongoDB
 
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...Cloudera, Inc.
 

Viewers also liked (18)

Big Data : a 360° Overview
Big Data : a 360° Overview Big Data : a 360° Overview
Big Data : a 360° Overview
 
360° View of Your Customers
360° View of Your Customers360° View of Your Customers
360° View of Your Customers
 
Solution Blueprint - Customer 360
Solution Blueprint - Customer 360Solution Blueprint - Customer 360
Solution Blueprint - Customer 360
 
Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360
 
Webinar: Working with Graph Data in MongoDB
Webinar: Working with Graph Data in MongoDBWebinar: Working with Graph Data in MongoDB
Webinar: Working with Graph Data in MongoDB
 
CET DQ Tool Selection - Executive
CET DQ Tool Selection - ExecutiveCET DQ Tool Selection - Executive
CET DQ Tool Selection - Executive
 
Webinar: Creating a Single View: Securing Your Deployment
Webinar: Creating a Single View: Securing Your DeploymentWebinar: Creating a Single View: Securing Your Deployment
Webinar: Creating a Single View: Securing Your Deployment
 
Creating a Single View Part 3: Securing Your Deployment
Creating a Single View Part 3: Securing Your DeploymentCreating a Single View Part 3: Securing Your Deployment
Creating a Single View Part 3: Securing Your Deployment
 
MongoDB - How to model and extract your data
MongoDB - How to model and extract your dataMongoDB - How to model and extract your data
MongoDB - How to model and extract your data
 
Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...
Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...
Creating a Single View Part 2: Loading Disparate Source Data and Creating a S...
 
Single View of Customer in Banking
Single View of Customer in BankingSingle View of Customer in Banking
Single View of Customer in Banking
 
How Insurance Companies Use MongoDB
How Insurance Companies Use MongoDB How Insurance Companies Use MongoDB
How Insurance Companies Use MongoDB
 
Extended 360 degree view of customer
Extended 360 degree view of customerExtended 360 degree view of customer
Extended 360 degree view of customer
 
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The Customer
 
Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...
Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...
Business Track: MetLife Leapfrogs Insurance Industry with MongoDB-Powered Big...
 
Customer 360
Customer 360Customer 360
Customer 360
 
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
 

Similar to Creating a Single View: Overview and Analysis

"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)
"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)
"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)Tech in Asia ID
 
Functional requirements: Thinking Like A Pirate
Functional requirements: Thinking Like A PirateFunctional requirements: Thinking Like A Pirate
Functional requirements: Thinking Like A PirateAmye Scavarda
 
5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven OrganizationVivastream
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonSocietyConsulting
 
danmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfdanmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfssuser3ee399
 
Why Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerWhy Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerProduct School
 
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryAge of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryInside Analysis
 
The 7-Step SEO Keyword Research Process By Jon Rognerud
The 7-Step SEO Keyword Research Process By Jon RognerudThe 7-Step SEO Keyword Research Process By Jon Rognerud
The 7-Step SEO Keyword Research Process By Jon RognerudJon Rognerud Chaosmap Digital
 
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365Brian Culver
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with DatabricksGrega Kespret
 
SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365Brian Culver
 
Charles Rygula: Value Beyond Words
Charles Rygula: Value Beyond WordsCharles Rygula: Value Beyond Words
Charles Rygula: Value Beyond WordsJack Molisani
 
Webinar: Scaling MongoDB
Webinar: Scaling MongoDBWebinar: Scaling MongoDB
Webinar: Scaling MongoDBMongoDB
 
Great Data Delivery: A model-based approach
Great Data Delivery: A model-based approachGreat Data Delivery: A model-based approach
Great Data Delivery: A model-based approachZach Taylor
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...Joe Lamantia
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyInside Analysis
 
Data-Driven Design for User Experience
Data-Driven Design for User Experience Data-Driven Design for User Experience
Data-Driven Design for User Experience Emi Kwon
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data ScienceAjay Ohri
 
Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Brian Culver
 

Similar to Creating a Single View: Overview and Analysis (20)

"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)
"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)
"How To Build and Lead a Winning Data Team" by Cahyo Listyanto (Bizzy.co.id)
 
Functional requirements: Thinking Like A Pirate
Functional requirements: Thinking Like A PirateFunctional requirements: Thinking Like A Pirate
Functional requirements: Thinking Like A Pirate
 
5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad Richeson
 
danmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfdanmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdf
 
Why Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerWhy Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product Manager
 
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryAge of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide Discovery
 
The 7-Step SEO Keyword Research Process By Jon Rognerud
The 7-Step SEO Keyword Research Process By Jon RognerudThe 7-Step SEO Keyword Research Process By Jon Rognerud
The 7-Step SEO Keyword Research Process By Jon Rognerud
 
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365
 
Charles Rygula: Value Beyond Words
Charles Rygula: Value Beyond WordsCharles Rygula: Value Beyond Words
Charles Rygula: Value Beyond Words
 
Webinar: Scaling MongoDB
Webinar: Scaling MongoDBWebinar: Scaling MongoDB
Webinar: Scaling MongoDB
 
Great Data Delivery: A model-based approach
Great Data Delivery: A model-based approachGreat Data Delivery: A model-based approach
Great Data Delivery: A model-based approach
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data Quickly
 
Data-Driven Design for User Experience
Data-Driven Design for User Experience Data-Driven Design for User Experience
Data-Driven Design for User Experience
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data Science
 
Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365
 

More from MongoDB

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump StartMongoDB
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB
 

More from MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
 

Recently uploaded

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Recently uploaded (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Creating a Single View: Overview and Analysis

  • 1. Enterprise Architect, MongoDB Brian D. Goodman Creating a single view: Overview and data analysis
  • 2. What Is He Going To Talk About? What are we trying to do? How do approach doing it? How do you navigate to success? Overview & Data Analysis Data Design & Loading Strategies Securing Your Deployment Creating A Single View Part 1 Part 2 Part 3
  • 3. A data platform is common process, tooling and management from ingestion to presentation • Integrating technologies • Streamlining ingestion • Centralizing access • Supporting analytics
  • 4. I have sat in your chair looking for answers Source: Elephant by James Fujii @ http://jamesfujiistoryartist.blogspot.com/2010/05/dancing-elephant-april-2010.html Source: Gorilla by Luigi Lucarelli @ http://loaduniverse.blogspot.com/2012/04/gorilla-sketch.html 13+ years 1.5 years Present
  • 5. ID, name, address, phone … ID, name, description, qty … ID, ID_PRODUCT, ID_CUSTOMER, QTY, PRICE … 127.0.0.1 - user [10/Oct/2000:13:55:36 -0700] "GET /product.html HTTP/1.0" 200 2326 10577, 41.041731, - 73.70972, Purchase, New York, 3454, 183686, 4 Apple, 0.938748, mixed, [ Company, Brand, … ], [ dbpedia, freebase, yago, website ] China, 0.519803, mixed, [ Country, Location, Admin_Div. … ], [ dbpedia, freebase … ]
  • 6. Single view use cases abound Financial Services Creating a single view of a product, trader, client Generating recommendations to sales people Identifying anomalous patterns Data sources: client profile, trade data, market data, service catalog Retail Creating a single view of a product or customer Generating recommendations to customers Proactive, personal marketing to customers Data sources: customer profile, loyalty program, inventory, product catalog Healthcare Creating 360-degree patient view Managing electronic health-care records for patients Population management for at-risk demographics Data sources: patient records, sensor data, care center metrics
  • 8. Zooming in to Single View backend
  • 11. Getting to a Single View of the customer Calculation using the aggregation framework MapReduce Analytics Embedding data in context Generic container to manage all the contact points and analytical observations Full-text search for ad hoc query Geo-indexing for extending insight
  • 12. Questions we can now ask … • What kinds of products has this person purchased? e.g. distinct( “orders.category”, { “id” : 12345678 } ) • How close are they from a point of service? Now. e.g. find("location" : { $near : [40.8768, -73.787] } ) • Who is most dissatisfiedwithour service? e.g. find().sort({ sentiment : 1 } ).limit( 100 ) • What shouldmy customer representative mention in the next conversation? e.g. find( { “action.topic”: “talkingpoint” } ).sort( createdOn : -1 )
  • 13. Start focused and iterate quickly 1. Focus on a single effort, a couple of data sources, and a familiar ‘actor” to avoid being overwhelmed 2. Consider what the data is and a few questions you might ask of it – this informs the initial data model 3. Prototype, prove then iterate
  • 14. RDBMs terminology has an equivalent Source: RDBMS to MongoDB migration guide: http://info.mongodb.com/rs/mongodb/images/RDBMStoMongoDBMigration.pdf
  • 15. Normalization acts as a decision point to either embed or continue to link Source: RDBMS to MongoDB migration guide: http://info.mongodb.com/rs/mongodb/images/RDBMStoMongoDBMigration.pdf
  • 16. Embedding vs. linking is all about navigating the compromise between performance & data duplication Relationshi p Embed Link Example 1 : 1 If atomic ✔ ≈ person to twitter handle 1 : n ✖ ✔ person to click paths 1 : few ✔ ✖ person to content of interest n : n For performance ✔ ✔ person to content read
  • 17. Dynamic schemas enables flexibility and agility • Data is coming in many forms • New data sources surface constantly • Single-views facilitate analytics and breed more data • Flexible schemas make it easy to evolve vs. reinvent • Schemas reflect the access patterns of the data Source: Sogeti: bigdata-introduction-sogeti-consultingservices-businesstechnology-20130628v5-130723151309-phpapp02.ppt
  • 18. Modeling a single view of the customer begins with the types of queries we think we will want to make • What is the customer (individual or company)doing that is not normal? • What are the customer’s peers doing, that they would do, but haven’t? • What productshas the customer purchased and what is likelihood that they will buy something new in the next week | thirty days? • What willthe customer do this week? • What are customers tellingus about them, us, the world? • What content / products are effectiveat generating sales? • Who is influential – both us and customers? • What does the world look like from our proprietarypointof view vs. the what is happening in the public? • Which customers should I focus on, what should I show them and why is it smart?
  • 19. What do all these crazy formats look like? Twitter timeline Product catalog Comments Content Binary
  • 20. Centering around customers enables single access pattern for fast rich queries and iterative extension • Using arrays to allow for multiple relationships (organizations & locations) • Tags as a means to flexibly profile e.g. top_10_percent, badge_of_honor, flight_risk • Actions array to store analytical outputs that describe the customer to sales channels and people e.g. anomalies, recommendations, predictions • Products array to store products they customer has purchased and if they publicly engaged e.g. rating, commenting, tweeting returned etc. • Products collection to store details • Product comments reference Customers
  • 21. A single-view of the customer enables fast, rich queries and iterative extension { "_id" : ObjectId("5063114bd386d8fadbd6b004"), "name" : "Brian D. Goodman", "organizations" : ["MongoDB”], "locations" : [ { "type" : "work", "address" : "229 W 34 St., 5th floor", "city" : "New York", "state" : "NY", "zipcode" : "10036" } ], "tags" : ["blogger”, "photographer”, "technology”], "lastAccess" : ISODate("2012-12-19T06:15:33.035Z"), "actions" : [ { "type" : "recommendation", "date" : "…", "tags" : [ ”photography", ”video" ], "description" : "Zack Arias released new video!", "action" : "http://…" } ], "products" : [ { "_id" : ObjectId("5063114bd333d8fadbd6b444"), "purchased" : ISODate("2012-12- 19T06:15:33.035Z"), "rated" : 1, "commented" : 1, "tweeted" : 0, "returned" : 0 } ]
  • 22. Adding streaming customer sentiment to base model enables new analytics and customer insight • Sentiment analytics could be performed – Across all customer expressions – Towards the company – Towards specific products • An aggregate sentiment score might place the customer on a positive / negative scale as a means to identify predisposition and can be contrasted with sentiment towards the company • Sentiment towards products might be stored in an array, similar to comments, enabling context specific dispositions
  • 23. Adding streaming customer sentiment to base model enables new analytics and customer insight { "_id" : ObjectId("5063114bd386d8fadbd6b004"), "name" : "Brian D. Goodman", "organizations" : ["MongoDB”], "locations" : [ { "type" : "work", "address" : "229 W 34 St., 5th floor", "city" : "New York", "state" : "NY", "zipcode" : "10036" } ], ”sentiment” : { ”personal” : 1, „towardsCompany” : 0 }, "tags" : ["blogger”, "photographer”, "technology”], "lastAccess" : ISODate("2012-12-19T06:15:33.035Z"), "actions" : [ . . . ], "products" : [ { "_id" : ObjectId("5063114bd333d8fadbd6b444"), "purchased" : ISODate("2012-12-19T06:15:33.035Z"), "rated" : 1, "commented" : 1, "tweeted" : 0, "returned" : 0 } ] }
  • 24. So, we have a model, but what if I need more than one way to look at my data? • Aggregateinstead of recalculate Examples: Creating an activity score based on a few variables on the fly or… Storing a variety of counters, but not the individual records • Store the original granularity for a detailed query and embed aggregations Example: Total number of reviews vs. counting the reviews themselves • Embedding and linking gives you the best of performance and management Example: a master collection for addresses, but an embedded copy for performance
  • 25. MongoDB shifts responsibilities to developers • Enforcing security, auditing, redaction, schema • Assuming change • Managing the data holistically • Balancing the decision to embed vs. link and the extra management of data that may be required
  • 26. What does your single-view look like? • What is at the center of your single-view? • What questions are you asking of your data? • How do you plan to get the data in there? • How do you make sure it doesn’t exit out the backdoor?
  • 27. • MongoDB Seattle : September 16, 2014 • MongoDB Boston : October 1, 2014 • MongoDB DC : October 14, 2014 • MongoDB SF : December 3, 2014 Coming to you soon… Register Now: www.mongodb.com/mongodb-coming-you Use code 25webinar for a 25% discount.
  • 28. Questions? Stay tuned after the webinar and take our survey for your chance to win MongoDB swag.
  • 29. Enterprise Architect, MongoDB Brian D. Goodman Thank You

Editor's Notes

  1. Good afternoon, we are here to talk about one of the most coveted and yet seemingly intractable challenges companies face, the single view of their data.
  2. This is a three part talk, so if you want to hear the full story this is what your afternoon looks like.
  3. We all have one of these. This kind of diagram cracks me up. For the last decade we have been building these high-level diagrams describing our data platforms. They have historically been documented as technology agnostic and included everything like security, business continuity, governance, standards, all in a neat set of boxes with arrows. The good news is that we have done the hard work of building skill around our data, formed the standards and processes that support managing the data space. The introduction of new technology doesn’t change that.
  4. I come from large businesses, leading a team at JPMorgan and as a distinguished engineer at IBM. Everyone has islands of data … intentionally and intelligently defined, but disjointed and fragile.
  5. So, lets get right into it. We are trying to get from multiple systems … in this case some we control and others we do not … in all their gnarly forms … and create richer document models. The use case is going to be customer centric – so a single view of the customer – and we are going to look how we bring together these sources into a more useful fashion and how that might evolve over time.
  6. So, even though we plan to cover a customer centric use case, these single views exist in many industries. These are three examples. Single view of a trader Single view of a product Single patient view
  7. Lets take a look at the high-level architecture for a single view. This should look a lot like what you document for your solutions. On the left we have the internet – our users, their devices, our applications – increasingly wearable devices. In the middle we have the web stack and hopefully the expression of some web services.
  8. Let’s zoom in on the heart of this system… On the right we have a series of data sources from orders, content management, web logs, profiles etc. Some of that data is magically flowing across the ETL arrows … maybe it flows to Hadoop, maybe it doesn’t. You might have real-time event data, like a twitter feed or news, flowing to a stream analytics engine like Storm. What we do know is that the life of data in MongoDB is not static. It exists as and evolving data set where real-time and offline analytics continuously enhances the profile.
  9. This approximates the relationships between some of the data sources we have been considering. The green elements are showing external systems – where we might simply be loading data (i.e. we do not control them) The other systems are ones we own and the colors are discreet systems. This is just a portion of what this might look like and it is useful when comparing it with a MongoDB mental model.
  10. There are a few things happening here, so lets just take a moment with this diagram. First, we have two major views or documents being created – customers and products. A bunch of the data that was once external is not embedded into the document. Other elements are linked, but you can see some elements are now being aggregated and stored into the respective documents.
  11. For example, the web traffic is being aggregated to produce the top pages for a give user … you could imagine linking that to the topics that might be of interest through categories or tags. You might use MapReduce to crunch a sentiment score … etc. etc.
  12. Which highly rated reviews have “health” in the user supplied text e.g. find( { $text: { $search: “health" } } ) How often has this person reviewed a product? e.g. reviews.count()
  13. Here is the thing … this process is iterative and going to require the repetition to hone So focus on a single effort, stop overwhelming yourself. What would you ask if you could … this links you back to the business which justifies the whole activity in the first place. Prototype.
  14. Need to add … when do I embed vs when do I reference … and if I reference I need to be clear about the multiple queries or cache and of course the client side join
  15. The good news is that MongoDB’s dynamic schemas make this easy. Whatever we think are today’s requirements, they will change. The data types we are dealing with certainly have changed. The key is that schemas reflect the access patterns – the use – of the data.
  16. The simplest way to figure out which single view you are trying to create is to list out all the questions your business has interest in asking and see what the main nouns are. Items in RED are other nouns, but the prevailing theme is the customer.
  17. So what do al these formats look like … Everything from opaque binaries with a little meta data, twitter feeds, product catalogs, social commenting, rating services, content management systems etc. This stuff is structure for the most part, but its understandable not linked and would be valuable if they were! ---- Content: http://developer.nytimes.com/docs/article_search_api#h2-examples Comments: http://developers.gigya.com/037_API_reference/030_Comments/comments.getComments Product catalog (ebay): https://developer.ebay.com/DevZone/Product/CallRef/findProducts.html Binary is from wikipedia Twitter: https://dev.twitter.com/docs/api/1.1/get/statuses/user_timeline
  18. So lets actually look at what this might look like … Customer is modeled with metadata + a generic container called actions and more defined array for products. Notice how products are managed externally. This is because some of the content does not make sense to embed. What is being embedded is the key and whatever summary or metadata might be use. For example, did the customer return the item.
  19. Let’s take a look at what the document that represents this structure might look like. At the top we have the typical metadata about the customer, then actions and then products. [explain what is in actions]
  20. So, great! We have had success and just as would happen in real life, we need to add a feature. Not 24 hours ago we didn’t even have a single view and now all of a sudden we need to extend it. In this case we are being asked to include sentiment analytics in our system to help customer representatives and our offer systems be aware and target customers more effectively. Customer: Personal sentiment and sentiment to the company Product: Sentiment around a specific product / specific version of a product
  21. So, again here is the document equivalent. It looks trivial because this part of the magic is trivial. Adding data to an existing model is fluid. The highlighted content in orange / red is describing that I am overall positive but neutral to the company.
  22. So we have the starts of a model, but what if I need to look at the data in more than one way. Well, we likely need to be storing it in a few forms. Original, aggregates etc. The system should really be moving towards a view of incrementing counters with the original data to potential support a detailed view. ----- Meeting Notes (5/19/14 12:48) ----- How do you get it in there and how do you make sure it doesnt escape (wrong door).
  23. One thing everyone should be on the same page about … Developers have more responsibility than they have had in the past.
  24. So many companies are facing change as a result of innovations… What questions do I want to answer… Now how do you get data in there? How do you keep it from walking out the back door? Stay tuned… Often I get questions about: the role of a dba… all the things that a software developer just got empowered to do … how long does any of this take?