The document provides an overview of Couchbase's NoSQL document database capabilities for interactive applications. It discusses Couchbase's leadership in the NoSQL market, common use cases where NoSQL is a good fit like mobile apps and caching, and how Couchbase allows for JSON data everywhere from devices to servers. Relational databases are compared to Couchbase, with Couchbase able to scale out more easily to very large datasets and workloads.
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesMongoDB
With so much talk of how Big Data is revolutionizing the world and how a data lake with Hadoop and/or Spark will solve all your data problems, it is hard to tell what is hype, reality, or somewhere in-between.
In working with dozens of enterprises in varying stages of their enterprise data management (EDM) strategy, MongoDB enterprise architect, Matt Kalan, sees the same challenges and misunderstandings arise again and again.
In this session, he will explain common challenges in data management, what capabilities are necessary, and what the future state of architecture looks like. MongoDB is uniquely capable of filling common gaps in the data lake strategy.
This session also includes a live Q&A portion during which you are encouraged to ask questions of our team.
Move Fast with MongoDB Cloud Database - Atlas.
The workshop covered:
Deploying a MongoDB cluster in minutes
Query and manage data in MongoDB
Executing continuous backups and point-in-time restores, ensuring that you can meet any restore point objectives
View historical metrics in optimized dashboards, see what’s happening in your database live, configure alerts, and receive automated index suggestions to improve the performance of your cluster
Using MongoDB Charts and create visual representations of your data
Marquez: A Metadata Service for Data Abstraction, Data Lineage, and Event-bas...Willy Lulciuc
At WeWork, it's critical that we understand the complete context for all datasets. We also want to be able to explore dependencies between jobs and the datasets they produce and consume. To do this, WeWork needs metadata. In this talk I will focus on Marquez, a core service for the collection, aggregation and visualization of a data ecosystems metadata. Marquez maintains the provenance of how datasets are consumed and produced while providing global visibility into job runtime.
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachDATAVERSITY
After three decades of relational data modeling, everyone’s pretty comfortable with schemas, tables, and entity-relationships. As more and more Global 2000 companies choose NoSQL databases to power their Digital Economy applications, they need to think about how to best model their data. How do they move from a constrained, table-driven model to an agile, flexible data model based on JSON documents?
This webinar is intended for architects and application developers who want to learn about new JSON document data modeling approaches, techniques, and best practices. This webinar will show you how to get started building a JSON document data model, how to migrate a table-based data model to JSON documents, and how to optimize your design to enable fast query performance.
This webinar will provide practical, experience-based advice and best practices for modeling JSON documents, including:
- When to embed or not embed objects in your JSON document
- Data modeling using a practical data access pattern approach
- Indexing your JSON documents
- Querying your data using N1QL (SQL for JSON)
MongoDB Europe 2016 - Using MongoDB to Build a Fast and Scalable Content Repo...MongoDB
MongoDB can be used in the Nuxeo Platform as a replacement for more traditional SQL databases. Nuxeo's content repository, which is the cornerstone of this open source enterprise content management platform, integrates completely with MongoDB for data storage. This presentation will explain the motivation for using MongoDB and will emphasize the different implementation choices driven by the very nature of a NoSQL datastore like MongoDB. Learn how Nuxeo integrated MongoDB into the platform which resulted in increased performance (including actual benchmarks) and better response to some use cases.
MongoDB Days Silicon Valley: Jumpstart: The Right and Wrong Use Cases for Mon...MongoDB
Presented by Sigfrido Narvaez, Senior Solutions Architect, MongoDB
Experience level: Introductory
When it comes time to select database software for your project, there are a bewildering number of choices. How do you know if your project is a good fit for a relational database, or whether one of the many NoSQL options is a better choice? In this session you will learn when to use MongoDB and how to evaluate if MongoDB is a fit for your project. You will see how MongoDB's flexible document model is solving business problems in ways that were not previously possible, and how MongoDB's built-in features allow running at scale.
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesMongoDB
With so much talk of how Big Data is revolutionizing the world and how a data lake with Hadoop and/or Spark will solve all your data problems, it is hard to tell what is hype, reality, or somewhere in-between.
In working with dozens of enterprises in varying stages of their enterprise data management (EDM) strategy, MongoDB enterprise architect, Matt Kalan, sees the same challenges and misunderstandings arise again and again.
In this session, he will explain common challenges in data management, what capabilities are necessary, and what the future state of architecture looks like. MongoDB is uniquely capable of filling common gaps in the data lake strategy.
This session also includes a live Q&A portion during which you are encouraged to ask questions of our team.
Move Fast with MongoDB Cloud Database - Atlas.
The workshop covered:
Deploying a MongoDB cluster in minutes
Query and manage data in MongoDB
Executing continuous backups and point-in-time restores, ensuring that you can meet any restore point objectives
View historical metrics in optimized dashboards, see what’s happening in your database live, configure alerts, and receive automated index suggestions to improve the performance of your cluster
Using MongoDB Charts and create visual representations of your data
Marquez: A Metadata Service for Data Abstraction, Data Lineage, and Event-bas...Willy Lulciuc
At WeWork, it's critical that we understand the complete context for all datasets. We also want to be able to explore dependencies between jobs and the datasets they produce and consume. To do this, WeWork needs metadata. In this talk I will focus on Marquez, a core service for the collection, aggregation and visualization of a data ecosystems metadata. Marquez maintains the provenance of how datasets are consumed and produced while providing global visibility into job runtime.
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachDATAVERSITY
After three decades of relational data modeling, everyone’s pretty comfortable with schemas, tables, and entity-relationships. As more and more Global 2000 companies choose NoSQL databases to power their Digital Economy applications, they need to think about how to best model their data. How do they move from a constrained, table-driven model to an agile, flexible data model based on JSON documents?
This webinar is intended for architects and application developers who want to learn about new JSON document data modeling approaches, techniques, and best practices. This webinar will show you how to get started building a JSON document data model, how to migrate a table-based data model to JSON documents, and how to optimize your design to enable fast query performance.
This webinar will provide practical, experience-based advice and best practices for modeling JSON documents, including:
- When to embed or not embed objects in your JSON document
- Data modeling using a practical data access pattern approach
- Indexing your JSON documents
- Querying your data using N1QL (SQL for JSON)
MongoDB Europe 2016 - Using MongoDB to Build a Fast and Scalable Content Repo...MongoDB
MongoDB can be used in the Nuxeo Platform as a replacement for more traditional SQL databases. Nuxeo's content repository, which is the cornerstone of this open source enterprise content management platform, integrates completely with MongoDB for data storage. This presentation will explain the motivation for using MongoDB and will emphasize the different implementation choices driven by the very nature of a NoSQL datastore like MongoDB. Learn how Nuxeo integrated MongoDB into the platform which resulted in increased performance (including actual benchmarks) and better response to some use cases.
MongoDB Days Silicon Valley: Jumpstart: The Right and Wrong Use Cases for Mon...MongoDB
Presented by Sigfrido Narvaez, Senior Solutions Architect, MongoDB
Experience level: Introductory
When it comes time to select database software for your project, there are a bewildering number of choices. How do you know if your project is a good fit for a relational database, or whether one of the many NoSQL options is a better choice? In this session you will learn when to use MongoDB and how to evaluate if MongoDB is a fit for your project. You will see how MongoDB's flexible document model is solving business problems in ways that were not previously possible, and how MongoDB's built-in features allow running at scale.
Rapid Development and Performance By Transitioning from RDBMSs to MongoDB
Modern day application requirements demand rich & dynamic data structures, fast response times, easy scaling, and low TCO to match the rapidly changing customer & business requirements plus the powerful programming languages used in today's software landscape.
Traditional approaches to solutions development with RDBMSs increasingly expose the gap between the modern development languages and the relational data model, and between scaling up vs. scaling horizontally on commodity hardware. Development time is wasted as the bulk of the work has shifted from adding business features to struggling with the RDBMSs.
MongoDB, the premier NoSQL database, offers a flexible and scalable solution to focus on quickly adding business value again.
In this session, we will provide:
- Overview of MongoDB's capabilities
- Code-level exploration of the MongoDB programming model and APIs and how they transform the way developers interact with a database
- Update of the exciting features in MongoDB 3.0
An unprecedented amount of data is being created and is accessible. This presentation will instruct on using the new NoSQL technologies to make sense of all this data.
ГАННА КАПЛУН «noSQL vs SQL: порівняння використання реляційних та нереляційни...QADay
Kyiv Quality Assurance Day 2021
ГАННА КАПЛУН
«noSQL vs SQL: порівняння використання реляційних та нереляційних СУБД в комерційних програмах»
Слідкуйте за нами:
Телеграм: t.me/goqameetup
Linkedin: linkedin.com/company/goqa
Fb: https://www.facebook.com/goqaevent
Fb: https://www.facebook.com/qaday.org
Youtube: https://www.youtube.com/channel/UCOyIrmpVbtqUZyRLxd9BKzA
Modern architectures are moving away from a "one size fits all" approach. We are well aware that we need to use the best tools for the job. Given the large selection of options available today, chances are that you will end up managing data in MongoDB for your operational workload and with Spark for your high speed data processing needs.
Description: When we model documents or data structures there are some key aspects that need to be examined not only for functional and architectural purposes but also to take into consideration the distribution of data nodes, streaming capabilities, aggregation and queryability options and how we can integrate the different data processing software, like Spark, that can benefit from subtle but substantial model changes. A clear example is when embedding or referencing documents and their implications on high speed processing.
Over the course of this talk we will detail the benefits of a good document model for the operational workload. As well as what type of transformations we should incorporate in our document model to adjust for the high speed processing capabilities of Spark.
We will look into the different options that we have to connect these two different systems, how to model according to different workloads, what kind of operators we need to be aware of for top performance and what kind of design and architectures we should put in place to make sure that all of these systems work well together.
Over the course of the talk we will showcase different libraries that enable the integration between spark and MongoDB, such as MongoDB Hadoop Connector, Stratio Connector and MongoDB Spark Native Connector.
By the end of the talk I expect the attendees to have an understanding of:
How they connect their MongoDB clusters with Spark
Which use cases show a net benefit for connecting these two systems
What kind of architecture design should be considered for making the most of Spark + MongoDB
How documents can be modeled for better performance and operational process, while processing these data sets stored in MongoDB.
The talk is suitable for:
Developers that want to understand how to leverage Spark
Architects that want to integrate their existing MongoDB cluster and have real time high speed processing needs
Data scientists that know about Spark, are playing with Spark and want to integrate with MongoDB for their persistency layer
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauMongoDB
Pairing your real-time operational data stored in a modern database like MongoDB with first-class business intelligence platforms like Tableau enables new insights to be discovered faster than ever before.
Many leading organizations already use MongoDB in conjunction with Tableau including a top American investment bank and the world’s largest airline. With the Connector for BI 2.0, it’s never been easier to streamline the connection process between these two systems.
In this webinar, we will create a live connection from Tableau Desktop to a MongoDB cluster using the Connector for BI. Once we have Tableau Desktop and MongoDB connected, we will demonstrate the visual power of Tableau to explore the agile data storage of MongoDB.
You’ll walk away knowing:
- How to configure MongoDB with Tableau using the updated connector
- Best practices for working with documents in a BI environment
- How leading companies are using big data visualization strategies to transform their businesses
In this hands-on webinar we will cover how to leverage the Treasure Data Javascript SDK library to ensure user stitching of web data into the Treasure Data Customer Data Platform to provide a holistic view of prospects and customers.
We will demo the native SDK, as well as deploying the SDK inside of Adobe DTM and Google Tag Manager.
Webinar: Live Data Visualisation with Tableau and MongoDBMongoDB
MongoDB 3.2 introduces a new way for familiar Business Intelligence (BI) tools to access your real-time operational data – opening it up to data analysts and data scientist, enabling new insights to be discovered faster than ever before. Tableau accesses the JSON document data stored in MongoDB via this new BI connector. We will cover how the BI connector works by creating a relational view definition of a JSON data set that is then used to present a tabular SQL/ODBC interface to Tableau. Then we will set-up a live connection from Tableau Desktop to the MongoDB Connector for BI. Once we have Tableau Desktop and MongoDB connected, we will demonstrate the visual power of Tableau to explore the agile data storage of MongoDB. This webinar will cover:
What is the MongoDB BI Connector?
Setting up a connection from Tableau to the MongoDB BI Connector.
How to perform data discovery Tableau connected to MongoDB live data.
Publishing a Tableau Dashboard for sharing insights.
Rapid Development and Performance By Transitioning from RDBMSs to MongoDB
Modern day application requirements demand rich & dynamic data structures, fast response times, easy scaling, and low TCO to match the rapidly changing customer & business requirements plus the powerful programming languages used in today's software landscape.
Traditional approaches to solutions development with RDBMSs increasingly expose the gap between the modern development languages and the relational data model, and between scaling up vs. scaling horizontally on commodity hardware. Development time is wasted as the bulk of the work has shifted from adding business features to struggling with the RDBMSs.
MongoDB, the premier NoSQL database, offers a flexible and scalable solution to focus on quickly adding business value again.
In this session, we will provide:
- Overview of MongoDB's capabilities
- Code-level exploration of the MongoDB programming model and APIs and how they transform the way developers interact with a database
- Update of the exciting features in MongoDB 3.0
An unprecedented amount of data is being created and is accessible. This presentation will instruct on using the new NoSQL technologies to make sense of all this data.
ГАННА КАПЛУН «noSQL vs SQL: порівняння використання реляційних та нереляційни...QADay
Kyiv Quality Assurance Day 2021
ГАННА КАПЛУН
«noSQL vs SQL: порівняння використання реляційних та нереляційних СУБД в комерційних програмах»
Слідкуйте за нами:
Телеграм: t.me/goqameetup
Linkedin: linkedin.com/company/goqa
Fb: https://www.facebook.com/goqaevent
Fb: https://www.facebook.com/qaday.org
Youtube: https://www.youtube.com/channel/UCOyIrmpVbtqUZyRLxd9BKzA
Modern architectures are moving away from a "one size fits all" approach. We are well aware that we need to use the best tools for the job. Given the large selection of options available today, chances are that you will end up managing data in MongoDB for your operational workload and with Spark for your high speed data processing needs.
Description: When we model documents or data structures there are some key aspects that need to be examined not only for functional and architectural purposes but also to take into consideration the distribution of data nodes, streaming capabilities, aggregation and queryability options and how we can integrate the different data processing software, like Spark, that can benefit from subtle but substantial model changes. A clear example is when embedding or referencing documents and their implications on high speed processing.
Over the course of this talk we will detail the benefits of a good document model for the operational workload. As well as what type of transformations we should incorporate in our document model to adjust for the high speed processing capabilities of Spark.
We will look into the different options that we have to connect these two different systems, how to model according to different workloads, what kind of operators we need to be aware of for top performance and what kind of design and architectures we should put in place to make sure that all of these systems work well together.
Over the course of the talk we will showcase different libraries that enable the integration between spark and MongoDB, such as MongoDB Hadoop Connector, Stratio Connector and MongoDB Spark Native Connector.
By the end of the talk I expect the attendees to have an understanding of:
How they connect their MongoDB clusters with Spark
Which use cases show a net benefit for connecting these two systems
What kind of architecture design should be considered for making the most of Spark + MongoDB
How documents can be modeled for better performance and operational process, while processing these data sets stored in MongoDB.
The talk is suitable for:
Developers that want to understand how to leverage Spark
Architects that want to integrate their existing MongoDB cluster and have real time high speed processing needs
Data scientists that know about Spark, are playing with Spark and want to integrate with MongoDB for their persistency layer
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauMongoDB
Pairing your real-time operational data stored in a modern database like MongoDB with first-class business intelligence platforms like Tableau enables new insights to be discovered faster than ever before.
Many leading organizations already use MongoDB in conjunction with Tableau including a top American investment bank and the world’s largest airline. With the Connector for BI 2.0, it’s never been easier to streamline the connection process between these two systems.
In this webinar, we will create a live connection from Tableau Desktop to a MongoDB cluster using the Connector for BI. Once we have Tableau Desktop and MongoDB connected, we will demonstrate the visual power of Tableau to explore the agile data storage of MongoDB.
You’ll walk away knowing:
- How to configure MongoDB with Tableau using the updated connector
- Best practices for working with documents in a BI environment
- How leading companies are using big data visualization strategies to transform their businesses
In this hands-on webinar we will cover how to leverage the Treasure Data Javascript SDK library to ensure user stitching of web data into the Treasure Data Customer Data Platform to provide a holistic view of prospects and customers.
We will demo the native SDK, as well as deploying the SDK inside of Adobe DTM and Google Tag Manager.
Webinar: Live Data Visualisation with Tableau and MongoDBMongoDB
MongoDB 3.2 introduces a new way for familiar Business Intelligence (BI) tools to access your real-time operational data – opening it up to data analysts and data scientist, enabling new insights to be discovered faster than ever before. Tableau accesses the JSON document data stored in MongoDB via this new BI connector. We will cover how the BI connector works by creating a relational view definition of a JSON data set that is then used to present a tabular SQL/ODBC interface to Tableau. Then we will set-up a live connection from Tableau Desktop to the MongoDB Connector for BI. Once we have Tableau Desktop and MongoDB connected, we will demonstrate the visual power of Tableau to explore the agile data storage of MongoDB. This webinar will cover:
What is the MongoDB BI Connector?
Setting up a connection from Tableau to the MongoDB BI Connector.
How to perform data discovery Tableau connected to MongoDB live data.
Publishing a Tableau Dashboard for sharing insights.
Tableau & MongoDB: Visual Analytics at the Speed of ThoughtMongoDB
Tableau enables people to ask questions of their data by bringing analysis and visualization together with revolutionary technology. In this session, you’ll learn how to leverage Tableau and MongoDB for visual analytics of rich JSON data at the speed of thought, dramatically reducing the time-to-insight for users. The talk will include interactive demos and best practices to drive smart and fast business insights.
This presentation was given by David Maier @magicable @munichnosql may 2014. The code can be found https://github.com/dmaier-couchbase/cbl-android-tasklist
Learn what you need to consider when moving from the world of relational databases to a NoSQL document store.
Hear from Developer Advocate Glynn Bird as he explains the key differences between relational databases and JSON document stores like Cloudant, as well as how to dodge the pitfalls of migrating from a relational database to NoSQL.
Hear Ryan Millay, IBM Cloudant software development manager, discuss what you need to consider when moving from world of relational databases to a NoSQL document store.
You'll learn about the key differences between relational databases and JSON document stores like Cloudant, as well as how to dodge the pitfalls of migrating from a relational database to NoSQL.
Add Redis to Postgres to Make Your Microservices Go Boom!Dave Nielsen
Slides for talk delivered at PostgresOpen 2018 in San Francisco https://postgresql.us/events/pgopen2018/schedule/session/538-add-redis-to-postgres-to-make-your-microservice-go-boom/
Webinar: How to Drive Business Value in Financial Services with MongoDBMongoDB
Huge upheaval in the finance industry has led to a major strain on existing IT infrastructure and systems. New finance industry regulation has meant increased volume, velocity and variability of data. This coupled with cost pressures from the business has led these institutions to seek alternatives. Top tier institutions like MetLife have turned to MongoDB because of the enormous business value it enables.
In this session, hear how MongoDB enabled these successful real world examples:
Single View of a Customer - 3 months and $2M for a single view of a customer across 50 source systems
Reference Data Management - $40M in cost savings from migrating to MongoDB for reference data management
Private cloud - MongoDB as a PaaS across a tier 1 bank for enabling agility for operations, not just the developer
The use cases are specific to financial services but the patterns of usage - agility, scale, global distribution - will be applicable across many industries.
Apache CarbonData+Spark to realize data convergence and Unified high performa...Tech Triveni
Challenges in Data Analytics:
Different application scenarios need different storage solutions: HBASE is ideal for point query scenarios but unsuitable for multi-dimensional queries. MPP is suitable for data warehouse scenarios but engine and data are coupled together which hampers scalability. OLAP stores used in BI applications perform best for Aggregate queries but full scan queries perform at a sub-optimal performance. Moreover, they are not suitable for real-time analysis. These distinct systems lead to low resource sharing and need different pipelines for data and application management.
Building Operational Data Lake using Spark and SequoiaDB with Yang PengDatabricks
This topic describes the use of Spark and SequoiaDB in the Operational Data Lake of China’s financial industry, including how to use SequoiaDB to provide online high concurrent services and how to use Spark for data processing and machine learning. China has the world’s largest population, and also the world’s second largest economy. Many of the best technologies used in the United States and Europe are difficult to play effectively in China. This topic will show you how Spark and SequoiaDB are able to provide online financial services to billions of population.
From SQL to NoSQL: Structured Querying for JSONKeshav Murthy
Can SQL be used to query JSON? SQL is the universally known structured query language, used for well defined, uniformly structured data; while JSON is the lingua franca of flexible data management, used to define complex, variably structured data objects.
Yes! SQL can most-definitely be used to query JSON with Couchbase's SQL query language for JSON called N1QL (verbalized as Nickel.)
In this session, we will explore how N1QL extends SQL to provide the flexibility and agility inherent in JSON while leveraging the universality of SQL as a query language.
We will discuss utilizing SQL to query complex JSON objects that include arrays, sets and nested objects.
You will learn about the powerful query expressiveness of N1QL, including the latest features that have been added to the language. We will cover how using N1QL can solve your real-world application challenges, based on the actual queries of Couchbase end-users.
Embracing Database Diversity with Kafka and DebeziumFrank Lyaruu
There was a time not long ago when we used relational databases for everything. Even if the data wasn’t particularly relational, we shoehorned it into relational tables, often because that was the only database we had. Thank god these dark times are over and now we have many different kinds of NoSQL databases: Document, realtime, graph, column, but that does not solve the problem that the same data might be a graph from one perspective, but a collection of documents from another.
It would be really nice if we can access that same data in many different ways, depending on the context of what we want to achieve in our current task.
As software architects this is not easy to solve but definitely possible: We can design an architecture using Event Sourcing: Capture the data with Debezium, post it to a Kafka queue, use Kafka Streams to model the data the way we like, and store the data in various different data sources, so we can synchronize data between data sources.
NoSQL Analytics: JSON Data Analysis and Acceleration in MongoDB WorldAjay Gupte
In analytics world, when you need to process many millions or billions of documents to generate a single report. Novel techniques have been developed for exploiting modern processor architecture (larger on-chip cache, SIMD processing, compression, vector processing, columnar approach). Now, this technology is available to process your large JSON data. This talk will discuss analysis of JSON data using advanced data warehousing techniques and make it simple and seamless for the application/tool developer.
Webinar: General Technical Overview of MongoDB for Ops TeamsMongoDB
MongoDB is the leading open-source, document database. In this webinar we'll dive into the technical details of MongoDB by first focusing on what makes it different from traditional relational database management systems. We'll review data storage, high availability and scaling for MongoDB. Next we'll discuss what's involved in deploying MongoDB in production. Finally, we'll delve into some of the operational challenges including performance tuning, capacity planning and what it takes to deploy robust highly-available cluster topology.
Postgres, the leading open source relational database, is positioned as the centerpiece of a pivot from traditional architectures to a micro-services based approach that is in full support of a DevOps motion.
Presented by Marc Linster, Senior Vice President of Product Development at EnterpriseDB, this explores how Postgres meets the key requirements for DevOps. Lister explains how Postgres is developer friendly, supporting the process with a versatile data model using JSONB, integrating other data sources using Foreign Data Wrappers, and how Postgres supports rapid deployment in the cloud and on premises.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Couchbase Overview Nov 2013
1. Data
Management
for
Interac6ve
Applica6ons
NoSQL
Document
Database
Jeff
Harris
Business
Development
1
2. Agenda
• Company
Overview
• Kinds
of
Database
Management
System
• Common
Use
cases,
When
is
NoSQL
a
good
fit?
• NoSQL
for
Mobile
Devices,
JSON
Anywhere
• Rela6onal
vs
Non-‐Rela6onal/NoSQL
• Live
Demo
• Couchbase
Technical
Overview
(under
the
hood
and
features)
2
3. Couchbase
NoSQL
Leadership
Leading
NoSQL
database
company
Open
Source
development
&
business
model
Document-‐oriented
NoSQL
database
Focused
on
interacBve
internet
and
mobile
applicaBons
Provide
more
flexible,
higher
performance,
more
scalable
database
than
rela6onal
alterna6ve
Most
mature,
reliable
and
widely
deployed
solu6on
>7,500
paid
producBon
deployments
worldwide
Headquarters
in
Silicon
Valley
(Mountain
View,
CA)
~120
employees
including
>70
in
engineering/product
>80%
of
commits
to
Couchbase,
memcached,
Apache
CouchDB
3
4. Market
Adop6on
–
Customers
Internet
Companies
Enterprises
More
than
350
customers
–
7,500
produc6on
deployments
worldwide
4
5. Market
Adop6on
Internet
Companies
•
•
•
•
•
•
•
•
Enterprises
• CommunicaBons
• Retail
• Financial
Services
• Health
Care
• AutomoBve/Airline
• Agriculture
• Consumer
Electronics
• Business
Systems
Social
Gaming
Ad
Networks
Social
Networks
Online
Business
Services
E-‐Commerce
Online
Media
Content
Management
Cloud
Services
5
6. Two
kinds
of
Database
Management
System
OLTP
/
OLTP
like
Opera6onal
Stores
Data
warehouse
or
Analy6cs
system
6
7. Adding
a
few
more
components
InteracBve
OLTP
/
OLTP
like
Opera6onal
Stores
ETL
Streaming
Data
RDBMS
Other
legacy
systems
BI
&
ReporBng
Analy6cs
Data
warehouse
7
RDBMS
Column
stores
8. NoSQL
+
Big
Data
Opera6onal
database
for
web
and
mobile
apps
with
high
performance
at
scale
Map-‐reduce
against
huge
datasets
to
analyze
and
find
insights
and
answers
8
9. Common
Use
Cases
Social
Gaming
Ad
Targe6ng
• Couchbase
stores
player
and
game
data
• Examples
customers
include:
Zynga
• Tapjoy,
Ubiso`,
Tencent
• Couchbase
stores
user
informaBon
for
fast
access
• Examples
customers
include:
AOL,
Mediamind,
Convertro
Mobile
Apps
• Couchbase
stores
user
info
and
app
content
• Examples
customers
include:
Kobo,
PlayBka
Session
store
• Couchbase
Server
as
a
key-‐
value
store
• Examples
customers
include:
Concur,
Sabre
User
Profile
Store
• Couchbase
Server
as
a
key-‐value
store
• Examples
customers
include:
Tunewiki
High
availability
cache
Content
&
Metadata
Store
• Couchbase
document
store
with
ElasBc
Search
• Examples
customers
include:
McGraw
Hill
3rd
party
data
aggrega6on
• Couchbase
stores
social
media
and
data
feeds
• Couchbase
Server
used
as
a
cache
Ber
replacement
• Examples
customers
include:
LivePerson
• Examples
customers
include:
Orbitz
9
12. JSON
Anywhere
JS
N
Couchbase
Server
Sync Gateway
Lite
Server
JS
N
JS
N
• JSON
on
the
device
• JSON
on
the
wire
Developers
increasingly
prefer
NoSQL
database
No
need
for
data
transformaBon
12
• JSON
in
the
cloud
Flexible
data
model
High
performance
Easy
scalability
14. Rela6onal
Technology
Scales
Up
Applica6on
Scales
Out
Just
add
more
commodity
web
servers
System
Cost
ApplicaBon
Performance
Web/App
Server
Tier
Users
RDBMS
Scales
Up
Get
a
bigger,
more
complex
server
System
Cost
ApplicaBon
Performance
Won’t
scale
beyond
this
point
Rela6onal
Database
Users
Expensive
and
disrup6ve
sharding,
doesn’t
perform
at
web
scale
14
15. Couchbase
Server
Scales
Out
Like
App
Tier
Applica6on
Scales
Out
Just
add
more
commodity
web
servers
System
Cost
ApplicaBon
Performance
Web/App
Server
Tier
Users
NoSQL
Database
Scales
Out
Cost
and
performance
mirrors
app
6er
System
Cost
ApplicaBon
Performance
Couchbase
Distributed
Data
Store
Users
Scaling
out
flabens
the
cost
and
performance
curves
15
16. If
Cars
Were
Stored
in
an
RDBMS…
hhp://iedei.files.wordpress.com/2012/04/mercedes-‐f1-‐car-‐disassembled.jpeg
16
17. If
Cars
Were
Stored
in
a
Document
Database…
hhp://images.conceptcarz.com/imgxra/Mercedes-‐Benz/Mercedes-‐W03-‐F1-‐Spanish-‐GP_012-‐1680.jpg
17
18. Rela6onal
vs
Document
Data
Model
C1
C2
C3
C4
{
}
JSON
JSON
JSON
Rela6onal
data
model
Document
data
model
Highly-‐structured
table
organizaBon
with
rigidly-‐defined
data
formats
and
record
structure.
CollecBon
of
complex
documents
with
arbitrary,
nested
data
formats
and
varying
“record”
format.
18
19. RDBMS
Example:
User
Profile
User
Info
Address
Info
KEY
First
Last
ZIP_id
ZIP_id
CITY
STATE
ZIP
1
Frank
Weigel
2
1
DEN
CO
30303
2
Ali
Dodson
2
2
MV
CA
94040
3
Mark
Azad
2
3
CHI
IL
60609
4
Steve
Yen
3
4
NY
NY
10010
To
get
info
about
specific
user,
you
perform
a
join
across
two
tables
19
20. Document
Example:
User
Profile
{
“ID”:
1,
“FIRST”:
“Frank”,
“LAST”:
“Weigel”,
“ZIP”:
“94040”,
“CITY”:
“MV”,
“STATE”:
“CA”
}
+
=
JSON
All
data
in
a
single
document
20
21. Making
the
Same
Change
With
a
Document
DB
{
“ID”:
1,
“FIRST”:
“Frank”,
“LAST”:
“Weigel”,
“ZIP”:
“94040”,
“CITY”:
“MV”,
“STATE”:
“CA”,
“STATUS”:
{
“TEXT”:
“At
Conf”
}
,
}
“GEO_LOC”:
“134”
},
“COUNTRY”:
”USA”
}
JSON
Just
add
informa6on
to
a
document
22. Rela6onal
vs
Document
Performance
User
Table
Photo
Table
User
ID
First
Last
Zip
1
Frank
Wiegel
Weigel
94040
2
Joe
Smith
94040
3
Ali
Dodson
94040
4
Sarah
Gorin
Bob
Young
30303
6
Nancy
Baker
10010
7
Ray
Jones
31311
8
Lee
Chen
V5V3
Photo
ID
Comment
2
d043
NYC
2
b054
Bday
5
c036
Miami
7
d072
Sunset
5002
e086
Spain
NW1
5
User
ID
Status
Table
Status
ID
Text
1
a42
At
conf
4
5
b26
c032
excited
5
4
c32
b26
hockey
12
•
•
•
User
ID
d83
Go
A’s
5000
e34
sailing
Affilia6ons
Table
5000
Doug
Moore
04252
5001
Mary
White
41694
User
ID
Affilia6ons
Affilia6ons
ID
Name
Lisa
Lisa
Clark
12425
a42
Cal
4
b96
USC
7
5002
5002
2
c14
UW
8
e22
{
{
{
{
{
{
}
}
}
JSON
JSON
}
}
JSON
JSON
}
JSON
JSON
Oxford
Faster
response
6mes
and
higher
throughput
23. NoSQL
Database
Considera6ons
Easy
Scalability
PE
RF O R M A N C E
Grow
cluster
without
applicaBon
changes,
without
downBme
when
needed
Consistent
High
Performance
Always
awesome
experience
for
your
applicaBon
users.
Always
On
24x7x365
JSON
JSON JSO
JSON N
JSON
Flexible
Data
Model
Keep
developers
producBve
and
allow
fast
and
easy
addiBon
of
new
features
The
sun
never
sets
on
the
Internet,
your
applicaBon
needs
the
database
to
always
serve
data.
23
24. Couchbase
Server
Is
The
Complete
Solu6on
✔
Easy
Scalability
Consistent
High
✔
Performance
One
click
scalability
and
no
app
changes.
Sub
millisecond
latency
with
high
throughput
for
reads
and
writes.
Always
On
✔
24x7x365
Maintenance,
upgrades
and
cluster
resizing
all
online
without
applicaBon
downBme
✔
Flexible
Data
Model
JSON
document
model
with
no
fixed
schema.
24
25. Couchbase
Server
Overview
Clustered
NoSQL
Document
Database
for
interacBve
applicaBons
• Easy
scalability
with
efficient
auto-‐sharding
ApplicaBon
stays
unchanged
as
cluster
size
changes
Data
replicaBon
and
node
auto-‐failover
• Consistent
high
performance
Sub
millisecond
latency
with
high
throughput
• Always-‐On
Maintenance,
upgrades
and
cluster
resizing
all
online
without
applicaBon
downBme
25
26. Couchbase
Server
Features
• Easy
scalability
• Built-‐in
clustering
• All
nodes
equal
• One
click
to
scale
horizontally
• Consistent
High
Performance
• Built-‐in
managed
cache
• High
performance
for
both
reads
and
writes
• Always
on
•
•
•
•
Data
replication
with
auto-‐failover
Cross
Datacenter
Replication
Zero-‐downtime
maintenance
Monitoring
and
administration
APIs
and
GUI
26
27. New
Couchbase
Server
Features
• Flexible
JSON
document
model
No
fixed
schema
for
higher
producBvity
and
less
obstacles
for
developers
• Data
indexing
and
querying
Secondary
indexes
on
JSON
documents
Simple
real-‐Bme
analyBcs
via
incremental
map-‐reduce
Connector
to
full-‐text
search
indexer
• Cross
datacenter
replica6on
Scale
your
data
beyond
a
single
data
center
27
29. Single
node
-‐
Couchbase
Write
Opera6on
App
Server
Doc
1
3
2
Managed
Cache
ReplicaBon
Queue
Doc
1
Disk
Queue
To
other
node
3
Disk
Couchbase
Server
Node
29
30. Single
node
-‐
Couchbase
Update
Opera6on
App
Server
Doc
1’
3
2
Managed
Cache
ReplicaBon
Queue
Doc
1
Doc
1’
Disk
Queue
To
other
node
3
Disk
Doc
1
Couchbase
Server
Node
30
31. App
Server
GET
Doc
1
Single
node
-‐
Couchbase
Read
Opera6on
3
2
Managed
Cache
ReplicaBon
Queue
Doc
1
Disk
Queue
To
other
node
3
Disk
Doc
1
Couchbase
Server
Node
31
32. Basic
Opera6on
APP
SERVER
1
APP
SERVER
2
COUCHBASE
Client
Library
CLUSTER
MAP
COUCHBASE
Client
Library
CLUSTER
MAP
READ/WRITE/UPDATE
SERVER
1
ACTIVE
SERVER
2
ACTIVE
• Docs
distributed
evenly
across
servers
SERVER
3
ACTIVE
• Each
server
stores
both
ac6ve
and
replica
docs
Doc
5
Doc
Doc
4
Doc
Doc
1
Doc
Doc
2
Doc
Doc
7
Doc
Doc
3
Doc
Only
one
server
acBve
at
a
Bme
Doc
9
Doc
Doc
8
Doc
Doc
6
Doc
• Client
library
provides
app
with
simple
interface
to
database
REPLICA
REPLICA
REPLICA
• Cluster
map
provides
map
to
which
server
doc
is
on
Doc
4
Doc
Doc
6
Doc
Doc
7
Doc
Doc
1
Doc
Doc
3
Doc
Doc
9
Doc
• App
reads,
writes,
updates
docs
Doc
8
Doc
Doc
2
Doc
Doc
5
Doc
• Mul6ple
app
servers
can
access
same
document
at
same
6me
App
never
needs
to
know
COUCHBASE
SERVER
CLUSTER
User
Configured
Replica
Count
=
1
32
33. Add
Nodes
to
Cluster
APP
SERVER
1
APP
SERVER
2
COUCHBASE
Client
Library
CLUSTER
MAP
COUCHBASE
Client
Library
CLUSTER
MAP
READ/WRITE/UPDATE
SERVER
1
ACTIVE
SERVER
2
ACTIVE
READ/WRITE/UPDATE
SERVER
3
ACTIVE
Doc
5
Doc
Doc
4
Doc
Doc
1
Doc
Doc
7
Doc
Doc
3
Doc
Doc
9
Doc
Doc
8
Doc
Doc
6
SERVER
5
ACTIVE
Doc
Doc
2
SERVER
4
ACTIVE
Doc
REPLICA
REPLICA
REPLICA
Doc
4
Doc
Doc
6
Doc
Doc
7
Doc
Doc
3
Doc
Doc
9
Doc
Doc
2
Doc
Doc
5
Even
distribuBon
of
docs
Minimum
doc
movement
• Cluster
map
updated
REPLICA
Doc
Doc
8
• Docs
automa6cally
rebalance
across
cluster
Doc
Doc
1
Doc
COUCHBASE
SERVER
CLUSTER
User
Configured
Replica
Count
=
1
• Two
servers
added
with
one-‐click
opera6on
33
REPLICA
• App
database
calls
now
distributed
over
larger
number
of
servers
34. Fail
Over
Node
APP
SERVER
1
APP
SERVER
2
COUCHBASE
Client
Library
CLUSTER
MAP
COUCHBASE
Client
Library
CLUSTER
MAP
SERVER
1
ACTIVE
SERVER
2
ACTIVE
SERVER
3
ACTIVE
SERVER
4
ACTIVE
Doc
5
Doc
Doc
4
Doc
Doc
1
Doc
Doc
9
Doc
Doc
2
Doc
Doc
7
Doc
Doc
3
Doc
Doc
8
SERVER
5
ACTIVE
Doc
Doc
1
Doc
6
Doc
Doc
Doc
3
REPLICA
REPLICA
REPLICA
REPLICA
Doc
4
Doc
Doc
6
Doc
Doc
7
Doc
Doc
5
Doc
1
Doc
Doc
3
Doc
Doc
9
Doc
Doc
Doc
2
Doc
8
Doc
Doc
COUCHBASE
SERVER
CLUSTER
User
Configured
Replica
Count
=
1
REPLICA
34
• App
servers
accessing
docs
• Requests
to
Server
3
fail
• Cluster
detects
server
failed
– Promotes
replicas
of
docs
to
acBve
– Updates
cluster
map
• Requests
for
docs
now
go
to
appropriate
server
• Typically
rebalance
would
follow
35. XDCR:
Cross
Data
Center
Replica6on
US
DATA
CENTER
EUROPE
DATA
CENTER
ASIA
DATA
CENTER
hhp://blog.groosy.com/wp-‐content/uploads/2011/10/internet-‐map.jpg
35
36. Cross
Data
Center
Replica6on
(XDCR)
SERVER
2
ACTIVE
SERVER
1
ACTIVE
SERVER
3
ACTIVE
Doc
Doc
Doc
Doc
2
RAM
Doc
Doc
COUCHBASE
SERVER
CLUSTER
NYC
DATA
CENTER
Doc
Doc
9
Doc
RAM
Doc
DISK
Doc
Doc
Doc
RAM
Doc
Doc
Doc
DISK
Doc
Doc
DISK
SERVER
2
ACTIVE
SERVER
1
ACTIVE
SERVER
3
ACTIVE
Doc
COUCHBASE
SERVER
CLUSTER
SF
DATA
CENTER
Doc
Doc
Doc
2
RAM
Doc
Doc
Doc
Doc
9
Doc
DISK
RAM
Doc
36
Doc
Doc
Doc
DISK
RAM
Doc
Doc
Doc
Doc
DISK
Doc
37. Indexing
and
Querying
Features
• Index
and
Query
Distributed
indexing
and
querying
Secondary
indexes
of
JSON
document
content
Flexible
querying
of
indexes
• Incremental
Map-‐Reduce
Distributed
simple
real-‐Bme
analyBcs
Only
considers
changes
due
to
updated
data
• Full
Text
Search
Robust
integraBon
with
ElasBcSearch
cluster
Flexible
full
text
search
and
faceted
search
37
38. Indexing
and
Querying
APP
SERVER
1
APP
SERVER
2
COUCHBASE
Client
Library
CLUSTER
MAP
COUCHBASE
Client
Library
CLUSTER
MAP
Query
SERVER
1
ACTIVE
SERVER
2
ACTIVE
SERVER
3
ACTIVE
• Indexing
work
is
distributed
amongst
nodes
Doc
5
Doc
Doc
4
Doc
Doc
1
Doc
• Large
data
set
possible
Doc
2
Doc
Doc
7
Doc
Doc
3
Doc
• Parallelize
the
effort
Doc
9
Doc
Doc
8
Doc
Doc
6
Doc
REPLICA
REPLICA
REPLICA
Doc
4
Doc
Doc
6
Doc
Doc
7
Doc
Doc
1
Doc
Doc
3
Doc
Doc
9
Doc
Doc
8
Doc
Doc
2
Doc
Doc
5
Doc
COUCHBASE
SERVER
CLUSTER
User
Configured
Replica
Count
=
1
38
• Each
node
has
index
for
data
stored
on
it
• Queries
combine
the
results
from
required
nodes
40. Couchbase
Demonstra6on
• Star6ng
with
4
database
nodes
under
load
• Dynamically
scaling
to
6
database
nodes
ApplicaBon
user
• Easy
management
and
monitoring
Web
applicaBon
server
• JSON
indexing
and
querying
• Easy
configura6on
of
cross-‐
datacenter
replica6on
Couchbase
Servers
In
the
EC2
or
Datacenter
• Not
possible
any
other
database
technology
40
XDCR
42. Couchbase:
The
Complete
NoSQL
Solu6on
Easy
Scalability
PE
Grow
cluster
without
applicaBon
changes,
without
downBme
when
needed
Consistent
High
Performance
RF O R M A N C E
Always
awesome
experience
for
your
applicaBon
users.
Always
On
24x7x365
JSON
JSON JSO
JSON N
JSON
Flexible
Data
Model
Keep
developers
producBve
and
allow
fast
and
easy
addiBon
of
new
features
The
sun
never
sets
on
the
Internet,
your
applicaBon
needs
the
database
to
always
serve
data.
42
43. Consistent
High
Performance
• Consistent,
predictable
sub
millisecond
latency
Apps
need
fast,
predictable
access
to
data,
it’s
not
good
enough
to
be
fast
some
of
the
Bme
• Consistent,
predictable
throughput
Throughput
capacity
of
your
data
layer
should
be
independent
of
the
mix
of
reads
and
writes
• Linear
throughput
scalability
Double
the
number
of
servers,
get
twice
the
maximum
throughput
and
double
the
data
capacity
43
44. YCSB
Benchmark
Details
• Web
applica6on
simula6on
Simulates
changing
set
of
users
using
the
app/accessing
data
Document
size
of
1.5-‐2K
with
15
million
acBve
documents
• Data
doesn’t
enBrely
fit
into
RAM
Workload
simulaBng
realisBc
mix
of
reads
and
writes
(60/40)
• System
details
4
node
cluster
with
seperate
node
to
run
client
workload
AWS
Extra
Large
instances
with
striped
EBS
1
replica
setup
(For
mongoDB
-‐
no
write
concern,
no
journaling)
Each
test
run
3
Bmes
with
varying
throughputs
with
95%
latency
measured
• YCSB
test
workload
source
code
hhps://github.com/Altoros/YCSB
44
48. NoSQL
Database
Considera6ons
Easy
Scalability
PE
Grow
cluster
without
applicaBon
changes,
without
downBme
when
needed
Consistent
High
Performance
RF O R M A N C E
Always
awesome
experience
for
your
applicaBon
users.
Always
On
24x7x365
JSON
JSON JSO
JSON N
JSON
Flexible
Data
Model
Keep
developers
producBve
and
allow
fast
and
easy
addiBon
of
new
features
The
sun
never
sets
on
the
Internet,
your
applicaBon
needs
the
database
to
always
serve
data.
48
49. Couchbase:
High
throughput
that
scales
linearly
High
throughput
with
1.4
GB/sec
data
transfer
rate
using
4
servers
Linear
throughput
scalability
hhp://www.cisco.com/en/US/prod/collateral/switches/ps9441/ps9670/white_paper_c11-‐708169.pdf
49
50. Draw
Something
“Goes
Viral”
3
Weeks
Azer
Launch
Draw
Something
by
OMGPOP
Daily
Ac)ve
Users
(millions)
Feb
2012
50
March
2012
51. As
Usage
Grew,
Game
Data
Went
Non-‐
Draw
Something
by
OMGPOP
Linear
Daily
Ac)ve
Users
(millions)
Feb
2012
51
March
2012
52. NoSQL
Database
Considera6ons
Easy
Scalability
PE
Grow
cluster
without
applicaBon
changes,
without
downBme
when
needed
Consistent
High
Performance
RF O R M A N C E
Always
awesome
experience
for
your
applicaBon
users.
Always
On
24x7x365
JSON
JSON JSO
JSON N
JSON
Flexible
Data
Model
Keep
developers
producBve
and
allow
fast
and
easy
addiBon
of
new
features
The
sun
never
sets
on
the
Internet,
your
applicaBon
needs
the
database
to
always
serve
data.
52
53. The
Sun
Never
Sets
on
the
Internet
hhp://personal.bgsu.edu/~tede/020522_briBshempire1360.jpg
53
55. Couchbase
Server
Is
The
Complete
Solu6on
✔
Easy
Scalability
Consistent
High
✔
Performance
One
click
scalability
and
no
app
changes.
Sub
millisecond
latency
with
high
throughput
for
reads
and
writes.
Always
On
✔
24x7x365
Maintenance,
upgrades
and
cluster
resizing
all
online
without
applicaBon
downBme
✔
Flexible
Data
Model
JSON
document
model
with
no
fixed
schema.
55
56. Couchbase
Server
vs.
MongoDB
Easy
Scalability
Consistent,
High
Performance
Flexible
Data
Model
Always
On
24x7x365
With
1-‐click,
horizontally
✔
grow
cluster,
even
scale
across
datacenters
Consistent
sub
millisecond
✔
reads/writes;
throughput
Consistent
high
✔
Schemaless
data
model
for
rapid
development
✔
No
downBme
for
so`ware
upgrades,
hardware
maintenance,
etc.
56
✖
Complex
mulB-‐step
scaling,
no
write
scaling
across
data
centers
✖
High
&
Inconsistent
latency;
Lower
throughput
✔
Schemaless
data
model
for
rapid
development
✖
✔
Difficult
online
upgrade;
Not
all
maintenance
is
online
57. Consistent
Lower
Latencies
and
Higher
Throughput
• Couchbase
High
read
and
write
throughput
and
consistent
low
latencies
Write
performance
advantage
due
to
low
granularity
of
Couchbase
memory
locking
mechanism
and
minimal
contenBon
Read
operaBons
happen
concurrently
with,
and
independently
of
writes
• MongoDB
Severely
limited
write
throughput
due
to
very
coarse
write
locks
that
limit
concurrency
at
the
node
level
Inconsistent
latencies
and
throughput
as
all
reads
need
to
wait
for
a
write
to
finish.
• Cost
Considera6ons
57
58. Couchbase
Server
vs.
Cassandra
Easy
Scalability
✔
With
1-‐click,
horizontally
grow
cluster,
even
scale
across
datacenters
Consistent,
High
✔
Performance
Consistent
sub-‐millisecond
reads/writes
and
high
throughput
Flexible
Data
Model
✔
Schemaless
data
model
for
rapid
development
✔
No
downBme
for
so`ware
upgrades,
hardware
maintenance,
etc.
Always
On
24x7x365
58
✖
Complex
mulB-‐step
scaling,
coarse
grain
growth
recommended
✖
High
and
inconsistent
latency;
medium
throughput
✖
Very
complex
columnar
data
model
✔
Online
upgrades
and
online
maintenance
60. Enterprise
vs.
Community
Edi6on
Enterprise
Edi6on
Cost
Community
Edi6on
• Free
for
pre-‐producBon
• StarBng
at
$2,499/node/yr
• Free
Produc6on
Readiness
ProducBon
Ready
Unknown
ProducBon
Readiness
Technical
Support
Professional
Support
Community/Forum
Support
Best
Prac6ce
Exper6se
Broad
Best
PracBce
ExperBse
ExperBse
of
community
members
Release
Support
Long-‐Tail
Release
Support
No
Support
for
Old
Releases
License
Type
Commercial
License
Simple
“As
is”
license
Intended
Use
ProducBon
Deployments
Non-‐ProducBon
Use,
Simple
Use
Cases
• Take
open
source
snapshots
• Put
through
rigorous
QA
process,
fix
bugs
• Defined
SLAs
from
the
experts
• Immediate
hot
bug
fixes
• Built
with
latest
open
source
• Undefined
quality
• No
SLA,
unknown
response
Bmes
• No
bugs
fixes
assured
in
a
Bmely
manner
• From
working
with
100s
of
customers
• Including
hot
bug
fixes
• Based
on
limited
individual
experBse
• Must
support
old
releases
yourself
• Std
license
terms,
SLA,
indemnity
60
61. Next
Steps:
The
Path
to
Produc6on
ü
Couchbase
Overview
–
Done
2. Technical
session
• Who:
Architects/Developers
and
Couchbase
SoluBons
Architect
• Deployment
and
development
best
pracBces
• OperaBons
deep-‐dive
• Tech
Q&A
3. Design
review
• Review
and
feedback
on
design
and
feature
usage
• Verify
assumpBons
and
use
of
Couchbase
APIs
4. Produc6on
deployment
61
62. Thank
you
Couchbase
NoSQL
Document
Database
62
64. Couchbase
Server
Features
• Graphical
monitoring
and
admin
console
with
RESTful
interface
Single
click
cluster
resizing,
powerful
monitoring
accessible
via
every
node
• Online
upgrades,
backups
and
database
maintenance
Database
is
always
online,
applicaBons
can
be
available
24x7x365
• All
nodes
are
iden6cal
Easy
provisioning
on
all
supported
plaxorms:
Linux,
Windows,
MacOS
• Supported
SDKs
for
all
common
languages
Easy
adopBon
by
developers
for
Java,
.NET,
PHP,
Ruby,
C/C++
64
65. Couchbase
performance
with
varying
document
sizes
Consistently
low
latencies
sub-‐millisecond
for
varying
documents
sizes
with
a
mixed
workload
65
66. Enterprise
Edi6on
Subscrip6ons
Standard
Premium
$2499
$4499
Per
node
Per
node
10
X
5
24
X
7
7am
–
5pm
Pacific
Standard
Time
N/A
Phone,
Email,
Web,
Forums
Phone,
Email,
Web,
Forums
P1
Response
Time
5
hours
2
hours
P2
Response
Time
1
day
5
hours
Update
releases
Yes
Yes
Ho{ixes
Yes
Yes
Technical
Alerts
Yes
Yes
Price/node
Licensing
Support
Hours
Hours
of
Opera6on
Support
Channel
hhp://www.couchbase.com/couchbase-‐support-‐and-‐subscripBons
66
68. Challenges
with
a
Memcached
Tier
Problem
Cold
Cache
Heavy
RDBMS
Conten6on
Lack
of
Scalability
Complex
Monitoring
Symptoms
Couchbase
Solu6on
Slowdown
or
collapse
of
the
data
service
layer
due
to
heavily
overloaded
RDBMS
when
memcached
nodes
go
down
(on
failure
or
for
maintenance)
Data
is
automaBcally
replicated
across
the
Couchbase
cluster,
providing
high
availability
of
data
even
on
failures
MulBple
requests
for
data
items
that
do
not
exist
in
the
cache
results
in
sudden
shi`ing
of
load
to
the
relaBonal
database
causing
heavy
contenBon
By
replicaBng
data
across
the
cluster,
Couchbase
Server
provides
consistent
performance
without
shi`ing
load
to
the
RDBMS
layer
Adding
or
removing
memcached
nodes
is
complicated
and
causes
unpredictable
applicaBon
performance
degradaBon
Auto-‐sharding
and
online
rebalancing
in
Couchbase
Server
provides
easy
non-‐
disrupBve
expansion
of
the
cluster
Management
of
individual
Couchbase
Server
provides
an
in-‐built
memcached
nodes
increases
the
admin
console
for
cluster
wide
complexity
of
operaBons
and
lacks
a
management
and
monitoring
as
well
as
single
consistent
view
of
the
caching
RESTful
APIs
for
easy
automaBon
and
layer
third-‐party
integraBon
68
69. Before
and
Azer:
Replacing
Caching
Tier
with
Couchbase
Server
69
70. Memcached
Tier
Replacement:
How
it
Works
• Fully
memcached
protocol
compa6ble
• Easy
to
replace
a
6er
of
individual
memcached
servers
with
a
Couchbase
Server
cluster
• The
cluster
receives
reads
and
writes,
keeps
frequently
accessed
items
in
memory,
persists
and
shards
and
replicates
the
data
amongst
the
cluster
• Reads
and
writes
are
s6ll
as
low
latency
and
high
throughput
as
memcached
• User
gets
all
the
scalability
and
high-‐availability
advantages
of
a
Couchbase
Server
cluster
70
72. How
to
Prepare
Your
Social
Game
for
Massive
Growth
Published
February
2,
2012
Five
days
later…
hhp://mashable.com/2012/02/01/social-‐game-‐prepare-‐growth/
72