This presentation was written by Wagner Bianchi for the presentation on the Oracle Consulting Team/Professional Services meeting that took place in San Francisco/CA.
What's new in MySQL Cluster 7.4 webinar chartsAndrew Morgan
MySQL Cluster powers the subscriber databases of major communication services providers as well as next generation web, cloud, social and mobile applications. It is designed to deliver:
- Real-time, in-memory performance for both OLTP and analytics workloads
- Linear scale-out for both reads and writes
99.999% High Availability
- Transparent, cross-shard transactions and joins
- Update-Anywhere Geographic replication
- SQL or native NoSQL APIs
All that while still providing full ACID transactions.
Get the best out of MySQL Cluster, presentation covers:
- Tuning and optimization to exploit the auto-sharded, distributed design of MySQL Cluster
- Using Adaptive Query Localization to scale cross-shard JOINs
- Data access patterns, schema and query optimizations
- Recommended tuning parameters
Tune in to the on-demand webinar: http://www.mysql.com/news-and-events/on-demand-webinars/display-od-719.html
FOSDEM 2015 - NoSQL and SQL the best of both worldsAndrew Morgan
There’s a lot of excitement around NoSQL Data Stores with the promise of simple access patterns, flexible schemas, scalability and High Availability. The downside comes in the form of losing ACID transactions, consistency, flexible queries and data integrity checks. What if you could have the best of both worlds? This session shows how MySQL Cluster provides simultaneous SQL and native NoSQL access to your data – whether a simple key-value API (Memcached), REST, JavaScript, Java or C++. You will hear how the MySQL Cluster architecture delivers in-memory real-time performance, 99.999% availability, on-line maintenance and linear, horizontal scalability through transparent auto-sharding.
This presentation was written by Wagner Bianchi for the presentation on the Oracle Consulting Team/Professional Services meeting that took place in San Francisco/CA.
What's new in MySQL Cluster 7.4 webinar chartsAndrew Morgan
MySQL Cluster powers the subscriber databases of major communication services providers as well as next generation web, cloud, social and mobile applications. It is designed to deliver:
- Real-time, in-memory performance for both OLTP and analytics workloads
- Linear scale-out for both reads and writes
99.999% High Availability
- Transparent, cross-shard transactions and joins
- Update-Anywhere Geographic replication
- SQL or native NoSQL APIs
All that while still providing full ACID transactions.
Get the best out of MySQL Cluster, presentation covers:
- Tuning and optimization to exploit the auto-sharded, distributed design of MySQL Cluster
- Using Adaptive Query Localization to scale cross-shard JOINs
- Data access patterns, schema and query optimizations
- Recommended tuning parameters
Tune in to the on-demand webinar: http://www.mysql.com/news-and-events/on-demand-webinars/display-od-719.html
FOSDEM 2015 - NoSQL and SQL the best of both worldsAndrew Morgan
There’s a lot of excitement around NoSQL Data Stores with the promise of simple access patterns, flexible schemas, scalability and High Availability. The downside comes in the form of losing ACID transactions, consistency, flexible queries and data integrity checks. What if you could have the best of both worlds? This session shows how MySQL Cluster provides simultaneous SQL and native NoSQL access to your data – whether a simple key-value API (Memcached), REST, JavaScript, Java or C++. You will hear how the MySQL Cluster architecture delivers in-memory real-time performance, 99.999% availability, on-line maintenance and linear, horizontal scalability through transparent auto-sharding.
Conference slides: MySQL Cluster Performance TuningSeveralnines
This presentation goes through performance tuning basics in MySQL Cluster.
It also covers the new parameters and status variables of MySQL Cluster 7.2 to determine issues with e.g disk data performance and query (join) performance.
Conference tutorial: MySQL Cluster as NoSQLSeveralnines
Slides from the 'MySQL Cluster as NoSQL' tutorial at Percona Live MySQL Conference 2012 in London.
Tutorial covers:
*MySQL Cluster administration
* NoSQL options for MySQL Cluster and when to use what
* Memcached (installation and configuration)
* Cluster/J
* NDBAPI
* Benchmarking of different access methods on a live cluster
MySQL Cluster (NDB) - Best Practices Percona Live 2017Severalnines
This presentation by Johan Andersson at Percona Live 2017 in Santa Clara, California gives detailed information on all you need to know to effectively deploy and manage MySQL Cluster technology in your environment.
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLScaleBase
In this webinar Matt Aslett of 451 Research joins ScaleBase to discuss the benefits and drawbacks of NoSQL, NewSQL & MySQL databases and explores real-life use cases for each.
MOUG17 Keynote: Oracle OpenWorld Major AnnouncementsMonica Li
Midwest Oracle Users Group Training Day 2017 Presentation by Rich Niemiec, Chief Innovation Officer at Viscosity North America.
Catch up on OOW17's top announcements in this 1 hour presentation.
The Real Scoop on Migrating from Oracle DatabasesEDB
During this presentation you will be provided with actionable guidelines to:
• Identify the right applications to migrate
• Easily and safely migrate your applications
• Leverage resources before, during and after your migration
• Learn how to achieve independence from Oracle databases - without sacrificing performance.
The Software as a Service or SaaS market is large and growing. Demands of 24/7 availability, high performance, back-up, security, affordability, scalability, manageability, audit ability and easy integration when delivering your product and or service to your customers, are business challenges which we will address in this presentation. By demonstrating MySQL’s proven ability in this area, we will show how we can help new and seasoned SaaS vendors.
Python Utilities for Managing MySQL DatabasesMats Kindahl
Managing a MySQL database server can become a full time job. What we need are tools that bundle a set of related tasks into a common utility. While there are several such utility libraries to choose, it is often the case that you need to customize them to your needs. The MySQL Utilities library is the answer to that need. It is open source so you can modify and expand it as you see fit.
This is the presentation from OSCON 2011 in Portland.
Severalnines Self-Training: MySQL® Cluster - Part VISeveralnines
Part VI of our free self-training slides on MySQL Cluster.
In this part we cover ’Configuration and Installation'
* Data Node configuration
* SQL Node configuration
* Important parameters
* Installation
* Upgrading
Elastic Scalability in MySQL Fabric Using OpenStackMats Kindahl
Elastic scalability, the ability to quickly adapt to changing demands for resources, is critical to running modern applications. Both over- and underallocation of resources have an impact on a business’s bottom line. OpenStack is a cloud operating system that achieves elastic scalability by managing the allocation of compute, storage, and network resources. MySQL Fabric is a new member of the community enabling large database systems to be managed easily, providing support for handling high availability and sharding. In this session, you will learn how to leverage OpenStack and MySQL Fabric to build a system in which resources can be added on demand, providing elastic scalability, sharding, and high availability as a single system.
During this presentation, Craig Silviera, WW Director of Field Engineering described the steps to take to dramatically reduce your IT infrastructure costs when you make the switch from Oracle.
Craig provided actionable steps to:
• Identify the right applications to migrate
• Easily and safely migrate your applications
• Leverage resources before, during and after your migration
To learn more about migrating your database from Oracle to Postgres,
please email info@enterprisedb.com and someone will follow up with you asap.
DB12c: All You Need to Know About the Resource ManagerMaris Elsins
This presentation is different from the previous uploads as SLOB was used for the testing.
Oracle Database 12c Multitenant provides the highest level of Oracle Database resource efficiency, driven by an improved resource manager. The 12c resource manager effectively allocates resources both within a single database and between multiple pluggable databases in a container. This presentation will review new features of the 12c resource manager, provide guidelines for migration of your current resource management plan to 12c, and will also look into how much overhead the resource manager introduces.
Conference slides: MySQL Cluster Performance TuningSeveralnines
This presentation goes through performance tuning basics in MySQL Cluster.
It also covers the new parameters and status variables of MySQL Cluster 7.2 to determine issues with e.g disk data performance and query (join) performance.
Conference tutorial: MySQL Cluster as NoSQLSeveralnines
Slides from the 'MySQL Cluster as NoSQL' tutorial at Percona Live MySQL Conference 2012 in London.
Tutorial covers:
*MySQL Cluster administration
* NoSQL options for MySQL Cluster and when to use what
* Memcached (installation and configuration)
* Cluster/J
* NDBAPI
* Benchmarking of different access methods on a live cluster
MySQL Cluster (NDB) - Best Practices Percona Live 2017Severalnines
This presentation by Johan Andersson at Percona Live 2017 in Santa Clara, California gives detailed information on all you need to know to effectively deploy and manage MySQL Cluster technology in your environment.
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLScaleBase
In this webinar Matt Aslett of 451 Research joins ScaleBase to discuss the benefits and drawbacks of NoSQL, NewSQL & MySQL databases and explores real-life use cases for each.
MOUG17 Keynote: Oracle OpenWorld Major AnnouncementsMonica Li
Midwest Oracle Users Group Training Day 2017 Presentation by Rich Niemiec, Chief Innovation Officer at Viscosity North America.
Catch up on OOW17's top announcements in this 1 hour presentation.
The Real Scoop on Migrating from Oracle DatabasesEDB
During this presentation you will be provided with actionable guidelines to:
• Identify the right applications to migrate
• Easily and safely migrate your applications
• Leverage resources before, during and after your migration
• Learn how to achieve independence from Oracle databases - without sacrificing performance.
The Software as a Service or SaaS market is large and growing. Demands of 24/7 availability, high performance, back-up, security, affordability, scalability, manageability, audit ability and easy integration when delivering your product and or service to your customers, are business challenges which we will address in this presentation. By demonstrating MySQL’s proven ability in this area, we will show how we can help new and seasoned SaaS vendors.
Python Utilities for Managing MySQL DatabasesMats Kindahl
Managing a MySQL database server can become a full time job. What we need are tools that bundle a set of related tasks into a common utility. While there are several such utility libraries to choose, it is often the case that you need to customize them to your needs. The MySQL Utilities library is the answer to that need. It is open source so you can modify and expand it as you see fit.
This is the presentation from OSCON 2011 in Portland.
Severalnines Self-Training: MySQL® Cluster - Part VISeveralnines
Part VI of our free self-training slides on MySQL Cluster.
In this part we cover ’Configuration and Installation'
* Data Node configuration
* SQL Node configuration
* Important parameters
* Installation
* Upgrading
Elastic Scalability in MySQL Fabric Using OpenStackMats Kindahl
Elastic scalability, the ability to quickly adapt to changing demands for resources, is critical to running modern applications. Both over- and underallocation of resources have an impact on a business’s bottom line. OpenStack is a cloud operating system that achieves elastic scalability by managing the allocation of compute, storage, and network resources. MySQL Fabric is a new member of the community enabling large database systems to be managed easily, providing support for handling high availability and sharding. In this session, you will learn how to leverage OpenStack and MySQL Fabric to build a system in which resources can be added on demand, providing elastic scalability, sharding, and high availability as a single system.
During this presentation, Craig Silviera, WW Director of Field Engineering described the steps to take to dramatically reduce your IT infrastructure costs when you make the switch from Oracle.
Craig provided actionable steps to:
• Identify the right applications to migrate
• Easily and safely migrate your applications
• Leverage resources before, during and after your migration
To learn more about migrating your database from Oracle to Postgres,
please email info@enterprisedb.com and someone will follow up with you asap.
DB12c: All You Need to Know About the Resource ManagerMaris Elsins
This presentation is different from the previous uploads as SLOB was used for the testing.
Oracle Database 12c Multitenant provides the highest level of Oracle Database resource efficiency, driven by an improved resource manager. The 12c resource manager effectively allocates resources both within a single database and between multiple pluggable databases in a container. This presentation will review new features of the 12c resource manager, provide guidelines for migration of your current resource management plan to 12c, and will also look into how much overhead the resource manager introduces.
In this presentation Nachum Shacham talks about the uses and qualities of Big Data, and how they are utilised where he works at PayPal. He talks about the ultimate goal of extracting business value, as well as unlocking the true value of your data through use of algorithms and sufficient data further down the long tail.
Driving Sales Execution with Compelling MessagingRussell Scherwin
Focuses on compelling messaging as the tip of the arrow for optimizing revenue operations and sales execution.
350+ attendee’s at Atlanta’s Product Camp awarded me best presenter in delivering this.
The little engine(s) that could: scaling online social networksJosep M. Pujol
Presentation of the SIGCOMM 2010 paper: "The little engine(s) that could: scaling online social networks".
The full paper is available at:
http://ccr.sigcomm.org/online/?q=node/642
or at
http://research.tid.es/jmps/
Paper at: http://www2016.net/proceedings/proceedings/p121.pdf
Abstract: Online tracking poses a serious privacy challenge that has drawn significant attention in both academia and industry. Existing approaches for preventing user tracking, based on curated blocklists, suffer from limited coverage and coarse-grained resolution for classification, rely on exceptions that impact sites’ functionality and appearance, and require significant manual maintenance. In this paper we propose a novel approach, based on the concepts leveraged from k-Anonymity, in which users collectively identify unsafe data elements, which have the potential to identify uniquely an individual user, and remove them from requests. We deployed our system to 200,000 German users running the Cliqz Browser or the Cliqz Firefox extension to evaluate its efficiency and feasibility. Results indicate that our approach achieves better privacy protection than blocklists, as provided by Disconnect, while keeping the site breakage to a minimum, even lower than the community-optimized Ad-Block Plus. We also provide evidence of the prevalence and reach of trackers to over 21 million pages of 350,000 unique sites, the largest scale empirical evaluation to date. 95% of the pages visited contain 3rd party requests to potential trackers and 78% attempt to transfer unsafe data. Tracker organizations are also ranked, showing that a single organization can reach up to 42% of all page visits in Germany.
A Global In-memory Data System for MySQLDaniel Austin
This is my presentation from Percona Live! London last year. I describe my Big Data system built on MySQL/NDB and show that we can preserve the relational model for most Big Data purposes in the real world.
"Navigating the Database Universe" by Dr. Michael Stonebraker and Scott Jarr,...lisapaglia
Webinar presentation delivered by Dr. Michael Stonebraker and Scott Jarr of VoltDB on December 11, 2012. www.voltdb.com
The design decisions you make today will have a huge performance impact down the line. Until recently, when it came to databases, the choice was easy. Essentially, you had one option: the RDBMS. Today, there's a new universe of databases being thrown into production — and not always with the greatest success. How do you make the right choice for your next application? Database pioneer Dr. Michael Stonebraker and VoltDB co-founder Scott Jarr have some thoughts.
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDBBigDataCloud
"Navigating the Database Universe" was the topic of the Big Data Cloud meetup held on Jan 24th 2013 in Santa Clara, CA. This is the presentation made by Mike Stonebraker & Scott Jarr of VoltDB.
This meetup was sponsored by VoltDb.
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedInLinkedIn
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn. This was a presentation made at QCon 2009 and is embedded on LinkedIn's blog - http://blog.linkedin.com/
NoSQL databases like MongoDB, Elasticsearch, and Cassandra are synonymous with scalability, search, and developer agility. But there’s a downside...having to give up the ease and comfort of SQL.
Or do you?
Join this webcast to learn how the newest databases, like CrateDB and CockroachDB deliver the benefits of NoSQL with the ease of SQL by building SQL engines on top of custom NoSQL technology stacks. Database industry veteran Andy Ellicott, who helped launch Vertica, VoltDB, Cloudant, and now with Crate.io, will provide a no-BS view of current DBMS architectures and predictions for the future of data.
If you’re a DBMS user, this webcast will help you make sense of a very crowded DBMS market and make better-informed decisions for your new tech stacks.
NoSQL is not a buzzword anymore. The array of non- relational technologies have found wide-scale adoption even in non-Internet scale focus areas. With the advent of the Cloud...the churn has increased even more yet there is no crystal clear guidance on adoption techniques and architectural choices surrounding the plethora of options available. This session initiates you into the whys & wherefores, architectural patterns, caveats and techniques that will augment your decision making process & boost your perception of architecting scalable, fault-tolerant & distributed solutions.
OSDC 2018 | The operational brain: how new Paradigms like Machine Learning ar...NETWAYS
With the advent of IoT, companies have the opportunity to put larger and larger volumes of machine data to work to optimize operations like manufacturing production, safety, security, user experience. Yet, they are finding that the old paradigms of processing this data do not help mainstream developers keep pace with the velocity of data, new analytic algorithms, and the need for real-time insight. Jodok Batlogg, founder and CTO of Crate.io, believes that the solution to this problem lies at the nexus of modern open source distributed database architectures, machine learning/AI, and IoT networking. These technologies will combine to create a new data management paradigm that moves beyond traditional conceptions of databases. He believes the future lies in a central nervous system, an “operational brain” that connects directly to sensory inputs and applies artificial intelligence to control, predict, and monitor systems and things in real time. In this session, Jodok will use-real world, in-production manufacturing and cybersecurity examples of “operational brains” at work to explain the new paradigm, and discuss the concrete steps organizations can take to implement them.
Solr cloud the 'search first' nosql database extended deep divelucenerevolution
Presented by Mark Miller, Software Engineer, Cloudera
As the NoSQL ecosystem looks to integrate great search, great search is naturally beginning to expose many NoSQL features. Will these Goliath's collide? Or will they remain specialized while intermingling – two sides of the same coin.
Come learn about where SolrCloud fits into the NoSQL landscape. What can it do? What will it do? And how will the big data, NoSQL, Search ecosystem evolve. If you are interested in Big Data, NoSQL, distributed systems, CAP theorem and other hype filled terms, than this talk may be for you.
Presentation from Bosch Connected World, providing an overview of Blockchain, applications within the IoT, and how to get started evaluating the potential benefits
10-Step Methodology to Building a Single View with MongoDBMat Keep
MongoDB has been used in many single view projects across enterprises of all sizes and industries. In this session, we will share the best practices we have observed and institutionalized over the years. By attending the webinar, you will learn:
- A repeatable, 10-step methodology to successfully delivering a single view
- The required technology capabilities and tools to accelerate project delivery
- Case studies from customers who have built transformational single view applications on MongoDB.
An overview of how MongoDB and Apache Spark are enabling IoT innovators to create new business models.
Presented at the Business of IoT meetup in London on Jan 13th 2016
Overview of security best practices and the new security architecture of MongoDB, including latest features in MongoDB 2.6, including user-defined roles, LDAP, PKI integration, field level redaction, auditing and more
This presentation is designed to assist Developers, Architects and DBAs in navigating the complex waters of HA. It presents:
- Methodology for selecting the right HA solution to meet SLAs;
Tour of the leading certified HA solutions for MySQL;
Operational best practices to implement and support HA.
MySQL Cluster 7.2 added support for the Memcached API, enabling web services to directly query MySQL Cluster using the Memcached API, and adding a persistent, scalable, fault tolerant backend to Memcached.
The slides take you through the design concepts and introduce a sample social media app built using memcached and MySQL Cluster
Replication enhancements in MySQL 5.6, including GTIDs, HA / Self-healing, multi-threaded slaves and more. Slides over design rationale, implementation and how to get started using these new capabilities
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
PayPal Big Data and MySQL Cluster
1. Big Data is a Big Scam (Most of the Time)
Daniel Austin, PayPal Technical Staff
MySQL Connect Conference
September 30, 2012 v1.2
2. Today’s Agenda
Big Myths about Big Data
YESQL: A Counterexample
Q&A
Global In-memory MySQL Confidential and Proprietary 2
3. THE FUNDAMENTAL PROBLEM IN
DISTRIBUTED DATA SYSTEMS
“How Do We Manage Reliable
Distribution of Data Across Geographical
Distances?”
Confidential and Proprietary
4. The NoSQL Solution
• NoSQL Systems provide a solution that relaxes
many of the common constraints of typical
RDBMS systems
– Slow - RDBMS has not scaled with CPUs
– Often require complex data management (SOX,
SOR)
– Costly to build and maintain, slow to change and
adapt
– Intolerant of CAP models (more on this later)
• Non-relational models, usually key-value
• May be batched or streaming
• Not necessarily distributed geographically
Confidential and Proprietary
5. Big Data Myth #1: Big Data = NoSQL
• ‘Big Data’ Refers to a Common Set of Problems
– Large Volumes
– High Rates of Change
• Of Data
• Of Data Models
• Of Data Presentation and Output
– Often Require ‘Fast Data’ as well as ‘Big’
• Near-real Time Analytics
• Mapping Complex Structures
Takeaway: Big Data is the problem, NoSQL is one
(proposed) solution
Confidential and Proprietary
6. 3 Kinds of Big Data Systems
1. Columnar K-V Systems
– Hadoop, Hbase, Cassandra, PNUTs
2. Document-Based
– MongoDB, TerraCotta
3. Graph-Based
– FlockDB, Voldemort
Takeaway: These were originally designed as
solutions to specific problems because no
commercial solution would work.
Confidential and Proprietary
7. Big Data Hype Cycle: Where Are We Now?
There are currently more than 120+ NoSQL
databases listed at nosql-databases.com!
You Are Here ?
As the pace of new technology solutions has slowed, some clear winners have emerged.
Confidential and Proprietary
8. Big Data Myth #2: The CAP Theorem Doesn’t
Say What You Think It Does
• Consistency, Availability, (Network) Partition
• The Real Story: These are not Independent
Variables
• AP =CP (Um, what? But…A != C )
• Variations:
– PACELC (adds latency tolerance)
Takeaway: the real story here is about the tradeoffs
made by designers of different systems, and the
main tradeoff is between consistency and
availability, usually in favor of the latter.
Confidential and Proprietary
9. Big Data Myth: You Need A Big Data System
Well, Maybe….But Before You Go There…
There are essentially two ‘Big Data Problems’:
“I have too much data and it’s coming in too fast to
handle with any RDBMS.”
“I have a lot of data distributed geographically and
need to be able to read and write from anywhere in
near real-time.”
Takeaway: if you have one of these Big Data
problems, a NoSQL solution might work for you.
But there are also other alternatives…
Confidential and Proprietary
10. BIG DATA MYTH #3: BIG DATA AND NOSQL
ARE NEW IDEAS
• The first and most successful
such system is DNS, created in
1983.
• Began with flat files
• Currently serves the entire
Internet (!)
• DNS is an AP system,
availability is #1
• Many extensions complicate a
simple design
• Suggests a new term for CAP-
like ideas: variability
• DNS variability is very high,
often 2-3x the mean
Confidential and Proprietary
11. Today’s Agenda
Big Myths About Big Data
YESQL: A Counterexample
Q&A
Global In-memory MySQL Confidential and Proprietary 11
12. Mission YESQL
“Develop a globally distributed DB For
user-related data.”
• Must Not Fail (99.999%)
• Must Not Lose Data. Period.
• Must Support Transactions
• Must Support (some) SQL
• Must WriteRead 32-bit integer globally in
1000ms
• Maximum Data Volume: 100 TB
• Must Scale Linearly with Costs
Confidential and Proprietary
13. What about “High Performance”?
• Maximum lightspeed distance on Earth’s
Surface: ~67 ms
• Target: data available worldwide in < 1000 ms
Sound Easy?
Think Again!
Confidential and Proprietary
14. WHY MYSQL CLUSTER?
Pro Con
• True HA by design • Some semantic
– Fast recovery limitations on fields
• Supports (some) X- • Size constraints (2
actions TB?)
• Relational Model – Hardware limits
• In-memory also
architecture = high • Higher cost/byte
performance • Requires reasonable
• Disk storage for data partitioning
non-indexed data • Higher complexity
• APIs, APIs, APIs
Confidential and Proprietary
15. How MySQL Cluster Works in 1 Slide
Graphics courtesy dev.mysql.com
Confidential and Proprietary
17. AVAILABILITY DEFINED
• Availability of the entire
system:
n m
Asys = 1 – Π(1-Πri)j V
i=1 j=1 I
P
• Number of Parallel
Components Needed to
Achieve Availability Amin:
Parallel
Serial
Nmin = [ln(1-Amin)/ln(1-r)]
Confidential and Proprietary
18. AWS Meets MySQL Cluster
• Why AWS?
– Cheap and easy infrastructure-in-a-box
(Or so I thought! Ha!)
• Services Used:
– EC2 (Centos 5.3, small instances for mgm
& query nodes, XL for data
– Elastic IPs/ELB
– EBS Volumes
– S3
– Cloudwatch
Confidential and Proprietary
19. ARCHITECTURAL TILES
AWS Availability Zones
Tiling Rules
• Never separate NDB & SQL
A B
• Ndb:2-SQL:1-MGM:1
• Scale by adding more tiles
• Failover 1st to nearest AZ
• Then to nearest DC
• At least 1 replica/AZ
C ELB
• Don’t share nodes
• Mgmt nodes are redundant
Limitations Unused (not present in all locations)
• AWS is network-bound @
250 MBPS – ouch!
• Need specific ACL across AZ Data Mgmt SQL
boundaries Node Node Node
• AZs not uniform!
• No GSLB
• Dynamic IPs
• ELB sticky sessions !reliable
Confidential and Proprietary
20. Architecture Stack
Scale by Tiling
A B A B A B
A B
A B
A B A B
5 AWS Data Centers: US-E, US-W, TK, EU, AS
Confidential and Proprietary
21. Other Technologies Considered
• Paxos
– Elegant-but-complex consensus-based
messaging protocol
– Used in Google Megastore, Bing metadata
• Java Query Caching
– Queries as serialized objects
– Not yet working
• Multiple Ring Architectures
– Even more complicated = no way
Confidential and Proprietary
22. SYSTEM READ/WRITE PERFORMANCE (!)
What we tested:
• 32 & 256 byte char fields
In-region replication tests
• Reads, writes, query speed vs. volume
• Data replication speeds
Results:
• Global replication < 350 ms
• 256 byte read < 10ms worldwide
06/19/2011 06/20/2011 06/21/2011 06/22/2011 06/23/2011
Confidential and Proprietary
23. Data Models and Query Optimization
• Network Latency is an obvious issue
• Data model requires all segments present
in each geo-region
• Parameterized (Linked) Joins
– Adaptive Query Localization (SIP)
technique from Clustra (see Clement
Frazer’s blog for details)
Confidential and Proprietary
24. Conservation of Timestamps or The
Commit Ordering Problem
• Why does commit ordering matter?
• Write operators are non-commutative
[W(d,t1),W(d,t2)] != 0 unless t1=t2
– Can lead to inconsistency
– Can lead to timestamp corruption
– Forcing sequential writes defeats Amdahl’s
rule
• Can show up in GSLB scenarios
Confidential and Proprietary
25. Hard Lessons, Shared
• Be Careful…
– With “Eventual Consistency”-related concepts
– ACID, CAP are not really as well-defined as we’d like
considering how often we invoke them
• MySQL Cluster is a good solution
– Real HA, real SQL
– Notable limitations around fields, datatypes
– Successfully competes with NoSQL systems for most use
cases – better in many cases
• NoSQL Systems
– All have relatively low levels of maturity
– More suitable for simpler key-value models
– Victim of Tech Fashion
Confidential and Proprietary
26. Future Directions
• Alternate solution using Pacemaker,
Heartbeat
– From Yves Trudeau @ Percona
– Uses InnoDB, not NDB
• Implement Memcached plugin
– To test NoSQL functionality, APIs
• Add simple connection-based persistence
to preserve connections during failover
• Better data node distribution
• Better testing & monitoring
Confidential and Proprietary
27. Summing Up On YESQL v0.85
• It works! Far better than expected.
• Very fast, very reliable
• Reduced complexity since v0.7
• AWS poses challenges that private data
centers may not experience
• You can achieve high performance and
availability without giving up relational
models and read consistency!
Confidential and Proprietary
28. The Big Picture on Big Data
• Only use Big Data solutions when you have a real
Big Data problem.
– Don’t be a Dedicated Follower of Tech Fashion!
• Not all Big Data solutions are created equal
– What tradeoffs are most important to you?
– Consistency, Fault Tolerance, Availability,
Performance, Variability
• Is your data model a fit for NoSQL?
– You don’t have to give up the relational model in
most cases, so don’t!
• You can achieve high performance and availability
without giving up relational models and read
consistency! Just say YESQL!
Confidential and Proprietary
29. “In the long run, we are all dead
eventually consistent.”
Maynard Keynes on NoSQL Databases
Twitter: @daniel_b_austin
Email: daaustin@paypal.com
With apologies and thanks to the real DB experts, Andrew Goodman, Yves
Trudeau, Frazer Clement, Daniel Abadi, Kent Beck, and everyone else who
contributed. It really works!