Making your RDBMS fast!

•

0 likes•23 views

Victor shares developer-focused database agnostic performance tips from his experiences on consulting on a database migration project for a major telecom.

Technology

Developer-Focused Database Agnostic Performance Tips
Making your RDBMS Fast!
Victor Szoltysek
July / 2015

Me
ThoughtWorks Consultant
Supported Database Customer Migration Project at Shaw
Cable (Major Canadian telecom) for ~2 years
OOP background
Limited (typical?) SQL experience prior to Shaw
Today sharing .. lessons learned from project

Customer Migration @ Shaw in a Nutshell
Large DataVolume
Tables with 10 million+ rows
80 gigs+ of data per city
Full build (migration) took ~6hrs
MSSQL, MySql, Oracle, proprietary databases
and ﬂat ﬁles

Only testing with small datasets, but using large ones in
production
.. expecting same performance
Database Development Misconceptions

Reality
Everything changes when large data is involved
Ideally, performance test against comparable real world
volume
80/20 rule

Database Development Misconceptions
ORMs will “automagically” take care of everything
var sessionFactory = Fluently.Configure()
.Mappings(m => m.AutoMappings
.Add(AutoMap.AssemblyOf<Product>()))
.BuildSessionFactory();

Reality
ORMs are great for small data Greenﬁeld projects
ORMs hide optimization abilities, and database speciﬁc
features
Performance tweaks become increasingly more
important the larger the data you are dealing with

Reduce Row Size
Use smallest possible datatype
(Small Int instead of Big Int, etc)
Use Integer instead of GUID for keys
Use ﬁxed length if possible
(Char instead of Varchar)
Prefer Non-Null instead of Nullable columns

Null != Null in SQL
SELECT ‘SOME RESULT’ WHERE NULL=NULL
No ROWS RETURNED

Empty String != Empty String in Oracle
SELECT ‘SOME RESULT’ WHERE ‘’=’’
No ROWS RETURNED

Avoid N+1 Problem
for each(SELECT * FROM EMPLOYEES)
for each(SELECT * FROM SALES WHERE SALES.EMP_ID = employee.EMP_ID
//do something with sale
for each(SELECT * FROM SALES JOIN EMPLOYEES USING (EMP_ID)
//do something with sale

Give database as much work as possible
Reduce SQL database calls / network roundtrips
Defer query decisions to database, let it choose
optimum evaluation plan
Prefer SQL code to procedural logic (loops, cursors,
separated calls)

Use BULK operations
Sqlloader.exe (Oracle), Bcp.exe (MSSQL) , Bulk Insert
FAST insertion of static data
Cleaner code (CSV ﬁles instead of Insert Into’s)
~10x performance gains observed

Add Indexes where needed
Index Analogy - Index at end of book
Indexes SIGNIFICANTLY speeds up searching
(using ‘WHERE’ criteria)
ORMs don’t add indexes for non-key columns
Determine common searches
~100x performance gains observed

Gotcha - Indexes ignored with function usage
Example:
...WHERE UPPER(name) = ‘BOB’
‘name’ index will not be used!!
ORMs sometimes insert lower/upper behind the
scenes!!

Remove unused Indexes
Unused indexes take up space, slow down insertion and
deletion

Statistics
Databases use table statistics (row counts, data range,
data distribution etc) to determine optimum query
evaluation
Statistics are not automatically updated when data is
changed/inserted!! (as opposed to indexes)

Manually update Statistics on large data changes
Out-of-date statistics can cause database to make
inefﬁcient query evaluation decisions
#1 performance optimization at Shaw
~100x performance gains observed in some situations

Other Performance Options
Disable/Remove Constraints
Remove/Minimize Row-based Trigger Usage
Disable Logging
~1.5x performance gains observed
Good choice for test environments
(Be careful!!!)
Use SSD’s
~2x performance gains observed
Use RAM
(Be careful!!!)

SQL is not Dead
NoSQL is an alternative, not replacement for SQL/RDMS

SQL / RDBMS best for:
Row based data
Static table deﬁnitions
Complex Table Relationships / Joins
Transactions

What's hot

If you’ve brought two or more ML models into production, you know the struggle that comes from managing multiple data sets, feature engineering pipelines, and models. This talk will propose a whole new approach to MLOps that allows you to successfully scale your models, without increasing latency, by merging a database, a feature store, and machine learning. Splice Machine is a hybrid (HTAP) database built upon HBase and Spark. The database powers a one of a kind single-engine feature store, as well as the deployment of ML models as tables inside the database. A simple JDBC connection means Splice Machine can be used with any model ops environment, such as Databricks. The HBase side allows us to serve features to deployed ML models, and generate ML predictions, in milliseconds. Our unique Spark engine allows us to generate complex training sets, as well as ML predictions on petabytes of data. In this talk, Monte will discuss how his experience running the AI lab at NASA, and as CEO of Red Pepper, Blue Martini Software and Rocket Fuel, led him to create Splice Machine. Jack will give a quick demonstration of how it all works.

Unified MLOps: Feature Stores & Model Deployment

Databricks

ETL in the Cloud With Microsoft Azure

Mark Kromer

Azure data lake sql konf 2016

Kenneth Michael Nielsen

ADF Mapping Data Flows Training Slides V1

Mark Kromer

Microsoft Machine Learning Smackdown

Lynn Langit

ETL 2.0 Data Engineering for developers

Microsoft Tech Community

Microsoft Azure Data Factory Hands-On Lab Overview Slides

Mark Kromer

SQL Saturday Redmond 2019 ETL Patterns in the Cloud

Mark Kromer

Microsoft Machine Learning Smackdown

Lynn Langit

SQL Tips + Tricks for Developers

VictorSzoltysek

Deliver Your Modern Data Warehouse (Microsoft Tech Summit Oslo 2018)

Cathrine Wilhelmsen

Marketing vs Technology

Nguyen Ngoc Hoai Aan

Microsoft Data Integration Pipelines: Azure Data Factory and SSIS

Mark Kromer

Data quality patterns in the cloud with ADF

Mark Kromer

From Idea to Model: Productionizing Data Pipelines with Apache Airflow

Databricks

A/B testing, i.e., measuring the impact of proposed variants of e.g. e-commerce websites, is fundamental for increasing conversion rates and other key business metrics. We have developed a solution that makes it possible to run dozens of simultaneous A/B tests, obtain conclusive results sooner, and get more interpretable results than just statistical significance, but rather probabilities of the change having a positive effect, how much revenue is risked, etc. To compute those metrics, we need to estimate the posterior distributions of the metrics, which are computed using Generalized Linear Models (GLMs). Since we process gigabytes of data, we use a PySpark implementation, which however does not provide standard errors of coefficients. We, therefore, use bootstrapping to estimate the distributions. In this talk, I’ll describe how we’ve implemented parallelization of an already parallelized GLM computation to be able to scale this computation horizontally over a large cluster in Databricks and describe various tweaks and how they’ve improved the performance.

Bootstrapping of PySpark Models for Factorial A/B Tests

Databricks

Data Quality Patterns in the Cloud with ADF

Mark Kromer

Data cleansing and prep with synapse data flows

Mark Kromer

Intoduction to sql 2012 Tabular Modeling

Karan Gulati

NoSQL Now

Orchestrate

What's hot (20)

Unified MLOps: Feature Stores & Model Deployment

ETL in the Cloud With Microsoft Azure

Azure data lake sql konf 2016

ADF Mapping Data Flows Training Slides V1

Microsoft Machine Learning Smackdown

ETL 2.0 Data Engineering for developers

Microsoft Azure Data Factory Hands-On Lab Overview Slides

SQL Saturday Redmond 2019 ETL Patterns in the Cloud

Microsoft Machine Learning Smackdown

SQL Tips + Tricks for Developers

Deliver Your Modern Data Warehouse (Microsoft Tech Summit Oslo 2018)

Marketing vs Technology

Microsoft Data Integration Pipelines: Azure Data Factory and SSIS

Data quality patterns in the cloud with ADF

From Idea to Model: Productionizing Data Pipelines with Apache Airflow

Bootstrapping of PySpark Models for Factorial A/B Tests

Data Quality Patterns in the Cloud with ADF

Data cleansing and prep with synapse data flows

Intoduction to sql 2012 Tabular Modeling

NoSQL Now

Similar to Making your RDBMS fast!

It ready dw_day3_rev00

Siwawong Wuttipongprasert

The thinking persons guide to data warehouse design

Calpont

Moving from SQL Server to MongoDB

Nick Court

Data Lakes have become a new tool in building modern data warehouse architectures. In this presentation we will introduce Microsoft's Azure Data Lake offering and its new big data processing language called U-SQL that makes Big Data Processing easy by combining the declarativity of SQL with the extensibility of C#. We will give you an initial introduction to U-SQL by explaining why we introduced U-SQL and showing with an example of how to analyze some tweet data with U-SQL and its extensibility capabilities and take you on an introductory tour of U-SQL that is geared towards existing SQL users. slides for SQL Saturday 635, Vancouver BC, Aug 2017

Introduction to Azure Data Lake and U-SQL for SQL users (SQL Saturday 635)

Michael Rys

Nosql seminar

Shreyashkumar Nangnurwar

SQL Server 2008 Development for Programmers

Adam Hutson

Data-driven companies have a need to make their data easily accessible to those who analyze it. Many organizations have adopted the Looker application, LookML on AWS, a centralized analytical database with a user-friendly interface that allows employees to ask and answer their own questions to make informed business decisions. Join our webinar to learn how our customer, Casper, an online mattress retailer, made the switch from a transactional database to Looker’s data analytics program on Amazon Redshift. Looker on Amazon Redshift can help you greatly reduce your analytics lifecycle with a simplified infrastructure and rapid cloud scaling. Join us to learn: • How to utilize LookML to build reusable definitions and logic for your data • Best practices for architecting a centralized analytical database • How Casper leveraged Looker and Amazon Redshift to provide all their employees access to their data and metrics Who should attend: Heads of Analytics, Heads of BI, Analytics Managers, BI Teams, Senior Analysts

A Data Culture with Embedded Analytics in Action

Amazon Web Services

Prague data management meetup 2018-03-27

Martin Bém

A data warehouse is a database designed for query and analysis rather than for transaction processing. An appropriate design leads to scalable, balanced and flexible architecture that is capable to meet both present and long-term future needs. This session covers a comparison of the main data warehouse architectures together with best practices for the logical and physical design that support staging, load and querying.

Data Warehouse Design and Best Practices

Ivo Andreev

Azure Synapse Analytics is Azure SQL Data Warehouse evolved: a limitless analytics service, that brings together enterprise data warehousing and Big Data analytics into a single service. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources, at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate business intelligence and machine learning needs. This is a huge deck with lots of screenshots so you can see exactly how it works.

Azure Synapse Analytics Overview (r1)

James Serra

SQL Server 2008 Integration Services

Eduardo Castro

ETL

butest

ADL/U-SQL Introduction (SQLBits 2016)

Michael Rys

Myth busters - performance tuning 102 2008

paulguerin

In-memory ColumnStore Index

SolidQ

Making MySQL Great For Business Intelligence

Calpont

2021 04-20 apache arrow and its impact on the database industry.pptx

Andrew Lamb

Big data technology unit 3

RojaT4

Azure SQL Database is a fully managed cloud database service with built-in intelligence, elastic scale, performance, reliability, and data protection that enables enterprises and ISVs to reduce their total cost of ownership and operational cost and overheads. In this session, I will share real-world experience of successfully migrated existing SaaS application and on-premises workload for some our tier 1 customers and ISV partners to Azure SQL Database service. The session walks through planning, assessment, migration tools and best practices from the proven experiences and practices of migrating real world applications to Azure SQL Database service.

Migrating on premises workload to azure sql database

PARIKSHIT SAVJANI

Power BI with Essbase in the Oracle Cloud

Kellyn Pot'Vin-Gorman

Similar to Making your RDBMS fast! (20)

It ready dw_day3_rev00

The thinking persons guide to data warehouse design

Moving from SQL Server to MongoDB

Introduction to Azure Data Lake and U-SQL for SQL users (SQL Saturday 635)

Nosql seminar

SQL Server 2008 Development for Programmers

A Data Culture with Embedded Analytics in Action

Prague data management meetup 2018-03-27

Data Warehouse Design and Best Practices

Azure Synapse Analytics Overview (r1)

SQL Server 2008 Integration Services

ETL

ADL/U-SQL Introduction (SQLBits 2016)

Myth busters - performance tuning 102 2008

In-memory ColumnStore Index

Making MySQL Great For Business Intelligence

2021 04-20 apache arrow and its impact on the database industry.pptx

Big data technology unit 3

Migrating on premises workload to azure sql database

Power BI with Essbase in the Oracle Cloud

More from VictorSzoltysek

In the past six months, the AI landscape has undergone a massive transformation, ushering in a new era of productivity with the latest in Large Language Models (LLMs) and AI technology. This deep dive unlocks how to: Create CustomGPT Models: No coding needed to tailor AI for your unique projects. Integrate your own data, including PDFs and Excel sheets, making information handling a breeze. Plus, discover how to call your own actions/integrations for even more personalized utility. Navigate Advanced Prompting: Overcome AI's memory limits and utilize Retrieval-Augmented Generation for accessing your personalized data, streamlining how you interact with AI. Stay Ahead with AI Trends: Peek into the evolving world of LLMs, featuring newcomers like Google Gemini, Anthropic Claude, Open Sora, and Twitter Grok, and understand what their advancements mean for your productivity. Witness Real-Life Transformations: Through examples and prompt demonstrations, see firsthand how these AI strategies revolutionize routine tasks, from data analysis to content creation. Learn to leverage image output and input for advanced practical use cases, adding a new dimension to your productivity toolkit. No previous coding or AI experience is needed for this talk. Stay ahead in the fast-evolving world of work. Embrace the AI revolution and transform your workflow with advanced LLM techniques. Join us to ensure you're not left behind in the productivity race.

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques

VictorSzoltysek

The fast-paced world of Agile development just got a turbo boost! Dive into this engaging talk on "Innovation at Speed: ChatGPT & the AI Transformation in Agile Development." Over the last few months, AI, including notable tools like Google Bard and Microsoft Co-Pilot, has been leveraged to supercharge Agile workflows. In this session, attendees will get an overview of the current AI landscape and its impact on development. They will learn why AI is the game-changer needed to embrace in the Agile lifecycle, offering that elusive X factor to multiply productivity. Real-world examples will be presented, and the challenges of implementing AI tools like ChatGPT within corporate environments will be discussed.

Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment

VictorSzoltysek

Are you tired of the ever-increasing complexity in the world of DevOps? Do Docker and Kubernetes scripts, Ansible configurations, and networking woes make your head spin? It's time for a breath of fresh air. Join us on a transformative journey where we shatter the myth that DevOps has to be overly complicated. Say goodbye to the days of struggling with incomplete scripts and tangled configurations. In this enlightening talk, we'll guide you through the process of rapidly onboarding your new standard microservice into the DevOps and Cloud universe. We'll unveil the power of GitHub Actions, AWS, OpenAI API, and MS Teams Incoming Web hooks in a way that's both enlightening and entertaining. Additionally, we'll explore how Language Model APIs (LLMs) can be leveraged to enhance and streamline your DevOps workflows. You'll discover that DevOps doesn't have to be a labyrinth of complexity; it can be a streamlined and enjoyable experience. So, if you're ready to simplify your DevOps journey and embrace a world where AWS, the OpenAI API, and GitHub Actions collaborate seamlessly while harnessing the potential of LLMs, join us and let's make DevOps a breeze!

Simplified DevOps Bliss -with OpenAI API

VictorSzoltysek

SpaceX's triumph in the space industry reveals essential principles that can guide our DevOps journey: optimizing for the whole system, embracing failure, and tightening feedback loops. Unfortunately, in the world of DevOps, these principles are often lost in a rush to adopt the latest technologies, such as Kubernetes, without understanding their true value. This talk will explore how many organizations are caught up in the hype, sacrificing the core of DevOps for superficial implementations. Drawing parallels with SpaceX's development and the audacity of their mission, we'll investigate how to bring DevOps back to its roots. Prepare for a potentially controversial examination of common practices that may be slowing you down, like mandated pull requests and feature branching. By aligning with the principles that powered SpaceX's success, we'll redefine what DevOps means and how it can transform the way we work.

From SpaceX Launch Pads to Rapid Deployments

VictorSzoltysek

The Future of JVM Languages

VictorSzoltysek

Driving Process Improvements - A Guided Approach to Running Effective Retrosp...

VictorSzoltysek

Elon Musk’s SpaceX company continues to revolutionize the aerospace industry, has greatly reduced the cost of space travel, and is on track to put mankind on Mars in our life time. What are the Principles to this Success, and what does it have to do with Pull Requests and Feature Branching ? We’ll look at these Principles to explain how Pull Requests and Feature Branching are often detrimental to software delivery and are likely slowing you down. We’ll explore alternatives like Trunk Based Development and Pair Programming and walk through a more efficient coding workflow to help speed up your software delivery project.

Spaceships, Pull Requests and Feature Branching - A Principles-Based approac...

VictorSzoltysek

DevOps continues to be a confusing space, between the plethora of lingo and technologies it’s often nothing more then a relabelling of “Release Engineering” with the same long feedback loops, tiresome ticket wait times, and nightmarish merge-hell conflicts. We will do things the right way .. by implementing a modern, scrappy, and at times controversial “Day-One” DevOps solution for a typical Java Spring Boot front-end app, including setting a full end-to-end CI/CD pipeline. Simple and Easy to implement Strategies will be provided that you can immediately start applying on your project regardless of programming languages used.

Real-World DevOps — 20 Practical Developers Tips for Tightening Your Operatio...

VictorSzoltysek

Application Observability is a confusing space, between all the lingo (SRE, APMs, Tracing, Spans, etc) and technologies (Splunk, Datadog, Micrometer, etc), things often get implementing haphazardly with minimal focus on actual value. We will cut through buzzwords and fads, and take a high level look at a typical Java Spring Boot / ReactJS Application and provide 11 Practical Developers Focused Tips on implementing and getting REAL value. Simple, Easy (but often ignored) Strategies will be provided that you can immediately start applying on your project regardless of programming languages used.

Real-World Application Observability - 11 Practical Developer Focused Tips

VictorSzoltysek

Victor's Awesome Retro Deck

VictorSzoltysek

Software Development in Internet Memes

VictorSzoltysek

Only a few things are certain in life: death, taxes, and the fact most software projects will go over-budget, underperform, or worse-yet fail altogether. With decades of books, conferences, and materials written on the subject, we continue repeating the same fundamental mistakes that lead to these failures — and we still end-up with massive cost overruns or worse still: complete write-offs with nothing to show. Come take a journey back in time to 1990’s Springfield to get some insights on avoiding these costly mistakes, and to help shed light on why failures like healthgov.org or canada.ca continue happening. We’ll look on how to pragmatically deal with our obsession with shiny objects, our wishful thinking tendencies, our self-interest with Resume-Driven Design, and a slew of other and often non-technical anti-patterns —in order to help mitigate them, and make a more efficient Springfield.

Big Bangs, Monorails and Microservices - Feb 2020

VictorSzoltysek

"Perfection is Achieved Not When There Is Nothing More to Add, But When There Is Nothing Left to Take Away" One of the most effective ways of improving profitability and reducing costs is identifying and eliminating any waste in your SLDC process that doesn’t add value. We will look at the 7 Wastes of Lean Manufacturing — popularized by Toyota — and how they apply to the Software Development industry. This includes: Partially Done Work, Extra Features, Relearning, Handoffs, Delays, Task Switching, and Defects. We will look at how identifying these wastes, how to reduce them, and how to deliver software projects with lower costs (time, money, resources).

Less is more the 7 wastes of lean software development

VictorSzoltysek

Since it’s introduction over 20 years ago, Java developers have had plenty of strong and often opposing ways of doing things. Modern day JVM development has only added more options, more divisiveness, and even more heated arguments. * Constructor or Setter Injection ? * Dynamic or Static typing ? * Monolithic or Micro-service application design ? * Java or Scala or Kotlin ? * JVM or .NET ? * Reactive ? * Are Mutable types and Threads inherently Evil? * Is Kubernetes / Docker the modern way of JVM deployments? Let’s dive into these and other issues .. while also stepping back and looking at the Bigger Picture: What is it really about these choices that improve the bottom line ? Most importantly .. let’s also settle once and for all: Maven or Gradle?

Modern day jvm controversies

VictorSzoltysek

More then 20 years after it’s release, Java still tops the popular TIOBE index as the most popular programming language in the world. But is it time for Java to be relegated to the ranks of Cobalt and Fortran ? Itself replaced by something shiner on the JVM stack ? Take a quick journey back-in-time to explore the progression of programming languages; from Assembly to C to Java. Then dive into 3 solid JVM language alternatives in search of the next big thing, and explore the benefits they bring to the Java space.

The Future of Java - and a look at the evolution of programming languages

VictorSzoltysek

Client Technical Analysis of Legacy Software and Future Replacement

VictorSzoltysek

The power of abstraction has been a large driver in improving developer productivity over the last few decades. Higher level programming languages have allowed developers to spend progressively more of their time focusing on delivering tangible business value instead of worrying about underlying details such as registers and memory management. Through the use of containers, virtualization, and the cloud; we are now seeing the same productivity gains through the abstraction of infrastructure. This again is allowing developers and Ops to focus more of their time on delivering real value instead of the minutiae of requisition forms, OS patching, and late night calls to manually reboot machines. For this presentation, we will delve into the cornucopia of current buzzwords and offerings in the infrastructure space including: Docker, Kubernetes, Microservices, Cloud Native, Iaas, PaaS, SaaS, FaaS, Hypervisors, VMs, etc. Cutting through the hype, we will the discuss the different offering including: uses-cases, pros/cons, and reoccurring fallacies and misconceptions. Whether you’re running On-Prem Monolith snowflake environments , or fully Containerized Cloud Microservices; you’ll pick up common Cloud Practices and Patterns to best leverage these tools and improve your team velocity.

Improving velocity through abstraction

VictorSzoltysek

More from VictorSzoltysek (17)

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques

Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment

Simplified DevOps Bliss -with OpenAI API

From SpaceX Launch Pads to Rapid Deployments

The Future of JVM Languages

Driving Process Improvements - A Guided Approach to Running Effective Retrosp...

Spaceships, Pull Requests and Feature Branching - A Principles-Based approac...

Real-World DevOps — 20 Practical Developers Tips for Tightening Your Operatio...

Real-World Application Observability - 11 Practical Developer Focused Tips

Victor's Awesome Retro Deck

Software Development in Internet Memes

Big Bangs, Monorails and Microservices - Feb 2020

Less is more the 7 wastes of lean software development

Modern day jvm controversies

The Future of Java - and a look at the evolution of programming languages

Client Technical Analysis of Legacy Software and Future Replacement

Improving velocity through abstraction

Recently uploaded

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

Real Time Object Detection Using Open CV

Khem

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

Histor y of HAM Radio presentation slide

vu2urc

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Microsoft's Threat Matrix for Kubernetes helps organizations understand the attack surface a Kubernetes deployment introduces to their environments. This ensures that adequate detections and mitigations are in place. By covering over 40 different attacker techniques, defenders can learn about Kubernetes-specific mitigations and controls to deploy to their environments. In this session, we will explore the MS-TA9013 Host Path Mount technique, which is commonly used by attackers to perform privilege escalation in a Kubernetes cluster. Attendees will learn how attackers and defenders can: * Escape the container's host volume mount to gain persistence on an underlying node * Move laterally from the underlying node into the customer's cloud environment * Analyze Kubernetes audit logs to detect pods deployed with a hostPath mount * Deploy an admission controller that prevents new pods from using a hostPath mount

Breaking the Kubernetes Kill Chain: Host Path Mount

Puma Security, LLC

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

Explore 'The Codex of Business: Writing Software for Real-World Solutions,' a compelling SlideShare presentation that delves into digital transformation in healthcare. Discover through a detailed case study how Agile methodologies empower healthcare providers to develop, iterate, and refine digital solutions that address real-world challenges. Learn how strategic planning, user feedback, and continuous improvement drive success in deploying technologies that enhance patient care and operational efficiency. Ideal for healthcare professionals, IT specialists, and digital transformation advocates seeking actionable insights and practical examples of technology making a real difference.

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Malak Abu Hammad

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

Presentation on how to chat with PDF using ChatGPT code interpreter

naman860154

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

The Raspberry Pi 5 was announced on October 2023. This new version of the popular embedded device comes with a new iteration of Broadcom’s VideoCore GPU platform, and was released with a fully open source driver stack, developed by Igalia. The presentation will discuss some of the major changes required to support this new Video Core iteration, the challenges we faced in the process and the solutions we provided in order to deliver conformant OpenGL ES and Vulkan drivers. The talk will also cover the next steps for the open source Raspberry Pi 5 graphics stack. (c) Embedded Open Source Summit 2024 April 16-18, 2024 Seattle, Washington (US) https://events.linuxfoundation.org/embedded-open-source-summit/ https://eoss24.sched.com/event/1aBEx

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Igalia

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Delhi Call girls

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)

Real Time Object Detection Using Open CV

Boost Fertility New Invention Ups Success Rates.pdf

Driving Behavioral Change for Information Management through Data-Driven Gree...

Histor y of HAM Radio presentation slide

Data Cloud, More than a CDP by Matt Robison

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

GenCyber Cyber Security Day Presentation

Breaking the Kubernetes Kill Chain: Host Path Mount

08448380779 Call Girls In Civil Lines Women Seeking Men

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Handwritten Text Recognition for manuscripts and early printed texts

Presentation on how to chat with PDF using ChatGPT code interpreter

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

What Are The Drone Anti-jamming Systems Technology?

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Making your RDBMS fast!

1. Developer-Focused Database Agnostic Performance Tips Making your RDBMS Fast! Victor Szoltysek July / 2015

2. Me ThoughtWorks Consultant Supported Database Customer Migration Project at Shaw Cable (Major Canadian telecom) for ~2 years OOP background Limited (typical?) SQL experience prior to Shaw Today sharing .. lessons learned from project

3. Customer Migration @ Shaw in a Nutshell Large DataVolume Tables with 10 million+ rows 80 gigs+ of data per city Full build (migration) took ~6hrs MSSQL, MySql, Oracle, proprietary databases and ﬂat ﬁles

4. Only testing with small datasets, but using large ones in production .. expecting same performance Database Development Misconceptions

5. Reality Everything changes when large data is involved Ideally, performance test against comparable real world volume 80/20 rule

6. Database Development Misconceptions ORMs will “automagically” take care of everything var sessionFactory = Fluently.Configure() .Mappings(m => m.AutoMappings .Add(AutoMap.AssemblyOf<Product>())) .BuildSessionFactory();

7. Reality ORMs are great for small data Greenﬁeld projects ORMs hide optimization abilities, and database speciﬁc features Performance tweaks become increasingly more important the larger the data you are dealing with

8. Reduce Row Size Use smallest possible datatype (Small Int instead of Big Int, etc) Use Integer instead of GUID for keys Use ﬁxed length if possible (Char instead of Varchar) Prefer Non-Null instead of Nullable columns

9. Null Gotchas

10. Null != Null in SQL SELECT ‘SOME RESULT’ WHERE NULL=NULL No ROWS RETURNED

11. Empty String != Empty String in Oracle SELECT ‘SOME RESULT’ WHERE ‘’=’’ No ROWS RETURNED

12. Avoid N+1 Problem for each(SELECT * FROM EMPLOYEES) for each(SELECT * FROM SALES WHERE SALES.EMP_ID = employee.EMP_ID //do something with sale for each(SELECT * FROM SALES JOIN EMPLOYEES USING (EMP_ID) //do something with sale

13. Give database as much work as possible Reduce SQL database calls / network roundtrips Defer query decisions to database, let it choose optimum evaluation plan Prefer SQL code to procedural logic (loops, cursors, separated calls)

14. Use BULK operations Sqlloader.exe (Oracle), Bcp.exe (MSSQL) , Bulk Insert FAST insertion of static data Cleaner code (CSV ﬁles instead of Insert Into’s) ~10x performance gains observed

15. Add Indexes where needed Index Analogy - Index at end of book Indexes SIGNIFICANTLY speeds up searching (using ‘WHERE’ criteria) ORMs don’t add indexes for non-key columns Determine common searches ~100x performance gains observed

16. Gotcha - Indexes ignored with function usage Example: ...WHERE UPPER(name) = ‘BOB’ ‘name’ index will not be used!! ORMs sometimes insert lower/upper behind the scenes!!

17. Remove unused Indexes Unused indexes take up space, slow down insertion and deletion

18. Statistics Databases use table statistics (row counts, data range, data distribution etc) to determine optimum query evaluation Statistics are not automatically updated when data is changed/inserted!! (as opposed to indexes)

19. Manually update Statistics on large data changes Out-of-date statistics can cause database to make inefﬁcient query evaluation decisions #1 performance optimization at Shaw ~100x performance gains observed in some situations

20. Other Performance Options Disable/Remove Constraints Remove/Minimize Row-based Trigger Usage Disable Logging ~1.5x performance gains observed Good choice for test environments (Be careful!!!) Use SSD’s ~2x performance gains observed Use RAM (Be careful!!!)

21. SQL is not Dead NoSQL is an alternative, not replacement for SQL/RDMS

22. SQL / RDBMS best for: Row based data Static table deﬁnitions Complex Table Relationships / Joins Transactions

Making your RDBMS fast!

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Making your RDBMS fast!

Similar to Making your RDBMS fast! (20)

More from VictorSzoltysek

More from VictorSzoltysek (17)

Recently uploaded

Recently uploaded (20)

Making your RDBMS fast!