Niko Neugebauer gave a presentation on the columnstore improvements in SQL Server 2016. Some of the key improvements discussed include hybrid transactional/analytical processing (HTAP), new T-SQL syntax for defining columnstore indexes, high availability features like readable secondaries, improved data loading and batch processing performance, new maintenance features, and expanded monitoring capabilities. The presentation provided examples and demonstrations of many of these new columnstore features in SQL Server 2016.
Introduction into the world of Clustered Columnstore Indexes in SQL Server 2014, with explanations of the basic structures and functionalities.
Available data types, limitations and differences to SQL Server 2012 Nonclustered Columnstore Indexes are all described here
Deep dive into Clustered Columnstore structures with information on compression algorithms, compression types, locking and dictionaries, as well as the Batch Processing Mode.
ETL with Clustered Columnstore - PASS Summit 2014Niko Neugebauer
You will find some basic information about ways to extract, load & maintain information with Clustered Columnstore Indexes.
You will need to have some knowledge about Columnstore Indexes structures to use it.
Introduction to CosmosDB - Azure Bootcamp 2018Josh Carlisle
[Session Abstract] Cosmos DB is a globally distributed, multi-model, Serverless, NoSQL database solution that runs on Microsoft Azure. With guaranteed SLAs, various consistency models, and support for multiple APIs, Cosmos DB can have many advantages over common relational database solutions. However, the shift to NoSQL in addition to the numerous configuration options available in Cosmos DB can be a challenge for traditional relational database developers. In this talk, we will take an existing application built on a traditional relational database and update the solution to take advantage of Cosmos DB. Along the way, we will have many decisions to make including which API we should use, how best to model our data, which consistency model to use, and how our data should be indexed, partitioned, and organized. By the end of the talk you should have familiarity with the decisions you will need to make to successfully implement your own solutions on Cosmos DB.
Introducing 3 FREE Smart solutions for SQL Server (Adi Sapir, Docco Labs)
As Database experts, we work with SQL Server Databases on a daily basis. We face the same problems every SQL Administrator and/or developer does. And – we spend our time writing solutions for these problems! In this session Adi will introduce the following 3, totally FREE solutions:
· ClipTable – A revolutionary new *anything* to SQL Table importer
· Database File Explorer – a much easier way to explore our database->filegroups->files->storage mapping
· Log Table Viewer – a complete client/server logger solution for SQL Server
Introduction into the world of Clustered Columnstore Indexes in SQL Server 2014, with explanations of the basic structures and functionalities.
Available data types, limitations and differences to SQL Server 2012 Nonclustered Columnstore Indexes are all described here
Deep dive into Clustered Columnstore structures with information on compression algorithms, compression types, locking and dictionaries, as well as the Batch Processing Mode.
ETL with Clustered Columnstore - PASS Summit 2014Niko Neugebauer
You will find some basic information about ways to extract, load & maintain information with Clustered Columnstore Indexes.
You will need to have some knowledge about Columnstore Indexes structures to use it.
Introduction to CosmosDB - Azure Bootcamp 2018Josh Carlisle
[Session Abstract] Cosmos DB is a globally distributed, multi-model, Serverless, NoSQL database solution that runs on Microsoft Azure. With guaranteed SLAs, various consistency models, and support for multiple APIs, Cosmos DB can have many advantages over common relational database solutions. However, the shift to NoSQL in addition to the numerous configuration options available in Cosmos DB can be a challenge for traditional relational database developers. In this talk, we will take an existing application built on a traditional relational database and update the solution to take advantage of Cosmos DB. Along the way, we will have many decisions to make including which API we should use, how best to model our data, which consistency model to use, and how our data should be indexed, partitioned, and organized. By the end of the talk you should have familiarity with the decisions you will need to make to successfully implement your own solutions on Cosmos DB.
Introducing 3 FREE Smart solutions for SQL Server (Adi Sapir, Docco Labs)
As Database experts, we work with SQL Server Databases on a daily basis. We face the same problems every SQL Administrator and/or developer does. And – we spend our time writing solutions for these problems! In this session Adi will introduce the following 3, totally FREE solutions:
· ClipTable – A revolutionary new *anything* to SQL Table importer
· Database File Explorer – a much easier way to explore our database->filegroups->files->storage mapping
· Log Table Viewer – a complete client/server logger solution for SQL Server
What is Change Data Capture (CDC) and Why is it Important?FlyData Inc.
Check out what Change Data Capture (CDC) is and why it is becoming ever more important. Slides also include useful tips on how to design your CDC implementation.
Oracle GoldenGate for MySQL provides the real-time data replication to capture and deliver data for MySQL databases.This is an overview of the product.
This session will be more of an introduction to Amazon DynamoDB with the background of what actually a NoSQL DB is. We will get to know the other available options for NoSQL DB, followed by a demo.
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScyllaDB
Applications have never been so data-hungry, nor as demanding for scale, speed and availability. Hear from CEO Dor Laor as he shares how ScyllaDB is powering this next tech cycle.
To watch all of the recordings hosted during Scylla Summit 2022 visit our website here: https://www.scylladb.com/summit.
In 2018's user conference keynote MariaDB CEO, Michael Howard, announced an initiative to build a MariaDB DBaaS platform. In this session, the DBaaS team shares how MariaDB is approaching DBaaS, then discusses the role of containers and Kubernetes, the need for infrastructure-agnostic provisioning, support for day-two operations and enterprise requirements for large-scale DBaaS deployments.
Running Scylla on Kubernetes with Scylla OperatorScyllaDB
Kubernetes has quickly become the ad-hoc standard for managing software deployments. The Scylla team recently released the beta version of the Scylla Operator for Kubernetes.
Join us to learn how Kubernetes can be used to automate the deployment, scaling, and various operations of Scylla NoSQL. In this webinar we will:
- Present the mapping between Scylla and Kubernetes entities, and explain their reasoning
- Give insight into the Operator Open Source project, and how you can get involved
- Explain and demo common procedures with Scylla Operator
- Discuss the challenges of running a high performance, persistent application on Kubernetes, and the trade offs we considered
- Share our plans for Scylla on Kubernetes
ClustrixDB: how distributed databases scale outMariaDB plc
ClustrixDB, now part of MariaDB, is a fully distributed and transactional RDBMS for applications with the highest scalability requirements. In this session Robbie Mihalyi, VP of Engineering for ClustrixDB, provides an introduction to ClustrixDB, followed by an in-depth technical overview of its architecture, with a focus on distributed storage, transactions and query processing – and its unique approach to index partitioning.
This DataStage internet Training will furnish you with the capability expected to work with the IBM DataStage. DataStage is an ETL device that uses a graphical documentation for the combination of information. This is the lead result of IBM in Business Intell
Maximizing Database Tuning in SAP SQL AnywhereSAP Technology
This session illustrates the different tools available in SQL Anywhere to analyze performance issues, as well as describes the most common types of performance problems encountered by database developers and administrators. We also take a look at various tips and techniques that will help boost the performance of your SQL Anywhere database.
What is Change Data Capture (CDC) and Why is it Important?FlyData Inc.
Check out what Change Data Capture (CDC) is and why it is becoming ever more important. Slides also include useful tips on how to design your CDC implementation.
Oracle GoldenGate for MySQL provides the real-time data replication to capture and deliver data for MySQL databases.This is an overview of the product.
This session will be more of an introduction to Amazon DynamoDB with the background of what actually a NoSQL DB is. We will get to know the other available options for NoSQL DB, followed by a demo.
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScyllaDB
Applications have never been so data-hungry, nor as demanding for scale, speed and availability. Hear from CEO Dor Laor as he shares how ScyllaDB is powering this next tech cycle.
To watch all of the recordings hosted during Scylla Summit 2022 visit our website here: https://www.scylladb.com/summit.
In 2018's user conference keynote MariaDB CEO, Michael Howard, announced an initiative to build a MariaDB DBaaS platform. In this session, the DBaaS team shares how MariaDB is approaching DBaaS, then discusses the role of containers and Kubernetes, the need for infrastructure-agnostic provisioning, support for day-two operations and enterprise requirements for large-scale DBaaS deployments.
Running Scylla on Kubernetes with Scylla OperatorScyllaDB
Kubernetes has quickly become the ad-hoc standard for managing software deployments. The Scylla team recently released the beta version of the Scylla Operator for Kubernetes.
Join us to learn how Kubernetes can be used to automate the deployment, scaling, and various operations of Scylla NoSQL. In this webinar we will:
- Present the mapping between Scylla and Kubernetes entities, and explain their reasoning
- Give insight into the Operator Open Source project, and how you can get involved
- Explain and demo common procedures with Scylla Operator
- Discuss the challenges of running a high performance, persistent application on Kubernetes, and the trade offs we considered
- Share our plans for Scylla on Kubernetes
ClustrixDB: how distributed databases scale outMariaDB plc
ClustrixDB, now part of MariaDB, is a fully distributed and transactional RDBMS for applications with the highest scalability requirements. In this session Robbie Mihalyi, VP of Engineering for ClustrixDB, provides an introduction to ClustrixDB, followed by an in-depth technical overview of its architecture, with a focus on distributed storage, transactions and query processing – and its unique approach to index partitioning.
This DataStage internet Training will furnish you with the capability expected to work with the IBM DataStage. DataStage is an ETL device that uses a graphical documentation for the combination of information. This is the lead result of IBM in Business Intell
Maximizing Database Tuning in SAP SQL AnywhereSAP Technology
This session illustrates the different tools available in SQL Anywhere to analyze performance issues, as well as describes the most common types of performance problems encountered by database developers and administrators. We also take a look at various tips and techniques that will help boost the performance of your SQL Anywhere database.
Storage Optimization and Operational Simplicity in SAP Adaptive Server Enter...SAP Technology
This presentation will discuss the key storage optimization and operational simplicity features available in SAP ASE and introduce enhancements such as heat map providing the capability to move data to high/low performing storage devices based on access patterns.
Self-serve analytics journey at Celtra: Snowflake, Spark, and DatabricksGrega Kespret
Celtra provides a platform for streamlined ad creation and campaign management used by customers including Porsche, Taco Bell, and Fox to create, track, and analyze their digital display advertising. Celtra’s platform processes billions of ad events daily to give analysts fast and easy access to reports and ad hoc analytics. Celtra’s Grega Kešpret leads a technical dive into Celtra’s data-pipeline challenges and explains how it solved them by combining Snowflake’s cloud data warehouse with Spark to get the best of both.
Topics include:
- Why Celtra changed its pipeline, materializing session representations to eliminate the need to rerun its pipeline
- How and why it decided to use Snowflake rather than an alternative data warehouse or a home-grown custom solution
- How Snowflake complemented the existing Spark environment with the ability to store and analyze deeply nested data with full consistency
- How Snowflake + Spark enables production and ad hoc analytics on a single repository of data
SQL Server 2014 for Developers (Cristian Lefter)ITCamp
There’s a special Microsoft product that has come a long way since its first OS/2 version. Actually this year, the 12th version of the product marks 25 years from the start, 25 years of Microsoft SQL Server.
Over the years, we have seen many amazing and innovative technologies such as SQL CLR, Service Broker, Resource Governor or recently the cloud-based version of SQL Server. And each time we concluded that we have seen it all, another version came to surprise us. SQL Server 2014 is no exception and I guarantee that some of its features will know your socks off!
I will say just “In-Memory OLTP” and “Updatable Clustered Columnstore Indexes” and it should be enough. But there’s more than that! Come to this session to find out for yourself!
Columnstore indexes were first introduced in SQL Server 2012. Columnstore indexes provide the ability to flip-over your indexes from column-based to row-based, leading to dramatic changes in the storage and performance of your data. In this session, we'll review the basic concepts of columnstore indexes and look at the changes coming to them in SQL Server 2014
Maintenance Plans for Beginners (but not only) | Each of experienced administrators used (to some extent) what is called Maintenance Plans - Plans of Conservation. During this session, I'd like to discuss what can be useful for us to provide functionality when we use them and what to look out for. Session at 200 times the forward-300, with the opening of the discussion.
Les index columnstore sont apparus avec SQL Server 2012 et bon nombre de limitations ou d'améliorations ont vu le jour avec SQL Server 2014 et bientôt SQL Server 2016. Il en va de même pour les tables In-Memory à partir de SQL Server 2014. Découvrez lors de cette session comment SQL Server 2016 répond aux besoins d'analyse opérationnelle en temps réel en introduisant et en mixant ces 2 technologies In-Memory
This presentation about Data Warehouse modernization and extending it to the modern data platform by adding Big Data solution using EMR and Spark and streaming data with Kinesis Firehose. In addition, it will cover the use case of complimentory data lake for data warehouse. Moreover, this presentation include ETL tool selection process and ML consideration.
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo
Presentation slides taken from Fast Data Strategy Roadshow San Francisco Bay Area.
For more Denodo 6-0 demos, please follow this link:https://goo.gl/XkxJjX
Similar to Columnstore improvements in SQL Server 2016 (20)
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
3. Explore Everything PASS Has to Offer
FREE ONLINE WEBINAR EVENTS FREE 1-DAY LOCAL TRAINING EVENTS
LOCAL USER GROUPS
AROUND THE WORLD
ONLINE SPECIAL INTEREST
USER GROUPS
BUSINESS ANALYTICS TRAINING
VOLUNTEERING OPPORTUNITIES
PASS COMMUNITY NEWSLETTER
BA INSIGHTS NEWSLETTERFREE ONLINE RESOURCES
4. Session Evaluations
ways to access
Go to passSummit.com Download the GuideBook App
and search: PASS Summit 2016
Follow the QR code link displayed
on session signage throughout the
conference venue and in the
program guide
Submit by 5pm
Friday November 6th to
WIN prizes
Your feedback is
important and valuable. 3
5. Niko Neugebauer
Consultant, OH22
• Microsoft Data Platform Professional
• OH22 (http://www.oh22.net)
• Data Platform MVP
• Founder of a couple of Portuguese PASS Chapters
• Creator of CISL – Free & Open Source
Columnstore Indexes Script Library
(https://github.com/NikoNeugebauer/CISL)
Place your
photo here
/WebCaravela @NikoNeugebauer www.nikoport.com
6. Our Today’s Columnstore Agenda p. 1
• HTAP aka Operational Analytics
• Data Warehousing (CCI)
• T-SQL
• High Availability
• Batch Mode Changes
• Data Loading (+Minimal Logging)
• Performance (Batch, SSIS)
10. What is it all about:
• Predictive Analytics, Prescriptive Analytics, Business
Analytics, Machine Learning, Data Mining
But technically it all goes down to the mining
one thing we treasure quite a lot: the Data
Analytics
11. Analytics – is the process of discovery & communication
of meaningful patterns in data.
Reporting - extraction of the aggregated information for
further analysis.
Querying - data extraction process
Analytics Faces
13. • Costs
• Integration Problems (data types, constraints, network
problems, etc)
• Delay for getting the actual data
Traditional Operational Analytics
Problems:
14. Nothing substitutes analytics queries performance,
possible using schemas customized (Star/Snowflake) and/or
pre-aggregated cubes
Modern Operational Analytics notes:
15. Minimizing data latency for analytics
Benefits
No data latency
No ETL
No separate data warehouse
Challenges
Analytics queries are resource intensive and
can cause blocking
Minimizing impact on operational workloads
Sub-optimal execution of analytics on
relational schema
IIS Server
BI analysts
16. Optimizing data latency for analytics
BI analysts
What is operational analytics and what does it
mean to you?
Operational analytics with disk-based tables
Operational analytics with In-Memory OLTP
IIS Server
17. There are now 2 types of Operational Analytics:
Operational Analytics for Disk-Based
Operational Analytics for InMemory
Operational Analytics: SQL Server 2016
29. The Problem:
When we move a row from Delta-Store into the
compressed Row Group, it becomes compressed.
Once it is updated, the row is marked as deleted in
the Deleted Bitmap/Deleted Buffer and the new
value is inserted in Delta-Store.
30. The Solution is the Compression Delay
Default delay is 0 min
alter index PK_NCCi_test_inline
on dbo.ncci_test_inline
set (COMPRESSION_DELAY = 60 Minutes);
34. New T-SQL
• Inline Table definition of Clustered Columnstore
Index
• Inline Table definition of Nonclustered
Columnstore Index
• In-Memory OLTP & Columnstore Indexes
35. New T-SQL for Disk-Based Columnstore
CREATE TABLE [dbo].[FactOnlineSales_CCI](
[OnlineSalesKey] [int] NOT NULL,
[StoreKey] [int] NOT NULL,
[ProductKey] [int] NOT NULL,
[PromotionKey] [int] NOT NULL,
[CurrencyKey] [int] NOT NULL,
[CustomerKey] [int] NOT NULL,
INDEX PK_FactOnlineSales_CCI CLUSTERED COLUMNSTORE
);
CREATE TABLE [dbo].[FactOnlineSales_NCCI](
[OnlineSalesKey] [int] NOT NULL,
[StoreKey] [int] NOT NULL,
[ProductKey] [int] NOT NULL,
[PromotionKey] [int] NOT NULL,
[CurrencyKey] [int] NOT NULL,
[CustomerKey] [int] NOT NULL,
INDEX PK_FactOnlineSales_CCI NONCLUSTERED COLUMNSTORE (OnlineSalesKey, CustomerKey)
);
36. New T-SQL for In-Memory Columnstore
CREATE TABLE [dbo].[FactOnlineSales_Hekaton](
[OnlineSalesKey] [int] NOT NULL,
[StoreKey] [int] NOT NULL,
[ProductKey] [int] NOT NULL,
[PromotionKey] [int] NOT NULL,
[CurrencyKey] [int] NOT NULL,
[CustomerKey] [int] NOT NULL,
Constraint PK_FactOnlineSales_Hekaton PRIMARY KEY NONCLUSTERED ([OnlineSalesKey]),
INDEX CCI_FactOnlineSales_Hekaton
CLUSTERED COLUMNSTORE
) WITH (MEMORY_OPTIMIZED = ON, DURABILITY = SCHEMA_AND_DATA);
ALTER TABLE dbo. FactOnlineSales_Hekaton
ADD INDEX CCI_FactOnlineSales_Hekaton
CLUSTERED COLUMNSTORE;
40. Transactional Replication
We can now control, how the HTAP
Columnstore Indexes are being replicated –
if they should be created on the Subscriber or not.
43. Data Loading Improvements
• Parallel Data Loading finally enabled
• SIMD support
• Delta-Stores are not Page-Compressed!!!
• Under NCCI Delta-Stores are automatically increasing their max size with
each iteration under pressure
• Compression Delay
• SSIS
• Minimal Logging
45. SIMD Improvements
Columnstore is SIMD-aware and uses it for
different purposes, such as:
- Huffman decoding
- Predicate Pushdown
- String Hash
46. Delta-Stores for NCCI
- Under memory pressure the size of the created
Delta-Stores for the Nonclustered Columnstore
Indexes will be doubled for each succeeding
operation from 1048576 rows to ~2 Million
Rows, ~4 Million Rows, ... 32 Million Rows
47. The solution: Compression Delay
Default delay is 0 min
alter index PK_NCCi_test_inline
on dbo.ncci_test_inline
set (COMPRESSION_DELAY = 60 Minutes);
48. SSIS 2016 Improvements
AutoAdjustBufferSize - True
Warning: It’s not enough, you will need to set up the
maximum number of rows per buffer, otherwise
your performance will be extremely slow!
DefaultBufferMaxRows = 1048576
54. Batch Mode Improvements
• Batch Mode support for 1 core execution plan operators
• Batch Mode support for the Sort operator
• Batch Mode support for the Multiple Distinct Count
operations
• Batch Mode support for the Left Anti-Semi Join operators
55. Batch Sorting Improvements
Trace Flag Description
9347 Disables batch mode sort operator
9349 Disables batch mode top sort operator
9358
Disable batch mode sort operations in a complex parallel query in SQL
Server 2016
59. Further Batch Improvements
• Simple Aggregate Predicate Pushdown
• Local Aggregation
• String Predicate Pushdown for Index Scan operator in
Batch Mode
63. Change Tracking / CDC in SQL 2016
Columnstore Change Tracking Change Data Capture Temporal (2016+)
Clustered No No Yes
Nonclustered Yes Yes Yes
Memory-Optimized No No Yes
64. Maintenance Improvements
• Better ALTER INDEX … REORGANIZE
(removes deleted rows, less memory pressure)
• Self-Merging (this is when Row Group is being recompressed
eliminating obsolete row versions that are found in Deleted
Bitmap)
• Intergroup Merging (when the sum of active rows between
multiple Row Groups is less then 1048576, then their
information can be merged into a new single Row Group)
65. Maintenance Improvements
• Being a compressed Row Group (Delta-Stores & Tail
Row Groups are not qualified)
• A Row Group has to have at least 10% of its data being
marked as obsolete in Deleted Bitmap (for In-Memory
this threshold is set on 90%)
• – A Row Group should not being trimmed because of
the dictionary pressure.
66. Row Group Merging in SQL Server 2016
Columnstore Self-Merge Intergroup Merge Cleanup
Clustered Yes Yes Yes
Nonclustered Yes Yes*1 Yes
Memory-Optimized Yes*2 No Yes
71. Configuration Improvement
Large Pages (Trace Flag 834)
should be supported in SQL Server 2016,
or at least the Connect Item for it is closed as fixed.
Take them to DEV & QA Environments and let Microsoft
know the results
72. New Trace Flags
Trace Flag Description
9347 Disables batch mode sort operator
9349 Disables batch mode top sort operator
9358
Disable batch mode sort operations in a complex parallel query in SQL
Server 2016
9389 Enables dynamic memory grant for batch mode operators
10204 Disables merge/recompress during columnstore index reorganization
73. Indexed Views
SQL Server 2016 supports Indexed Views with updatable
Nonclustered Columnstore Indexes!
74. Resources:
My Columnstore Blogpost Series:
http://www.nikoport.com/columnstore
CISL – Open Source Columnstore Library:
https://github.com/NikoNeugebauer/CISL
75. Session Evaluations
ways to access
Go to passSummit.com Download the GuideBook App
and search: PASS Summit 2016
Follow the QR code link displayed
on session signage throughout the
conference venue and in the
program guide
Submit by 5pm
Friday November 6th to
WIN prizes
Your feedback is
important and valuable. 3
76. Thank You
Learn more from
Niko Neugebauer
n.neugebauer@oh22.net
or follow @NikoNeugebauer