Sql Server Statistics

SQL Server
Statistics
Crafted for the love ♥ of SQL Server
http://www.sqlserverapp.com/
Dedicated to the DBA community

Distribution of data

Distribution of data
When we have millions of rows in the tables, it really matters (to the database!) on how the
data is distributed. For example, it is useful to know that 35% of the employees (from the data in
an Employee table) work from France, 28% from Germany and so on.
Distribution of data really matters!

Why does it matter so much?
So why is all this obsession about data distribution?!
Because only then SQL Server can optimize query execution.
Before it runs a query, SQL Server needs to ‘estimate’ how much of data is being fetched.
For example, ‘Is the query fetching about 10% of the data from the table?’
Or ‘Is it getting almost 90% of the data in the table?’
Based on this, the SQL Server Query Optimizer makes several choices on how to
execute the query.
Say, for example, it will decide whether or not to use a speciﬁc index during the execution.

Introducing
Statistics

SQL Server uses a really cool way to track data distribution.
And thats’s what we call
Statistics
Statistics … a crucial SQL technique

sys.stats
is our hero here!
Not that you are going to look into its data much.
But worth getting a basic hang on it when you ﬁnd some time
or during your coffee break!
DMV for Statistics
https://docs.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-stats-transact-sql

Each Statistics object is created for one or more of the columns
in the tables and indexed views in SQL Server.
SQL Server maintains a histogram depicting the distribution of values.
Statistics objects
Tidbit : What is a Histogram?
A Histogram groups data into ranges and helps display how much data is in each range.

When the automatic creation/updating of Statistics is enable in SQL Server, SQL Server takes a wise
call on whether it is really necessary to update the Statistics before a query is run.
It takes this call based on how much the data in the related tables have changed since the last Statistics
update.
When are they created?

While the concept of Statistics is really cool, we need to keep in mind that it has a performance cost.
Having to maintain what distribution of data is present in every table and indexed view is not trivial.
While the need to keep the Statistics up-to-date is important, there is a question of how much up-to-
date it needs to be. Updating Statistics for, say, every insert/delete that happens in a table might be
way too costly. However, updating too less might also turn out bad because then queries will be
executed (i.e. query plans would be chosen by the Query Optimizer) using the old values in Statistics.
When are they created?

How much control do you have
over Statistics?

Yes!
But again, be wary of the performance implication in maintaining a Statistic object.
Can you create a Statistic object?

This is possible too. You can run an update of the Statistics objects when you want.
You can use the Stored Procedure sp_updatestats
Updating Statistics at your will

Updating Statistics at your will
Be wary that updating
Statistics will lead to
recompilation of your queries!

Happy
DBA’ing!
See you soon with another interesting SQL Server concept!
Until then …
Referenced from MSDN
iKosmik
Follow us to get notiﬁed on SQL Server concepts and tidbits

Sql Server Statistics

Recommended

Recommended

More Related Content

What's hot

What's hot (10)

Similar to Sql Server Statistics

Similar to Sql Server Statistics (20)

Recently uploaded

Recently uploaded (20)

Sql Server Statistics