Previously known as
Think Big. Move Fast.
Template designed by
brought to you by
SolidQ
• Born in 2002 in USA and Spain
• Established in 2007 in Italy
• More than 1000 customers and more than 200 consultants worldwide
• Dedicated to Data Management on the Microsoft Platform
• Books Authors, Conference Speakers, SQL Server MVPs and Regional Directors
• www.solidq.com
Davide Mauri
• 18 Years of experience on the SQL Server Platform
• Specialized in Data Solution Architecture, Database Design, Performance
Tuning, Business Intelligence
• Microsoft SQL Server MVP
• President of UGISS (Italian SQL Server UG)
• Mentor @ SolidQ
• Video, Book & Article Author
• Regular Speaker @ SQL Server events
• Projects, Consulting, Mentoring & Training
Data Analysis
Data Analysis
• Enterprise
• SQL Server Analysis Services
• Multimensional
• Tabular
• Self-Service
• Power Pivot
Data Analysis
• Multidimensional -> MDX
• Strictly tied to Kimball concepts: Fact, Measures, Dimensions
• Mature Product
• Provide optimum performance on most of the cases
• Medium / High Complexity
• Even for simple situations
Analysis Services Multidimensional
Data Analysis
• Tabular / Power Pivot -> DAX
• Based on the idea of Tables and Relationships
• Based on the concept of “Contexts”
• Row & Filter Context
• In-Memory Engine
• Column Store
• Visually very similar to Excel
• Simple for simple things. Can become very complex for medium/complex
things.
Power Pivot & Analysis Services Tabular
Data Analysis
• Data Mining features included in the Multidimensional Engine
• Classification algorithms
• predict one or more discrete variables, based on the other attributes in the dataset.
• Regression algorithms
• predict one or more continuous variables, such as profit or loss, based on other attributes in the
dataset.
• Segmentation algorithms
• divide data into groups, or clusters, of items that have similar properties.
• Association algorithms
• find correlations between different attributes in a dataset.
• Sequence analysis
• algorithms summarize frequent sequences or episodes in data, such as a Web path flow
Data Analysis
• Microsoft Association Algorithm
• Microsoft Clustering Algorithm
• Microsoft Decision Trees Algorithm
• Microsoft Linear Regression Algorithm
• Microsoft Logistic Regression Algorithm
• Microsoft Naive Bayes Algorithm
• Microsoft Neural Network Algorithm
• Microsoft Sequence Clustering Algorithm
• Microsoft Time Series Algorithm
• Plugin Algorithms
Data Analysis
• Data Mining Language: DMX
• Data Mining also available through Excel AddIn
Data Mining
Previously known as
Think Big. Move Fast.

Ds03 data analysis

  • 1.
  • 2.
  • 3.
    SolidQ • Born in2002 in USA and Spain • Established in 2007 in Italy • More than 1000 customers and more than 200 consultants worldwide • Dedicated to Data Management on the Microsoft Platform • Books Authors, Conference Speakers, SQL Server MVPs and Regional Directors • www.solidq.com
  • 4.
    Davide Mauri • 18Years of experience on the SQL Server Platform • Specialized in Data Solution Architecture, Database Design, Performance Tuning, Business Intelligence • Microsoft SQL Server MVP • President of UGISS (Italian SQL Server UG) • Mentor @ SolidQ • Video, Book & Article Author • Regular Speaker @ SQL Server events • Projects, Consulting, Mentoring & Training
  • 5.
  • 6.
    Data Analysis • Enterprise •SQL Server Analysis Services • Multimensional • Tabular • Self-Service • Power Pivot
  • 7.
    Data Analysis • Multidimensional-> MDX • Strictly tied to Kimball concepts: Fact, Measures, Dimensions • Mature Product • Provide optimum performance on most of the cases • Medium / High Complexity • Even for simple situations
  • 8.
  • 9.
    Data Analysis • Tabular/ Power Pivot -> DAX • Based on the idea of Tables and Relationships • Based on the concept of “Contexts” • Row & Filter Context • In-Memory Engine • Column Store • Visually very similar to Excel • Simple for simple things. Can become very complex for medium/complex things.
  • 10.
    Power Pivot &Analysis Services Tabular
  • 11.
    Data Analysis • DataMining features included in the Multidimensional Engine • Classification algorithms • predict one or more discrete variables, based on the other attributes in the dataset. • Regression algorithms • predict one or more continuous variables, such as profit or loss, based on other attributes in the dataset. • Segmentation algorithms • divide data into groups, or clusters, of items that have similar properties. • Association algorithms • find correlations between different attributes in a dataset. • Sequence analysis • algorithms summarize frequent sequences or episodes in data, such as a Web path flow
  • 12.
    Data Analysis • MicrosoftAssociation Algorithm • Microsoft Clustering Algorithm • Microsoft Decision Trees Algorithm • Microsoft Linear Regression Algorithm • Microsoft Logistic Regression Algorithm • Microsoft Naive Bayes Algorithm • Microsoft Neural Network Algorithm • Microsoft Sequence Clustering Algorithm • Microsoft Time Series Algorithm • Plugin Algorithms
  • 13.
    Data Analysis • DataMining Language: DMX • Data Mining also available through Excel AddIn
  • 14.
  • 15.