• Save
MS SQL SERVER: Microsoft naive bayes algorithm
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

MS SQL SERVER: Microsoft naive bayes algorithm

  • 3,337 views
Uploaded on

MS SQL SERVER: Microsoft naive bayes algorithm

MS SQL SERVER: Microsoft naive bayes algorithm

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
3,337
On Slideshare
3,329
From Embeds
8
Number of Embeds
2

Actions

Shares
Downloads
0
Comments
0
Likes
0

Embeds 8

http://www.dataminingtools.net 6
http://dataminingtools.net 2

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Microsoft Naive Bayes Algorithm
  • 2. overview
    Naive Bayes Algorithm
    DMX Queries
    Exploring a Naive Bayes Model
    Naive Bayes Principles
    Naive Bayes Parameters
  • 3. Naive Bayes Algorithm
    The Microsoft Naive Bayes algorithm is a classification algorithm provided by Microsoft SQL Server Analysis Services for use in predictive modeling.
    The name Naive Bayes derives from the fact that the algorithm uses Bayes theorem but does not take into account dependencies that may exist, and therefore its assumptions are said to be naive.
  • 4. How to use the Naive Bayes algorithm in SQL server?
    This algorithm is less computationally intense than other Microsoft algorithms
    It is therefore is useful for quickly generating mining models to discover relationships between input columns and predictable columns.
    The algorithm considers each pair of input attribute values and output attribute values.
    Exploring a Naive Bayes model will tell you how your attributes are related to each other.
  • 5. DMX
    When you create a query against a data mining model
    you can create either a content query, which provides details about the patterns discovered in analysis, or you can create a prediction query, which uses the patterns in the model to make predictions for new data
    You can also retrieve metadata about the model by using a query against the data mining schema rowset.
  • 6. DMX Queries
    SELECT MODEL_CATALOG, MODEL_NAME, DATE_CREATED, LAST_PROCESSED, SERVICE_NAME, PREDICTION_ENTITY, FILTER
    FROM
    $system.DMSCHEMA_MINING_MODELS WHERE MODEL_NAME = 'TM_NaiveBayes_Filtered‘
    Getting Model Metadata by Using DMX
    you can find metadata for the model, by querying the data mining schema rowset.
    This might include when the model was created, when the model was last processed, the name of the mining structure that the model is based on, and the name of the columns used as the predictable attribute.
  • 7. DMX Queries
    Retrieving a Summary of Training Data
    Query to retrieve the data from the node specified.
    Because the statistics are stored in a nested table, the FLATTENED keyword is used to make the results easier to view.
    SELECT FLATTENED MODEL_NAME,
    (SELECT ATTRIBUTE_NAME,
    ATTRIBUTE_VALUE, [SUPPORT],
    [PROBABILITY], VALUETYPE
    FROM
    NODE_DISTRIBUTION) AS t FROM
    TM_NaiveBayes.CONTENT WHERE
    NODE_TYPE = 26
  • 8. DMX Queries
    Finding More Information about Attributes
    Example to show how to return information from the model about a particular attribute( here ”Region”)
    The Result of this query is shown in the next slide.
    SELECT NODE_TYPE, NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE ATTRIBUTE_NAME = 'Region'
  • 9. DMX Queries
    Sample Resultto showing
    information from the model about a particular attribute  ”Region”
  • 10. DMX Queries
    SELECT NODE_CAPTION, MSOLAP_NODE_SCORE
    FROM
    TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 10 ORDER BY MSOLAP_NODE_SCORE DESC
    Query returns the
    importance scores of
    all attributes in the
    Model.
    The Result of this query is shown in the next slide.
  • 11. DMX Queries
    query returns the
    importance scores of
    all attributes in the
    Model.
  • 12. Exploring a Naive Bayes Model
    The convenient way to start analyzing a new data set is to create a Naive Bayes model and mark all the non-key columns as both input and predictive.
    The content of each model is presented as a series of nodes.
    A node is an object within a mining model that contains metadata and information about a portion of the model.
    Nodes are arranged in a hierarchy. 
  • 13. Naive Bayes Model Content
  • 14. Exploring a Naive Bayes Model
    The Naive Bayes viewer is accessed through either the BI Development Studio or SQL Management Studio by right-clicking on the model and selecting Browse.
    SQL Server Data Mining provides four different views on Naive Bayes models :
    • Dependency Network:
    Provides a quick display of how all of the attributes in your model are related.
    Each node in the graph represents an attribute, whereas each edge represents a relationship.
    outgoing edge (it is predictive of the attribute in the node at the end of the edge)
    Incoming edge( it is predicted by the other node)
  • 15. Exploring a Naive Bayes Model
    • Attribute Profiles:
    provides you with an exhaustive report of how each input attribute corresponds to each output attribute, one attribute at a time.
    At the top of the Attribute Profiles view, you select which output you want to look at, and the rest of the view shows how all of the input attributes are correlated to the states of the selected output attribute.
  • 16. Exploring a Naive Bayes Model
    • Attribute Characteristics:
    This tab allows you to select an output attribute and value and shows you a description of the cases where that attribute and value occur.
    • Attribute Discrimination:
    Provides the answers to the most interesting question:
    What is the difference between X and Y?
    With this viewer, you choose the attribute you are interested in, and select the states you want to compare.
  • 17. Naive Bayes Principles
    Bayes mathematical methods use a combination of conditional and unconditional probabilities.
    The Naive part of Naive Bayes tells you to treat all of your input attributes as independent of each other with respect to the target variable.
    This may be a faulty assumption, but it allows you to multiply your probabilities to determine the likelihood of each state.
  • 18. Naive Bayes Principles
    The Bayes rule states that if you have a hypothesis Hand evidence about that hypothesis E, then the probability of H is calculated using the following formula:
    P(H | E) = P(E | H) × P(H)
    P(E)
    This simply states that the probability of your hypothesis given the evidence is equal to the probability of the evidence given the hypothesis multiplied by the probability of the hypothesis, and then normalized.
  • 19. Naive Bayes Parameters
    MAXIMUM _INPUT _ATTRIBUTES determines the number of attributes that will be considered as inputs for training.
    If there is more than this number of inputs, the algorithm will select the most important inputs and ignore the rest.
    Setting this parameter to 0 causes the algorithm to consider all attributes.
    The default value is 255.
    MAXIMUM _OUTPUT _ATTRIBUTES determines the number of attributes that will be considered as outputs for training.
    If there is more than this number of outputs, the algorithm will select the most important outputs and ignore the rest.
    Setting this parameter to 0 causes the algorithm to consider all attributes.
    The default value is 255.
  • 20. Naive Bayes Parameters
    MAXIMUM _STATES controls how many states of an attribute are considered. If an attribute has more than this number of states, only the most popular states will be used.
    States that are not selected will be considered to be missing data. This parameter is useful when an attribute has a high cardinality
  • 21. Summary
    Naive Bayes Algorithm
    DMX Queries
    Naive Bayes Model Content
    Exploring a Naive Bayes Model
    Naive Bayes Principles
    Naive Bayes Parameters
  • 22. Visit more self help tutorials
    Pick a tutorial of your choice and browse through it at your own pace.
    The tutorials section is free, self-guiding and will not involve any additional support.
    Visit us at www.dataminingtools.net