RapidMinerNested Sub-processes
Cross ValidationCross validation is a method for estimating the true error of a model. When a model is built from training data, the error on the training data is a rather optimistic estimate of the error rates the model will achieve on unseen data.
Cross ValidationThe aim of building a model is usually to apply the model to new, unseen data--we expect the model to generalize to data other than the training data on which it was built. Thus, we would like to have some method for better approximating the error that might occur in general. Cross validation provides such a method.
Cross ValidationCross validation is also used to evaluate a model in deciding which algorithm to deploy for learning, when choosing from amongst a number of learning algorithms. It can also provide a guide as to the effect of parameter tuning in building a model from a specific algorithm.
Applying Naïve Bayes model to data sample
Validation
Nested Sub-processThe small icon on the lower right corner implies that the operator can be double clicked to view the nested sub-process.
Nested Sub-processDouble click on the operator to see the two sub-processes
Modeling the sub-processes
Results
More Questions?Reach us at support@dataminingtools.netVisit: www.dataminingtools.net

RapidMiner: Nested Subprocesses

  • 1.
  • 2.
    Cross ValidationCross validationis a method for estimating the true error of a model. When a model is built from training data, the error on the training data is a rather optimistic estimate of the error rates the model will achieve on unseen data.
  • 3.
    Cross ValidationThe aimof building a model is usually to apply the model to new, unseen data--we expect the model to generalize to data other than the training data on which it was built. Thus, we would like to have some method for better approximating the error that might occur in general. Cross validation provides such a method.
  • 4.
    Cross ValidationCross validationis also used to evaluate a model in deciding which algorithm to deploy for learning, when choosing from amongst a number of learning algorithms. It can also provide a guide as to the effect of parameter tuning in building a model from a specific algorithm.
  • 5.
    Applying Naïve Bayesmodel to data sample
  • 6.
  • 7.
    Nested Sub-processThe smallicon on the lower right corner implies that the operator can be double clicked to view the nested sub-process.
  • 8.
    Nested Sub-processDouble clickon the operator to see the two sub-processes
  • 9.
  • 10.
  • 11.
    More Questions?Reach usat support@dataminingtools.netVisit: www.dataminingtools.net