SlideShare a Scribd company logo
How Did I Get That Model?
      Session Logs in SPM




         Dan Steinberg
        Salford Systems
http://www.salford-systems.com

       December 2012
SPM Version Numbers

• Salford Systems has begun offering SPM 7.0 to all of our
  customers (as of November 30, 2012) and from this date
  forward our videos and instructional materials will make use
  of the SPM 7 application
• The SPM interface was designed to be as similar as
  possible to the SPM 6 series of software meaning that you
  can follow along for much of the presentation with that
  version (or with version 6 generations of the standalone
  engines
• We will clearly point out features, options, and controls that
  are unique to 7
• The current video and blog is largely relevant to SPM 6 and
  you will not require SPM 7 for most of its content
                    © Copyright Salford Systems 2012
How Did I Get That Model or Result?

• In the course of interactive data analysis modelers typically
  modify many model setup controls and options
• Many modelers will also alter the data they use
   – Construction of new variables during the session
   – Focusing on specific subsets of the data (segments)
   – Using SELECT statements or DELETE records via the built-in
     Salford BASIC programming language
• After a long session of changing this and that you might find
  that you have lost track of exactly what you have done
• You might not realize this until you save work and come
  back to again after a long weekend or after a hiatus of
  several weeks

                     © Copyright Salford Systems 2012
How SPM Can Help You

• SPM and its predecessor standalone data mining products
  CART, MARS, TreeNet, and RandomForests all maintain an
  audit trail of every session
• The audit trail is a set of commands that were generated
  either by you or by the GUI (Graphical User Interface) on
  your behalf as you conducted your session
• The audit trail does not record pure GUI actions you might
  have taken such as viewing the Variable Importance ranking
  or resizing a window
• The audit trail records all file open and save actions and
  commands that set up or run a model


                  © Copyright Salford Systems 2012
The Command Log

• For the log of your current session the simplest way to
  review it is to click on the command log icon on the toolbar
   – The “L” circled in red below




                      © Copyright Salford Systems 2012
Clicking on the “L” icon brings up a text file




          © Copyright Salford Systems 2012
Understanding the Command Log

•   Often you do not need to pay any attention to most of this file as it is
    devoted to setting up default options
•   Instead you will want to focus on essentials such as
     – USE the command connecting you to your data
     – MODEL the command identifying your target
     – KEEP the command listing the predictors you are using
     – Here are our commands related to the setting up of a regression tree
        using the BOSTON.CSV Boston housing data set




                         © Copyright Salford Systems 2012
Saving and Retrieving Command Logs

• You do not have to worry about saving the command logs as they
  are saved for you automatically
• However, you should make your own decision as to where your
  logs will be saved
• We recommend that you to to the EDIT menu and select Options




                    © Copyright Salford Systems 2012
Alternatively, select the check mark toolbar icon



                                             This brings up the same
                                             dialog as does the EDIT
                                             ..Options menu item

                                             Then select the Directories
                                             tab




          © Copyright Salford Systems 2012
Select a convenient location for Temporary Files




In addition to temporary work files we also permanently store command logs for
every session in this location.
Windows will default to a possibly awkward location so we advise changing it
                        © Copyright Salford Systems 2012
SPM 7 automatically navigates to the stored logs


                                  In version 6 applications you will need
                                  to navigate to this location manually if
                                  you wish to open one of the past
                                  session logs

                                  Select this item in SPM 7 to reveal
                                  the log archives as shown on the next
                                  screen




           © Copyright Salford Systems 2012
Directory Listing of Past Session Logs




These are all plain text files with a .TXT extension and have names beginning with
CTRXmmdd where mm=month and dd=date of creation. The remainder of the name is
randomly generated. Files of size 2KB are for sessions that just opened and closed SPM
                            © Copyright Salford Systems 2012
Session Logs Are Permanent

• We do not ever delete session logs but you might want to
  both selectively delete some sessions including the ones in
  which next to nothing was done
• You might also want to rename important logs so that you
  can tell what they are about
• Session logs are updated after each command is generated
  either by you directly from the command line or via the
  commands that the GUI generates for you
• Session logs are generally complete but may lack the very
  last command issued if the application of the Operating
  System subsequently crashed
• Session logs are critical for diagnosing problems as well as
  for determining exactly what you did to obtain a result
                   © Copyright Salford Systems 2012
Command Logs and Groves

• SPM stores all models in a special form we call a grove.
  Groves may optionally be saved and transferred from one
  machine to another including between Windows and
  Linux/UNIX platforms
• Groves store model information and the entire command log
  up to the point at which the model in question was
  generated
• If you build three consecutive models
   – The first model grove will contain the command log up to the point
     that the first model was built
   – The second model grove will contain the command log relevant to
     both the first and the second models
   – The third model grove will contain the command log for al three
     models built
                      © Copyright Salford Systems 2012
Command Log from GUI (Grove)




We can access all commands issued in the session up to the creation of this
model. These commands are saved in the grove file.
                        © Copyright Salford Systems 2012
GUI Grove Based Command Log Display:
                              TreeNet Model
• If a model’s main results are being displayed you will always
  see a “Commands” button towards the bottom of the display




“Commands” button is available for CART, MARS, GPS, and TreeNet models.
Reveals same information as command log up to the moment model was created
Commands available at any future time from the grove if it is saved
                      © Copyright Salford Systems 2012
Hints on Trouble Shooting

• Since the command logs contain literally every command
  issued either by you directly or by the GUI on your behalf it
  serves as a source of information for explaining unexpected
  results
• Some common causes unexpected results include
   – A SELECT command being active or inactive
   – BASIC commands deleting some records, setting certain predictors
     to missing, or altering some predictors
   – Analysis type being changed from Classification to Regression
   – An active LIMIT command preventing a CART tree from growing as
     large as expected or desired
   – Model setup controls altered such as CART growing method or
     number of nodes in TreeNet trees, etc.

                     © Copyright Salford Systems 2012

More Related Content

Viewers also liked

RandomForests for Biomedical Applications
RandomForests for Biomedical ApplicationsRandomForests for Biomedical Applications
RandomForests for Biomedical Applications
Salford Systems
 
Molecular data mining tool advances in hiv
Molecular data mining tool  advances in hivMolecular data mining tool  advances in hiv
Molecular data mining tool advances in hiv
Salford Systems
 
Comparison of statistical methods commonly used in predictive modeling
Comparison of statistical methods commonly used in predictive modelingComparison of statistical methods commonly used in predictive modeling
Comparison of statistical methods commonly used in predictive modeling
Salford Systems
 
Assessing the Asymmetric Information Associated with the Equity Market A CART...
Assessing the Asymmetric Information Associated with the Equity Market A CART...Assessing the Asymmetric Information Associated with the Equity Market A CART...
Assessing the Asymmetric Information Associated with the Equity Market A CART...
Salford Systems
 
Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...
Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...
Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...
Salford Systems
 
A Study of RandomForests Learning Mechanism with Application to the Identific...
A Study of RandomForests Learning Mechanism with Application to the Identific...A Study of RandomForests Learning Mechanism with Application to the Identific...
A Study of RandomForests Learning Mechanism with Application to the Identific...
Salford Systems
 
A More Transparent Interpretation of Health Club Surveys (YMCA)
A More Transparent Interpretation of Health Club Surveys (YMCA)A More Transparent Interpretation of Health Club Surveys (YMCA)
A More Transparent Interpretation of Health Club Surveys (YMCA)
Salford Systems
 
TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination
TreeNet Tree Ensembles and CART  Decision Trees:  A Winning CombinationTreeNet Tree Ensembles and CART  Decision Trees:  A Winning Combination
TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination
Salford Systems
 
Hybrid cart logit model 1998
Hybrid cart logit model 1998Hybrid cart logit model 1998
Hybrid cart logit model 1998
Salford Systems
 
Evolution of regression ols to gps to mars
Evolution of regression   ols to gps to marsEvolution of regression   ols to gps to mars
Evolution of regression ols to gps to mars
Salford Systems
 

Viewers also liked (20)

RandomForests for Biomedical Applications
RandomForests for Biomedical ApplicationsRandomForests for Biomedical Applications
RandomForests for Biomedical Applications
 
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
 
TreeNet Tree Ensembles & CART Decision Trees: A Winning Combination
TreeNet Tree Ensembles & CART Decision Trees:  A Winning CombinationTreeNet Tree Ensembles & CART Decision Trees:  A Winning Combination
TreeNet Tree Ensembles & CART Decision Trees: A Winning Combination
 
Molecular data mining tool advances in hiv
Molecular data mining tool  advances in hivMolecular data mining tool  advances in hiv
Molecular data mining tool advances in hiv
 
Comparison of statistical methods commonly used in predictive modeling
Comparison of statistical methods commonly used in predictive modelingComparison of statistical methods commonly used in predictive modeling
Comparison of statistical methods commonly used in predictive modeling
 
Introduction to TreeNet (2004)
Introduction to TreeNet (2004)Introduction to TreeNet (2004)
Introduction to TreeNet (2004)
 
SPM v7.0 Feature Matrix
SPM v7.0 Feature MatrixSPM v7.0 Feature Matrix
SPM v7.0 Feature Matrix
 
Statistically Significant Quotes To Remember
Statistically Significant Quotes To RememberStatistically Significant Quotes To Remember
Statistically Significant Quotes To Remember
 
Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...
Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...
Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...
 
Assessing the Asymmetric Information Associated with the Equity Market A CART...
Assessing the Asymmetric Information Associated with the Equity Market A CART...Assessing the Asymmetric Information Associated with the Equity Market A CART...
Assessing the Asymmetric Information Associated with the Equity Market A CART...
 
Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...
Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...
Using CART to Unravel Clusters for the Testing of Interactions in Asthma Data...
 
Some of the new features in SPM 7
Some of the new features in SPM 7Some of the new features in SPM 7
Some of the new features in SPM 7
 
A Study of RandomForests Learning Mechanism with Application to the Identific...
A Study of RandomForests Learning Mechanism with Application to the Identific...A Study of RandomForests Learning Mechanism with Application to the Identific...
A Study of RandomForests Learning Mechanism with Application to the Identific...
 
A More Transparent Interpretation of Health Club Surveys (YMCA)
A More Transparent Interpretation of Health Club Surveys (YMCA)A More Transparent Interpretation of Health Club Surveys (YMCA)
A More Transparent Interpretation of Health Club Surveys (YMCA)
 
CART Training 1999
CART Training 1999CART Training 1999
CART Training 1999
 
TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination
TreeNet Tree Ensembles and CART  Decision Trees:  A Winning CombinationTreeNet Tree Ensembles and CART  Decision Trees:  A Winning Combination
TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination
 
Hybrid cart logit model 1998
Hybrid cart logit model 1998Hybrid cart logit model 1998
Hybrid cart logit model 1998
 
Evolution of regression ols to gps to mars
Evolution of regression   ols to gps to marsEvolution of regression   ols to gps to mars
Evolution of regression ols to gps to mars
 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques
Analysing a Complex Agent-Based Model  Using Data-Mining TechniquesAnalysing a Complex Agent-Based Model  Using Data-Mining Techniques
Analysing a Complex Agent-Based Model Using Data-Mining Techniques
 
2000 KDD Cup Winners
2000 KDD Cup Winners 2000 KDD Cup Winners
2000 KDD Cup Winners
 

Similar to Session Logs Tutorial for SPM

Practical management of development & QA environments for SharePoint 2013
Practical management of development & QA environments for SharePoint 2013Practical management of development & QA environments for SharePoint 2013
Practical management of development & QA environments for SharePoint 2013
SharePointRadi
 
IBM SmartCloud Solutions
IBM SmartCloud Solutions IBM SmartCloud Solutions
IBM SmartCloud Solutions
IBM Danmark
 

Similar to Session Logs Tutorial for SPM (20)

XPages: No Experience Needed
XPages: No Experience NeededXPages: No Experience Needed
XPages: No Experience Needed
 
Em13c features- HotSos 2016
Em13c features- HotSos 2016Em13c features- HotSos 2016
Em13c features- HotSos 2016
 
Monitoring and Tuning Oracle FMW 11g
Monitoring and Tuning Oracle FMW 11gMonitoring and Tuning Oracle FMW 11g
Monitoring and Tuning Oracle FMW 11g
 
Introduction to the IBM Java Tools
Introduction to the IBM Java ToolsIntroduction to the IBM Java Tools
Introduction to the IBM Java Tools
 
FDM to ERPi– Upgrade & Migration Strategies
FDM to ERPi– Upgrade & Migration StrategiesFDM to ERPi– Upgrade & Migration Strategies
FDM to ERPi– Upgrade & Migration Strategies
 
Duma ver3
Duma ver3Duma ver3
Duma ver3
 
Lecture_02_System_Structures.ppt.pdf
Lecture_02_System_Structures.ppt.pdfLecture_02_System_Structures.ppt.pdf
Lecture_02_System_Structures.ppt.pdf
 
Lotus Domino 8.5
Lotus Domino 8.5Lotus Domino 8.5
Lotus Domino 8.5
 
Connect2013: BP105: A Performance Boost for your IBM Lotus Notes Client
Connect2013: BP105: A Performance Boost for your IBM Lotus Notes ClientConnect2013: BP105: A Performance Boost for your IBM Lotus Notes Client
Connect2013: BP105: A Performance Boost for your IBM Lotus Notes Client
 
BP105 - A Performance Boost for your IBM Lotus Notes Client
BP105 - A Performance Boost for your IBM Lotus Notes ClientBP105 - A Performance Boost for your IBM Lotus Notes Client
BP105 - A Performance Boost for your IBM Lotus Notes Client
 
Text mining tutorial
Text mining tutorialText mining tutorial
Text mining tutorial
 
Practical management of development & QA environments for SharePoint 2013
Practical management of development & QA environments for SharePoint 2013Practical management of development & QA environments for SharePoint 2013
Practical management of development & QA environments for SharePoint 2013
 
Command Central Overview
Command Central OverviewCommand Central Overview
Command Central Overview
 
Kscope presentation 2013
Kscope presentation 2013Kscope presentation 2013
Kscope presentation 2013
 
IBM SmartCloud Solutions
IBM SmartCloud Solutions IBM SmartCloud Solutions
IBM SmartCloud Solutions
 
IBM Cognos Dimensional Dashboarding Techniques
IBM Cognos Dimensional Dashboarding TechniquesIBM Cognos Dimensional Dashboarding Techniques
IBM Cognos Dimensional Dashboarding Techniques
 
OS-operating systems - ch02 - part-2-2024
OS-operating systems - ch02 - part-2-2024OS-operating systems - ch02 - part-2-2024
OS-operating systems - ch02 - part-2-2024
 
Bp116
Bp116Bp116
Bp116
 
BP116
BP116BP116
BP116
 
HFM API Deep Dive – Making a Better Financial Management Client
HFM API Deep Dive – Making a Better Financial Management ClientHFM API Deep Dive – Making a Better Financial Management Client
HFM API Deep Dive – Making a Better Financial Management Client
 

More from Salford Systems

Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learning
Salford Systems
 
Global Modeling of Biodiversity and Climate Change
Global Modeling of Biodiversity and Climate ChangeGlobal Modeling of Biodiversity and Climate Change
Global Modeling of Biodiversity and Climate Change
Salford Systems
 
Predicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNetPredicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNet
Salford Systems
 
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Salford Systems
 
Fraud Analysis and Other Applications of Unsupervised Learning in Property an...
Fraud Analysis and Other Applications of Unsupervised Learning in Property an...Fraud Analysis and Other Applications of Unsupervised Learning in Property an...
Fraud Analysis and Other Applications of Unsupervised Learning in Property an...
Salford Systems
 

More from Salford Systems (16)

Datascience101presentation4
Datascience101presentation4Datascience101presentation4
Datascience101presentation4
 
Improve Your Regression with CART and RandomForests
Improve Your Regression with CART and RandomForestsImprove Your Regression with CART and RandomForests
Improve Your Regression with CART and RandomForests
 
Churn Modeling-For-Mobile-Telecommunications
Churn Modeling-For-Mobile-Telecommunications Churn Modeling-For-Mobile-Telecommunications
Churn Modeling-For-Mobile-Telecommunications
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
Introduction to Random Forests by Dr. Adele Cutler
Introduction to Random Forests by Dr. Adele CutlerIntroduction to Random Forests by Dr. Adele Cutler
Introduction to Random Forests by Dr. Adele Cutler
 
9 Data Mining Challenges From Data Scientists Like You
9 Data Mining Challenges From Data Scientists Like You9 Data Mining Challenges From Data Scientists Like You
9 Data Mining Challenges From Data Scientists Like You
 
Using CART For Beginners with A Teclo Example Dataset
Using CART For Beginners with A Teclo Example DatasetUsing CART For Beginners with A Teclo Example Dataset
Using CART For Beginners with A Teclo Example Dataset
 
CART Classification and Regression Trees Experienced User Guide
CART Classification and Regression Trees Experienced User GuideCART Classification and Regression Trees Experienced User Guide
CART Classification and Regression Trees Experienced User Guide
 
Data Mining for Higher Education
Data Mining for Higher EducationData Mining for Higher Education
Data Mining for Higher Education
 
SPM User's Guide: Introducing MARS
SPM User's Guide: Introducing MARSSPM User's Guide: Introducing MARS
SPM User's Guide: Introducing MARS
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learning
 
Global Modeling of Biodiversity and Climate Change
Global Modeling of Biodiversity and Climate ChangeGlobal Modeling of Biodiversity and Climate Change
Global Modeling of Biodiversity and Climate Change
 
Predicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNetPredicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNet
 
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
 
Fraud Analysis and Other Applications of Unsupervised Learning in Property an...
Fraud Analysis and Other Applications of Unsupervised Learning in Property an...Fraud Analysis and Other Applications of Unsupervised Learning in Property an...
Fraud Analysis and Other Applications of Unsupervised Learning in Property an...
 
Predictions from MARS
Predictions from MARSPredictions from MARS
Predictions from MARS
 

Recently uploaded

Recently uploaded (20)

Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 

Session Logs Tutorial for SPM

  • 1. How Did I Get That Model? Session Logs in SPM Dan Steinberg Salford Systems http://www.salford-systems.com December 2012
  • 2. SPM Version Numbers • Salford Systems has begun offering SPM 7.0 to all of our customers (as of November 30, 2012) and from this date forward our videos and instructional materials will make use of the SPM 7 application • The SPM interface was designed to be as similar as possible to the SPM 6 series of software meaning that you can follow along for much of the presentation with that version (or with version 6 generations of the standalone engines • We will clearly point out features, options, and controls that are unique to 7 • The current video and blog is largely relevant to SPM 6 and you will not require SPM 7 for most of its content © Copyright Salford Systems 2012
  • 3. How Did I Get That Model or Result? • In the course of interactive data analysis modelers typically modify many model setup controls and options • Many modelers will also alter the data they use – Construction of new variables during the session – Focusing on specific subsets of the data (segments) – Using SELECT statements or DELETE records via the built-in Salford BASIC programming language • After a long session of changing this and that you might find that you have lost track of exactly what you have done • You might not realize this until you save work and come back to again after a long weekend or after a hiatus of several weeks © Copyright Salford Systems 2012
  • 4. How SPM Can Help You • SPM and its predecessor standalone data mining products CART, MARS, TreeNet, and RandomForests all maintain an audit trail of every session • The audit trail is a set of commands that were generated either by you or by the GUI (Graphical User Interface) on your behalf as you conducted your session • The audit trail does not record pure GUI actions you might have taken such as viewing the Variable Importance ranking or resizing a window • The audit trail records all file open and save actions and commands that set up or run a model © Copyright Salford Systems 2012
  • 5. The Command Log • For the log of your current session the simplest way to review it is to click on the command log icon on the toolbar – The “L” circled in red below © Copyright Salford Systems 2012
  • 6. Clicking on the “L” icon brings up a text file © Copyright Salford Systems 2012
  • 7. Understanding the Command Log • Often you do not need to pay any attention to most of this file as it is devoted to setting up default options • Instead you will want to focus on essentials such as – USE the command connecting you to your data – MODEL the command identifying your target – KEEP the command listing the predictors you are using – Here are our commands related to the setting up of a regression tree using the BOSTON.CSV Boston housing data set © Copyright Salford Systems 2012
  • 8. Saving and Retrieving Command Logs • You do not have to worry about saving the command logs as they are saved for you automatically • However, you should make your own decision as to where your logs will be saved • We recommend that you to to the EDIT menu and select Options © Copyright Salford Systems 2012
  • 9. Alternatively, select the check mark toolbar icon This brings up the same dialog as does the EDIT ..Options menu item Then select the Directories tab © Copyright Salford Systems 2012
  • 10. Select a convenient location for Temporary Files In addition to temporary work files we also permanently store command logs for every session in this location. Windows will default to a possibly awkward location so we advise changing it © Copyright Salford Systems 2012
  • 11. SPM 7 automatically navigates to the stored logs In version 6 applications you will need to navigate to this location manually if you wish to open one of the past session logs Select this item in SPM 7 to reveal the log archives as shown on the next screen © Copyright Salford Systems 2012
  • 12. Directory Listing of Past Session Logs These are all plain text files with a .TXT extension and have names beginning with CTRXmmdd where mm=month and dd=date of creation. The remainder of the name is randomly generated. Files of size 2KB are for sessions that just opened and closed SPM © Copyright Salford Systems 2012
  • 13. Session Logs Are Permanent • We do not ever delete session logs but you might want to both selectively delete some sessions including the ones in which next to nothing was done • You might also want to rename important logs so that you can tell what they are about • Session logs are updated after each command is generated either by you directly from the command line or via the commands that the GUI generates for you • Session logs are generally complete but may lack the very last command issued if the application of the Operating System subsequently crashed • Session logs are critical for diagnosing problems as well as for determining exactly what you did to obtain a result © Copyright Salford Systems 2012
  • 14. Command Logs and Groves • SPM stores all models in a special form we call a grove. Groves may optionally be saved and transferred from one machine to another including between Windows and Linux/UNIX platforms • Groves store model information and the entire command log up to the point at which the model in question was generated • If you build three consecutive models – The first model grove will contain the command log up to the point that the first model was built – The second model grove will contain the command log relevant to both the first and the second models – The third model grove will contain the command log for al three models built © Copyright Salford Systems 2012
  • 15. Command Log from GUI (Grove) We can access all commands issued in the session up to the creation of this model. These commands are saved in the grove file. © Copyright Salford Systems 2012
  • 16. GUI Grove Based Command Log Display: TreeNet Model • If a model’s main results are being displayed you will always see a “Commands” button towards the bottom of the display “Commands” button is available for CART, MARS, GPS, and TreeNet models. Reveals same information as command log up to the moment model was created Commands available at any future time from the grove if it is saved © Copyright Salford Systems 2012
  • 17. Hints on Trouble Shooting • Since the command logs contain literally every command issued either by you directly or by the GUI on your behalf it serves as a source of information for explaining unexpected results • Some common causes unexpected results include – A SELECT command being active or inactive – BASIC commands deleting some records, setting certain predictors to missing, or altering some predictors – Analysis type being changed from Classification to Regression – An active LIMIT command preventing a CART tree from growing as large as expected or desired – Model setup controls altered such as CART growing method or number of nodes in TreeNet trees, etc. © Copyright Salford Systems 2012