SlideShare a Scribd company logo
1 of 29
Download to read offline
Data-Driven College Counseling
Michael Discenza
Senior Data Scientist - SchooLinks
SchooLinks | A personalized college and career readiness solution
About Me
● Statistics B.A. + M.A. @ Columbia
● Data & Accountability Team @ Success Academies Charter
Network in NYC
● Data Science @JPMorgan, @ RUN Ads (Digital ad targeting)
● Currently Data Science @ SchooLinks
● Why am I here and what do I know about college
planning/counseling?
What should you take away?
1) How being data driven counseling can help you in
counseling
2) Process/Framework to follow
3) Exposure and working knowledge of more advanced
techniques
Why data-driven?
● “Big data”
● It helps businesses figure out what they need to do, or what actions
they need to take to meet their goals - predicting the future? Super
powers?
● Why does it work? Boils down to is Scientific Method and the
availability of data.
● Businesses saw that this framework for thinking about the world
was helpful for them.
What is data useful for in college
counseling?
Conduct research yourself if you’re in academia or want to publish
a paper about college counseling…
But more likely:
○ Learn how to be more effective individually,
○ Be more effective as a department, or
○ Justify the investment to use a new curriculum/ new
approach (be sure it has a positive outcome)
What are your goals?
● Examples from folks that we work with:
○ Increase the number of students who have meaningful
post graduation plans
○ Increase college going rate
○ Close achievement gap in your school/district
○ College retention
Quantify these Goals
● Goal Metrics
○ Outcomes (matriculation, retention)
○ Process metrics (setting up for success… completion date of
applications)
● KPI - intermediate/progress tracking
○ FAFSA Completion
○ PSAT/SAT/ACT, etc completion rates
The Process
1) PLAN
2) Get data
3) Prepare/Analyze data
4) Improve based on the learnings from your data
1. Plan
● Background research to make sure your question is a good one for
primary research: i.e. specific to your school and can’t be more
efficiently answered by reading it in a book or elsewhere
● Write down your question
● Make sure it is important and tied to outcomes you are about…
and will yield actionable insights
● Lab notebook (folder on your computer) etc.
● Ensure data access/you will be able to complete the actual
“research"
● List out assumptions
2. Getting Data
● Two main types of data:
○ Outcome data (dependent variable) - college going rate,
students who were accepted into their top 3 choices
(combination of the KPI and the goal metrics we talked about
before)
○ Treatment data (independent variable) - curriculum they used,
programs/extracurricular at schools, sentiment as reported by
surveys
● Sources of data (we’ll talk about strategies for each):
○ Existing data
○ Data you collect
2.1 Getting Data - Using Existing Sources of
Student Data
● Where do you find it/how do you access?
○ SIS data - grades, attendance, participation -> CSV export
○ College tools such as SchooLinks and Naviance, National
College Clearinghouse data
● What does the data mean, “data generating process”
● FERPA?
2.2 Getting Data - Collecting Your Own Data
● Sources:
○ Structured activities/curriculum exposure (what they do)
○ Surveys/Questionnaires (what they say they do and what
they think)
● Best Practices:
○ Organization is essential: lab notebook, dates/timestamps
○ Pay attention to ID space - your ability to analyze data is
actually really tied to your ability to tie outcome data to
treatment data with a key
3. Preparing Data
● Combining data from different data sets: ID space (key)
○ Vlookup (Excel, Google sheets, Apple Numbers)
○ Joins - SQL, python, etc.
● Messy/missing data, outliers - what to include and not to
include?
● Visualizing data
4. Analyzing Data
● All about the relationship between the treatment data and the
outcome data.
● Conditional probability is the most complicated math you’ll
need and most of these dynamics are really early visualized with
graphs
● Background: ASCA has actually a pretty useful book - a review
of percentages/probability, etc focused on giving counselors the
background to do this work - you might be able to pick to up
here if there’s a book store
Sample Data Prep
Sample Data Analysis
Students who had B achieved success at
33% whereas A achieved success at
22%
What to do with your findings:
● Apply them yourself
● Share them - if they’re worthwhile for you, they’re probably
worthwhile for the rest of your dept (ideally “generalizable”)
● Communicate them - for larger adoption across a dept or
funding
○ Graphs, writing, speaking
○ Keep it simple
Pretty simple… what’s the big deal about?
Concepts you should be aware of:
● Regression
● Classification
● Multivariate Analysis
● Causal Analysis
● Statistical Confidence (p values)
● Machine Learning
Goal: know how these are useful
Classification
● Determining the class or group of a
case
● Outcome is the probability case being
part of a certain group
● Most common use case is binary
classification
● Many different statistical methods
(“families of models” can be used)
Example:
Predicted whether a student will fill out FAFSA for
by a certain date based on academic performance
Regression
● Predicting continuous outcomes
● Similar to our exercise but instead
of the probability, we’re looking at
the average score for particular
treatment groups or the average
change in one variable for a unit
change in the other “slope”
Example:
Predicting number of AP classes by house of
extracurricular activity per week
Multivariate Analysis
● Incorporating more than one
independent variable, still only one
response variable
● Can think of it as data in more than two
dimensions
● Think about the effect of one variable
controlling for all others
● Could be in classification setting or
regression setting
http://metabolomicsplatform.com/projects/gc-ms/
Example:
Predicted whether a student will fill out FAFSA for by a
certain date based on academic performance,
demographics, and survey data
Causal Analysis
● Accounting for the fact that example cases
aren’t always assigned to treatment in a
randomized way
● Many techniques, usually require a lot of data,
simplest is Propensity Score Matching (PSM)
● 1) build model to understand probability of
assignment to treatment/control
● 2) Pick groups of subjects in treatment and
control groups that had the same chance of
being assigned to the treatment or control
based on all of the other day (controls for bias)
P-values
● All about quantifying how certain
you are that your finding is a real
finding and not just “random
variation” or statistical noise
● Dependent on sample size and
variability of you data
● Given that there was no true
difference between groups, how
likely would you be to find the
two groups as different as you did
in your analysis
http://uk.cochrane.org/news/key-statistical-result-i
nterpretation-p-value-plain-english
Machine Learning
● Figuring out how to encode knowledge
and patterns into structures we can use
● All about predictive accuracy vs.
statistics which is more about
assembling knowledge of the
underlying patterns that we study
● Supervised vs. Unsupervised Learning
● Many different methodologies:
decisions trees, bayesian learning,
deep learning, clustering, expectation
maximization
Additional Resources
Concluding Remarks
● Data skills - more about logic, domain knowledge, and posing
good questions rather than hard technical skills
● Get 80% of the way there with conditional probability
● Use tools to automate workflow and save time
● Ask questions... of your data, your vendors, colleagues, the
internet
Questions?
Contact Info:
Mike@schoolinks.com
SchooLinks | A personalized college and career readiness solution

More Related Content

What's hot

Education analytics – reporting students growth using sgp model
Education analytics – reporting students growth using sgp modelEducation analytics – reporting students growth using sgp model
Education analytics – reporting students growth using sgp modeleSAT Journals
 
The information needs of Occupational Therapy students - Jane Morgan Daniel
The information needs of Occupational Therapy students - Jane Morgan DanielThe information needs of Occupational Therapy students - Jane Morgan Daniel
The information needs of Occupational Therapy students - Jane Morgan DanielLISDISConference
 
Business Statistics
Business StatisticsBusiness Statistics
Business StatisticsTim Walters
 
Uop qnt 561 week 6 signature assignment (hospital) new
Uop qnt 561 week 6 signature assignment (hospital) newUop qnt 561 week 6 signature assignment (hospital) new
Uop qnt 561 week 6 signature assignment (hospital) newolivergeorg
 
Some Glaring Mistakes made by Researchers in Education in Statistical Analysis
Some Glaring Mistakes made by Researchers in Education in Statistical AnalysisSome Glaring Mistakes made by Researchers in Education in Statistical Analysis
Some Glaring Mistakes made by Researchers in Education in Statistical AnalysisMadhavi Dharankar
 
Data collection in research (Course code-8613)
Data collection in research  (Course code-8613)Data collection in research  (Course code-8613)
Data collection in research (Course code-8613)HennaAnsari
 
How to select the appropriate method for our study of Interest?
How to select the appropriate method for our study of Interest?How to select the appropriate method for our study of Interest?
How to select the appropriate method for our study of Interest?NurFathihaTahiatSeeu
 
Data editing ( In research methodology )
Data editing ( In research methodology )Data editing ( In research methodology )
Data editing ( In research methodology )Np Shakeel
 
Business Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysisBusiness Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysisAhsan Khan Eco (Superior College)
 
Analysing/Interpreting Quantitative Research
Analysing/Interpreting  Quantitative Research Analysing/Interpreting  Quantitative Research
Analysing/Interpreting Quantitative Research HariBolKafle
 
Term Paper Topics
Term Paper TopicsTerm Paper Topics
Term Paper TopicsDamon Rawk
 
DataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeDataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeSreenivas Ravi
 

What's hot (20)

Education analytics – reporting students growth using sgp model
Education analytics – reporting students growth using sgp modelEducation analytics – reporting students growth using sgp model
Education analytics – reporting students growth using sgp model
 
The information needs of Occupational Therapy students - Jane Morgan Daniel
The information needs of Occupational Therapy students - Jane Morgan DanielThe information needs of Occupational Therapy students - Jane Morgan Daniel
The information needs of Occupational Therapy students - Jane Morgan Daniel
 
Business Statistics
Business StatisticsBusiness Statistics
Business Statistics
 
Classification of research
Classification of researchClassification of research
Classification of research
 
Uop qnt 561 week 6 signature assignment (hospital) new
Uop qnt 561 week 6 signature assignment (hospital) newUop qnt 561 week 6 signature assignment (hospital) new
Uop qnt 561 week 6 signature assignment (hospital) new
 
Data Analysis, Intepretation
Data Analysis, IntepretationData Analysis, Intepretation
Data Analysis, Intepretation
 
Some Glaring Mistakes made by Researchers in Education in Statistical Analysis
Some Glaring Mistakes made by Researchers in Education in Statistical AnalysisSome Glaring Mistakes made by Researchers in Education in Statistical Analysis
Some Glaring Mistakes made by Researchers in Education in Statistical Analysis
 
Research design
Research designResearch design
Research design
 
Questionnaires and surveys
Questionnaires and surveysQuestionnaires and surveys
Questionnaires and surveys
 
Data analysis
Data analysisData analysis
Data analysis
 
Data collection in research (Course code-8613)
Data collection in research  (Course code-8613)Data collection in research  (Course code-8613)
Data collection in research (Course code-8613)
 
How to select the appropriate method for our study of Interest?
How to select the appropriate method for our study of Interest?How to select the appropriate method for our study of Interest?
How to select the appropriate method for our study of Interest?
 
Projctppt (1)
Projctppt (1)Projctppt (1)
Projctppt (1)
 
Data editing ( In research methodology )
Data editing ( In research methodology )Data editing ( In research methodology )
Data editing ( In research methodology )
 
Business Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysisBusiness Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysis
 
Analysing/Interpreting Quantitative Research
Analysing/Interpreting  Quantitative Research Analysing/Interpreting  Quantitative Research
Analysing/Interpreting Quantitative Research
 
Lecture 01 - Some basic terminology, History, Application of statistics - Def...
Lecture 01 - Some basic terminology, History, Application of statistics - Def...Lecture 01 - Some basic terminology, History, Application of statistics - Def...
Lecture 01 - Some basic terminology, History, Application of statistics - Def...
 
Data Analysis
Data AnalysisData Analysis
Data Analysis
 
Term Paper Topics
Term Paper TopicsTerm Paper Topics
Term Paper Topics
 
DataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeDataGathering-Qualitative and Quantitative
DataGathering-Qualitative and Quantitative
 

Similar to Data-Driven College Counseling Techniques

How AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksHow AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksKatie Fang
 
Presentation For Gene S Revision 3
Presentation For Gene S Revision 3Presentation For Gene S Revision 3
Presentation For Gene S Revision 3WSU Cougars
 
Learning Analytics In Higher Education: Struggles & Successes (Part 2)
Learning Analytics In Higher Education: Struggles & Successes (Part 2)Learning Analytics In Higher Education: Struggles & Successes (Part 2)
Learning Analytics In Higher Education: Struggles & Successes (Part 2)Lambda Solutions
 
General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyStatistics Solutions
 
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014ICPSR
 
Learning Analytics
Learning AnalyticsLearning Analytics
Learning AnalyticsJames Little
 
data science course with placement in hyderabad
data science course with placement in hyderabaddata science course with placement in hyderabad
data science course with placement in hyderabadmaneesha2312
 
GBS MSCBDA - Dissertation Guidelines.pdf
GBS MSCBDA - Dissertation Guidelines.pdfGBS MSCBDA - Dissertation Guidelines.pdf
GBS MSCBDA - Dissertation Guidelines.pdfStanleyChivandire1
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analyticssunnypatil1778
 
Data Driven Decision Making Presentation
Data Driven Decision Making PresentationData Driven Decision Making Presentation
Data Driven Decision Making PresentationRussell Kunz
 
EBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting BlEBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting BlDr. Bruce A. Johnson
 
Group4 present3 3-15
Group4 present3 3-15Group4 present3 3-15
Group4 present3 3-15gsu3eagle
 
Research Fundamentals_ lecture2.pdf
Research Fundamentals_ lecture2.pdfResearch Fundamentals_ lecture2.pdf
Research Fundamentals_ lecture2.pdfMohamedAli17961
 
Rearch methodology
Rearch methodologyRearch methodology
Rearch methodologyYedu Dharan
 
Assessment Institute August 21 2008
Assessment Institute August 21 2008Assessment Institute August 21 2008
Assessment Institute August 21 2008middlesex
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxProf. Kanchan Kumari
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxProf. Kanchan Kumari
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxcloudserviceuit
 

Similar to Data-Driven College Counseling Techniques (20)

How AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksHow AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinks
 
Starr Hoffman - Data Collection & Research Design
Starr Hoffman - Data Collection & Research Design Starr Hoffman - Data Collection & Research Design
Starr Hoffman - Data Collection & Research Design
 
Presentation For Gene S Revision 3
Presentation For Gene S Revision 3Presentation For Gene S Revision 3
Presentation For Gene S Revision 3
 
Learning Analytics In Higher Education: Struggles & Successes (Part 2)
Learning Analytics In Higher Education: Struggles & Successes (Part 2)Learning Analytics In Higher Education: Struggles & Successes (Part 2)
Learning Analytics In Higher Education: Struggles & Successes (Part 2)
 
General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
 
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
 
Learning Analytics
Learning AnalyticsLearning Analytics
Learning Analytics
 
data science course with placement in hyderabad
data science course with placement in hyderabaddata science course with placement in hyderabad
data science course with placement in hyderabad
 
GBS MSCBDA - Dissertation Guidelines.pdf
GBS MSCBDA - Dissertation Guidelines.pdfGBS MSCBDA - Dissertation Guidelines.pdf
GBS MSCBDA - Dissertation Guidelines.pdf
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analytics
 
Data Driven Decision Making Presentation
Data Driven Decision Making PresentationData Driven Decision Making Presentation
Data Driven Decision Making Presentation
 
EBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting BlEBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting Bl
 
Group4 present3 3-15
Group4 present3 3-15Group4 present3 3-15
Group4 present3 3-15
 
Research Fundamentals_ lecture2.pdf
Research Fundamentals_ lecture2.pdfResearch Fundamentals_ lecture2.pdf
Research Fundamentals_ lecture2.pdf
 
Rearch methodology
Rearch methodologyRearch methodology
Rearch methodology
 
Assessment Institute August 21 2008
Assessment Institute August 21 2008Assessment Institute August 21 2008
Assessment Institute August 21 2008
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptx
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptx
 
Data analysis 2011
Data analysis 2011Data analysis 2011
Data analysis 2011
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
 

Recently uploaded

Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 

Recently uploaded (20)

Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 

Data-Driven College Counseling Techniques

  • 1. Data-Driven College Counseling Michael Discenza Senior Data Scientist - SchooLinks SchooLinks | A personalized college and career readiness solution
  • 2. About Me ● Statistics B.A. + M.A. @ Columbia ● Data & Accountability Team @ Success Academies Charter Network in NYC ● Data Science @JPMorgan, @ RUN Ads (Digital ad targeting) ● Currently Data Science @ SchooLinks ● Why am I here and what do I know about college planning/counseling?
  • 3. What should you take away? 1) How being data driven counseling can help you in counseling 2) Process/Framework to follow 3) Exposure and working knowledge of more advanced techniques
  • 4. Why data-driven? ● “Big data” ● It helps businesses figure out what they need to do, or what actions they need to take to meet their goals - predicting the future? Super powers? ● Why does it work? Boils down to is Scientific Method and the availability of data. ● Businesses saw that this framework for thinking about the world was helpful for them.
  • 5.
  • 6. What is data useful for in college counseling?
  • 7. Conduct research yourself if you’re in academia or want to publish a paper about college counseling… But more likely: ○ Learn how to be more effective individually, ○ Be more effective as a department, or ○ Justify the investment to use a new curriculum/ new approach (be sure it has a positive outcome)
  • 8. What are your goals? ● Examples from folks that we work with: ○ Increase the number of students who have meaningful post graduation plans ○ Increase college going rate ○ Close achievement gap in your school/district ○ College retention
  • 9. Quantify these Goals ● Goal Metrics ○ Outcomes (matriculation, retention) ○ Process metrics (setting up for success… completion date of applications) ● KPI - intermediate/progress tracking ○ FAFSA Completion ○ PSAT/SAT/ACT, etc completion rates
  • 10. The Process 1) PLAN 2) Get data 3) Prepare/Analyze data 4) Improve based on the learnings from your data
  • 11. 1. Plan ● Background research to make sure your question is a good one for primary research: i.e. specific to your school and can’t be more efficiently answered by reading it in a book or elsewhere ● Write down your question ● Make sure it is important and tied to outcomes you are about… and will yield actionable insights ● Lab notebook (folder on your computer) etc. ● Ensure data access/you will be able to complete the actual “research" ● List out assumptions
  • 12. 2. Getting Data ● Two main types of data: ○ Outcome data (dependent variable) - college going rate, students who were accepted into their top 3 choices (combination of the KPI and the goal metrics we talked about before) ○ Treatment data (independent variable) - curriculum they used, programs/extracurricular at schools, sentiment as reported by surveys ● Sources of data (we’ll talk about strategies for each): ○ Existing data ○ Data you collect
  • 13. 2.1 Getting Data - Using Existing Sources of Student Data ● Where do you find it/how do you access? ○ SIS data - grades, attendance, participation -> CSV export ○ College tools such as SchooLinks and Naviance, National College Clearinghouse data ● What does the data mean, “data generating process” ● FERPA?
  • 14. 2.2 Getting Data - Collecting Your Own Data ● Sources: ○ Structured activities/curriculum exposure (what they do) ○ Surveys/Questionnaires (what they say they do and what they think) ● Best Practices: ○ Organization is essential: lab notebook, dates/timestamps ○ Pay attention to ID space - your ability to analyze data is actually really tied to your ability to tie outcome data to treatment data with a key
  • 15. 3. Preparing Data ● Combining data from different data sets: ID space (key) ○ Vlookup (Excel, Google sheets, Apple Numbers) ○ Joins - SQL, python, etc. ● Messy/missing data, outliers - what to include and not to include? ● Visualizing data
  • 16. 4. Analyzing Data ● All about the relationship between the treatment data and the outcome data. ● Conditional probability is the most complicated math you’ll need and most of these dynamics are really early visualized with graphs ● Background: ASCA has actually a pretty useful book - a review of percentages/probability, etc focused on giving counselors the background to do this work - you might be able to pick to up here if there’s a book store
  • 18. Sample Data Analysis Students who had B achieved success at 33% whereas A achieved success at 22%
  • 19. What to do with your findings: ● Apply them yourself ● Share them - if they’re worthwhile for you, they’re probably worthwhile for the rest of your dept (ideally “generalizable”) ● Communicate them - for larger adoption across a dept or funding ○ Graphs, writing, speaking ○ Keep it simple
  • 20. Pretty simple… what’s the big deal about? Concepts you should be aware of: ● Regression ● Classification ● Multivariate Analysis ● Causal Analysis ● Statistical Confidence (p values) ● Machine Learning Goal: know how these are useful
  • 21. Classification ● Determining the class or group of a case ● Outcome is the probability case being part of a certain group ● Most common use case is binary classification ● Many different statistical methods (“families of models” can be used) Example: Predicted whether a student will fill out FAFSA for by a certain date based on academic performance
  • 22. Regression ● Predicting continuous outcomes ● Similar to our exercise but instead of the probability, we’re looking at the average score for particular treatment groups or the average change in one variable for a unit change in the other “slope” Example: Predicting number of AP classes by house of extracurricular activity per week
  • 23. Multivariate Analysis ● Incorporating more than one independent variable, still only one response variable ● Can think of it as data in more than two dimensions ● Think about the effect of one variable controlling for all others ● Could be in classification setting or regression setting http://metabolomicsplatform.com/projects/gc-ms/ Example: Predicted whether a student will fill out FAFSA for by a certain date based on academic performance, demographics, and survey data
  • 24. Causal Analysis ● Accounting for the fact that example cases aren’t always assigned to treatment in a randomized way ● Many techniques, usually require a lot of data, simplest is Propensity Score Matching (PSM) ● 1) build model to understand probability of assignment to treatment/control ● 2) Pick groups of subjects in treatment and control groups that had the same chance of being assigned to the treatment or control based on all of the other day (controls for bias)
  • 25. P-values ● All about quantifying how certain you are that your finding is a real finding and not just “random variation” or statistical noise ● Dependent on sample size and variability of you data ● Given that there was no true difference between groups, how likely would you be to find the two groups as different as you did in your analysis http://uk.cochrane.org/news/key-statistical-result-i nterpretation-p-value-plain-english
  • 26. Machine Learning ● Figuring out how to encode knowledge and patterns into structures we can use ● All about predictive accuracy vs. statistics which is more about assembling knowledge of the underlying patterns that we study ● Supervised vs. Unsupervised Learning ● Many different methodologies: decisions trees, bayesian learning, deep learning, clustering, expectation maximization
  • 28. Concluding Remarks ● Data skills - more about logic, domain knowledge, and posing good questions rather than hard technical skills ● Get 80% of the way there with conditional probability ● Use tools to automate workflow and save time ● Ask questions... of your data, your vendors, colleagues, the internet
  • 29. Questions? Contact Info: Mike@schoolinks.com SchooLinks | A personalized college and career readiness solution