SlideShare a Scribd company logo
1 of 31
Download to read offline
July 10-11, 2019
The Conference Center,
How AI Helps Students
Solve Math Problems
Hwechul Derrick Cho
Principal AI Research Engineer
ST Unitas & The Princeton Review
About the company
An Online Tutoring Service
The Princeton Review’s Earlier Work
History
• Tutoring Service since 1998
• Homework Help Mobile in 2015
• Live tutoring PC to mobile
Opportunity
• Generation Z expects a shorter feedback cycle
We Launched Conects Q&A
DEMO VIDEO
How can we provide students
an answer as fast as possible?
Our Solution: A Problem Search Engine
Example Math Problems
Student’s Query Search Result Top1 Search Result Top2 Search Result Top3
How the Problem Search Engine Works
5
Why a Problem Search Engine is Valuable
Business side
• Lower the tutoring cost
• More questions and more data
Student side
• Find answers quickly
• More affordable tutoring
Steps Taken & Learnings
Define (1) Build In-house Dataset
Needs
• Evaluate performance
• Access similar images; however, no public
dataset matched our case
What we did: Took 1,000 photos
of a problem from our book
Example of The Princeton Review SAT
Define (2) Augmentation and Pairing
True Label: Original Image
(1,000 images)
Original Image
Input Data: Augmented Images
(5 per each original image)
Augmented Images
Finding
Original Image
Define (3) Set Out Baseline Model
How
• Baseline doesn’t need to be state-of-the-art
• Calculated similarity distance of Perceptual Hash (pHash)
Result: Top@5 Accuracy ≈ 30%
Solving (1) Search Similar Images
How
• Use distance of two images’ representation
• Our baseline, pHash, is also image representation
• We used ImageNet models to represent images to vector
Result: Top@5 Accuracy ≈ 50%
Example of image representation architecture
Vector representation
RGB Image
Solving (2) Search Similar Texts
Example of text-only image
Problem: Text-only math problems
Solving (2) Use Amazon Rekognition
• Amazon Rekognition was the fastest way to proof of concept
Result: Top@5 Accuracy ≈ 72%
Amazon Rekognition example
Extracted text
Solving (3) Search Similar Images with Texts
Vector representation
RGB Image
+
• Combine two similarity scores
• Use simple grid search algorithm to find optimal combine factor
Result: Top@5 Accuracy ≈ 81%
Uncovered Blind Spots to Keep Iterating
• Didn’t recognize mathematic symbols or different fonts
• Text extracted from graphs unhelpful
What we did: We built a new dataset which addressed those problems and hand-labeled ourselves
“8. The graph f.x) is given below.
Evaluate Sr(*) adx.3H107146E”
Extracted text
“47 and 48 The graphs of a
function f and its derivative f! are
shown. Which is f' bigger, (-1) or
(1)? f" 47. 48.” Extracted text
Improving our Engine
• Detect important layouts from the image
• Replace Text Extraction (Amazon Rekognition) with our own model in Amazon SageMaker
How we did it:
With Amazon SageMaker, we could
easily deploy and scale our model
SageMaker
Architecture
Achievement: Detecting Layouts
Ours Ground Truth Google Vision API
0.54
0.38
0.05
0.00
0.10
0.20
0.30
0.40
0.50
0.60
Our model Google Vision API AWS Rekognition API
Comparing Layout Analysis Performance
(F-Score)
Achievement: Extracting Text
Extracted text
Future Plans
Future Plan (1): Using User-Labeled Data
Ask students if
search result
was helpful
Future Plan (2): Normalizing the Problem
Determine how to split variables from a problem and normalize
Future Plan (3): Auto-solving
Solve simple questions automatically
“If You Define the Problem Correctly,
You Almost Have the Solution”
Steve Jobs (1955 - 2011)
Fireside Chat
Thank you!

More Related Content

What's hot

My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Daniel Tunkelang
 

What's hot (10)

A Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-LearnA Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-Learn
 
Could a Data Science Program use Data Science Insights?
Could a Data Science Program use Data Science Insights?Could a Data Science Program use Data Science Insights?
Could a Data Science Program use Data Science Insights?
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
 
Ml3
Ml3Ml3
Ml3
 
Social Network Visualisation
Social Network VisualisationSocial Network Visualisation
Social Network Visualisation
 
Analysis of Effort Estimation Model in Traditional and Agile (USING METRICS ...
Analysis of Effort Estimation Model in Traditional and Agile  (USING METRICS ...Analysis of Effort Estimation Model in Traditional and Agile  (USING METRICS ...
Analysis of Effort Estimation Model in Traditional and Agile (USING METRICS ...
 
Approximated and User Steerable tSNE for Progressive Visual Analytics
Approximated and User Steerable tSNE for Progressive Visual AnalyticsApproximated and User Steerable tSNE for Progressive Visual Analytics
Approximated and User Steerable tSNE for Progressive Visual Analytics
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
 
computer notes - Introduction to data structures
computer notes - Introduction to data structurescomputer notes - Introduction to data structures
computer notes - Introduction to data structures
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 

Similar to How AI Helps Students Solve Math Problems

DeepSearch_Project_Report
DeepSearch_Project_ReportDeepSearch_Project_Report
DeepSearch_Project_Report
Urjit Patel
 
Net campus2015 antimomusone
Net campus2015 antimomusoneNet campus2015 antimomusone
Net campus2015 antimomusone
DotNetCampus
 
Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...
Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...
Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...
VNIT-ACM Student Chapter
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le web
Ahmed Hammami
 

Similar to How AI Helps Students Solve Math Problems (20)

Naver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltcNaver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltc
 
DeepSearch_Project_Report
DeepSearch_Project_ReportDeepSearch_Project_Report
DeepSearch_Project_Report
 
Le Machine Learning de A à Z
Le Machine Learning de A à ZLe Machine Learning de A à Z
Le Machine Learning de A à Z
 
Net campus2015 antimomusone
Net campus2015 antimomusoneNet campus2015 antimomusone
Net campus2015 antimomusone
 
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATAPREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
 
Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...
Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...
Research Opportunities in India & Keyword Search Over Dynamic Categorized Inf...
 
B017350710
B017350710B017350710
B017350710
 
Efficient Refining Of Why-Not Questions on Top-K Queries
Efficient Refining Of Why-Not Questions on Top-K QueriesEfficient Refining Of Why-Not Questions on Top-K Queries
Efficient Refining Of Why-Not Questions on Top-K Queries
 
Wecp all-india-test-series-program-brochure
Wecp all-india-test-series-program-brochureWecp all-india-test-series-program-brochure
Wecp all-india-test-series-program-brochure
 
Wecp all-india-test-series-program-brochure
Wecp all-india-test-series-program-brochureWecp all-india-test-series-program-brochure
Wecp all-india-test-series-program-brochure
 
ds 1.pptx
ds 1.pptxds 1.pptx
ds 1.pptx
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le web
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
 
Dice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkDice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank Talk
 
Maisa Penha - Art of Possible.pdf
Maisa Penha - Art of Possible.pdfMaisa Penha - Art of Possible.pdf
Maisa Penha - Art of Possible.pdf
 
data-science-pdf-16588.pdf
data-science-pdf-16588.pdfdata-science-pdf-16588.pdf
data-science-pdf-16588.pdf
 
Module 1.3 data exploratory
Module 1.3  data exploratoryModule 1.3  data exploratory
Module 1.3 data exploratory
 
QuerySurge AI webinar
QuerySurge AI webinarQuerySurge AI webinar
QuerySurge AI webinar
 
Machine learning workshop @DYP Pune
Machine learning workshop @DYP PuneMachine learning workshop @DYP Pune
Machine learning workshop @DYP Pune
 
Intelligent Software Engineering: Synergy between AI and Software Engineering
Intelligent Software Engineering: Synergy between AI and Software EngineeringIntelligent Software Engineering: Synergy between AI and Software Engineering
Intelligent Software Engineering: Synergy between AI and Software Engineering
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

How AI Helps Students Solve Math Problems

  • 1. July 10-11, 2019 The Conference Center,
  • 2. How AI Helps Students Solve Math Problems Hwechul Derrick Cho Principal AI Research Engineer ST Unitas & The Princeton Review
  • 5. The Princeton Review’s Earlier Work History • Tutoring Service since 1998 • Homework Help Mobile in 2015 • Live tutoring PC to mobile Opportunity • Generation Z expects a shorter feedback cycle
  • 8. How can we provide students an answer as fast as possible?
  • 9. Our Solution: A Problem Search Engine
  • 10. Example Math Problems Student’s Query Search Result Top1 Search Result Top2 Search Result Top3
  • 11. How the Problem Search Engine Works 5
  • 12. Why a Problem Search Engine is Valuable Business side • Lower the tutoring cost • More questions and more data Student side • Find answers quickly • More affordable tutoring
  • 13. Steps Taken & Learnings
  • 14. Define (1) Build In-house Dataset Needs • Evaluate performance • Access similar images; however, no public dataset matched our case What we did: Took 1,000 photos of a problem from our book Example of The Princeton Review SAT
  • 15. Define (2) Augmentation and Pairing True Label: Original Image (1,000 images) Original Image Input Data: Augmented Images (5 per each original image) Augmented Images Finding Original Image
  • 16. Define (3) Set Out Baseline Model How • Baseline doesn’t need to be state-of-the-art • Calculated similarity distance of Perceptual Hash (pHash) Result: Top@5 Accuracy ≈ 30%
  • 17. Solving (1) Search Similar Images How • Use distance of two images’ representation • Our baseline, pHash, is also image representation • We used ImageNet models to represent images to vector Result: Top@5 Accuracy ≈ 50% Example of image representation architecture Vector representation RGB Image
  • 18. Solving (2) Search Similar Texts Example of text-only image Problem: Text-only math problems
  • 19. Solving (2) Use Amazon Rekognition • Amazon Rekognition was the fastest way to proof of concept Result: Top@5 Accuracy ≈ 72% Amazon Rekognition example Extracted text
  • 20. Solving (3) Search Similar Images with Texts Vector representation RGB Image + • Combine two similarity scores • Use simple grid search algorithm to find optimal combine factor Result: Top@5 Accuracy ≈ 81%
  • 21. Uncovered Blind Spots to Keep Iterating • Didn’t recognize mathematic symbols or different fonts • Text extracted from graphs unhelpful What we did: We built a new dataset which addressed those problems and hand-labeled ourselves “8. The graph f.x) is given below. Evaluate Sr(*) adx.3H107146E” Extracted text “47 and 48 The graphs of a function f and its derivative f! are shown. Which is f' bigger, (-1) or (1)? f" 47. 48.” Extracted text
  • 22. Improving our Engine • Detect important layouts from the image • Replace Text Extraction (Amazon Rekognition) with our own model in Amazon SageMaker How we did it: With Amazon SageMaker, we could easily deploy and scale our model SageMaker Architecture
  • 23. Achievement: Detecting Layouts Ours Ground Truth Google Vision API 0.54 0.38 0.05 0.00 0.10 0.20 0.30 0.40 0.50 0.60 Our model Google Vision API AWS Rekognition API Comparing Layout Analysis Performance (F-Score)
  • 26. Future Plan (1): Using User-Labeled Data Ask students if search result was helpful
  • 27. Future Plan (2): Normalizing the Problem Determine how to split variables from a problem and normalize
  • 28. Future Plan (3): Auto-solving Solve simple questions automatically
  • 29. “If You Define the Problem Correctly, You Almost Have the Solution” Steve Jobs (1955 - 2011)