SlideShare a Scribd company logo
1 of 12
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015
Data sources
SQL_anonymized_forum.sql
SQL_anonymized_general.sql
SQL_hash_mapping.sql
standard extract.csvSQL_unanonymizable.sql.gz
clickstream_export.gz
Personal data
Demographic data
…
Data sources
Weekly data packages
auth_user-{site}-analytics.sql
auth_userprofile-{site}-analytics.sql
certificates_generatedcertificate-{site}-analytics.sql
Daily data packages
course_structure-{site}-analytics.json
courseware_studentmodule-{site}-analytics.sql
email_opt_in-{site}-analytics.csv
student_courseenrollment-{site}-analytics.sql
user_api_usercoursetag-{site}-analytics.sql
user_id_map-{site}-analytics.sql
{org}-{course}-{date}-{site}.mongo
wiki_article-{site}-analytics.sql
wiki_articlerevision-{site}-analytics.sql
{org}-{site}-events-{date}.log.gz.gpg
Challenges
• Analytics across several courses
• Analytics across different platforms
• Analytics across different institutions
• Sharing data
Solution?
• Collaborative data science platform
– Standardize data storage
– Generalizable across courses and data providers
(currently OpenEdX, edX and Coursera)
– “Data being shared without data being
exchanged”
– Sharing and reproducing the results
MOOC data science commons
MOOCdb
Observing mode
- Observed Events table
- Resources table
- Resources Types table
- URLs table
- Resource URLs table
Submitting mode
- Problem Types table
- Problems table
- Submissions table
- Assessments table
Collaborating mode
- Collaborations table
- Collaboration Types table
Feedback mode
- Feedbacks table
- Questions table
- Answers table
- Surveys table
User information
- User PII table
- Global User table
- Course User table
http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
Collaborative platform and
applications
edX Coursera MOOCdb doc Github repo
Feature factory LabelMe Digital learner quantified Problem analytics
My MOOCViz Social network analysis Forum analysis Dropout prediction
Current state
• Established network of institutions
– MIT, Stanford, University of Michigan, University
of Edinburgh, University of Queensland, University
of Texas (Austin)
• Release of open source software
• Development and release of the first data
analytics framework
Next steps
Digital Learner Quantified
Discussion forum analysis
LabelMe
Problem analytics
Dropout prediction
Social network analysis
Collaboration
• If you are interested in…
– Development
– Feature modeling
– Translating your data
– Testing
kalyan@csail.mit.edu
s.Joksimovic@ed.ac.uk
Q&A
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015

More Related Content

Similar to MOOCdb: Developing Data Standards for MOOCs

Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
Charles Severance
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
Vamsi Manoj
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
Chengjen Lee
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
Vikram Parmar
 

Similar to MOOCdb: Developing Data Standards for MOOCs (20)

Step by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSEStep by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSE
 
SCORE REPORT
SCORE REPORTSCORE REPORT
SCORE REPORT
 
Silverlight & WCF RIA
Silverlight & WCF RIASilverlight & WCF RIA
Silverlight & WCF RIA
 
Jisc learning analytics service core slides
Jisc learning analytics service core slidesJisc learning analytics service core slides
Jisc learning analytics service core slides
 
Data Science in the Elastic Stack
Data Science in the Elastic StackData Science in the Elastic Stack
Data Science in the Elastic Stack
 
Exploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics serviceExploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics service
 
Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
 
CS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfCS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdf
 
Library Management Python, MySQL
Library Management Python, MySQLLibrary Management Python, MySQL
Library Management Python, MySQL
 
Resume_Susmita
Resume_SusmitaResume_Susmita
Resume_Susmita
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_public
 
Models in Minutes using AutoML
Models in Minutes using AutoMLModels in Minutes using AutoML
Models in Minutes using AutoML
 
Online examination documentation
Online examination documentationOnline examination documentation
Online examination documentation
 
Start Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxStart Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptx
 
20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
 
Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
 
Resume
ResumeResume
Resume
 

Recently uploaded

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
University of Hertfordshire
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
LeenakshiTyagi
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 

Recently uploaded (20)

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 

MOOCdb: Developing Data Standards for MOOCs

  • 1. MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015
  • 3. Data sources Weekly data packages auth_user-{site}-analytics.sql auth_userprofile-{site}-analytics.sql certificates_generatedcertificate-{site}-analytics.sql Daily data packages course_structure-{site}-analytics.json courseware_studentmodule-{site}-analytics.sql email_opt_in-{site}-analytics.csv student_courseenrollment-{site}-analytics.sql user_api_usercoursetag-{site}-analytics.sql user_id_map-{site}-analytics.sql {org}-{course}-{date}-{site}.mongo wiki_article-{site}-analytics.sql wiki_articlerevision-{site}-analytics.sql {org}-{site}-events-{date}.log.gz.gpg
  • 4. Challenges • Analytics across several courses • Analytics across different platforms • Analytics across different institutions • Sharing data
  • 5. Solution? • Collaborative data science platform – Standardize data storage – Generalizable across courses and data providers (currently OpenEdX, edX and Coursera) – “Data being shared without data being exchanged” – Sharing and reproducing the results
  • 7. MOOCdb Observing mode - Observed Events table - Resources table - Resources Types table - URLs table - Resource URLs table Submitting mode - Problem Types table - Problems table - Submissions table - Assessments table Collaborating mode - Collaborations table - Collaboration Types table Feedback mode - Feedbacks table - Questions table - Answers table - Surveys table User information - User PII table - Global User table - Course User table http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
  • 8. Collaborative platform and applications edX Coursera MOOCdb doc Github repo Feature factory LabelMe Digital learner quantified Problem analytics My MOOCViz Social network analysis Forum analysis Dropout prediction
  • 9. Current state • Established network of institutions – MIT, Stanford, University of Michigan, University of Edinburgh, University of Queensland, University of Texas (Austin) • Release of open source software • Development and release of the first data analytics framework
  • 10. Next steps Digital Learner Quantified Discussion forum analysis LabelMe Problem analytics Dropout prediction Social network analysis
  • 11. Collaboration • If you are interested in… – Development – Feature modeling – Translating your data – Testing kalyan@csail.mit.edu s.Joksimovic@ed.ac.uk
  • 12. Q&A MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015

Editor's Notes

  1. MOOCdb which is our solution to centralizing and generalizing MOOC data organization and providing general purpose analytics for MOOC education research.
  2. “How does amount of time spent on the videos during a certain week correlate to performance on the homework?”
  3. CAN WE HAVE STANDARDIZED DATA STORAGE? Sharing and reproducing the results: When they publish research, analysts can share the scripts by depositing them into a public archive where they are retrievable and cross-referenced to their donor and publication.
  4. The MOOCdb project aims to brings together educational researchers, computer science researchers, machine learning researchers, technologists, database and big data experts to advance MOOC data science. The project founded at MIT includes a platform agnostic functional data model for data exhaust from MOOCs, a collaborative-open source-open access data visualization framework, a crowd sourced knowledge discovery framework and a privacy preserving software framework. The team is currently working to release a number of these tools and frameworks as open source. WHAT MOOCdb PROVIDES? Concise data storage: MOOCdb's proposed schema is \loss-less" with respect to research relevant information, i.e. no information is lost in translating raw data to it. Access Control Levels for Anonymized Data: The data schema offers an organized means of structuring anonymized user identities safeguard them further.
  5. Sharing of data extraction scripts: Scripts for data extraction and descriptive statistics extraction can be open source and shared by everyone because they reference data organized according to the schema. Crowd source potential: Machine learning frequently involves humans identifying explanatory variables that could drive a response. Enabling the crowd to help propose variables could greatly scale the com- munity's progress in mining MOOC data.