SlideShare a Scribd company logo
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, VOL. 7, NO. 3, JULY-SEPTEMBER 2014
Mining Social Media Data for Understanding
Students’ Learning Experiences
Xin Chen, Student Member, IEEE, Mihaela Vorvoreanu, and Krishna Madha
Presented By
Biplab Chandra Debnath
ID: 1015312004
Institute of Information and Communication Technology (IICT)
Bangladesh University of Engineering and Technology (BUET)
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Contents
 Objectives
 Introduction
 Related Works
 Data Collection
 Inductive Content Analysis
 Naïve Bayes Multilevel Classifier
 Comparison Experiment
 Detect Students Problems From Purdue Data Set
 Limitations and Future Work
 Conclusion
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Objectives
 Demonstrating workflow of social media data sense
making for education data mining.
 Integrating both qualitative analysis and large scale data
mining techniques
 Exploring engineering students informal conversations on
twitter.
 Understanding issues and problems students encounter
in their learning experiences.
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Introduction
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Related Work
 Public Discourse on the Web
 Goffman’s theory (notion of front-stage and back-stage of people’s
social performances)
 Mining Twitter Data
 Analyze tweets with hastag #iranElection
 Popular classification model (Decision tree, Logistic regression,
Maximum entropy, Boosting, SVM)
 Learning Analytics and Educational Data Mining
 CMS, VLE, EDM (blackboard.com)
 Identify students academic performances
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 Radian6 (http://www.salesforce.com/)
 Twitter APIs
 Keywords: engineer, students, campus, class, homework,
professor, and lab.
 Twitter hashtag #engineeringProblems occurring most
frequently
 25,284 tweets with the hashtag #engineeringProblems posted
from 10,239 unique Twitter accounts.
 Considering only 2785 tweets
 39,095 tweets with the hashtag #engineeringProblems posted
from 5,592 unique Twitter accounts (Purdue University)
Data Collection
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 Non-mutually exclusive categories
Development of Categories
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 Naïve Bayes classifier is effective on this data set compared to other
multi-label classifiers.
 Text Pre-Processing
 Naïve Bayes multilevel classifier
 Evaluation Measures for Multi-Label Classifier
 Classification Result
Naïve Bayes Multilevel Classification
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 Remove all #tag, negative emotions, repeating letters
(huuungryyy)
 Used the Krovetz stemmer in the Lemur information
retrieval toolkit
 Remove the common stop words (much, more, all, always,
still, only)
Text Pre-Processing
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Naïve Bayes multilevel classifier
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Naïve Bayes multilevel classifier
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Naïve Bayes multilevel classifier
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Example Based Classification Measures
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Example Based Classification Measures
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Example Based Classification Measures
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Label-Based Evaluation Measures
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Label-Based Evaluation Measures
Macro-averaged F1 is higher for classifiers work better on
smaller categories.
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Label-Based Evaluation Measures
Label based accuracy is not a very effective measure to
account label imbalance.
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Comparison Experiment: SVM and M3L
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Comparison Experiment: SVM and M3L
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Comparison Experiment: SVM and M3L
Same training and testing data sets
 One-versus-all SVM multi-label classifier classified all
tweets into not in the category for all categories.
 Max-Margin Multi-Label classifier takes label correlation.
 The performance is better than the simplistic one-versus-
all SVM classifier.
 But still not as good as the Naive Bayes classifier.
 Because SVM is not a probabilistic model
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Detect Students Problems From Purdue Data Set
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Detect Students Problems From Purdue Data Set
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
Detect Students Problems From Purdue Data Set
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 First, not all students are active on Twitter.
 Second, consideration on only negative aspect but not
positive on learning experiences
 Third, identified the prominent themes with relatively large
number of tweets in the data.
 Fourth, the qualitative analysis reveals that there are
correlations among the themes.
Limitations
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 First, The “manipulation” of personal image online may
need to be taken into considerations in future work.
 Second, Future work can compare both the good and bad
things to investigate the tradeoffs with which students
struggle.
 Third, Future work can be done to design more
sophisticated algorithms in order to reveal the hidden
information in the “long tail”.
 Fourth, Future work could specifically address the
correlations among these student problems.
Future Work
© Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016
 Through a qualitative content analysis, we found that
engineering students are largely struggling with the heavy
study load, and are not able to manage it successfully.
 Heavy study load leads to many consequences including
lack of social engagement, sleep problems, and other
psychological and physical health problems.
 This detector can be applied as a monitoring mechanism
to identify at-risk students.
Conclusion

More Related Content

Viewers also liked

Data Mining Ieee Papers Trichy
Data Mining Ieee Papers TrichyData Mining Ieee Papers Trichy
Data Mining Ieee Papers Trichykrish madhi
 
Presentation data mining(1)
Presentation data mining(1)Presentation data mining(1)
Presentation data mining(1)cegonsoft1999
 
Cloud computing 2015 ieee papers Data mining ieee project titles
Cloud computing  2015 ieee papers  Data mining ieee project titlesCloud computing  2015 ieee papers  Data mining ieee project titles
Cloud computing 2015 ieee papers Data mining ieee project titlesDoClick Solutions
 
Project center in trichy @ieee 2016 17 titles for java and dotnet
Project center in trichy @ieee 2016 17 titles for java and dotnetProject center in trichy @ieee 2016 17 titles for java and dotnet
Project center in trichy @ieee 2016 17 titles for java and dotnetElakkiya Triplen
 
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
MINING HEALTH EXAMINATION RECORDS  A GRAPH-BASED APPROACHMINING HEALTH EXAMINATION RECORDS  A GRAPH-BASED APPROACH
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACHNexgen Technology
 
Mining Electronic Health Records for Insights
Mining Electronic Health Records for InsightsMining Electronic Health Records for Insights
Mining Electronic Health Records for InsightsOntotext
 
Practical experience of Scrum through project-based learning mediated by ICT ...
Practical experience of Scrum through project-based learning mediated by ICT ...Practical experience of Scrum through project-based learning mediated by ICT ...
Practical experience of Scrum through project-based learning mediated by ICT ...Sergio Yazyi
 
Final year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCA
Final year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCAFinal year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCA
Final year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCAprojectsepark
 
Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Gerben Zaagsma
 
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan PhdSMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan PhdHealthcare consultant
 
Smart health prediction using data mining by customsoft
Smart health prediction using data mining by customsoftSmart health prediction using data mining by customsoft
Smart health prediction using data mining by customsoftCustom Soft
 
Collaborazione nelle comunità open source: tecniche e strumenti
Collaborazione nelle comunità open source: tecniche e strumentiCollaborazione nelle comunità open source: tecniche e strumenti
Collaborazione nelle comunità open source: tecniche e strumentiFilippo Lanubile
 
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?Matt Wood
 
DOs and DONT’s of Social Analytics
DOs and DONT’s of Social AnalyticsDOs and DONT’s of Social Analytics
DOs and DONT’s of Social AnalyticsChristophe Lauer
 
Towards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack OverflowTowards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack OverflowNicole Novielli
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBYTE Project
 

Viewers also liked (20)

Data Mining Ieee Papers Trichy
Data Mining Ieee Papers TrichyData Mining Ieee Papers Trichy
Data Mining Ieee Papers Trichy
 
Presentation data mining(1)
Presentation data mining(1)Presentation data mining(1)
Presentation data mining(1)
 
Cloud computing 2015 ieee papers Data mining ieee project titles
Cloud computing  2015 ieee papers  Data mining ieee project titlesCloud computing  2015 ieee papers  Data mining ieee project titles
Cloud computing 2015 ieee papers Data mining ieee project titles
 
Project center in trichy @ieee 2016 17 titles for java and dotnet
Project center in trichy @ieee 2016 17 titles for java and dotnetProject center in trichy @ieee 2016 17 titles for java and dotnet
Project center in trichy @ieee 2016 17 titles for java and dotnet
 
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
MINING HEALTH EXAMINATION RECORDS  A GRAPH-BASED APPROACHMINING HEALTH EXAMINATION RECORDS  A GRAPH-BASED APPROACH
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
 
Mining Electronic Health Records for Insights
Mining Electronic Health Records for InsightsMining Electronic Health Records for Insights
Mining Electronic Health Records for Insights
 
Practical experience of Scrum through project-based learning mediated by ICT ...
Practical experience of Scrum through project-based learning mediated by ICT ...Practical experience of Scrum through project-based learning mediated by ICT ...
Practical experience of Scrum through project-based learning mediated by ICT ...
 
PPT FOR BIG
PPT FOR BIGPPT FOR BIG
PPT FOR BIG
 
Final year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCA
Final year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCAFinal year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCA
Final year IEEE,NON IEEE projects for 2013-14 for BCA,BTECH,Diploma,Mtech,MCA
 
Data mining
Data miningData mining
Data mining
 
Text categorization
Text categorizationText categorization
Text categorization
 
Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...
 
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan PhdSMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
 
Smart health prediction using data mining by customsoft
Smart health prediction using data mining by customsoftSmart health prediction using data mining by customsoft
Smart health prediction using data mining by customsoft
 
Collaborazione nelle comunità open source: tecniche e strumenti
Collaborazione nelle comunità open source: tecniche e strumentiCollaborazione nelle comunità open source: tecniche e strumenti
Collaborazione nelle comunità open source: tecniche e strumenti
 
Kaggle's WISE 2014 challenge
Kaggle's WISE 2014 challenge Kaggle's WISE 2014 challenge
Kaggle's WISE 2014 challenge
 
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
 
DOs and DONT’s of Social Analytics
DOs and DONT’s of Social AnalyticsDOs and DONT’s of Social Analytics
DOs and DONT’s of Social Analytics
 
Towards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack OverflowTowards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack Overflow
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency Management
 

Similar to Data mining on social networks for students learning experiences

Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Aravindharamanan S
 
Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Erin Robinson
 
Austin Koenig Resume
Austin Koenig ResumeAustin Koenig Resume
Austin Koenig ResumeAustin Koenig
 
AAPBriefingNov2015
AAPBriefingNov2015AAPBriefingNov2015
AAPBriefingNov2015Mark Conway
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Christophe Debruyne
 
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...Erin Robinson
 
EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...
EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...
EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...Eduexcellence
 
Introduction to-data-science
Introduction to-data-scienceIntroduction to-data-science
Introduction to-data-scienceAhmad karawash
 
An Engineering Technology Capstone Project The Snow Load Network.pdf
An Engineering Technology Capstone Project  The Snow Load Network.pdfAn Engineering Technology Capstone Project  The Snow Load Network.pdf
An Engineering Technology Capstone Project The Snow Load Network.pdfAshley Hernandez
 
Big Data Analytics IEEE 2015 Projects
Big Data Analytics IEEE 2015 ProjectsBig Data Analytics IEEE 2015 Projects
Big Data Analytics IEEE 2015 ProjectsVijay Karan
 
Towards Quality-Aware Development of Big Data Applications with DICE
Towards Quality-Aware Development of Big Data Applications with DICETowards Quality-Aware Development of Big Data Applications with DICE
Towards Quality-Aware Development of Big Data Applications with DICEPooyan Jamshidi
 
Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...
Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...
Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...IES VE
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
UKSG Jisc learninganalytics-3june2016
UKSG Jisc learninganalytics-3june2016UKSG Jisc learninganalytics-3june2016
UKSG Jisc learninganalytics-3june2016Paul Bailey
 
flight data analysis using big data
flight data analysis using big data flight data analysis using big data
flight data analysis using big data Sanjib Mitra
 
Institutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthInstitutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthLighton Phiri
 
Certified Big Data Science Analyst (CBDSA)
Certified Big Data Science Analyst (CBDSA)Certified Big Data Science Analyst (CBDSA)
Certified Big Data Science Analyst (CBDSA)GICTTraining
 
ATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docx
ATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docxATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docx
ATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docxrosemaryralphs52525
 
The SWIMing project
The SWIMing projectThe SWIMing project
The SWIMing projectLD4SC
 

Similar to Data mining on social networks for students learning experiences (20)

Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1
 
Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...
 
Austin Koenig Resume
Austin Koenig ResumeAustin Koenig Resume
Austin Koenig Resume
 
AAPBriefingNov2015
AAPBriefingNov2015AAPBriefingNov2015
AAPBriefingNov2015
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
 
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
 
EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...
EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...
EDUFEST 2015 at IIT MADRAS - Presentation on Technology Driven Transformation...
 
Introduction to-data-science
Introduction to-data-scienceIntroduction to-data-science
Introduction to-data-science
 
An Engineering Technology Capstone Project The Snow Load Network.pdf
An Engineering Technology Capstone Project  The Snow Load Network.pdfAn Engineering Technology Capstone Project  The Snow Load Network.pdf
An Engineering Technology Capstone Project The Snow Load Network.pdf
 
Big Data Analytics IEEE 2015 Projects
Big Data Analytics IEEE 2015 ProjectsBig Data Analytics IEEE 2015 Projects
Big Data Analytics IEEE 2015 Projects
 
Towards Quality-Aware Development of Big Data Applications with DICE
Towards Quality-Aware Development of Big Data Applications with DICETowards Quality-Aware Development of Big Data Applications with DICE
Towards Quality-Aware Development of Big Data Applications with DICE
 
Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...
Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...
Future of the Higher Education Estate - Impact of COVID-19 and Net-Zero Targe...
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
UKSG Jisc learninganalytics-3june2016
UKSG Jisc learninganalytics-3june2016UKSG Jisc learninganalytics-3june2016
UKSG Jisc learninganalytics-3june2016
 
flight data analysis using big data
flight data analysis using big data flight data analysis using big data
flight data analysis using big data
 
Jess Dudoff Resume
Jess Dudoff ResumeJess Dudoff Resume
Jess Dudoff Resume
 
Institutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthInstitutional Repository Single Sources of Truth
Institutional Repository Single Sources of Truth
 
Certified Big Data Science Analyst (CBDSA)
Certified Big Data Science Analyst (CBDSA)Certified Big Data Science Analyst (CBDSA)
Certified Big Data Science Analyst (CBDSA)
 
ATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docx
ATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docxATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docx
ATTRIBUTES AND EVALUATION OF DISCUSSION CONTRIBUTIONSDue Date E.docx
 
The SWIMing project
The SWIMing projectThe SWIMing project
The SWIMing project
 

Recently uploaded

Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Domenico Conte
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单ewymefz
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单vcaxypu
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhArpitMalhotra16
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单yhkoc
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单vcaxypu
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesStarCompliance.io
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatheahmadsaood
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 

Recently uploaded (20)

Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 

Data mining on social networks for students learning experiences

  • 1. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, VOL. 7, NO. 3, JULY-SEPTEMBER 2014 Mining Social Media Data for Understanding Students’ Learning Experiences Xin Chen, Student Member, IEEE, Mihaela Vorvoreanu, and Krishna Madha Presented By Biplab Chandra Debnath ID: 1015312004 Institute of Information and Communication Technology (IICT) Bangladesh University of Engineering and Technology (BUET)
  • 2. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Contents  Objectives  Introduction  Related Works  Data Collection  Inductive Content Analysis  Naïve Bayes Multilevel Classifier  Comparison Experiment  Detect Students Problems From Purdue Data Set  Limitations and Future Work  Conclusion
  • 3. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Objectives  Demonstrating workflow of social media data sense making for education data mining.  Integrating both qualitative analysis and large scale data mining techniques  Exploring engineering students informal conversations on twitter.  Understanding issues and problems students encounter in their learning experiences.
  • 4. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Introduction
  • 5. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Related Work  Public Discourse on the Web  Goffman’s theory (notion of front-stage and back-stage of people’s social performances)  Mining Twitter Data  Analyze tweets with hastag #iranElection  Popular classification model (Decision tree, Logistic regression, Maximum entropy, Boosting, SVM)  Learning Analytics and Educational Data Mining  CMS, VLE, EDM (blackboard.com)  Identify students academic performances
  • 6. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  Radian6 (http://www.salesforce.com/)  Twitter APIs  Keywords: engineer, students, campus, class, homework, professor, and lab.  Twitter hashtag #engineeringProblems occurring most frequently  25,284 tweets with the hashtag #engineeringProblems posted from 10,239 unique Twitter accounts.  Considering only 2785 tweets  39,095 tweets with the hashtag #engineeringProblems posted from 5,592 unique Twitter accounts (Purdue University) Data Collection
  • 7. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  Non-mutually exclusive categories Development of Categories
  • 8. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  Naïve Bayes classifier is effective on this data set compared to other multi-label classifiers.  Text Pre-Processing  Naïve Bayes multilevel classifier  Evaluation Measures for Multi-Label Classifier  Classification Result Naïve Bayes Multilevel Classification
  • 9. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  Remove all #tag, negative emotions, repeating letters (huuungryyy)  Used the Krovetz stemmer in the Lemur information retrieval toolkit  Remove the common stop words (much, more, all, always, still, only) Text Pre-Processing
  • 10. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Naïve Bayes multilevel classifier
  • 11. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Naïve Bayes multilevel classifier
  • 12. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Naïve Bayes multilevel classifier
  • 13. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Example Based Classification Measures
  • 14. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Example Based Classification Measures
  • 15. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Example Based Classification Measures
  • 16. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Label-Based Evaluation Measures
  • 17. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Label-Based Evaluation Measures Macro-averaged F1 is higher for classifiers work better on smaller categories.
  • 18. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Label-Based Evaluation Measures Label based accuracy is not a very effective measure to account label imbalance.
  • 19. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Comparison Experiment: SVM and M3L
  • 20. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Comparison Experiment: SVM and M3L
  • 21. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Comparison Experiment: SVM and M3L Same training and testing data sets  One-versus-all SVM multi-label classifier classified all tweets into not in the category for all categories.  Max-Margin Multi-Label classifier takes label correlation.  The performance is better than the simplistic one-versus- all SVM classifier.  But still not as good as the Naive Bayes classifier.  Because SVM is not a probabilistic model
  • 22. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Detect Students Problems From Purdue Data Set
  • 23. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Detect Students Problems From Purdue Data Set
  • 24. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016 Detect Students Problems From Purdue Data Set
  • 25. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  First, not all students are active on Twitter.  Second, consideration on only negative aspect but not positive on learning experiences  Third, identified the prominent themes with relatively large number of tweets in the data.  Fourth, the qualitative analysis reveals that there are correlations among the themes. Limitations
  • 26. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  First, The “manipulation” of personal image online may need to be taken into considerations in future work.  Second, Future work can compare both the good and bad things to investigate the tradeoffs with which students struggle.  Third, Future work can be done to design more sophisticated algorithms in order to reveal the hidden information in the “long tail”.  Fourth, Future work could specifically address the correlations among these student problems. Future Work
  • 27. © Biplab C. Debnath ICT 6522: Data Warehousing and Mining 10th August, 2016  Through a qualitative content analysis, we found that engineering students are largely struggling with the heavy study load, and are not able to manage it successfully.  Heavy study load leads to many consequences including lack of social engagement, sleep problems, and other psychological and physical health problems.  This detector can be applied as a monitoring mechanism to identify at-risk students. Conclusion