SlideShare a Scribd company logo
Text Mining and
Educational Discourse
Dr. Chi-Un Lei, Dept. of Electrical and Electronic Eng.
LASI-HK 2014
(Adopted from LASI workshop 2014)
1
Words from the Speaker
“The key insight communicated through this
workshop is that …
If we can understand the connection between socio-
psychological processes and language by means of
the social signals encoded in them, we can
structure computational models of language
interactions more effectively.”
--- Carolyn Penstein Rosé
2
Outline
 Theoretical: Connection between discourse and
learning
 From rich but implicit constructs to explicit features
that capture the essence for machine learning
 Hands-on: Machine learning for text extraction
and classification
3
Automatic
Analysis
Of
Conversation
Conversational
Interventions
Positive
Learning
Outcomes
Educational Discourse
4
Sociolinguistics
Discourse Analysis
Language
And
Identity
Language
Use
Machine
Learning
Multi-
Level
Modeling
Applied
Statistics
Computational
Models
Of
Discourse
Analysis
Souffle Framework
5
Engagement Engagement
• Transactive
• Knowledge Integration
Person Person
Authority Authority
 Analysis of discussions for learning
Transactivity
 Building on an idea expressed earlier in a
conversation
 Using a reasoning statement
6
We don't want tmax
to be at 570 both
for the material and
[the Environment]
Well, for power and
efficiency, we want a
high tmax, but
environmentally, we
want a lower one.
7
8
System of Engagement
 Showing openness to the existence of other
perspectives
 Examples
 Nuclear is a good choice
 I consider nuclear to be a good choice
 There’s no denying that nuclear is a superior choice
 Is nuclear a good choice?
9
10
11
What is machine learning?
 Automatically or semi-automatically
 Inducing concepts (i.e., rules) from data
 Finding patterns in data (For human and computer)
 Explaining data
 Making predictions
12
Data Learning Algorithm Model
New Data
Prediction
Classification Engine
13
Keep this picture in mind…
 Machine learning isn’t magic
 But it can be useful for identifying meaningful
patterns in your data when used properly
 Proper use requires insight into your data
 Otherwise, GIGO (Garbage In Garbage Out)
 Think like a computer!
14
Machine Learning for Text Mining
 Basic features: “Bag of Words”
 Represent text as a vector where each position
corresponds to a term
15
• Cows make cheese. (110010)
• Cheese make cows. (110010)
• Hamsters eat seeds. (001101)
Cheese
Cows
Eat
Hamsters
Make
Seeds
Basic Types of Features
 Unigram
 Single words (e.g. prefer, sandwhich)
 Bigram
 Pairs of words next to each other (e.g. eat bread)
 Simple lexical patterns
 e.g. “common denominator” versus “common multiple”
 Punctuation
 “You think the answer is 9?” vs. “You think the answer is 9.”
16
Part of Speech (POS) Tagging
 POS bigrams capture syntactic or stylistic
information
 e.g. “the answer which is …” vs “which is the answer”
 Pairs of POS (Part-of-Speech) tags next to each other
 DT_NN: "Determiner"_"Noun, singular or mass “
 NNP_NNP: “Proper noun, singular”_“Proper noun,
singular”
 Examples
 JJR: Adjective, comparative
17
http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
Feature Space Customizations
 Machine learning algorithms look for features
that are good predictors, NOT features that
are necessarily meaningful
 Look for approximations
e.g. Don’t need to do a complete syntactic
analysis for questions
Look for question marks
Look for wh-terms that occur immediately before an
auxilliary verb --- Combined features
18
LightSide
 Easy UI
 Feature Extraction
 Model Building /
Machine Learning
 Error Analysis
 Data Structuring
 Free/open-source for
adoption and extension
19
Feature
Extraction
20
Machine
Learning
and
Evaluation
21
Recap …
“The key insight communicated through this
workshop is that …
If we can understand the connection between socio-
psychological processes and language by means of
the social signals encoded in them, we can
structure computational models of language
interactions more effectively.”
--- Carolyn Penstein Rosé
22
Examples of Part of Speech Tagging
1. CC Coordinating conjunction
2. CD Cardinal number
3. DT Determiner
4. EX Existential there
5. FW Foreign word
6. IN Preposition/subord
7. JJ Adjective
8. JJR Adjective, comparative
9. JJS Adjective, superlative
10.LS List item marker
11.MD Modal
12.NN Noun, singular or mass
13.NNS Noun, plural
14.NNP Proper noun, singular
15.NNPS Proper noun, plural
16.PDT Predeterminer
17.POS Possessive ending
18.PRP Personal pronoun
19.PP Possessive pronoun
20.RB Adverb
21.RBR Adverb, comparative
22.RBS Adverb, superlative23
http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
Examples of Part of Speech Tagging
23.RP Particle
24.SYM Symbol
25.TO to
26.UH Interjection
27.VB Verb, base form
28.VBD Verb, past tense
29.VBG Verb,
gerund/present participle
30.VBN Verb, past participle
31.VBP Verb, non-3rd ps.
sing. present
32.VBZ Verb, 3rd ps. sing.
present
33.WDT wh-determiner
34.WP wh-pronoun
35.WP Possessive wh-
pronoun
36.WRB wh-adverb
http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
Findings
 Transactivity (Berkowitz & Gibbs, 1983)
 Moderating effect on learning (Joshi & Rosé, 2007; Russell, 2005;
Kruger & Tomasello, 1986; Teasley, 1995)
 Moderating effect on knowledge sharing in working groups (Gweon et
al., 2011)
 Engagement (Martin & White, 2005)
 Correlational analysis: Strong correlation between displayed openness
of group members and articulation of reasoning (R = .72) (Dyke et al.,
in press)
 Intervention study: Causal effect on propensity to articulate ideas in
group chats (effect size .6 standard deviations) (Kumar et al., 2011)
 Mediating effect of idea contribution on learning in scientific inquiry
(Wang et al., 2011)
25

More Related Content

Similar to Chi-Un Lei "Text Mining and Educational Discourse"

Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
Pranav Gupta
 
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical DomainDDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
LuukBoulogne
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
punedevscom
 
REPORT.doc
REPORT.docREPORT.doc
Nlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudyNlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudy
Raza Azeem
 
Cognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsCognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithms
André Karpištšenko
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Saurabh Kaushik
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Toine Bogers
 
Nlp
NlpNlp
Syntax (2).pptx
Syntax (2).pptxSyntax (2).pptx
Syntax (2).pptx
ssuser61590a
 
EuroVis DocuBurst Presentation 2009
EuroVis DocuBurst Presentation 2009EuroVis DocuBurst Presentation 2009
EuroVis DocuBurst Presentation 2009Christopher Collins
 
Simulating meaning: a neural theory of discourse coherence
Simulating meaning: a neural theory of discourse coherenceSimulating meaning: a neural theory of discourse coherence
Simulating meaning: a neural theory of discourse coherence
Terry McDonough
 
Discourse Analysis for Social Research
Discourse Analysis for Social ResearchDiscourse Analysis for Social Research
Discourse Analysis for Social Research
Dominik Lukes
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processing
gulshan kumar
 
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...
CITE
 
Artificial Thinking: can machines reason with analogies?
Artificial Thinking:  can machines reason with analogies? Artificial Thinking:  can machines reason with analogies?
Artificial Thinking: can machines reason with analogies?
Federico Bianchi
 
Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809
Katrina (Kate) Pugh
 

Similar to Chi-Un Lei "Text Mining and Educational Discourse" (20)

Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical DomainDDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
 
NLP
NLPNLP
NLP
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
REPORT.doc
REPORT.docREPORT.doc
REPORT.doc
 
Nlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudyNlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudy
 
Cognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsCognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithms
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Nlp
NlpNlp
Nlp
 
Syntax (2).pptx
Syntax (2).pptxSyntax (2).pptx
Syntax (2).pptx
 
EuroVis DocuBurst Presentation 2009
EuroVis DocuBurst Presentation 2009EuroVis DocuBurst Presentation 2009
EuroVis DocuBurst Presentation 2009
 
Simulating meaning: a neural theory of discourse coherence
Simulating meaning: a neural theory of discourse coherenceSimulating meaning: a neural theory of discourse coherence
Simulating meaning: a neural theory of discourse coherence
 
NLP
NLPNLP
NLP
 
NLP
NLPNLP
NLP
 
Discourse Analysis for Social Research
Discourse Analysis for Social ResearchDiscourse Analysis for Social Research
Discourse Analysis for Social Research
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processing
 
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...
 
Artificial Thinking: can machines reason with analogies?
Artificial Thinking:  can machines reason with analogies? Artificial Thinking:  can machines reason with analogies?
Artificial Thinking: can machines reason with analogies?
 
Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809
 

More from CITE

Keynote 1: Teaching and Learning Computational Thinking at Scale
Keynote 1: Teaching and Learning Computational Thinking at ScaleKeynote 1: Teaching and Learning Computational Thinking at Scale
Keynote 1: Teaching and Learning Computational Thinking at Scale
CITE
 
Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...
Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...
Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...
CITE
 
Changing Technology Changing Practice: Empowering Staff and Building Capabili...
Changing Technology Changing Practice: Empowering Staff and Building Capabili...Changing Technology Changing Practice: Empowering Staff and Building Capabili...
Changing Technology Changing Practice: Empowering Staff and Building Capabili...
CITE
 
Traditional Large Scale Educational Assessment and the Incorporation of Digit...
Traditional Large Scale Educational Assessment and the Incorporation of Digit...Traditional Large Scale Educational Assessment and the Incorporation of Digit...
Traditional Large Scale Educational Assessment and the Incorporation of Digit...
CITE
 
Scaling up Assessment for Learning
Scaling up Assessment for LearningScaling up Assessment for Learning
Scaling up Assessment for Learning
CITE
 
Seminar on policy study on e-Learning in Informal Learning contexts
Seminar on policy study on e-Learning in Informal Learning contextsSeminar on policy study on e-Learning in Informal Learning contexts
Seminar on policy study on e-Learning in Informal Learning contexts
CITE
 
Seminar on policy study on e-Learning in Formal & Open Learning contexts
Seminar on policy study on e-Learning in Formal & Open Learning contextsSeminar on policy study on e-Learning in Formal & Open Learning contexts
Seminar on policy study on e-Learning in Formal & Open Learning contexts
CITE
 
Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society
Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society
Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society
CITE
 
G:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-Fred
G:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-FredG:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-Fred
G:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-Fred
CITE
 
Dr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningDr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based Learning
CITE
 
Analogy, Causality, and Discovery in Science: The engines of human thought
Analogy, Causality, and Discovery in Science: The engines of human thoughtAnalogy, Causality, and Discovery in Science: The engines of human thought
Analogy, Causality, and Discovery in Science: The engines of human thought
CITE
 
Educating the Scientific Brain and Mind: Insights from The Science of Learnin...
Educating the Scientific Brain and Mind: Insights from The Science of Learnin...Educating the Scientific Brain and Mind: Insights from The Science of Learnin...
Educating the Scientific Brain and Mind: Insights from The Science of Learnin...
CITE
 
Science of Learning — Why it matters to schools and families?
Science of Learning — Why it matters to schools and families?Science of Learning — Why it matters to schools and families?
Science of Learning — Why it matters to schools and families?
CITE
 
Understanding the self through self bias
Understanding the self through self biasUnderstanding the self through self bias
Understanding the self through self bias
CITE
 
The implementation of "Reading Battle" in Lam Tin Methodist Primary School
The implementation of "Reading Battle" in Lam Tin Methodist Primary SchoolThe implementation of "Reading Battle" in Lam Tin Methodist Primary School
The implementation of "Reading Battle" in Lam Tin Methodist Primary SchoolCITE
 
Strengthening students' reading comprehension ability (both Chinese and Engli...
Strengthening students' reading comprehension ability (both Chinese and Engli...Strengthening students' reading comprehension ability (both Chinese and Engli...
Strengthening students' reading comprehension ability (both Chinese and Engli...CITE
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"CITE
 
Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"CITE
 
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...CITE
 
Phil Winne "Learning Analytics for Learning Science When N = me"
Phil Winne "Learning Analytics for Learning Science When N = me"Phil Winne "Learning Analytics for Learning Science When N = me"
Phil Winne "Learning Analytics for Learning Science When N = me"CITE
 

More from CITE (20)

Keynote 1: Teaching and Learning Computational Thinking at Scale
Keynote 1: Teaching and Learning Computational Thinking at ScaleKeynote 1: Teaching and Learning Computational Thinking at Scale
Keynote 1: Teaching and Learning Computational Thinking at Scale
 
Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...
Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...
Keynote 2: Social Epistemic Cognition in Engineering Learning: Theory, Pedago...
 
Changing Technology Changing Practice: Empowering Staff and Building Capabili...
Changing Technology Changing Practice: Empowering Staff and Building Capabili...Changing Technology Changing Practice: Empowering Staff and Building Capabili...
Changing Technology Changing Practice: Empowering Staff and Building Capabili...
 
Traditional Large Scale Educational Assessment and the Incorporation of Digit...
Traditional Large Scale Educational Assessment and the Incorporation of Digit...Traditional Large Scale Educational Assessment and the Incorporation of Digit...
Traditional Large Scale Educational Assessment and the Incorporation of Digit...
 
Scaling up Assessment for Learning
Scaling up Assessment for LearningScaling up Assessment for Learning
Scaling up Assessment for Learning
 
Seminar on policy study on e-Learning in Informal Learning contexts
Seminar on policy study on e-Learning in Informal Learning contextsSeminar on policy study on e-Learning in Informal Learning contexts
Seminar on policy study on e-Learning in Informal Learning contexts
 
Seminar on policy study on e-Learning in Formal & Open Learning contexts
Seminar on policy study on e-Learning in Formal & Open Learning contextsSeminar on policy study on e-Learning in Formal & Open Learning contexts
Seminar on policy study on e-Learning in Formal & Open Learning contexts
 
Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society
Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society
Prof. Gerald KNEZEK: Implications of Digital Generations for a Learning Society
 
G:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-Fred
G:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-FredG:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-Fred
G:\CITERS2015\29May2015\2 Invited-Talk-2-Sidorko-Fred
 
Dr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningDr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based Learning
 
Analogy, Causality, and Discovery in Science: The engines of human thought
Analogy, Causality, and Discovery in Science: The engines of human thoughtAnalogy, Causality, and Discovery in Science: The engines of human thought
Analogy, Causality, and Discovery in Science: The engines of human thought
 
Educating the Scientific Brain and Mind: Insights from The Science of Learnin...
Educating the Scientific Brain and Mind: Insights from The Science of Learnin...Educating the Scientific Brain and Mind: Insights from The Science of Learnin...
Educating the Scientific Brain and Mind: Insights from The Science of Learnin...
 
Science of Learning — Why it matters to schools and families?
Science of Learning — Why it matters to schools and families?Science of Learning — Why it matters to schools and families?
Science of Learning — Why it matters to schools and families?
 
Understanding the self through self bias
Understanding the self through self biasUnderstanding the self through self bias
Understanding the self through self bias
 
The implementation of "Reading Battle" in Lam Tin Methodist Primary School
The implementation of "Reading Battle" in Lam Tin Methodist Primary SchoolThe implementation of "Reading Battle" in Lam Tin Methodist Primary School
The implementation of "Reading Battle" in Lam Tin Methodist Primary School
 
Strengthening students' reading comprehension ability (both Chinese and Engli...
Strengthening students' reading comprehension ability (both Chinese and Engli...Strengthening students' reading comprehension ability (both Chinese and Engli...
Strengthening students' reading comprehension ability (both Chinese and Engli...
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
 
Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"
 
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
 
Phil Winne "Learning Analytics for Learning Science When N = me"
Phil Winne "Learning Analytics for Learning Science When N = me"Phil Winne "Learning Analytics for Learning Science When N = me"
Phil Winne "Learning Analytics for Learning Science When N = me"
 

Recently uploaded

1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
Wasim Ak
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 

Recently uploaded (20)

1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 

Chi-Un Lei "Text Mining and Educational Discourse"

  • 1. Text Mining and Educational Discourse Dr. Chi-Un Lei, Dept. of Electrical and Electronic Eng. LASI-HK 2014 (Adopted from LASI workshop 2014) 1
  • 2. Words from the Speaker “The key insight communicated through this workshop is that … If we can understand the connection between socio- psychological processes and language by means of the social signals encoded in them, we can structure computational models of language interactions more effectively.” --- Carolyn Penstein Rosé 2
  • 3. Outline  Theoretical: Connection between discourse and learning  From rich but implicit constructs to explicit features that capture the essence for machine learning  Hands-on: Machine learning for text extraction and classification 3 Automatic Analysis Of Conversation Conversational Interventions Positive Learning Outcomes
  • 5. Souffle Framework 5 Engagement Engagement • Transactive • Knowledge Integration Person Person Authority Authority  Analysis of discussions for learning
  • 6. Transactivity  Building on an idea expressed earlier in a conversation  Using a reasoning statement 6 We don't want tmax to be at 570 both for the material and [the Environment] Well, for power and efficiency, we want a high tmax, but environmentally, we want a lower one.
  • 7. 7
  • 8. 8
  • 9. System of Engagement  Showing openness to the existence of other perspectives  Examples  Nuclear is a good choice  I consider nuclear to be a good choice  There’s no denying that nuclear is a superior choice  Is nuclear a good choice? 9
  • 10. 10
  • 11. 11
  • 12. What is machine learning?  Automatically or semi-automatically  Inducing concepts (i.e., rules) from data  Finding patterns in data (For human and computer)  Explaining data  Making predictions 12 Data Learning Algorithm Model New Data Prediction Classification Engine
  • 13. 13
  • 14. Keep this picture in mind…  Machine learning isn’t magic  But it can be useful for identifying meaningful patterns in your data when used properly  Proper use requires insight into your data  Otherwise, GIGO (Garbage In Garbage Out)  Think like a computer! 14
  • 15. Machine Learning for Text Mining  Basic features: “Bag of Words”  Represent text as a vector where each position corresponds to a term 15 • Cows make cheese. (110010) • Cheese make cows. (110010) • Hamsters eat seeds. (001101) Cheese Cows Eat Hamsters Make Seeds
  • 16. Basic Types of Features  Unigram  Single words (e.g. prefer, sandwhich)  Bigram  Pairs of words next to each other (e.g. eat bread)  Simple lexical patterns  e.g. “common denominator” versus “common multiple”  Punctuation  “You think the answer is 9?” vs. “You think the answer is 9.” 16
  • 17. Part of Speech (POS) Tagging  POS bigrams capture syntactic or stylistic information  e.g. “the answer which is …” vs “which is the answer”  Pairs of POS (Part-of-Speech) tags next to each other  DT_NN: "Determiner"_"Noun, singular or mass “  NNP_NNP: “Proper noun, singular”_“Proper noun, singular”  Examples  JJR: Adjective, comparative 17 http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
  • 18. Feature Space Customizations  Machine learning algorithms look for features that are good predictors, NOT features that are necessarily meaningful  Look for approximations e.g. Don’t need to do a complete syntactic analysis for questions Look for question marks Look for wh-terms that occur immediately before an auxilliary verb --- Combined features 18
  • 19. LightSide  Easy UI  Feature Extraction  Model Building / Machine Learning  Error Analysis  Data Structuring  Free/open-source for adoption and extension 19
  • 22. Recap … “The key insight communicated through this workshop is that … If we can understand the connection between socio- psychological processes and language by means of the social signals encoded in them, we can structure computational models of language interactions more effectively.” --- Carolyn Penstein Rosé 22
  • 23. Examples of Part of Speech Tagging 1. CC Coordinating conjunction 2. CD Cardinal number 3. DT Determiner 4. EX Existential there 5. FW Foreign word 6. IN Preposition/subord 7. JJ Adjective 8. JJR Adjective, comparative 9. JJS Adjective, superlative 10.LS List item marker 11.MD Modal 12.NN Noun, singular or mass 13.NNS Noun, plural 14.NNP Proper noun, singular 15.NNPS Proper noun, plural 16.PDT Predeterminer 17.POS Possessive ending 18.PRP Personal pronoun 19.PP Possessive pronoun 20.RB Adverb 21.RBR Adverb, comparative 22.RBS Adverb, superlative23 http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
  • 24. Examples of Part of Speech Tagging 23.RP Particle 24.SYM Symbol 25.TO to 26.UH Interjection 27.VB Verb, base form 28.VBD Verb, past tense 29.VBG Verb, gerund/present participle 30.VBN Verb, past participle 31.VBP Verb, non-3rd ps. sing. present 32.VBZ Verb, 3rd ps. sing. present 33.WDT wh-determiner 34.WP wh-pronoun 35.WP Possessive wh- pronoun 36.WRB wh-adverb http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
  • 25. Findings  Transactivity (Berkowitz & Gibbs, 1983)  Moderating effect on learning (Joshi & Rosé, 2007; Russell, 2005; Kruger & Tomasello, 1986; Teasley, 1995)  Moderating effect on knowledge sharing in working groups (Gweon et al., 2011)  Engagement (Martin & White, 2005)  Correlational analysis: Strong correlation between displayed openness of group members and articulation of reasoning (R = .72) (Dyke et al., in press)  Intervention study: Causal effect on propensity to articulate ideas in group chats (effect size .6 standard deviations) (Kumar et al., 2011)  Mediating effect of idea contribution on learning in scientific inquiry (Wang et al., 2011) 25

Editor's Notes

  1. Don’t forget that you need to prune