SlideShare a Scribd company logo
1 of 52
The Geometry of Learning November 17th, 2009, Utrecht, The Netherlands Fridolin WildKMi, The Open University
(createdwith http://www.wordle.net)
Outline Context & Framing Theories Latent Semantic Analysis (LSA) Social Network Analysis (SNA) Meaningful Interaction Analysis (MIA) Conclusion & Outlook
Context & Theories
Information (96dpi) Information could be the quality of a certain signal. Information could be a logical abstractor, the release mechanism. Information & Knowledge Knowledge could be the delta at the receiver (a paper, a human, a library).
Learning is change Learning is about competence development Competence becomes visible in performance Professional competence is mainly about (re-)constructing and processing information and knowledge from cues Professional competence development is much about learning concepts from language Professional performance is much about demonstrating conceptual knowledge with language Language! What is learning about?
Tying shoelaces Douglas Adams’ ‘meaning of liff’: Epping: The futile movements of forefingers and eyebrows used when failing to attract the attention of waiters and barmen. Shoeburyness: The vague uncomfortable feeling you get when sitting on a seat which is still warm from somebody else's bottom I have been convincingly Sapir-Whorfed by this book. Non-textual concepts things we can’t (easily) learn from language
Latent Semantic Analysis
Word Choice Educated adult understands ~100,000 word forms An average sentence contains 20 tokens.  Thus 100,00020 possible combinationsof words in a sentence  maximum of log2 100,00020= 332 bits in word choice alone. 20! = 2.4 x 1018 possible orders of 20 words = maximum of 61 bits from order of the words.  332/(61+ 332) = 84%word choice (Landauer, 2007)
Latent Semantic Analysis “Humans learn word meanings and how to combine them into passage meaning through experience with ~paragraph unitized verbal environments.”  “They don’t remember all the separate words of a passage; they remember its overall gist or meaning.”  “LSA learns by ‘reading’ ~paragraph unitized texts that represent the environment.” “It doesn’t remember all the separate words of a text it; it remembers its overall gist or meaning.” (Landauer, 2007)
Latent Semantics latent-semantic space In other words:Assumption: language utterances have a semantic structure Problem: structure is obscured by word usage(noise, synonymy, polysemy, …) Solution: map doc-term matrix using conceptual indices derived statistically (truncated SVD) and make similarity comparisons using angles
Input (e.g., documents) term = feature vocabulary = ordered set of features Only the red terms appear in more than one document, so strip the rest. TEXTMATRIX { M } =  Deerwester, Dumais, Furnas, Landauer, and Harshman (1990): Indexing by Latent Semantic Analysis, In: Journal of the American Society for Information Science, 41(6):391-407
Singular Value Decomposition =
Truncated SVD latent-semantic space … we will get a different matrix (different values, but still of the same format as M).
(Landauer, 2007)
Reconstructed, Reduced Matrix m4: Graphminors: A survey
Similarity in a Latent-Semantic Space Query Y dimension Target 1 Angle 1 Angle 2 Target 2 X dimension (Landauer, 2007)
doc2doc - similarities Unreduced = pure vector space model - Based on M = TSD’ - Pearson Correlation  over document vectors reduced - based on M2 = TS2D’ - Pearson Correlation    over document vectors
Typical, simple workflow tm = textmatrix(‘dir/‘) tm = lw_logtf(tm) * gw_idf(tm) space = lsa(tm, dims=dimcalc_share()) tm3 = fold_in(tm, space) as.textmatrix(tm)
Processing Pipeline (with Options) 4 x 12 x 7 x 2 x 3 = 2016 Combinations
b) SVD is computationally expensive From seconds (lower hundreds of documents, optimised linear algebra libraries, truncated SVD) To minutes (hundreds to thousands of documents) To hours (tens and hundreds of thousands) a) SVD factor stability SVD calculates factorsover a given text base; different texts – different factors Problem: avoid unwanted factor changes Solution: folding-in of instead of recalculating Projecting by Folding-In
2 1 vT Folding-In in Detail (cf. Berry et al., 1995) Mk (2) convert „Dk“-format vector to „Mk“-format Tk Sk Dk (1) convert Original Vector to „Dk“-format
The Value of Singular Values Pearson(jahr, wien) Pearson(eu, österreich)
Simple LSA application
Summary Writing: Working Principle (Landauer, 2007)
Summary Writing Gold Standard 1 Gold  Standard 2 Y dimension Gold Standard 3 Essay 1 Essay 2 X dimension
‘Dumb’ Summary Writing (Code) library( "lsa“ )# load package # load training texts trm = textmatrix( "trainingtexts/“ ) trm = lw_bintf( trm ) * gw_idf( trm )# weighting space = lsa( trm )# create an LSA space # fold-in summaries to be tested (including gold standard text) tem = textmatrix( "testessays/", vocabulary=rownames(trm) ) tem_red = fold_in( tem, space ) # score a summary by comparing with  # gold standard text (very simple method!) cor( tem_red[,"goldstandard.txt"], tem_red[,"E1.txt"] ) => 0.7
Evaluating Effectiveness Compare Machine Scores with Human Scores Human-to-Human Correlation  Usually around .6  Increased by familiarity between  assessors, tighter assessment schemes, …  Scores vary even stronger with decreasing  subject familiarity (.8 at high familiarity,  worst test -.07) ,[object Object]
Training Collection: 3 ‘golden essays’, plus 302 documents from a marketing glossary, average length: 56.1 words,[object Object]
(S)NA
Social Network Analysis Existing for a long time (term coined 1954) Basic idea: Actors and Relationships between them (e.g. Interactions) Actors can be people (groups, media, tags, …) Actors and Ties form a Graph (edges and nodes) Within that graph, certain structures can be investigated  Betweenness, Degree of Centrality, Density, Cohesion Structural Patterns can be identified (e.g. the Troll)
Forum Messages
Incidence Matrix msg_id = incident, authorsappear in incidents
DeriveAdjacency Matrix = t(im) %*% im
Visualization: Sociogramme
Measuring Techniques (Sample) Closenesshow close to all others Degree Centralitynumber of (in/out) connections to others Betweennesshow often intermediary Componentse.g. kmeans cluster (k=3)
SNA applications
Co-Authorship Network WI (2005)
Paper Collaboration Prolearn e.g. co-authorships of ~30 deliverables of three work packages (ProLearn NoE) Roles: reviewer (red), editor (green), contributor Size: Prestige() But: type of interaction? Content of interaction? => not possible!
TEL Project Cooperation (2004-2007)
iCamp Collaboration (Y1) Shades of yellow: WP leadership Red: coordinator
MIA
Meaningful Interaction Analysis (MIA) Fusion: Combining LSA with SNA Terms and Documents (or anything else represented with column vectors or row vectors) are mapped into same space by LSA Semantic proximity can be measured between them: how close is a term to a document? (S)NA allows to analyse these resulting graph structures By e.g. cluster or component analysis By e.g. identifying central descriptors for these
The mathemagics behind Meaning Interaction Analysis
Truncated SVD latent-semantic space … we will get a different matrix (different values, but still of the same format as M).
Knowledge Proxy: LSA Part Tk= left-hand sided matrix = ‚term loadings‘  on the singular value Dk= right-hand sided matrix = ‚document loadings‘ on the singular value Multiply them into same space VT = TkSk VD = DkTSk Cosine Distance Matrix over  		 ... = a graph Extension: add author vectors VAthrough cluster centroids or vector addition of their publication vectors latent-semantic space Ofcourse:useexistingspaceandfold inthewholesetsofvectors
Knowledge Proxy: SNA Part:Filter the Network Every vectorhas a cosinedistancetoeveryother (maybe negative)! So: filter forthedesiredsimilaritystrength
ConSpectmonitoring conceptual development
TopicProxy (30 people, 2005)
Spot unwanted fragmentation e.g. two authors work on the same topic, but with different collaborator groups and with different literature Intervention Instrument: automatically recommend to hold a flashmeeting Bringing together what belongs together Wild, Ochoa, Heinze, Crespo, Quick (2009, to appear)
//eof.

More Related Content

What's hot

Basic review on topic modeling
Basic review on  topic modelingBasic review on  topic modeling
Basic review on topic modelingHiroyuki Kuromiya
 
A Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information RetrievalA Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information RetrievalBhaskar Mitra
 
Word Embedding to Document distances
Word Embedding to Document distancesWord Embedding to Document distances
Word Embedding to Document distancesGanesh Borle
 
Topic Models - LDA and Correlated Topic Models
Topic Models - LDA and Correlated Topic ModelsTopic Models - LDA and Correlated Topic Models
Topic Models - LDA and Correlated Topic ModelsClaudia Wagner
 
A Simple Introduction to Word Embeddings
A Simple Introduction to Word EmbeddingsA Simple Introduction to Word Embeddings
A Simple Introduction to Word EmbeddingsBhaskar Mitra
 
TopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptxTopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptxKalpit Desai
 
Neural Models for Document Ranking
Neural Models for Document RankingNeural Models for Document Ranking
Neural Models for Document RankingBhaskar Mitra
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learningtelss09
 
Topic model, LDA and all that
Topic model, LDA and all thatTopic model, LDA and all that
Topic model, LDA and all thatZhibo Xiao
 
5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information RetrievalBhaskar Mitra
 
Topic modeling using big data analytics
Topic modeling using big data analyticsTopic modeling using big data analytics
Topic modeling using big data analyticsFarheen Nilofer
 
Dual Embedding Space Model (DESM)
Dual Embedding Space Model (DESM)Dual Embedding Space Model (DESM)
Dual Embedding Space Model (DESM)Bhaskar Mitra
 
Information Retrieval using Semantic Similarity
Information Retrieval using Semantic SimilarityInformation Retrieval using Semantic Similarity
Information Retrieval using Semantic SimilaritySaswat Padhi
 
Introduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisIntroduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisNYC Predictive Analytics
 
Vectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for SearchVectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for SearchBhaskar Mitra
 
Search Engines
Search EnginesSearch Engines
Search Enginesbutest
 

What's hot (20)

Topic Modeling
Topic ModelingTopic Modeling
Topic Modeling
 
Basic review on topic modeling
Basic review on  topic modelingBasic review on  topic modeling
Basic review on topic modeling
 
A Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information RetrievalA Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information Retrieval
 
Word Embedding to Document distances
Word Embedding to Document distancesWord Embedding to Document distances
Word Embedding to Document distances
 
Topic Models - LDA and Correlated Topic Models
Topic Models - LDA and Correlated Topic ModelsTopic Models - LDA and Correlated Topic Models
Topic Models - LDA and Correlated Topic Models
 
A Simple Introduction to Word Embeddings
A Simple Introduction to Word EmbeddingsA Simple Introduction to Word Embeddings
A Simple Introduction to Word Embeddings
 
TopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptxTopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptx
 
Neural Models for Document Ranking
Neural Models for Document RankingNeural Models for Document Ranking
Neural Models for Document Ranking
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learning
 
Topic model, LDA and all that
Topic model, LDA and all thatTopic model, LDA and all that
Topic model, LDA and all that
 
5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval
 
Topic modeling using big data analytics
Topic modeling using big data analyticsTopic modeling using big data analytics
Topic modeling using big data analytics
 
Dual Embedding Space Model (DESM)
Dual Embedding Space Model (DESM)Dual Embedding Space Model (DESM)
Dual Embedding Space Model (DESM)
 
Information Retrieval using Semantic Similarity
Information Retrieval using Semantic SimilarityInformation Retrieval using Semantic Similarity
Information Retrieval using Semantic Similarity
 
Canini09a
Canini09aCanini09a
Canini09a
 
Introduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisIntroduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic Analysis
 
LDG-basic-slides
LDG-basic-slidesLDG-basic-slides
LDG-basic-slides
 
Vectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for SearchVectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for Search
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
Topic Models
Topic ModelsTopic Models
Topic Models
 

Similar to The Geometry of Learning

Meaningful Interaction Analysis
Meaningful Interaction AnalysisMeaningful Interaction Analysis
Meaningful Interaction Analysisfridolin.wild
 
Topic Models Based Personalized Spam Filter
Topic Models Based Personalized Spam FilterTopic Models Based Personalized Spam Filter
Topic Models Based Personalized Spam FilterSudarsun Santhiappan
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Bhaskar Mitra
 
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural NetworksDesigning, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networksconnectbeubax
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksLeonardo Di Donato
 
Simulation of Language Acquisition Walter Daelemans
Simulation of Language Acquisition Walter DaelemansSimulation of Language Acquisition Walter Daelemans
Simulation of Language Acquisition Walter Daelemansbutest
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligencevini89
 
lecture_mooney.ppt
lecture_mooney.pptlecture_mooney.ppt
lecture_mooney.pptbutest
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddingsgleicher
 
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONAN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONijnlc
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalBhaskar Mitra
 
Cognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsCognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsAndré Karpištšenko
 
Discovering Novel Information with sentence Level clustering From Multi-docu...
Discovering Novel Information with sentence Level clustering  From Multi-docu...Discovering Novel Information with sentence Level clustering  From Multi-docu...
Discovering Novel Information with sentence Level clustering From Multi-docu...irjes
 
Learning deep structured semantic models for web search
Learning deep structured semantic models for web searchLearning deep structured semantic models for web search
Learning deep structured semantic models for web searchhyunsung lee
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Jonathon Hare
 
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffnL6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffnRwanEnan
 
Learning group dssm - 20170605
Learning group   dssm - 20170605Learning group   dssm - 20170605
Learning group dssm - 20170605Shuai Zhang
 
Latent Semanctic Analysis Auro Tripathy
Latent Semanctic Analysis Auro TripathyLatent Semanctic Analysis Auro Tripathy
Latent Semanctic Analysis Auro TripathyAuro Tripathy
 

Similar to The Geometry of Learning (20)

Meaningful Interaction Analysis
Meaningful Interaction AnalysisMeaningful Interaction Analysis
Meaningful Interaction Analysis
 
Topic Models Based Personalized Spam Filter
Topic Models Based Personalized Spam FilterTopic Models Based Personalized Spam Filter
Topic Models Based Personalized Spam Filter
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
 
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural NetworksDesigning, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networks
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
 
Simulation of Language Acquisition Walter Daelemans
Simulation of Language Acquisition Walter DaelemansSimulation of Language Acquisition Walter Daelemans
Simulation of Language Acquisition Walter Daelemans
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
lecture_mooney.ppt
lecture_mooney.pptlecture_mooney.ppt
lecture_mooney.ppt
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddings
 
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONAN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information Retrieval
 
Cognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsCognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithms
 
Discovering Novel Information with sentence Level clustering From Multi-docu...
Discovering Novel Information with sentence Level clustering  From Multi-docu...Discovering Novel Information with sentence Level clustering  From Multi-docu...
Discovering Novel Information with sentence Level clustering From Multi-docu...
 
Vsm lsi
Vsm lsiVsm lsi
Vsm lsi
 
Learning deep structured semantic models for web search
Learning deep structured semantic models for web searchLearning deep structured semantic models for web search
Learning deep structured semantic models for web search
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
 
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffnL6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
 
Learning group dssm - 20170605
Learning group   dssm - 20170605Learning group   dssm - 20170605
Learning group dssm - 20170605
 
Latent Semanctic Analysis Auro Tripathy
Latent Semanctic Analysis Auro TripathyLatent Semanctic Analysis Auro Tripathy
Latent Semanctic Analysis Auro Tripathy
 
LSA and PLSA
LSA and PLSALSA and PLSA
LSA and PLSA
 

More from fridolin.wild

Performance Augmentation (Keynote, SIG LT, XR4ALL)
Performance Augmentation (Keynote, SIG LT, XR4ALL)Performance Augmentation (Keynote, SIG LT, XR4ALL)
Performance Augmentation (Keynote, SIG LT, XR4ALL)fridolin.wild
 
Professional TEL 4.0: Performance Augmentation for Industry 4.0
Professional TEL 4.0: Performance Augmentation for Industry 4.0 Professional TEL 4.0: Performance Augmentation for Industry 4.0
Professional TEL 4.0: Performance Augmentation for Industry 4.0 fridolin.wild
 
Performance Augmentation
Performance AugmentationPerformance Augmentation
Performance Augmentationfridolin.wild
 
Reality As A Knowledge Medium
Reality As A Knowledge MediumReality As A Knowledge Medium
Reality As A Knowledge Mediumfridolin.wild
 
ARLEM draft spec - overview
ARLEM draft spec - overviewARLEM draft spec - overview
ARLEM draft spec - overviewfridolin.wild
 
AR community meeting, Seoul, Korea, October 6, 2015
AR community meeting, Seoul, Korea, October 6, 2015AR community meeting, Seoul, Korea, October 6, 2015
AR community meeting, Seoul, Korea, October 6, 2015fridolin.wild
 
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015fridolin.wild
 
IEEE p1589 'ARLEM' virtual meeting, July 8, 2015
IEEE p1589 'ARLEM' virtual meeting, July 8, 2015IEEE p1589 'ARLEM' virtual meeting, July 8, 2015
IEEE p1589 'ARLEM' virtual meeting, July 8, 2015fridolin.wild
 
Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)
Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)
Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)fridolin.wild
 
IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)fridolin.wild
 
Learning from meaningful, purposive interaction
Learning from meaningful, purposive interactionLearning from meaningful, purposive interaction
Learning from meaningful, purposive interactionfridolin.wild
 
IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)fridolin.wild
 
ARgh! kinesthetic learning
ARgh! kinesthetic learningARgh! kinesthetic learning
ARgh! kinesthetic learningfridolin.wild
 
Lab rats-and-the-moral-maze-v2
Lab rats-and-the-moral-maze-v2Lab rats-and-the-moral-maze-v2
Lab rats-and-the-moral-maze-v2fridolin.wild
 
Quantifying reflection
Quantifying reflectionQuantifying reflection
Quantifying reflectionfridolin.wild
 
Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)fridolin.wild
 
What if...? Technology and Knowledge in the University of the Future
What if...? Technology and Knowledge in the University of the FutureWhat if...? Technology and Knowledge in the University of the Future
What if...? Technology and Knowledge in the University of the Futurefridolin.wild
 

More from fridolin.wild (20)

Performance Augmentation (Keynote, SIG LT, XR4ALL)
Performance Augmentation (Keynote, SIG LT, XR4ALL)Performance Augmentation (Keynote, SIG LT, XR4ALL)
Performance Augmentation (Keynote, SIG LT, XR4ALL)
 
Professional TEL 4.0: Performance Augmentation for Industry 4.0
Professional TEL 4.0: Performance Augmentation for Industry 4.0 Professional TEL 4.0: Performance Augmentation for Industry 4.0
Professional TEL 4.0: Performance Augmentation for Industry 4.0
 
Performance Augmentation
Performance AugmentationPerformance Augmentation
Performance Augmentation
 
Reality As A Knowledge Medium
Reality As A Knowledge MediumReality As A Knowledge Medium
Reality As A Knowledge Medium
 
ARLEM draft spec - overview
ARLEM draft spec - overviewARLEM draft spec - overview
ARLEM draft spec - overview
 
AR community meeting, Seoul, Korea, October 6, 2015
AR community meeting, Seoul, Korea, October 6, 2015AR community meeting, Seoul, Korea, October 6, 2015
AR community meeting, Seoul, Korea, October 6, 2015
 
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
 
IEEE p1589 'ARLEM' virtual meeting, July 8, 2015
IEEE p1589 'ARLEM' virtual meeting, July 8, 2015IEEE p1589 'ARLEM' virtual meeting, July 8, 2015
IEEE p1589 'ARLEM' virtual meeting, July 8, 2015
 
Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)
Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)
Special Interest Group on Wearables-Enhanced Leanring (SIG WELL)
 
IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)
 
Learning from meaningful, purposive interaction
Learning from meaningful, purposive interactionLearning from meaningful, purposive interaction
Learning from meaningful, purposive interaction
 
Reality as a Medium
Reality as a MediumReality as a Medium
Reality as a Medium
 
IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)
 
ARgh! kinesthetic learning
ARgh! kinesthetic learningARgh! kinesthetic learning
ARgh! kinesthetic learning
 
learning by doing.
learning by doing.learning by doing.
learning by doing.
 
Lab rats-and-the-moral-maze-v2
Lab rats-and-the-moral-maze-v2Lab rats-and-the-moral-maze-v2
Lab rats-and-the-moral-maze-v2
 
Quantifying reflection
Quantifying reflectionQuantifying reflection
Quantifying reflection
 
Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)
 
What if...? Technology and Knowledge in the University of the Future
What if...? Technology and Knowledge in the University of the FutureWhat if...? Technology and Knowledge in the University of the Future
What if...? Technology and Knowledge in the University of the Future
 
Widget- based PLEs
Widget-based PLEsWidget-based PLEs
Widget- based PLEs
 

Recently uploaded

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Recently uploaded (20)

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 

The Geometry of Learning

  • 1. The Geometry of Learning November 17th, 2009, Utrecht, The Netherlands Fridolin WildKMi, The Open University
  • 3. Outline Context & Framing Theories Latent Semantic Analysis (LSA) Social Network Analysis (SNA) Meaningful Interaction Analysis (MIA) Conclusion & Outlook
  • 5. Information (96dpi) Information could be the quality of a certain signal. Information could be a logical abstractor, the release mechanism. Information & Knowledge Knowledge could be the delta at the receiver (a paper, a human, a library).
  • 6. Learning is change Learning is about competence development Competence becomes visible in performance Professional competence is mainly about (re-)constructing and processing information and knowledge from cues Professional competence development is much about learning concepts from language Professional performance is much about demonstrating conceptual knowledge with language Language! What is learning about?
  • 7. Tying shoelaces Douglas Adams’ ‘meaning of liff’: Epping: The futile movements of forefingers and eyebrows used when failing to attract the attention of waiters and barmen. Shoeburyness: The vague uncomfortable feeling you get when sitting on a seat which is still warm from somebody else's bottom I have been convincingly Sapir-Whorfed by this book. Non-textual concepts things we can’t (easily) learn from language
  • 9. Word Choice Educated adult understands ~100,000 word forms An average sentence contains 20 tokens. Thus 100,00020 possible combinationsof words in a sentence  maximum of log2 100,00020= 332 bits in word choice alone. 20! = 2.4 x 1018 possible orders of 20 words = maximum of 61 bits from order of the words. 332/(61+ 332) = 84%word choice (Landauer, 2007)
  • 10. Latent Semantic Analysis “Humans learn word meanings and how to combine them into passage meaning through experience with ~paragraph unitized verbal environments.” “They don’t remember all the separate words of a passage; they remember its overall gist or meaning.” “LSA learns by ‘reading’ ~paragraph unitized texts that represent the environment.” “It doesn’t remember all the separate words of a text it; it remembers its overall gist or meaning.” (Landauer, 2007)
  • 11. Latent Semantics latent-semantic space In other words:Assumption: language utterances have a semantic structure Problem: structure is obscured by word usage(noise, synonymy, polysemy, …) Solution: map doc-term matrix using conceptual indices derived statistically (truncated SVD) and make similarity comparisons using angles
  • 12. Input (e.g., documents) term = feature vocabulary = ordered set of features Only the red terms appear in more than one document, so strip the rest. TEXTMATRIX { M } = Deerwester, Dumais, Furnas, Landauer, and Harshman (1990): Indexing by Latent Semantic Analysis, In: Journal of the American Society for Information Science, 41(6):391-407
  • 14. Truncated SVD latent-semantic space … we will get a different matrix (different values, but still of the same format as M).
  • 16. Reconstructed, Reduced Matrix m4: Graphminors: A survey
  • 17. Similarity in a Latent-Semantic Space Query Y dimension Target 1 Angle 1 Angle 2 Target 2 X dimension (Landauer, 2007)
  • 18. doc2doc - similarities Unreduced = pure vector space model - Based on M = TSD’ - Pearson Correlation over document vectors reduced - based on M2 = TS2D’ - Pearson Correlation over document vectors
  • 19. Typical, simple workflow tm = textmatrix(‘dir/‘) tm = lw_logtf(tm) * gw_idf(tm) space = lsa(tm, dims=dimcalc_share()) tm3 = fold_in(tm, space) as.textmatrix(tm)
  • 20. Processing Pipeline (with Options) 4 x 12 x 7 x 2 x 3 = 2016 Combinations
  • 21. b) SVD is computationally expensive From seconds (lower hundreds of documents, optimised linear algebra libraries, truncated SVD) To minutes (hundreds to thousands of documents) To hours (tens and hundreds of thousands) a) SVD factor stability SVD calculates factorsover a given text base; different texts – different factors Problem: avoid unwanted factor changes Solution: folding-in of instead of recalculating Projecting by Folding-In
  • 22. 2 1 vT Folding-In in Detail (cf. Berry et al., 1995) Mk (2) convert „Dk“-format vector to „Mk“-format Tk Sk Dk (1) convert Original Vector to „Dk“-format
  • 23. The Value of Singular Values Pearson(jahr, wien) Pearson(eu, österreich)
  • 25. Summary Writing: Working Principle (Landauer, 2007)
  • 26. Summary Writing Gold Standard 1 Gold Standard 2 Y dimension Gold Standard 3 Essay 1 Essay 2 X dimension
  • 27. ‘Dumb’ Summary Writing (Code) library( "lsa“ )# load package # load training texts trm = textmatrix( "trainingtexts/“ ) trm = lw_bintf( trm ) * gw_idf( trm )# weighting space = lsa( trm )# create an LSA space # fold-in summaries to be tested (including gold standard text) tem = textmatrix( "testessays/", vocabulary=rownames(trm) ) tem_red = fold_in( tem, space ) # score a summary by comparing with # gold standard text (very simple method!) cor( tem_red[,"goldstandard.txt"], tem_red[,"E1.txt"] ) => 0.7
  • 28.
  • 29.
  • 30. (S)NA
  • 31. Social Network Analysis Existing for a long time (term coined 1954) Basic idea: Actors and Relationships between them (e.g. Interactions) Actors can be people (groups, media, tags, …) Actors and Ties form a Graph (edges and nodes) Within that graph, certain structures can be investigated Betweenness, Degree of Centrality, Density, Cohesion Structural Patterns can be identified (e.g. the Troll)
  • 33. Incidence Matrix msg_id = incident, authorsappear in incidents
  • 34. DeriveAdjacency Matrix = t(im) %*% im
  • 36. Measuring Techniques (Sample) Closenesshow close to all others Degree Centralitynumber of (in/out) connections to others Betweennesshow often intermediary Componentse.g. kmeans cluster (k=3)
  • 39. Paper Collaboration Prolearn e.g. co-authorships of ~30 deliverables of three work packages (ProLearn NoE) Roles: reviewer (red), editor (green), contributor Size: Prestige() But: type of interaction? Content of interaction? => not possible!
  • 40. TEL Project Cooperation (2004-2007)
  • 41. iCamp Collaboration (Y1) Shades of yellow: WP leadership Red: coordinator
  • 42. MIA
  • 43. Meaningful Interaction Analysis (MIA) Fusion: Combining LSA with SNA Terms and Documents (or anything else represented with column vectors or row vectors) are mapped into same space by LSA Semantic proximity can be measured between them: how close is a term to a document? (S)NA allows to analyse these resulting graph structures By e.g. cluster or component analysis By e.g. identifying central descriptors for these
  • 44. The mathemagics behind Meaning Interaction Analysis
  • 45. Truncated SVD latent-semantic space … we will get a different matrix (different values, but still of the same format as M).
  • 46. Knowledge Proxy: LSA Part Tk= left-hand sided matrix = ‚term loadings‘ on the singular value Dk= right-hand sided matrix = ‚document loadings‘ on the singular value Multiply them into same space VT = TkSk VD = DkTSk Cosine Distance Matrix over ... = a graph Extension: add author vectors VAthrough cluster centroids or vector addition of their publication vectors latent-semantic space Ofcourse:useexistingspaceandfold inthewholesetsofvectors
  • 47. Knowledge Proxy: SNA Part:Filter the Network Every vectorhas a cosinedistancetoeveryother (maybe negative)! So: filter forthedesiredsimilaritystrength
  • 49.
  • 51. Spot unwanted fragmentation e.g. two authors work on the same topic, but with different collaborator groups and with different literature Intervention Instrument: automatically recommend to hold a flashmeeting Bringing together what belongs together Wild, Ochoa, Heinze, Crespo, Quick (2009, to appear)