Invited presentation given at the 2nd workshop on Natural Language Processing in Support of Learning: Metrics, Feedback and Connectivity (NLPSL'10) in Bucharest, Romania, September 15, 2010.
Social Network Analysis and collaborative learningFabio Nascimbeni
The presentation explore how network thinking and social network analysis can be useful to improve learners motivation and performance in collaborative learning settings.
Latent Semantic Analysis (LSA) is a mathematical technique for computationally modeling the meaning of words and larger units of texts. LSA works by applying a mathematical technique called Singular Value Decomposition (SVD) to a term*document matrix containing frequency counts for all words found in the corpus in all of the documents or passages in the corpus. After this SVD application, the meaning of a word is represented as a vector in a multidimensional semantic space, which makes it possible to compare word meanings, for instance by computing the cosine between two word vectors.
LSA has been successfully used in a large variety of language related applications from automatic grading of student essays to predicting click trails in website navigation. In Coh-Metrix (Graesser et al. 2004), a computational tool that produces indices of the linguistic and discourse representations of a text, LSA was used as a measure of text cohesion by assuming that cohesion increases as a function of higher cosine scores between adjacent sentences.
Besides being interesting as a technique for building programs that need to deal with semantics, LSA is also interesting as a model of human cognition. LSA can match human performance on word association tasks and vocabulary test. In this talk, Fridolin will focus on LSA as a tool in modeling language acquisition. After framing the area of the talk with sketching the key concepts learning, information, and competence acquisition, and after outlining presuppositions, an introduction into meaningful interaction analysis (MIA) is given. MIA is a means to inspect learning with the support of language analysis that is geometrical in nature. MIA is a fusion of latent semantic analysis (LSA) combined with network analysis (NA/SNA). LSA, NA/SNA, and MIA are illustrated by several examples.
To Model or Not to Model A Dialogue on the Role ofComputati.docxturveycharlyn
To Model or Not to Model? A Dialogue on the Role of
Computational Modeling in Developmental Science
Vanessa R. Simmering1, Jochen Triesch2, Gedeon O. Deák3, and John P. Spencer4
1 Department of Psychology, University of Wisconsin - Madison
2 Frankfurt Institute for Advanced Studies, Goethe University Frankfurt
3 Department of Cognitive Science, University of California at San Diego
4 Department of Psychology and Delta Center, University of Iowa
Abstract
All sciences use models of some variety to understand complex phenomena. In developmental
science, however, modeling is mostly limited to linear, algebraic descriptions of behavioral data.
Some researchers have suggested that complex mathematical models of developmental
phenomena are a viable (even necessary) tool that provide fertile ground for developing and
testing theory as well as for generating new hypotheses and predictions. This paper explores the
concerns, attitudes, and historical trends that underlie the tension between two cultures: one in
which computational simulations of behavior are an important complement to observation and
experimentation, and another which emphasizes evidence from behavioral experiments and linear
models enhanced by verbal descriptions. This tension is explored as a dialogue between three
characters: Ed (Experimental Developmentalist), Mira (Modeling Inclusive Research Advocate),
and Phil (Philosopher of Science).
Mira, Ed, and Phil are in Santa Fe, New Mexico, for a conference. After attending some of
the colloquia, they meet up at the Georgia O’Keeffe museum. Mira walks up to Ed and Phil
who are standing outside the museum.
Mira: Hey, I just saw a great set of talks—did you go to the symposium called “To Model or
Not To Model”? I thought they hit on some really important points.
Ed: I saw that in the schedule, but I didn’t go. I’m not really interested in modeling.
Mira: That’s too bad, you should have gone; this symposium was designed for people like
you.
Ed: What do you mean, ‘people like me’?
Mira: I mean people who don’t do modeling. The point of the symposium was to highlight
how modeling and empirical approaches can support one another if they stay connected.
Ed: I think empirical approaches are doing just fine, thank you very much.
Mira: I wasn’t trying to criticize non-modeling approaches. My point—rather, the point of
the symposium—was to discuss why developmental scientists from all perspectives should
Correspondence concerning this article should be addressed to Vanessa R. Simmering, Department of Psychology, University of
Wisconsin – Madison, 1202 W. Johnson St., Madison, WI 53706. [email protected]
NIH Public Access
Author Manuscript
Child Dev Perspect. Author manuscript; available in PMC 2011 May 27.
Published in final edited form as:
Child Dev Perspect. 2010 August ; 4(2): 152–158. doi:10.1111/j.1750-8606.2010.00134.x.
N
IH
-P
A
A
uthor M
anuscript
N
IH
-P
A
A
uthor M
anuscript
N
IH
-P
A
A
uthor M
anuscript
care ...
XIP Dashboard: Visual Analytics from Automated Rhetorical Parsing of Scient...Simon Buckingham Shum
XIP Dashboard: Visual Analytics from Automated Rhetorical Parsing of Scientific Metadiscourse
ABSTRACT
A key competency that we seek to build in learners is a critical mind, i.e. ability to engage with the ideas in the literature, and to identify when significant claims are being made in articles. The ability to decode such moves in texts is essential, as is the ability to make such moves in one’s own writing. Computational techniques for extracting them are becoming available, using Natural Language Processing (NLP) tuned to recognize the rhetorical signals that authors use when making a significant scholarly move. After reviewing related NLP work, we introduce the Xerox Incremental Parser (XIP), note previous work to render its output, and then motivate the design of the XIP Dashboard, a set of visual analytics modules built on XIP output, using the LAK/EDM open dataset as a test corpus. We report preliminary user reactions to a paper prototype of such a novel dashboard, describe the visualizations implemented to date, and present user scenarios for learners, educators and researchers. We conclude with a summary of ongoing design refinements, potential platform integrations, and questions that need to be investigated through end-user evaluations.
Performance Augmentation (Keynote, SIG LT, XR4ALL)fridolin.wild
presented at the first meeting of the XR4ALL SIG 'AR/VR for Learning and Training' (http://xr4all.eu/vrar-learning-training/), see event: http://xr4all.eu/event/webinar-on-xr-for-learning-and-training/
Professional TEL 4.0: Performance Augmentation for Industry 4.0 fridolin.wild
Industry 4.0 is on the rise and this coordinated push for automation, big data, and internet-of-things in the smart factory is already causing (and will continue to) disruption in the job market. New skills for 'new collar' jobs are needed and intelligent assistance systems with Augmented Reality, Smart Glasses, and other forms of wearable computing may help to deliver them.
In this talk, Dr. Wild will introduce to the concept of Performance Augmentation and illustrate how challenges for the future can be met at the hand of several examples of intelligent training and live guidance applications in aircraft maintenance, space assembly, and medical diagnostics.
More Related Content
Similar to Monitoring Conceptual Development with Meaningful Interaction Analysis
Social Network Analysis and collaborative learningFabio Nascimbeni
The presentation explore how network thinking and social network analysis can be useful to improve learners motivation and performance in collaborative learning settings.
Latent Semantic Analysis (LSA) is a mathematical technique for computationally modeling the meaning of words and larger units of texts. LSA works by applying a mathematical technique called Singular Value Decomposition (SVD) to a term*document matrix containing frequency counts for all words found in the corpus in all of the documents or passages in the corpus. After this SVD application, the meaning of a word is represented as a vector in a multidimensional semantic space, which makes it possible to compare word meanings, for instance by computing the cosine between two word vectors.
LSA has been successfully used in a large variety of language related applications from automatic grading of student essays to predicting click trails in website navigation. In Coh-Metrix (Graesser et al. 2004), a computational tool that produces indices of the linguistic and discourse representations of a text, LSA was used as a measure of text cohesion by assuming that cohesion increases as a function of higher cosine scores between adjacent sentences.
Besides being interesting as a technique for building programs that need to deal with semantics, LSA is also interesting as a model of human cognition. LSA can match human performance on word association tasks and vocabulary test. In this talk, Fridolin will focus on LSA as a tool in modeling language acquisition. After framing the area of the talk with sketching the key concepts learning, information, and competence acquisition, and after outlining presuppositions, an introduction into meaningful interaction analysis (MIA) is given. MIA is a means to inspect learning with the support of language analysis that is geometrical in nature. MIA is a fusion of latent semantic analysis (LSA) combined with network analysis (NA/SNA). LSA, NA/SNA, and MIA are illustrated by several examples.
To Model or Not to Model A Dialogue on the Role ofComputati.docxturveycharlyn
To Model or Not to Model? A Dialogue on the Role of
Computational Modeling in Developmental Science
Vanessa R. Simmering1, Jochen Triesch2, Gedeon O. Deák3, and John P. Spencer4
1 Department of Psychology, University of Wisconsin - Madison
2 Frankfurt Institute for Advanced Studies, Goethe University Frankfurt
3 Department of Cognitive Science, University of California at San Diego
4 Department of Psychology and Delta Center, University of Iowa
Abstract
All sciences use models of some variety to understand complex phenomena. In developmental
science, however, modeling is mostly limited to linear, algebraic descriptions of behavioral data.
Some researchers have suggested that complex mathematical models of developmental
phenomena are a viable (even necessary) tool that provide fertile ground for developing and
testing theory as well as for generating new hypotheses and predictions. This paper explores the
concerns, attitudes, and historical trends that underlie the tension between two cultures: one in
which computational simulations of behavior are an important complement to observation and
experimentation, and another which emphasizes evidence from behavioral experiments and linear
models enhanced by verbal descriptions. This tension is explored as a dialogue between three
characters: Ed (Experimental Developmentalist), Mira (Modeling Inclusive Research Advocate),
and Phil (Philosopher of Science).
Mira, Ed, and Phil are in Santa Fe, New Mexico, for a conference. After attending some of
the colloquia, they meet up at the Georgia O’Keeffe museum. Mira walks up to Ed and Phil
who are standing outside the museum.
Mira: Hey, I just saw a great set of talks—did you go to the symposium called “To Model or
Not To Model”? I thought they hit on some really important points.
Ed: I saw that in the schedule, but I didn’t go. I’m not really interested in modeling.
Mira: That’s too bad, you should have gone; this symposium was designed for people like
you.
Ed: What do you mean, ‘people like me’?
Mira: I mean people who don’t do modeling. The point of the symposium was to highlight
how modeling and empirical approaches can support one another if they stay connected.
Ed: I think empirical approaches are doing just fine, thank you very much.
Mira: I wasn’t trying to criticize non-modeling approaches. My point—rather, the point of
the symposium—was to discuss why developmental scientists from all perspectives should
Correspondence concerning this article should be addressed to Vanessa R. Simmering, Department of Psychology, University of
Wisconsin – Madison, 1202 W. Johnson St., Madison, WI 53706. [email protected]
NIH Public Access
Author Manuscript
Child Dev Perspect. Author manuscript; available in PMC 2011 May 27.
Published in final edited form as:
Child Dev Perspect. 2010 August ; 4(2): 152–158. doi:10.1111/j.1750-8606.2010.00134.x.
N
IH
-P
A
A
uthor M
anuscript
N
IH
-P
A
A
uthor M
anuscript
N
IH
-P
A
A
uthor M
anuscript
care ...
XIP Dashboard: Visual Analytics from Automated Rhetorical Parsing of Scient...Simon Buckingham Shum
XIP Dashboard: Visual Analytics from Automated Rhetorical Parsing of Scientific Metadiscourse
ABSTRACT
A key competency that we seek to build in learners is a critical mind, i.e. ability to engage with the ideas in the literature, and to identify when significant claims are being made in articles. The ability to decode such moves in texts is essential, as is the ability to make such moves in one’s own writing. Computational techniques for extracting them are becoming available, using Natural Language Processing (NLP) tuned to recognize the rhetorical signals that authors use when making a significant scholarly move. After reviewing related NLP work, we introduce the Xerox Incremental Parser (XIP), note previous work to render its output, and then motivate the design of the XIP Dashboard, a set of visual analytics modules built on XIP output, using the LAK/EDM open dataset as a test corpus. We report preliminary user reactions to a paper prototype of such a novel dashboard, describe the visualizations implemented to date, and present user scenarios for learners, educators and researchers. We conclude with a summary of ongoing design refinements, potential platform integrations, and questions that need to be investigated through end-user evaluations.
Performance Augmentation (Keynote, SIG LT, XR4ALL)fridolin.wild
presented at the first meeting of the XR4ALL SIG 'AR/VR for Learning and Training' (http://xr4all.eu/vrar-learning-training/), see event: http://xr4all.eu/event/webinar-on-xr-for-learning-and-training/
Professional TEL 4.0: Performance Augmentation for Industry 4.0 fridolin.wild
Industry 4.0 is on the rise and this coordinated push for automation, big data, and internet-of-things in the smart factory is already causing (and will continue to) disruption in the job market. New skills for 'new collar' jobs are needed and intelligent assistance systems with Augmented Reality, Smart Glasses, and other forms of wearable computing may help to deliver them.
In this talk, Dr. Wild will introduce to the concept of Performance Augmentation and illustrate how challenges for the future can be met at the hand of several examples of intelligent training and live guidance applications in aircraft maintenance, space assembly, and medical diagnostics.
What if we could change reality and augment it with fantasy? Much of the magic of my childhood can be done by you and me today - using technology. In this talk, I show a few examples of magic, ghosts, and other techno tricks. Short version of a lecture given at the Sunday Assembly Oxford on April 12, 2015.
Presentation on interoperability models for activities and unified reference spaces ('workplaces') given at the Augmented World Expo 2014 in Santa Clara, US.†
Natural Language Processing in R (rNLP)fridolin.wild
The introductory slides of a workshop given to the doctoral school at the Institute of Business Informatics of the Goethe University Frankfurt. The tutorials are available on http://crunch.kmi.open.ac.uk/w/index.php/Tutorials
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
2. Tying shoelaces Douglas Adams’ ‘meaning of liff’: Epping: The futile movements of forefingers and eyebrows used when failing to attract the attention of waiters and barmen. Shoeburyness: The vague uncomfortable feeling you get when sitting on a seat which is still warm from somebody else's bottom I have been convincingly Sapir-Whorfed by this book. Concepts things we can (easily) learn from or express in language
4. Meaningful Interaction Analysis Two-Mode factor analysis of the co-occurrences in the terminology Results in a latent-semantic vector space Which can be analysed with Network Analysis
13. Evaluation Evaluating effectiveness: measure of the accuracy in representing conceptual development Can be measured with two complementary methods by assessing the external validity of: Concept Annotation: effectiveness in selecting accurate conceptual descriptors (with ratings) Concept Proximity: effectiveness in representing proximity (with card-sorts) By comparing against human ratings of 18 first-year medical students of the University of Manchester Medical School aged 19-21
14. Concept Annotation Annotation of 5 authentic postings again on ‘safe prescribing’ Selection of 10 top-loading concepts Adding of 5 random distracters Participants ranked on Likert Scale of 1 to 5how good the concept described the posting Human Interrater Correlation was measured with free marginal Kappa (Randolph, 2005) Conflated categories (1+2,3,4+5)
15. Concept Proximity (1) Four authentic learner blog postings about ‘safe prescribing’ generated ~ 50 top-loading concepts each Printed on cards Participants grouped them in piles Comparison of participant clustering with kmeans-based clustering in the MIA space 1% of term pairs put into same cluster by more than 12 participants 7% by between 7 and 12 1% term pairs: Spearman’s Rho as interrater correlation
17. Proximity (3) Silhouette width in the MIA space (Rousseeuw, 1986) Silhouette plots depict for each observation, how good the balance between its distances to its other cluster members compared to its distances within the next close cluster is.
18. Conclusion 1st year students do not have much agreement in rating the annotations, could be a sign of heterogeneous frames of reference Activation strength of the 10 concepts has not been taken into account (would be interesting!) Still: pretty good clustering results in the upper range Lower range: could be an artefact of the clustering (clustering of a folded-in posting, not clustering in the space) All in all: points towards rigorous use of thresholds Near human results (at the human overlap) Near human results (producing clearly better results than chance, but no perfect agreement)