SlideShare a Scribd company logo
From NLP to NLU: Why we need
varied, comprehensive, and
stratified knowledge, and how to
use it for Neuro-symbolic AI
Keynote at KnowledgeNLP-AAAI’23
Amit Sheth
Director, AI Institute of South Carolina
University of South Carolina
amit@sc.edu
#AIISC, http://aiisc.ai
AIISC portfolio in Core AI & Translational AI
2
Knowledge-infused
Learning (Neuro-
symbolic
/Hybrid AI)
Knowledge Graph
Development
Deep Learning
Reinforcement Learning
Natural Language
Processing/
Understanding/Generation
Multimodal AI
(IoT/sensor, data
streams, images,
emoji)
Collaborative &
Personal Assistants
Multiagent
Systems
Interpretability/
Explainability/Safety/
Trust/Ethics in AI
Medicine/Healthcare/Nursing
(Nutrition, Neurodevelopmental Disease,
Asthma, Diabetes, Hypertension, Autism,
Aphasia, Cognitive Disorders,
Oncology,...)
Neuroscience
Brain Science
Epidemiology
Education
Social Good/Harm
(Disinformation,
Harassment,Toxic
Content, Deception,
Extremism,
Radicalization)
Public Health
(Mental Health,
Addiction, COVID-19,
Epidemics)
Smart Manufacturing
(Digital Twins, Factory
of Future)
Disaster Management
(Response, Resilience
Pharma: drug
discovery,
vigilance
Autonomous
Systems
(Vehicles)
Automated
Planning
Computer
Vision
Gaming
Cognitive
Science
Science & Engg:
Radiation,
Astrophysics, Civil
Infra & Transportation
Law
Contents
3
1. Challenges with Current LMs
2. Possible Fixes and Limitations
3. Knowledge Infused Neuro-Symbolic AI
4. Before Transformers, Historical Context [Neuro Symbolic AI]
5. After Transformers, Current Context [Neuro Symbolic AI]
6. Future Context [Neuro-Symbolic AI] with Process Knowledge
Infusion.
Challenges with Current LMs
4
Tycoon
Did you mean: tycoon
Did you mean: typhoon
Did you mean: typography
tycoon:
0.00567%
Distributional Semantics:
Language Models are trained to compute the
distributional plausibility of language tokens
from enormous amounts of training tokens.
World Model Semantics:
Relationships and effects among the objects
that the language tokens describe, grounded
in the physical or conceptual reality of the
world humans experience (ontological
commitment).
Do World Model Semantics Arise as an
Emergent Capability of Distributional
Semantics at Scale? [No- not directly, not
specifically, not adequately]
Sheth, A., Ramakrishnan, C., & Thomas, C. (2005). Semantics for the semantic web: The implicit, the formal and the powerful. International Journal on Semantic Web and
Information Systems (IJSWIS), 1(1), 1-18., 2005, link
World Model Semantics from Distributed Semantics at Scale?
1.
Challenges with Current LMs
5
Formally
∀i ⊆ [N], N = Vocabulary Size,
Model the probability:
P(tokeni, …. , tokenN | token1, … tokeni)
Example Usage
Once Trained on Enormous amounts of Data,
Predict:
1. y ~ P(S = “Obama just won the 2032
Election”)?
2. P(S) does not pass a threshold test
=> y = 0, or False.
Tycoon
Did you mean: tycoon
Did you mean: typhoon
Did you mean: typography
tycoon:
0.00567%
Recency : World Model Semantics from Distributed Semantics at Scale
1.
Challenges with Current LMs
6
Distributional Semantics:
Language Models are trained to compute the
distributional plausibility of language tokens
from enormous amounts of training tokens.
E.g., What was the color of the white horse of
Napolean?
It is not very distributionally plausible that a
person asks the answer to a question they
already know
The answer is abundantly clear in the question
But, (a) distribution semantics has problem
with the low probability, and (b) it does not
understand! More challenges arise due to lack
of context.
Common Sense : World Model Semantics from Distributed Semantics at Scale
1.
Challenges with Current LMs
7
Distributional Semantics:
Language Models are trained to compute the
distributional plausibility of language tokens from
enormous amounts of training tokens.
E.g., Mike’s mum had 4 kids; 3 of them are X,Y,Z. What
is the name of 4th kid?
- Missing information is not clear
Answer is obviously Mike
1. Mike is not a pet
2. More than one child cannot have the same name
3. Kid names cannot be any word
4. Impossible answers even after the hint:
Mike, Luis, Drake, and Matilda, and All other
tokens in the input!
Distribution semantics does not have commonsense
and probability does not help.
Challenge - Missing Information: World Model Semantics from Distributed Semantics at Scale
1.
Challenges with Current LMs
8
Distributional Semantics:
Language Models are trained to compute the
distributional plausibility of language tokens
from enormous amounts of training tokens.
E.g., Mike’s mum had 4 kids;3 of them are X,Y,Z.
What is the name of 4th kid?
Human’s first answers (We asked 10 fellow
humans)
1. Mike
2. I think this is a trick question
Distribution semantics does not know to make
human-like assumptions about missing
information.
Challenge - Missing Information: World Model Semantics from Distributed Semantics at Scale
1.
Challenges with Current LMs
9
Distributional Semantics are a Cloud of Probabilities
The
director, AIISC, of,
Amit Sheth
All other words
Prof
I kind of
understand (not
really though)
Dense Representational Spaces
Hallucinations - It makes up things !
1.
Recency
Insufficiency of data alone
Common-sense
Hallucinations
Challenges to be addressed for NLU
10
User-Explainability Application-level Safety
Characteristics of NLU Capable Systems
1.
1.
11
Challenges to be addressed for NLU
Possible Fix: Instruct GPT
12
The Instruct GPT Framework
Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C. L., Mishkin, P., ... & Lowe, R. (2022). Training language models to
follow instructions with human feedback. arXiv preprint arXiv:2203.02155. link
No Problem I will make things larger and use
Instruction based Training
Hallucinations - It makes up things !
2.
13
1. 40 humans to capture the breadth of knowledge corresponding to the
data in LLMs seems small
2. The richness of human knowledge is compressed into a mere label
2.
14
Prof: Amit Sheth
Synonyms: Amit
Sheth
Degree:Ph.D.
Synonyms:
Doctorate
Company: AI
Institute,
Abbr: AIISC
Synonyms:
AI Institute,
South Carolina
employee_of
Let’s see what this model of
understanding can yield
Knowledge (Graphs) to the rescue
Fix: Addressing Hallucinations: Recency, Common-Sense, and Implicit Entity Mentions, etc.,
2.
15
World as Concepts vs. World as Probabilities.
Explicit model of recency and common-sense.
Supply missing Knowledge (entities, relationships).
Semantics supported by Knowledge (Graphs)
User-level Explanations and Safety Constraints
2.
Knowledge Infused
Neuro-symbolic AI
3.
3.
Knowledge Infused
Neuro-symbolic AI
Using Graphs in Neural Network
Pipelines
18
3.
Shallow Infusion Semi-Deep Infusion
Shades of KiL - Shallow and Semi-Deep Infusion
Sheth, Gaur, Kursuncu, & Wickramarachchi, (2019). Shades of knowledge-infused learning for enhancing deep learning. IEEE Internet Computing, 23(6), 54-63., link
3.
Shades of KiL - Deep Infusion
19
Deep Infusion
Sheth, Gaur, Kursuncu, & Wickramarachchi, (2019). Shades of knowledge-infused learning for enhancing deep learning. IEEE Internet Computing, 23(6), 54-63., link
Sheth, Gaur, Kursuncu, & Wickramarachchi, (2019). Shades of knowledge-infused learning for enhancing deep learning. IEEE Internet Computing, 23(6), 54-63., link
Characteristics/Method Distributed Semantics Shallow Infusion Semi-Deep Infusion Deep Infusion
Recency U-M- M M+ H
Filling in Missing
Information
U U M H
Hallucinations Unsatisfactory (U)
Get by but not really
solving the problem
(M)
Better but not fully
solve the problem (M+)
Broadly solve the
problem (H)
Characteristics of Knowledge Infusion
3.
20
3.
KiL - Generic Architecture
21
Before Transformers
Historical Context
4.
4.
KiL - SEDO (Shallow Infusion)
23
Gaur, Kursuncu, Alambo,, Sheth, Daniulaityte, Thirunarayan, & Pathak. (2018, October). " Let Me Tell You About Your Mental Health!" Contextualized Classification of
Reddit Posts to DSM-5 for Web-based Intervention. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (pp. 753-762).,
link
After Transformers
Neuro-Symbolic AI
5.
5.
KiL - K-Adapter (Shallow Infusion)
25
Wang, R., Tang, D., Duan, N., Wei, Z., Huang, X. J., Ji, J., ... & Zhou, M. (2021, August). K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters. In Findings of the
Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 1405-1418), link
5.
KiL - KALA (Semi-Deep Infusion)
26
Kang, M., Baek, J., & Hwang, S. J. (2022, July). KALA: Knowledge-Augmented Language Model Adaptation. In Proceedings of the 2022 Conference of the North American
Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 5144-5167)., link
5.
KiL - TDLR (Semi-Deep Infusion)
27
Rawte, V., Chakraborty, M., Roy, K., Gaur, M., Faldu, K., Kikani, P., ... & Sheth, A. P. TDLR: Top Semantic-Down Syntactic Language Representation. In NeurIPS'22 Workshop on
All Things Attention: Bridging Different Perspectives on Attention., link
Knowledge Contexts Leads to Performance Gains With Smaller Models
Knowledge Contexts Leads to Performance Gains With Smaller Models and Smaller
Datasets
5.
TDLR - Results
28
Rawte, V., Chakraborty, M., Roy, K., Gaur, M., Faldu, K., Kikani, P., ... & Sheth, A. P. TDLR: Top Semantic-Down Syntactic Language Representation. In NeurIPS'22 Workshop on
All Things Attention: Bridging Different Perspectives on Attention., link
Knowledge Infused
Neuro-symbolic AI
Integrating Lifted Neural
Representations with Knowledge
Graphs
6.
Neural Network Abstract / Contextualization
ACT DECIDE
reasoning
Planning
Inference
Apply Process
Knowledge: User has
Specific concerns due to
X, Y, Z Concepts
Action:
Further Interact with
System User on their
concerns
Explicit Knowledge
Data
6.
30
Really struggling with my bisexuality which
is causing chaos in my relationship with a
girl. I am equal to worthless for her. I’m
now starting to get drunk because I can’t
cope with the obsessive, intrusive thoughts,
and need to get out of my head.
288291000119102: High risk bisexual behavior
365949003: Health-related behavior finding 365949003: Health-related behavior finding
307077003: Feeling hopeless
365107007: level of mood
225445003: Intrusive thoughts
55956009: Disturbance in content of thought
26628009: Disturbance in thinking
1376001: Obsessive compulsive personality
disorder
Multi-hop
traversal on
medical
knowledge graphs
<is symptom>
Obsessive-compulsive disorder is a disorder
in which people have obsessive, intrusive
thoughts, ideas or sensations that make them
feel driven to do something repetitively
6.
Knowledge Verified Interpretable Prediction through
linking to KG and definitions
31
Gaur, M., Desai, A., Faldu, K., & Sheth, A. (2020). Explainable ai using knowledge graphs. In ACM CoDS-COMAD Conference. Link, slide.
Process Knowledge Structure in C-SSRS
C-SSRS: Columbia Suicide Severity Rating Scale
I wish I could give a shit about what
would make it to the front page. I have
been there and got nothing. Same as
my life. I do have a gun.’, ’I thought I
was talking about it. I am not on a
ledge or something, but I do have my
gun in my lap.’, ’No. I made sure she
got an education and she knows how
to get a job. I also have recently
bought her clothes to make her more
attractive. She has told me she only
loves me because I buy her things.
1. Wish to be dead - Yes
2. Non-specific Active Suicidal
Thoughts - Yes
3. Active Suicidal Ideation with Some
Intent to Act - Yes
4. Label: Suicide Behavior or Attempt
Interpretable for System Users
i.e., Clinicians and Patients
(1,2,3 verify adherence to the
clinical guideline on diagnosis
which a clinician understands)
47%
70%
LLMs Process Knowledge
(Ours)
Agreement with Experts
6.
Knowledge Verified Interpretable Prediction
through Process Knowledge Structures
32
Sheth, A., Gaur, M., Roy, K., Venkataraman, R., & Khandelwal, V. (2022). Process Knowledge-Infused AI: Toward User-Level Explainability, Interpretability, and Safety. IEEE
Internet Computing, 26(5), 76-84., link
Do you feel nervous?
More than half the days
Do you feel irritated or
self destructive?
Do you feel something
extreme might happen to
you?
Are you able to relax?
Do you feel nervous?
More than half the days
Do you feel Irritated?
Are you bothered by
becoming easily annoyed
or irritable?
Are you bothered by any
relaxation troubles?
Knowledge
Infusion using
Medical
Questionnaire
(MedQ)
These questions
are medically
valid and safe, in
right sequence..
Safety
Check
s
6.
Knowledge Verified Interpretable and Safe Text
Generation through Process Knowledge Structures
33
Roy, K., Gaur, M., Soltani, M., Rawte, V., Kalyan, A., & Sheth, A. (2023). ProKnow: Process knowledge for safety constrained and explainable question generation for mental
health diagnostic assistance. Frontiers in Big Data, 5., link
● If the system were to give user-level explanation, it will need to incorporate/use
conceptual model (vocabularies, knowledge graph) used by the user. Purely data
driven system can at best give explanations that ML engineers (developers)
can use.
● Knowledge is multifaceted. I presented diverse knowledge to support different
levels of abstractions for NLU. There will be different knowledge for
abstractions involved in image understanding activity.
● Should you bring knowledge to the data level (embedding) or bring data to
the knowledge level (learn from data and align with knowledge)? For less
demanding intellectual activities (classification, prediction, recommendation) the
former will do; for more demanding intellectual activities (decision making and
taking actions with explanations), latter is most likely needed.
Takeaway
34
35
Acknowledgement
Funding supported in part by NSF Award#:
2133842 EAGER:
Advancing Neuro-symbolic AI with Deep
Knowledge-infused Learning
and other projects (see http://wiki.aiisc.ai)
Learn more:
● Website - http://aiisc.ai (projects, people, opensource, demos, open
data/tools, tutorials, workshops, papers)
● Wiki Page - http://wiki.aiisc.ai
● LinkedIn - http://linkedin.com/company/aiisc
● YouTube - http://youtube.com/aiisc (demos, tutorials, dissertations,
keynotes, invited talks)
Sheth core group
AIISC
Artificial Intelligence Institute of South Carolina (#AIISC)
Kaushik Roy Vedant Khandelwal
Also, Megha Chakraborty, Vipula Rawte, Yuxin Zi
36
Contribution/special thanks (this talk):

More Related Content

Similar to From NLP to NLU: Why we need varied, comprehensive, and stratified knowledge, and how to use it for Neuro-symbolic AI

Edet 637 Dual Coding Theory
Edet 637 Dual Coding TheoryEdet 637 Dual Coding Theory
Edet 637 Dual Coding Theoryguestb8ed61
 
Open Mining Education, Ethics & AI
Open Mining Education, Ethics & AIOpen Mining Education, Ethics & AI
Open Mining Education, Ethics & AI
Robert Farrow
 
Socially-Sensitive Interfaces: From Offline Studies to Interactive Experiences
Socially-Sensitive Interfaces: From Offline Studies to Interactive ExperiencesSocially-Sensitive Interfaces: From Offline Studies to Interactive Experiences
Socially-Sensitive Interfaces: From Offline Studies to Interactive Experiences
Elisabeth André
 
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
XAIC
 
Artificial Intelligence: The Promise, the Myth, and a Dose of Reality
Artificial Intelligence: The Promise, the Myth, and a Dose of RealityArtificial Intelligence: The Promise, the Myth, and a Dose of Reality
Artificial Intelligence: The Promise, the Myth, and a Dose of Reality
Dagmar Monett
 
using-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdf
using-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdfusing-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdf
using-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdf
MackGarcia
 
PyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcarePyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcare
Artificial Intelligence Institute at UofSC
 
PyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcarePyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcare
Manas Gaur
 
PyData Conference: Knowledge-infused Learning for Healthcare
PyData Conference: Knowledge-infused Learning for HealthcarePyData Conference: Knowledge-infused Learning for Healthcare
PyData Conference: Knowledge-infused Learning for Healthcare
Manas Gaur
 
Week 12 neural basis of consciousness : frontiers in consciousness research
Week 12 neural basis of consciousness : frontiers in consciousness researchWeek 12 neural basis of consciousness : frontiers in consciousness research
Week 12 neural basis of consciousness : frontiers in consciousness research
Nao (Naotsugu) Tsuchiya
 
Brain Research for Teachers & Other Curious Souls, 2013 update
Brain Research for Teachers & Other Curious Souls, 2013 updateBrain Research for Teachers & Other Curious Souls, 2013 update
Brain Research for Teachers & Other Curious Souls, 2013 update
Carolyn K.
 
People with Communication Disability Striving, Thriving,.pptx
People with Communication Disability Striving, Thriving,.pptxPeople with Communication Disability Striving, Thriving,.pptx
People with Communication Disability Striving, Thriving,.pptx
Bronwyn Hemsley
 
Artificial intelligence.pptx
Artificial intelligence.pptxArtificial intelligence.pptx
Singularity Pyramid overview Dec 21
Singularity Pyramid overview Dec 21Singularity Pyramid overview Dec 21
Singularity Pyramid overview Dec 21
Vo Viet Anh
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
Paul Groth
 
CDE CONFERENCE POSTER Metaphor & elearning
CDE CONFERENCE POSTER Metaphor & elearningCDE CONFERENCE POSTER Metaphor & elearning
CDE CONFERENCE POSTER Metaphor & elearning
Mike Howarth Associates
 
Artificial intelligence - the futuristic world
Artificial intelligence - the futuristic world Artificial intelligence - the futuristic world
Artificial intelligence - the futuristic world
MANASJHAMJ
 
Is the future post-human? IDU planner
Is the future post-human? IDU plannerIs the future post-human? IDU planner
Is the future post-human? IDU planner
cinbarnsley
 

Similar to From NLP to NLU: Why we need varied, comprehensive, and stratified knowledge, and how to use it for Neuro-symbolic AI (20)

Edet 637 Dual Coding Theory
Edet 637 Dual Coding TheoryEdet 637 Dual Coding Theory
Edet 637 Dual Coding Theory
 
Artificial intel
Artificial intelArtificial intel
Artificial intel
 
Open Mining Education, Ethics & AI
Open Mining Education, Ethics & AIOpen Mining Education, Ethics & AI
Open Mining Education, Ethics & AI
 
Socially-Sensitive Interfaces: From Offline Studies to Interactive Experiences
Socially-Sensitive Interfaces: From Offline Studies to Interactive ExperiencesSocially-Sensitive Interfaces: From Offline Studies to Interactive Experiences
Socially-Sensitive Interfaces: From Offline Studies to Interactive Experiences
 
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
 
Artificial Intelligence: The Promise, the Myth, and a Dose of Reality
Artificial Intelligence: The Promise, the Myth, and a Dose of RealityArtificial Intelligence: The Promise, the Myth, and a Dose of Reality
Artificial Intelligence: The Promise, the Myth, and a Dose of Reality
 
using-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdf
using-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdfusing-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdf
using-multiple-intelligence-theory-in-the-mathematics-classroom (1).pdf
 
PyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcarePyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcare
 
PyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcarePyData Salamanca knowledge infusion in healthcare
PyData Salamanca knowledge infusion in healthcare
 
PyData Conference: Knowledge-infused Learning for Healthcare
PyData Conference: Knowledge-infused Learning for HealthcarePyData Conference: Knowledge-infused Learning for Healthcare
PyData Conference: Knowledge-infused Learning for Healthcare
 
Week 12 neural basis of consciousness : frontiers in consciousness research
Week 12 neural basis of consciousness : frontiers in consciousness researchWeek 12 neural basis of consciousness : frontiers in consciousness research
Week 12 neural basis of consciousness : frontiers in consciousness research
 
Brain Research for Teachers & Other Curious Souls, 2013 update
Brain Research for Teachers & Other Curious Souls, 2013 updateBrain Research for Teachers & Other Curious Souls, 2013 update
Brain Research for Teachers & Other Curious Souls, 2013 update
 
People with Communication Disability Striving, Thriving,.pptx
People with Communication Disability Striving, Thriving,.pptxPeople with Communication Disability Striving, Thriving,.pptx
People with Communication Disability Striving, Thriving,.pptx
 
IS
ISIS
IS
 
Artificial intelligence.pptx
Artificial intelligence.pptxArtificial intelligence.pptx
Artificial intelligence.pptx
 
Singularity Pyramid overview Dec 21
Singularity Pyramid overview Dec 21Singularity Pyramid overview Dec 21
Singularity Pyramid overview Dec 21
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
 
CDE CONFERENCE POSTER Metaphor & elearning
CDE CONFERENCE POSTER Metaphor & elearningCDE CONFERENCE POSTER Metaphor & elearning
CDE CONFERENCE POSTER Metaphor & elearning
 
Artificial intelligence - the futuristic world
Artificial intelligence - the futuristic world Artificial intelligence - the futuristic world
Artificial intelligence - the futuristic world
 
Is the future post-human? IDU planner
Is the future post-human? IDU plannerIs the future post-human? IDU planner
Is the future post-human? IDU planner
 

Recently uploaded

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 

Recently uploaded (20)

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 

From NLP to NLU: Why we need varied, comprehensive, and stratified knowledge, and how to use it for Neuro-symbolic AI

  • 1. From NLP to NLU: Why we need varied, comprehensive, and stratified knowledge, and how to use it for Neuro-symbolic AI Keynote at KnowledgeNLP-AAAI’23 Amit Sheth Director, AI Institute of South Carolina University of South Carolina amit@sc.edu #AIISC, http://aiisc.ai
  • 2. AIISC portfolio in Core AI & Translational AI 2 Knowledge-infused Learning (Neuro- symbolic /Hybrid AI) Knowledge Graph Development Deep Learning Reinforcement Learning Natural Language Processing/ Understanding/Generation Multimodal AI (IoT/sensor, data streams, images, emoji) Collaborative & Personal Assistants Multiagent Systems Interpretability/ Explainability/Safety/ Trust/Ethics in AI Medicine/Healthcare/Nursing (Nutrition, Neurodevelopmental Disease, Asthma, Diabetes, Hypertension, Autism, Aphasia, Cognitive Disorders, Oncology,...) Neuroscience Brain Science Epidemiology Education Social Good/Harm (Disinformation, Harassment,Toxic Content, Deception, Extremism, Radicalization) Public Health (Mental Health, Addiction, COVID-19, Epidemics) Smart Manufacturing (Digital Twins, Factory of Future) Disaster Management (Response, Resilience Pharma: drug discovery, vigilance Autonomous Systems (Vehicles) Automated Planning Computer Vision Gaming Cognitive Science Science & Engg: Radiation, Astrophysics, Civil Infra & Transportation Law
  • 3. Contents 3 1. Challenges with Current LMs 2. Possible Fixes and Limitations 3. Knowledge Infused Neuro-Symbolic AI 4. Before Transformers, Historical Context [Neuro Symbolic AI] 5. After Transformers, Current Context [Neuro Symbolic AI] 6. Future Context [Neuro-Symbolic AI] with Process Knowledge Infusion.
  • 4. Challenges with Current LMs 4 Tycoon Did you mean: tycoon Did you mean: typhoon Did you mean: typography tycoon: 0.00567% Distributional Semantics: Language Models are trained to compute the distributional plausibility of language tokens from enormous amounts of training tokens. World Model Semantics: Relationships and effects among the objects that the language tokens describe, grounded in the physical or conceptual reality of the world humans experience (ontological commitment). Do World Model Semantics Arise as an Emergent Capability of Distributional Semantics at Scale? [No- not directly, not specifically, not adequately] Sheth, A., Ramakrishnan, C., & Thomas, C. (2005). Semantics for the semantic web: The implicit, the formal and the powerful. International Journal on Semantic Web and Information Systems (IJSWIS), 1(1), 1-18., 2005, link World Model Semantics from Distributed Semantics at Scale? 1.
  • 5. Challenges with Current LMs 5 Formally ∀i ⊆ [N], N = Vocabulary Size, Model the probability: P(tokeni, …. , tokenN | token1, … tokeni) Example Usage Once Trained on Enormous amounts of Data, Predict: 1. y ~ P(S = “Obama just won the 2032 Election”)? 2. P(S) does not pass a threshold test => y = 0, or False. Tycoon Did you mean: tycoon Did you mean: typhoon Did you mean: typography tycoon: 0.00567% Recency : World Model Semantics from Distributed Semantics at Scale 1.
  • 6. Challenges with Current LMs 6 Distributional Semantics: Language Models are trained to compute the distributional plausibility of language tokens from enormous amounts of training tokens. E.g., What was the color of the white horse of Napolean? It is not very distributionally plausible that a person asks the answer to a question they already know The answer is abundantly clear in the question But, (a) distribution semantics has problem with the low probability, and (b) it does not understand! More challenges arise due to lack of context. Common Sense : World Model Semantics from Distributed Semantics at Scale 1.
  • 7. Challenges with Current LMs 7 Distributional Semantics: Language Models are trained to compute the distributional plausibility of language tokens from enormous amounts of training tokens. E.g., Mike’s mum had 4 kids; 3 of them are X,Y,Z. What is the name of 4th kid? - Missing information is not clear Answer is obviously Mike 1. Mike is not a pet 2. More than one child cannot have the same name 3. Kid names cannot be any word 4. Impossible answers even after the hint: Mike, Luis, Drake, and Matilda, and All other tokens in the input! Distribution semantics does not have commonsense and probability does not help. Challenge - Missing Information: World Model Semantics from Distributed Semantics at Scale 1.
  • 8. Challenges with Current LMs 8 Distributional Semantics: Language Models are trained to compute the distributional plausibility of language tokens from enormous amounts of training tokens. E.g., Mike’s mum had 4 kids;3 of them are X,Y,Z. What is the name of 4th kid? Human’s first answers (We asked 10 fellow humans) 1. Mike 2. I think this is a trick question Distribution semantics does not know to make human-like assumptions about missing information. Challenge - Missing Information: World Model Semantics from Distributed Semantics at Scale 1.
  • 9. Challenges with Current LMs 9 Distributional Semantics are a Cloud of Probabilities The director, AIISC, of, Amit Sheth All other words Prof I kind of understand (not really though) Dense Representational Spaces Hallucinations - It makes up things ! 1.
  • 10. Recency Insufficiency of data alone Common-sense Hallucinations Challenges to be addressed for NLU 10 User-Explainability Application-level Safety Characteristics of NLU Capable Systems 1.
  • 11. 1. 11 Challenges to be addressed for NLU
  • 12. Possible Fix: Instruct GPT 12 The Instruct GPT Framework Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C. L., Mishkin, P., ... & Lowe, R. (2022). Training language models to follow instructions with human feedback. arXiv preprint arXiv:2203.02155. link No Problem I will make things larger and use Instruction based Training Hallucinations - It makes up things ! 2.
  • 13. 13 1. 40 humans to capture the breadth of knowledge corresponding to the data in LLMs seems small 2. The richness of human knowledge is compressed into a mere label 2.
  • 14. 14 Prof: Amit Sheth Synonyms: Amit Sheth Degree:Ph.D. Synonyms: Doctorate Company: AI Institute, Abbr: AIISC Synonyms: AI Institute, South Carolina employee_of Let’s see what this model of understanding can yield Knowledge (Graphs) to the rescue Fix: Addressing Hallucinations: Recency, Common-Sense, and Implicit Entity Mentions, etc., 2.
  • 15. 15 World as Concepts vs. World as Probabilities. Explicit model of recency and common-sense. Supply missing Knowledge (entities, relationships). Semantics supported by Knowledge (Graphs) User-level Explanations and Safety Constraints 2.
  • 17. 3. Knowledge Infused Neuro-symbolic AI Using Graphs in Neural Network Pipelines
  • 18. 18 3. Shallow Infusion Semi-Deep Infusion Shades of KiL - Shallow and Semi-Deep Infusion Sheth, Gaur, Kursuncu, & Wickramarachchi, (2019). Shades of knowledge-infused learning for enhancing deep learning. IEEE Internet Computing, 23(6), 54-63., link
  • 19. 3. Shades of KiL - Deep Infusion 19 Deep Infusion Sheth, Gaur, Kursuncu, & Wickramarachchi, (2019). Shades of knowledge-infused learning for enhancing deep learning. IEEE Internet Computing, 23(6), 54-63., link
  • 20. Sheth, Gaur, Kursuncu, & Wickramarachchi, (2019). Shades of knowledge-infused learning for enhancing deep learning. IEEE Internet Computing, 23(6), 54-63., link Characteristics/Method Distributed Semantics Shallow Infusion Semi-Deep Infusion Deep Infusion Recency U-M- M M+ H Filling in Missing Information U U M H Hallucinations Unsatisfactory (U) Get by but not really solving the problem (M) Better but not fully solve the problem (M+) Broadly solve the problem (H) Characteristics of Knowledge Infusion 3. 20
  • 21. 3. KiL - Generic Architecture 21
  • 23. 4. KiL - SEDO (Shallow Infusion) 23 Gaur, Kursuncu, Alambo,, Sheth, Daniulaityte, Thirunarayan, & Pathak. (2018, October). " Let Me Tell You About Your Mental Health!" Contextualized Classification of Reddit Posts to DSM-5 for Web-based Intervention. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (pp. 753-762)., link
  • 25. 5. KiL - K-Adapter (Shallow Infusion) 25 Wang, R., Tang, D., Duan, N., Wei, Z., Huang, X. J., Ji, J., ... & Zhou, M. (2021, August). K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 1405-1418), link
  • 26. 5. KiL - KALA (Semi-Deep Infusion) 26 Kang, M., Baek, J., & Hwang, S. J. (2022, July). KALA: Knowledge-Augmented Language Model Adaptation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 5144-5167)., link
  • 27. 5. KiL - TDLR (Semi-Deep Infusion) 27 Rawte, V., Chakraborty, M., Roy, K., Gaur, M., Faldu, K., Kikani, P., ... & Sheth, A. P. TDLR: Top Semantic-Down Syntactic Language Representation. In NeurIPS'22 Workshop on All Things Attention: Bridging Different Perspectives on Attention., link
  • 28. Knowledge Contexts Leads to Performance Gains With Smaller Models Knowledge Contexts Leads to Performance Gains With Smaller Models and Smaller Datasets 5. TDLR - Results 28 Rawte, V., Chakraborty, M., Roy, K., Gaur, M., Faldu, K., Kikani, P., ... & Sheth, A. P. TDLR: Top Semantic-Down Syntactic Language Representation. In NeurIPS'22 Workshop on All Things Attention: Bridging Different Perspectives on Attention., link
  • 29. Knowledge Infused Neuro-symbolic AI Integrating Lifted Neural Representations with Knowledge Graphs 6.
  • 30. Neural Network Abstract / Contextualization ACT DECIDE reasoning Planning Inference Apply Process Knowledge: User has Specific concerns due to X, Y, Z Concepts Action: Further Interact with System User on their concerns Explicit Knowledge Data 6. 30
  • 31. Really struggling with my bisexuality which is causing chaos in my relationship with a girl. I am equal to worthless for her. I’m now starting to get drunk because I can’t cope with the obsessive, intrusive thoughts, and need to get out of my head. 288291000119102: High risk bisexual behavior 365949003: Health-related behavior finding 365949003: Health-related behavior finding 307077003: Feeling hopeless 365107007: level of mood 225445003: Intrusive thoughts 55956009: Disturbance in content of thought 26628009: Disturbance in thinking 1376001: Obsessive compulsive personality disorder Multi-hop traversal on medical knowledge graphs <is symptom> Obsessive-compulsive disorder is a disorder in which people have obsessive, intrusive thoughts, ideas or sensations that make them feel driven to do something repetitively 6. Knowledge Verified Interpretable Prediction through linking to KG and definitions 31 Gaur, M., Desai, A., Faldu, K., & Sheth, A. (2020). Explainable ai using knowledge graphs. In ACM CoDS-COMAD Conference. Link, slide.
  • 32. Process Knowledge Structure in C-SSRS C-SSRS: Columbia Suicide Severity Rating Scale I wish I could give a shit about what would make it to the front page. I have been there and got nothing. Same as my life. I do have a gun.’, ’I thought I was talking about it. I am not on a ledge or something, but I do have my gun in my lap.’, ’No. I made sure she got an education and she knows how to get a job. I also have recently bought her clothes to make her more attractive. She has told me she only loves me because I buy her things. 1. Wish to be dead - Yes 2. Non-specific Active Suicidal Thoughts - Yes 3. Active Suicidal Ideation with Some Intent to Act - Yes 4. Label: Suicide Behavior or Attempt Interpretable for System Users i.e., Clinicians and Patients (1,2,3 verify adherence to the clinical guideline on diagnosis which a clinician understands) 47% 70% LLMs Process Knowledge (Ours) Agreement with Experts 6. Knowledge Verified Interpretable Prediction through Process Knowledge Structures 32 Sheth, A., Gaur, M., Roy, K., Venkataraman, R., & Khandelwal, V. (2022). Process Knowledge-Infused AI: Toward User-Level Explainability, Interpretability, and Safety. IEEE Internet Computing, 26(5), 76-84., link
  • 33. Do you feel nervous? More than half the days Do you feel irritated or self destructive? Do you feel something extreme might happen to you? Are you able to relax? Do you feel nervous? More than half the days Do you feel Irritated? Are you bothered by becoming easily annoyed or irritable? Are you bothered by any relaxation troubles? Knowledge Infusion using Medical Questionnaire (MedQ) These questions are medically valid and safe, in right sequence.. Safety Check s 6. Knowledge Verified Interpretable and Safe Text Generation through Process Knowledge Structures 33 Roy, K., Gaur, M., Soltani, M., Rawte, V., Kalyan, A., & Sheth, A. (2023). ProKnow: Process knowledge for safety constrained and explainable question generation for mental health diagnostic assistance. Frontiers in Big Data, 5., link
  • 34. ● If the system were to give user-level explanation, it will need to incorporate/use conceptual model (vocabularies, knowledge graph) used by the user. Purely data driven system can at best give explanations that ML engineers (developers) can use. ● Knowledge is multifaceted. I presented diverse knowledge to support different levels of abstractions for NLU. There will be different knowledge for abstractions involved in image understanding activity. ● Should you bring knowledge to the data level (embedding) or bring data to the knowledge level (learn from data and align with knowledge)? For less demanding intellectual activities (classification, prediction, recommendation) the former will do; for more demanding intellectual activities (decision making and taking actions with explanations), latter is most likely needed. Takeaway 34
  • 35. 35 Acknowledgement Funding supported in part by NSF Award#: 2133842 EAGER: Advancing Neuro-symbolic AI with Deep Knowledge-infused Learning and other projects (see http://wiki.aiisc.ai) Learn more: ● Website - http://aiisc.ai (projects, people, opensource, demos, open data/tools, tutorials, workshops, papers) ● Wiki Page - http://wiki.aiisc.ai ● LinkedIn - http://linkedin.com/company/aiisc ● YouTube - http://youtube.com/aiisc (demos, tutorials, dissertations, keynotes, invited talks)
  • 36. Sheth core group AIISC Artificial Intelligence Institute of South Carolina (#AIISC) Kaushik Roy Vedant Khandelwal Also, Megha Chakraborty, Vipula Rawte, Yuxin Zi 36 Contribution/special thanks (this talk):