SlideShare a Scribd company logo
1 of 12
Download to read offline
Unveiling the Reasoning
Abilities of Large
Language Models:
A Comprehensive Review
Diving deep into “Towards Reasoning in Large Language Models:
A Survey”, a survey paper written by Jie Huang and Kevin Chen-
Chuan Chang at the University of Illinois at Urbana-Champaign
in 2023
Presented by Kelly Nguyen
Research
01 02 03 04
Introduction Understanding
Reasoning in LLMs
Towards
Reasoning
Evaluation of
Reasoning
05 06 07 08
Findings,
Challenges, and
Future Directions
Personal Insights:
Ethical Considerations
& Impact
Conclusions
Overview
Published in July 2023, this paper explores the
capabilities of large language models in reasoning,
discussing methods for enhancing reasoning skills
in these models and suggesting future research
directions. Although it primarily focuses on natural
language processing, the review is comprehensive
and up-to-date, possibly providing insights that
could extend to other modalities.
Research
“Towards Reasoning in Large Language Models: A Survey”,
written by Jie Huang and Kevin Chen-Chuan Chang at the
University of Illinois at Urbana-Champaign
As we stand on the brink of a new era in artificial intelligence (AI),
the advancements we’ve witnessed in recent years are not just
accelerating — they’re reshaping our understanding of what
machines are capable of.
This comprehensive survey not only explores how these models are
programmed and trained to simulate reasoning but also challenges us
to consider whether these behaviors signify genuine understanding or
simply sophisticated pattern recognition. As we explore the intricate
dance between data training and emergent machine intelligence, we
uncover insights into how close we truly are to achieving machines
that think and reason like humans.
Introduction
Reasoning, a fundamental component of
human intelligence, involves the ability to
solve problems, make decisions, and
deduce new information from known facts.
Prompting and In-Context Learning
These techniques involve guiding the model through structured prompts
that mimic a logical reasoning process. For example, Chain of Thought
prompting encourages the model to break down a problem into sub-
components before reaching a conclusion.
Fine-Tuning on Specialized Tasks
By adjusting the model parameters specifically for tasks that require
reasoning, researchers can enhance the model’s ability to handle similar
tasks in the future.
Understanding
Reasoning in LLMs
Chain of Thought and
Variants
This approach helps in
cultivating a deeper, more
intuitive reasoning capability in
language models, encouraging
them to mimic human-like
thinking more closely. As a result
of using chain-of-thought (CoT)
prompting, models are trained
to generate comprehensive
rationales that logically lead to
the final answer when faced
with a query.
Rationale Engineering
Rationale engineering is a
strategy designed to enhance
how reasoning is elicited or
utilized in large language
models (LLMs). This process
involves refining rationales —
creating and using more
effective reasoning examples
that guide the models.
Problem
Decomposition
This method, known as
“decomposition” or “divide and
conquer,” involves solving each
part of the problem individually
before synthesizing these
solutions to tackle the
overarching issue. This approach
not only simplifies the problem-
solving process but also aligns
each step toward a
comprehensive resolution of the
whole problem.
Towards Reasoning
in LLMS
To measure the reasoning ability of LLMs, researchers
use benchmarks designed to test various forms of
reasoning such as deductive, inductive, and abductive
reasoning.
Arithmetic Reasoning
1.
Commonsense Reasoning
2.
Symbolic Reasoning
3.
Evaluation of
Reasoning
1. Emergence of Reasoning Abilities
2. Chain of Thought Prompts
3. Human Like Reasoning Patterns
4. Challenges with Complex Reasoning
5. Applicability to Real-World Tasks
6. Enhancing Reasoning Abilities
Findings
Despite their capabilities, LLMs face significant challenges:
Generalization: LLMs often struggle to apply learned
reasoning to new types of problems not covered in their
training data.
Understanding vs. Performance: There is a difference
between performing well on a task and truly
understanding the underlying processes, which LLMs have
yet to bridge.
Challenges and
Future Directions
As LLMs become more capable, their impact on society and
ethical considerations cannot be overstated. Issues such as
data privacy, model bias, and the potential for misuse need to
be addressed to ensure these technologies are used
responsibly.
Personal Insights:
Ethical Considerations
& Impact
“Towards Reasoning in Large Language Models: A Survey”
provides an insightful look into the state of LLMs in terms of
reasoning capabilities. It highlights both the advancements and
the roadblocks, offering a detailed roadmap for future research.
This survey serves not just as an academic review but also as a
critical reflection on the broader implications of deploying these
powerful models in everyday applications.
Conclusion
Thank You
For further questions, feel free to email me at
kelly.nguyen01@sjsu.edu

More Related Content

Similar to Short Story: Unveiling the Reasoning Abilities of Large Language Models by Kelly Nguyen.pdf

2. Research 1 Design Thinking-Poster MUDD 2015
2. Research 1 Design Thinking-Poster MUDD 20152. Research 1 Design Thinking-Poster MUDD 2015
2. Research 1 Design Thinking-Poster MUDD 2015
Maria Violet Miller
 
CRITICAL THINKING1Michael PriebeSouthern New Hampshire U
CRITICAL THINKING1Michael PriebeSouthern New Hampshire UCRITICAL THINKING1Michael PriebeSouthern New Hampshire U
CRITICAL THINKING1Michael PriebeSouthern New Hampshire U
MargenePurnell14
 
ITS 832Chapter 5From Building a Model to Adaptive Robust.docx
ITS 832Chapter 5From Building a Model to Adaptive Robust.docxITS 832Chapter 5From Building a Model to Adaptive Robust.docx
ITS 832Chapter 5From Building a Model to Adaptive Robust.docx
donnajames55
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
CITE
 
271_AI Lect Notes.pdf
271_AI Lect Notes.pdf271_AI Lect Notes.pdf
271_AI Lect Notes.pdf
kaxeca4096
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
priestmanmable
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
sodhi3
 

Similar to Short Story: Unveiling the Reasoning Abilities of Large Language Models by Kelly Nguyen.pdf (20)

Art%3 a10.1007%2fs40593 015-0087-3
Art%3 a10.1007%2fs40593 015-0087-3Art%3 a10.1007%2fs40593 015-0087-3
Art%3 a10.1007%2fs40593 015-0087-3
 
2. Research 1 Design Thinking-Poster MUDD 2015
2. Research 1 Design Thinking-Poster MUDD 20152. Research 1 Design Thinking-Poster MUDD 2015
2. Research 1 Design Thinking-Poster MUDD 2015
 
Argument Essay Thesis. Definition Argument Thesis Examples - Thesis Title Ide...
Argument Essay Thesis. Definition Argument Thesis Examples - Thesis Title Ide...Argument Essay Thesis. Definition Argument Thesis Examples - Thesis Title Ide...
Argument Essay Thesis. Definition Argument Thesis Examples - Thesis Title Ide...
 
Automatic Assessment of University Teachers Critical Thinking Levels.pdf
Automatic Assessment of University Teachers  Critical Thinking Levels.pdfAutomatic Assessment of University Teachers  Critical Thinking Levels.pdf
Automatic Assessment of University Teachers Critical Thinking Levels.pdf
 
CRITICAL THINKING1Michael PriebeSouthern New Hampshire U
CRITICAL THINKING1Michael PriebeSouthern New Hampshire UCRITICAL THINKING1Michael PriebeSouthern New Hampshire U
CRITICAL THINKING1Michael PriebeSouthern New Hampshire U
 
Critical thinking1 michael priebesouthern new hampshire u
Critical thinking1 michael priebesouthern new hampshire uCritical thinking1 michael priebesouthern new hampshire u
Critical thinking1 michael priebesouthern new hampshire u
 
Intro to lmls
Intro to lmlsIntro to lmls
Intro to lmls
 
1.6 introduction to Logomonic Learning System (LMLS)
1.6 introduction to Logomonic Learning System (LMLS)1.6 introduction to Logomonic Learning System (LMLS)
1.6 introduction to Logomonic Learning System (LMLS)
 
ITS 832Chapter 5From Building a Model to Adaptive Robust.docx
ITS 832Chapter 5From Building a Model to Adaptive Robust.docxITS 832Chapter 5From Building a Model to Adaptive Robust.docx
ITS 832Chapter 5From Building a Model to Adaptive Robust.docx
 
Redefining AI’s Understanding: The Emergence of Concept-Aware Large Language ...
Redefining AI’s Understanding: The Emergence of Concept-Aware Large Language ...Redefining AI’s Understanding: The Emergence of Concept-Aware Large Language ...
Redefining AI’s Understanding: The Emergence of Concept-Aware Large Language ...
 
11
1111
11
 
Motivation, affective aspects, and knowledge maturing
Motivation, affective aspects, and knowledge maturingMotivation, affective aspects, and knowledge maturing
Motivation, affective aspects, and knowledge maturing
 
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
Rsqrd AI: Recent Advances in Explainable Machine Learning ResearchRsqrd AI: Recent Advances in Explainable Machine Learning Research
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
 
Cognitive Tools
Cognitive ToolsCognitive Tools
Cognitive Tools
 
Lo3 Other Elements of Research
Lo3  Other Elements of ResearchLo3  Other Elements of Research
Lo3 Other Elements of Research
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
 
271_AI Lect Notes.pdf
271_AI Lect Notes.pdf271_AI Lect Notes.pdf
271_AI Lect Notes.pdf
 
Chapter 8: Theories of Media Cognition and Information Processing (final vers...
Chapter 8: Theories of Media Cognition and Information Processing (final vers...Chapter 8: Theories of Media Cognition and Information Processing (final vers...
Chapter 8: Theories of Media Cognition and Information Processing (final vers...
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
 

Recently uploaded

Recently uploaded (20)

UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
The Strategic Impact of Buying vs Building in Test Automation
The Strategic Impact of Buying vs Building in Test AutomationThe Strategic Impact of Buying vs Building in Test Automation
The Strategic Impact of Buying vs Building in Test Automation
 
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
Auto Affiliate  AI Earns First Commission in 3 Hours..pdfAuto Affiliate  AI Earns First Commission in 3 Hours..pdf
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
 
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
 
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 
Microsoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdfMicrosoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdf
 
architecting-ai-in-the-enterprise-apis-and-applications.pdf
architecting-ai-in-the-enterprise-apis-and-applications.pdfarchitecting-ai-in-the-enterprise-apis-and-applications.pdf
architecting-ai-in-the-enterprise-apis-and-applications.pdf
 
Test Automation Design Patterns_ A Comprehensive Guide.pdf
Test Automation Design Patterns_ A Comprehensive Guide.pdfTest Automation Design Patterns_ A Comprehensive Guide.pdf
Test Automation Design Patterns_ A Comprehensive Guide.pdf
 
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
 
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
Lessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdfLessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdf
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
 
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
 
Encryption Recap: A Refresher on Key Concepts
Encryption Recap: A Refresher on Key ConceptsEncryption Recap: A Refresher on Key Concepts
Encryption Recap: A Refresher on Key Concepts
 
Modern binary build systems - PyCon 2024
Modern binary build systems - PyCon 2024Modern binary build systems - PyCon 2024
Modern binary build systems - PyCon 2024
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Rapidoform for Modern Form Building and Insights
Rapidoform for Modern Form Building and InsightsRapidoform for Modern Form Building and Insights
Rapidoform for Modern Form Building and Insights
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Kelly Nguyen.pdf

  • 1. Unveiling the Reasoning Abilities of Large Language Models: A Comprehensive Review Diving deep into “Towards Reasoning in Large Language Models: A Survey”, a survey paper written by Jie Huang and Kevin Chen- Chuan Chang at the University of Illinois at Urbana-Champaign in 2023 Presented by Kelly Nguyen
  • 2. Research 01 02 03 04 Introduction Understanding Reasoning in LLMs Towards Reasoning Evaluation of Reasoning 05 06 07 08 Findings, Challenges, and Future Directions Personal Insights: Ethical Considerations & Impact Conclusions Overview
  • 3. Published in July 2023, this paper explores the capabilities of large language models in reasoning, discussing methods for enhancing reasoning skills in these models and suggesting future research directions. Although it primarily focuses on natural language processing, the review is comprehensive and up-to-date, possibly providing insights that could extend to other modalities. Research “Towards Reasoning in Large Language Models: A Survey”, written by Jie Huang and Kevin Chen-Chuan Chang at the University of Illinois at Urbana-Champaign
  • 4. As we stand on the brink of a new era in artificial intelligence (AI), the advancements we’ve witnessed in recent years are not just accelerating — they’re reshaping our understanding of what machines are capable of. This comprehensive survey not only explores how these models are programmed and trained to simulate reasoning but also challenges us to consider whether these behaviors signify genuine understanding or simply sophisticated pattern recognition. As we explore the intricate dance between data training and emergent machine intelligence, we uncover insights into how close we truly are to achieving machines that think and reason like humans. Introduction
  • 5. Reasoning, a fundamental component of human intelligence, involves the ability to solve problems, make decisions, and deduce new information from known facts. Prompting and In-Context Learning These techniques involve guiding the model through structured prompts that mimic a logical reasoning process. For example, Chain of Thought prompting encourages the model to break down a problem into sub- components before reaching a conclusion. Fine-Tuning on Specialized Tasks By adjusting the model parameters specifically for tasks that require reasoning, researchers can enhance the model’s ability to handle similar tasks in the future. Understanding Reasoning in LLMs
  • 6. Chain of Thought and Variants This approach helps in cultivating a deeper, more intuitive reasoning capability in language models, encouraging them to mimic human-like thinking more closely. As a result of using chain-of-thought (CoT) prompting, models are trained to generate comprehensive rationales that logically lead to the final answer when faced with a query. Rationale Engineering Rationale engineering is a strategy designed to enhance how reasoning is elicited or utilized in large language models (LLMs). This process involves refining rationales — creating and using more effective reasoning examples that guide the models. Problem Decomposition This method, known as “decomposition” or “divide and conquer,” involves solving each part of the problem individually before synthesizing these solutions to tackle the overarching issue. This approach not only simplifies the problem- solving process but also aligns each step toward a comprehensive resolution of the whole problem. Towards Reasoning in LLMS
  • 7. To measure the reasoning ability of LLMs, researchers use benchmarks designed to test various forms of reasoning such as deductive, inductive, and abductive reasoning. Arithmetic Reasoning 1. Commonsense Reasoning 2. Symbolic Reasoning 3. Evaluation of Reasoning
  • 8. 1. Emergence of Reasoning Abilities 2. Chain of Thought Prompts 3. Human Like Reasoning Patterns 4. Challenges with Complex Reasoning 5. Applicability to Real-World Tasks 6. Enhancing Reasoning Abilities Findings
  • 9. Despite their capabilities, LLMs face significant challenges: Generalization: LLMs often struggle to apply learned reasoning to new types of problems not covered in their training data. Understanding vs. Performance: There is a difference between performing well on a task and truly understanding the underlying processes, which LLMs have yet to bridge. Challenges and Future Directions
  • 10. As LLMs become more capable, their impact on society and ethical considerations cannot be overstated. Issues such as data privacy, model bias, and the potential for misuse need to be addressed to ensure these technologies are used responsibly. Personal Insights: Ethical Considerations & Impact
  • 11. “Towards Reasoning in Large Language Models: A Survey” provides an insightful look into the state of LLMs in terms of reasoning capabilities. It highlights both the advancements and the roadblocks, offering a detailed roadmap for future research. This survey serves not just as an academic review but also as a critical reflection on the broader implications of deploying these powerful models in everyday applications. Conclusion
  • 12. Thank You For further questions, feel free to email me at kelly.nguyen01@sjsu.edu