Eliciting Reasoning in LLM with Chain-of-thought Prompting

•Download as PPTX, PDF•

0 likes•310 views

Chain-of-thought prompting involves dividing complex reasoning tasks into natural language steps to help large language models perform better. It has been shown to improve arithmetic word problem solving by prompting models to show the steps and equations used to arrive at the answer. An ablation study found that showing the intermediate steps led to better performance than just showing the equation or computed answer alone. While promising for improving reasoning abilities, chain-of-thought prompting may not truly elicit human-like reasoning and can be costly to apply due to annotation efforts and model sizes required.

Data & Analytics

Using Chain-of-thought
prompting to elicit reasoning
in Large Language Models
Neetha Sherra
San Jose State University
CMPE 258-Deep Learning

Introduction
• The recent growth of Language Models (LM’s) in NLP has been
ground breaking especially with respect to Large Language
Models
• LLM’s are used across a range of natural language tasks
• Some of the broad benefits of scaling LM’s include improved
performance, generalization and efficiency
• Scaling is necessary for all tasks but is not sufficient for certain
tasks

Chain-of-thought prompting
• What is chain-of-thought
• Combining two approaches
• Can be trained to generate intermediate natural language steps
• Have been successful at question-answering with few-shot prompting
• What are the benefits of chain of thought prompting
• Division into steps
• Internal view
• Turning a general LLM into one that performs complex tasks

Arithmetic reasoning
• X-axis: model scale, y-axis: math
word problem benchmarks
• A large model is necessary but
not sufficient
• Performance and problem
complexity
• Performances beats previous
SOTA

Arithmetic reasoning continued…
• Ablation study
• Equation only
• Variable compute only
• Chain-of-thought after answer

Arithmetic reasoning continued…
• Robustness of chain-of-thought
prompting

Other reasoning tasks
Left: Commonsense reasoning performance of PaLM, Right:
Symbolic reasoning performance of PaLM

Limitations
• Is it really “reasoning”?
• Cost of annotation
• “right” reasoning path
• Cost of employing a ‘large’ language model
Conclusions
• Broadening the range of tasks
• Decreasing the scale threshold

References
https://arxiv.org/pdf/2201.11903.pdf

What's hot

Google's Pathways Language Model and Chain-of-ThoughtVaclav1

AutoML - The Future of AINing Jiang

Large Language Models - Chat AI.pdfDavid Rostcheck

Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...DataWorks Summit/Hadoop Summit

Demystifying NLP Transformers: Understanding the Power and Architecture behin...NILESH VERMA

Spiral Model in Software Engineering with Case StudySahil Bansal

[BEDROCK] Claude Prompt Engineering Techniques.pptxssuserdd71c7

Vectorland: Brief Notes from Using Text Embeddings for SearchBhaskar Mitra

Fine tuning large LMsSylvainGugger

SQA ComponentsLuthfia Ulinnuha

LLaMA Open and Efficient Foundation Language Models - 230528.pdftaeseon ryu

Machine Learning Ml Overview Algorithms Use Cases And ApplicationsSlideTeam

Using Large Language Models in 10 Lines of CodeGautier Marti

TransformersAnup Joseph

Natural language processing and transformer modelsDing Li

Thomas Wolf "Transfer learning in NLP"Fwdays

NLP using transformers Arvind Devaraj

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfPo-Chuan Chen

META-LEARNING.pptxAyanaRukasar

‘Big models’: the success and pitfalls of Transformer models in natural langu...Leiden University

What's hot (20)

Google's Pathways Language Model and Chain-of-Thought

AutoML - The Future of AI

Large Language Models - Chat AI.pdf

Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...

Demystifying NLP Transformers: Understanding the Power and Architecture behin...

Spiral Model in Software Engineering with Case Study

[BEDROCK] Claude Prompt Engineering Techniques.pptx

Vectorland: Brief Notes from Using Text Embeddings for Search

Fine tuning large LMs

SQA Components

LLaMA Open and Efficient Foundation Language Models - 230528.pdf

Machine Learning Ml Overview Algorithms Use Cases And Applications

Using Large Language Models in 10 Lines of Code

Transformers

Natural language processing and transformer models

Thomas Wolf "Transfer learning in NLP"

NLP using transformers

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf

META-LEARNING.pptx

‘Big models’: the success and pitfalls of Transformer models in natural langu...

Similar to Eliciting Reasoning in LLM with Chain-of-thought Prompting

Deep Learning for Natural Language ProcessingSangwoo Mo

The 360 Developerenteritos

June presentations org_adoption_learning_analyticsShane Dawson

Interactive DSML DesignAndriy Levytskyy

2211 APSIPAWarNik Chow

Software development fundamentalsAlfred Jett Grandeza

W4 ucl@md day2011MDDAY11

Course Plan: Engg.MathsDr. N. Asokan

ULCC e-ILP Focus GroupPhilip Butler

Setting up Machine Learning Projects - Full Stack Deep LearningSergey Karayev

GPT-2: Language Models are Unsupervised Multitask LearnersYoung Seok Kim

Machine Translation Introductionnlab_utokyo

Upside-down Problem SolvingJeffrey Miller

Lejla A. Bexheti - eLearningCentre SEE UniversityMetamorphosis

Less vs sa fe agile dc 2017 - clear systemsArt Moore

Regularization in deep learningKien Le

ICS1020 NLP 2020Vanessa Camilleri

Gpt1 and 2 model reviewSeoung-Ho Choi

5-CEN6016-Chapter1.pptDrCMeenakshiVISTAS

January 2012 rbt presentationThomas Feller, Jr.

Similar to Eliciting Reasoning in LLM with Chain-of-thought Prompting (20)

Deep Learning for Natural Language Processing

The 360 Developer

June presentations org_adoption_learning_analytics

Interactive DSML Design

2211 APSIPA

Software development fundamentals

W4 ucl@md day2011

Course Plan: Engg.Maths

ULCC e-ILP Focus Group

Setting up Machine Learning Projects - Full Stack Deep Learning

GPT-2: Language Models are Unsupervised Multitask Learners

Machine Translation Introduction

Upside-down Problem Solving

Lejla A. Bexheti - eLearningCentre SEE University

Less vs sa fe agile dc 2017 - clear systems

Regularization in deep learning

ICS1020 NLP 2020

Gpt1 and 2 model review

5-CEN6016-Chapter1.ppt

January 2012 rbt presentation

Recently uploaded

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

E-Commerce Order PredictionShraddha Kamble.pptxBoston Institute of Analytics

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

Recently uploaded (20)

Brighton SEO | April 2024 | Data Storytelling

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Schema on read is obsolete. Welcome metaprogramming..pdf

DBA Basics: Getting Started with Performance Tuning.pdf

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

04242024_CCC TUG_Joins and Relationships

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Customer Service Analytics - Make Sense of All Your Data.pptx

E-Commerce Order PredictionShraddha Kamble.pptx

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

Call Girls In Mahipalpur O9654467111 Escorts Service

RA-11058_IRR-COMPRESS Do 198 series of 1998

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

Eliciting Reasoning in LLM with Chain-of-thought Prompting

1. Using Chain-of-thought prompting to elicit reasoning in Large Language Models Neetha Sherra San Jose State University CMPE 258-Deep Learning

2. Introduction • The recent growth of Language Models (LM’s) in NLP has been ground breaking especially with respect to Large Language Models • LLM’s are used across a range of natural language tasks • Some of the broad benefits of scaling LM’s include improved performance, generalization and efficiency • Scaling is necessary for all tasks but is not sufficient for certain tasks

3. Chain-of-thought prompting • What is chain-of-thought • Combining two approaches • Can be trained to generate intermediate natural language steps • Have been successful at question-answering with few-shot prompting • What are the benefits of chain of thought prompting • Division into steps • Internal view • Turning a general LLM into one that performs complex tasks

4. Arithmetic reasoning • X-axis: model scale, y-axis: math word problem benchmarks • A large model is necessary but not sufficient • Performance and problem complexity • Performances beats previous SOTA

5. Arithmetic reasoning continued… • Ablation study • Equation only • Variable compute only • Chain-of-thought after answer

6. Arithmetic reasoning continued… • Robustness of chain-of-thought prompting

7. Other reasoning tasks Left: Commonsense reasoning performance of PaLM, Right: Symbolic reasoning performance of PaLM

8. Limitations • Is it really “reasoning”? • Cost of annotation • “right” reasoning path • Cost of employing a ‘large’ language model Conclusions • Broadening the range of tasks • Decreasing the scale threshold

9. References https://arxiv.org/pdf/2201.11903.pdf

10. Thank You

Eliciting Reasoning in LLM with Chain-of-thought Prompting

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Eliciting Reasoning in LLM with Chain-of-thought Prompting

Similar to Eliciting Reasoning in LLM with Chain-of-thought Prompting (20)

Recently uploaded

Recently uploaded (20)

Eliciting Reasoning in LLM with Chain-of-thought Prompting