You and Your Research -- LLMs Perspective

•

0 likes•45 views

Mohamed Elawady

4th ML/AI Workshop, University of Strathclyde, Glasgow, Scotland, UK.

Education

You and Your Research
LLMs Perspective
Dr Mohamed Elawady
Department of Computer and Information Sciences
University of Strathclyde
4th ML/AI Workshop
14th Sep 2023

Agenda
● Introduction: LLMs
● History of LLMs
● LLMs + Chatbots
● LLMs + Research
2
https://www.reddit.com/r/ChatGPTMemes/comme
nts/102mvys/yours_sincerely_chatgpt/?rdt=43569
“I visualise a time when we will
be to robots what dogs are to
humans, and I’m rooting for the
machines.”
Claude Shannon (1916-2001)

Introduction: LLMs
Large Language Model
(LLM): Natural Language
Processing (NLP) + Deep
Learning (DL)
● Basic: Input (text),
Output (text)
● How: self-supervised
(aka reinforcement
learning) and
semi-supervised
training over massive
datasets (in
terabytes).
3
https://lifearchitect.ai/models/

History of LLMs
4
Zhao, Wayne Xin, et al. "A survey of large language models." arXiv
preprint arXiv:2303.18223 (2023).
● What’s behind
○ Transformers
○ Massive data
○ GPUs
● Popular
○ OpenAI GPT 3/4
○ Google Bard
○ Meta LLaMA
○ Google T5
○ BLOOM
● Coming Soon!
○ Deepmind Gemini
○ OpenAI GPT 5

LLMs + Chatbots
● GPT-3.5/4 + ChatGPT (OpenAI)
● LaMDA + Bard (Google)
● GPT 4 + Bing (Microsoft)
● GPT 4 + YouChat (You.com)
● Claude + Claude AI (Anthropic)
● GPT 4 + ChatSonic (ChatSonic)
5

LLMs + Research
● Sentence-BERT / T5 / GPT-3 + Elicit
● SciBERT + Scite Assistant
● GPT-4 + Consensus
6

More Resources
● LLM Introduction: Learn Language Models, GitHub Gist:
https://gist.github.com/rain-1/eebd5e5eb2784feecf450324e3341c8d
● Awesome-LLM: a curated list of Large Language Model, GitHub:
https://github.com/Hannibal046/Awesome-LLM
● Demos over Hugging Face platform (signup required)
○ Text-to-Text Generation: https://huggingface.co/google/flan-t5-base
○ Text Summarization: https://huggingface.co/facebook/bart-large-cnn
○ Text Generation: https://huggingface.co/bigscience/bloom
7

References
● (GPT-3) Brown, Tom, et al. "Language models are few-shot learners." Advances in neural information processing systems 33
(2020): 1877-1901.
● (GPT-4) OpenAI. “GPT-4 Technical Report.” ArXiv abs/2303.08774 (2023).
● (LaMDA) Thoppilan, Romal, et al. "Lamda: Language models for dialog applications." arXiv preprint arXiv:2201.08239
(2022).
● (SciBERT) Beltagy, Iz, Kyle Lo, and Arman Cohan. "SciBERT: A pretrained language model for scientific text." arXiv preprint
arXiv:1903.10676 (2019).
● (Sentence-bert) Reimers, Nils, and Iryna Gurevych. "Sentence-bert: Sentence embeddings using siamese bert-networks."
arXiv preprint arXiv:1908.10084 (2019).
● (T5) Raffel, Colin, et al. "Exploring the limits of transfer learning with a unified text-to-text transformer." The Journal of
Machine Learning Research 21.1 (2020): 5485-5551.
● (LLaMA) Touvron, Hugo, et al. "Llama: Open and efficient foundation language models." arXiv preprint arXiv:2302.13971
(2023).
● (BLOOM) Scao, Teven Le, et al. "Bloom: A 176b-parameter open-access multilingual language model." arXiv preprint
arXiv:2211.05100 (2022).
● (LaMDA) Thoppilan, Romal, et al. "Lamda: Language models for dialog applications." arXiv preprint arXiv:2201.08239
(2022).
● (PaLM) Chowdhery, Aakanksha, et al. "Palm: Scaling language modeling with pathways." arXiv preprint arXiv:2204.02311
(2022).
● (Chinchilla) Hoffmann, Jordan, et al. "Training compute-optimal large language models." arXiv preprint arXiv:2203.15556
(2022).
8

Similar to You and Your Research -- LLMs Perspective

Writing Machines: Detection and Stylometric ProfilingGeorgeMikros3

20130805 Activating Linked Open Data in Libraries Archives and Museumsandrea huang

Introduction to LLMsLoic Merckel

Summary of GSCL 2013 international NLP conference in GermanyLifeng (Aaron) Han

Towards Linked Ontologies and Data on the Semantic WebJie Bao

Linked Open (Geo)Data and the Distributed Ontology Language – a perfect matchChristoph Lange

MS-Word.docbutest

ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015RIILP

LLM Healthcare.pdfATPowr

An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingTheodore J. LaGrow

Breaking down the AI magic of ChatGPT: A technologist's lens to its powerful ...rahul_net

Intro to LLMsLoic Merckel

Open learning- Text analysis basicsUp2Universe

Research Developments and Directions in Speech Recognition and ...butest

Knowledge Graph MaintenancePaul Groth

Wimmics Overview 2021Fabien Gandon

Mattingly "Text and Data Mining: Building Data Driven Applications"National Information Standards Organization (NISO)

A Comparative Study of Text Comprehension in IELTS Reading Exam using GPT-3AIRCC Publishing Corporation

[HANDOUT] Using Web 2.0 Tools to Enhance Learning and Engagement in Teaching ...Teaching the Hudson Valley

Online Masterclass Learning Analytics Hendrik Drachsler

Similar to You and Your Research -- LLMs Perspective (20)

Writing Machines: Detection and Stylometric Profiling

20130805 Activating Linked Open Data in Libraries Archives and Museums

Introduction to LLMs

Summary of GSCL 2013 international NLP conference in Germany

Towards Linked Ontologies and Data on the Semantic Web

Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match

MS-Word.doc

ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015

LLM Healthcare.pdf

An-Exploration-of-scientific-literature-using-Natural-Language-Processing

Breaking down the AI magic of ChatGPT: A technologist's lens to its powerful ...

Intro to LLMs

Open learning- Text analysis basics

Research Developments and Directions in Speech Recognition and ...

Knowledge Graph Maintenance

Wimmics Overview 2021

Mattingly "Text and Data Mining: Building Data Driven Applications"

A Comparative Study of Text Comprehension in IELTS Reading Exam using GPT-3

[HANDOUT] Using Web 2.0 Tools to Enhance Learning and Engagement in Teaching ...

Online Masterclass Learning Analytics

Recently uploaded

microwave assisted reaction. General introductionMaksud Ahmed

Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Sports & Fitness Value Added Course FY..Disha Kariya

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxRAM LAL ANAND COLLEGE, DELHI UNIVERSITY.

Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande

APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur

9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt

Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron

Interactive Powerpoint_How to Master effective communicationnomboosow

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

Mastering the Unannounced Regulatory InspectionSafetyChain Software

Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K

Paris 2024 Olympic Geographies - an activityGeoBlogs

mini mental status format.docxPoojaSen20

Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB

Recently uploaded (20)

microwave assisted reaction. General introduction

Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...

CARE OF CHILD IN INCUBATOR..........pptx

Sports & Fitness Value Added Course FY..

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx

Disha NEET Physics Guide for classes 11 and 12.pdf

Call Girls in Dwarka Mor Delhi Contact Us 9654467111

Web & Social Media Analytics Previous Year Question Paper.pdf

APM Welcome, APM North West Network Conference, Synergies Across Sectors

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...

9548086042 for call girls in Indira Nagar with room service

Q4-W6-Restating Informational Text Grade 3

Interactive Powerpoint_How to Master effective communication

Sanyam Choudhary Chemistry practical.pdf

Mastering the Unannounced Regulatory Inspection

Measures of Dispersion and Variability: Range, QD, AD and SD

Paris 2024 Olympic Geographies - an activity

mini mental status format.docx

Beyond the EU: DORA and NIS 2 Directive's Global Impact

You and Your Research -- LLMs Perspective

1. You and Your Research LLMs Perspective Dr Mohamed Elawady Department of Computer and Information Sciences University of Strathclyde 4th ML/AI Workshop 14th Sep 2023

2. Agenda ● Introduction: LLMs ● History of LLMs ● LLMs + Chatbots ● LLMs + Research 2 https://www.reddit.com/r/ChatGPTMemes/comme nts/102mvys/yours_sincerely_chatgpt/?rdt=43569 “I visualise a time when we will be to robots what dogs are to humans, and I’m rooting for the machines.” Claude Shannon (1916-2001)

3. Introduction: LLMs Large Language Model (LLM): Natural Language Processing (NLP) + Deep Learning (DL) ● Basic: Input (text), Output (text) ● How: self-supervised (aka reinforcement learning) and semi-supervised training over massive datasets (in terabytes). 3 https://lifearchitect.ai/models/

4. History of LLMs 4 Zhao, Wayne Xin, et al. "A survey of large language models." arXiv preprint arXiv:2303.18223 (2023). ● What’s behind ○ Transformers ○ Massive data ○ GPUs ● Popular ○ OpenAI GPT 3/4 ○ Google Bard ○ Meta LLaMA ○ Google T5 ○ BLOOM ● Coming Soon! ○ Deepmind Gemini ○ OpenAI GPT 5

5. LLMs + Chatbots ● GPT-3.5/4 + ChatGPT (OpenAI) ● LaMDA + Bard (Google) ● GPT 4 + Bing (Microsoft) ● GPT 4 + YouChat (You.com) ● Claude + Claude AI (Anthropic) ● GPT 4 + ChatSonic (ChatSonic) 5

6. LLMs + Research ● Sentence-BERT / T5 / GPT-3 + Elicit ● SciBERT + Scite Assistant ● GPT-4 + Consensus 6

7. More Resources ● LLM Introduction: Learn Language Models, GitHub Gist: https://gist.github.com/rain-1/eebd5e5eb2784feecf450324e3341c8d ● Awesome-LLM: a curated list of Large Language Model, GitHub: https://github.com/Hannibal046/Awesome-LLM ● Demos over Hugging Face platform (signup required) ○ Text-to-Text Generation: https://huggingface.co/google/flan-t5-base ○ Text Summarization: https://huggingface.co/facebook/bart-large-cnn ○ Text Generation: https://huggingface.co/bigscience/bloom 7

8. References ● (GPT-3) Brown, Tom, et al. "Language models are few-shot learners." Advances in neural information processing systems 33 (2020): 1877-1901. ● (GPT-4) OpenAI. “GPT-4 Technical Report.” ArXiv abs/2303.08774 (2023). ● (LaMDA) Thoppilan, Romal, et al. "Lamda: Language models for dialog applications." arXiv preprint arXiv:2201.08239 (2022). ● (SciBERT) Beltagy, Iz, Kyle Lo, and Arman Cohan. "SciBERT: A pretrained language model for scientific text." arXiv preprint arXiv:1903.10676 (2019). ● (Sentence-bert) Reimers, Nils, and Iryna Gurevych. "Sentence-bert: Sentence embeddings using siamese bert-networks." arXiv preprint arXiv:1908.10084 (2019). ● (T5) Raffel, Colin, et al. "Exploring the limits of transfer learning with a unified text-to-text transformer." The Journal of Machine Learning Research 21.1 (2020): 5485-5551. ● (LLaMA) Touvron, Hugo, et al. "Llama: Open and efficient foundation language models." arXiv preprint arXiv:2302.13971 (2023). ● (BLOOM) Scao, Teven Le, et al. "Bloom: A 176b-parameter open-access multilingual language model." arXiv preprint arXiv:2211.05100 (2022). ● (LaMDA) Thoppilan, Romal, et al. "Lamda: Language models for dialog applications." arXiv preprint arXiv:2201.08239 (2022). ● (PaLM) Chowdhery, Aakanksha, et al. "Palm: Scaling language modeling with pathways." arXiv preprint arXiv:2204.02311 (2022). ● (Chinchilla) Hoffmann, Jordan, et al. "Training compute-optimal large language models." arXiv preprint arXiv:2203.15556 (2022). 8

You and Your Research -- LLMs Perspective

Recommended

Recommended

More Related Content

Similar to You and Your Research -- LLMs Perspective

Similar to You and Your Research -- LLMs Perspective (20)

More from Mohamed Elawady

More from Mohamed Elawady (20)

Recently uploaded

Recently uploaded (20)

You and Your Research -- LLMs Perspective