KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions

•Download as PPTX, PDF•

0 likes•19 views

Sri Ambati

Philipp Singer, Senior Principal Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023

Technology

v
H2O.ai Confidential
Senior Principal Data Scientist at H2O.ai
● Vienna / Austria
● Kaggle GM
● All things deep learning
● I love training models
● H2O Hydrogen Torch
● H2O LLM Studio
Philipp Singer

H2O.ai Confidential
Mastering Classification and Regression with LLMs:
Insights from Kaggle Competitions

v
H2O.ai Confidential
The typical LLM use case
Text generation Q&A / RAG Labeling Agents

v
H2O.ai Confidential
What about classification?
Common business use-case
Classify text into two or more categories
Sentiment classification
Document categorization
Spam detection
Language detection
Topic classification

v
H2O.ai Confidential
The common way
Supervised training
Train a model on labeled data, predict on unlabeled data
Bag of word approach
Vectorize into fixed vocabulary, train gradient boosting models
Transformer approach
Train transformer models like BERT, Roberta, Deberta

v
H2O.ai Confidential
The LLM way
Zero-shot classification
Ask a LLM model for the prediction without training
Zero-shot engineering
Can we improve zero-shot prediction quality?
Fine-tuning LLMs
Fine-tune a LLM model for task-specific classification

v
H2O.ai Confidential
Financial sentiment data
huggingface.co/datasets/financial_phrasebank
● Data from FiQA and Financial PhraseBank
● Subset with >=75% annotator agreement
● Data statistics
○ 3,453 rows
○ Train: 2,589 rows
○ Val: 864 rows
● Labels
○ ~62% neutral
○ ~26% positive
○ ~12% negative
● Simplest majority baseline
○ 0.622 Accuracy
Cramo and Peab
have signed
exclusive five-year
rental agreements
in Finland and have
extended their
existing rental
agreements in the
Swedish market for
another five years.
Construction work
on the Helsinki
Music Centre is to
start this autumn,
with the total cost of
the project
estimated at 140
million euros.
The company's
profit totaled
578,100 in H1 2007,
down 30.9% year-
on-year.

v
H2O.ai Confidential
Zero-shot streamlining is difficult
No training needed
No labels needed
Easy to get started
Prompt engineering tricky
Difficult to automate into business processes
Runtime expensive
Often interpretable results Evaluation still required → labels

v
H2O.ai Confidential
Zero-shot logits approach
GPT
Your task is to analyze the message below and predict whether it has negative, neutral or positive
sentiment.
Return on investment was 16.6% compared to 15.8% in 2004.
The sentiment is

v
H2O.ai Confidential
Zero-shot logits approach

v
H2O.ai Confidential
Classification fine-tuning
GPT Negative
Neutral
Positive
Classification
Head
Finetune

v
H2O.ai Confidential
H2O LLM Studio
github.com/h2oai/h2o-llmstudio
● Easily and effectively fine-tune LLMs
● CLI & No-Code GUI
● SOTA techniques
○ LoRA
○ Quantization
○ Hyperparameter tuning
○ Experiment tracking and advanced evaluation
○ SFT & RLHF
● Newest Problem Type: Causal Classification
● Fully open-sourced

v
H2O.ai Confidential
Fine-tuning approach

H2O.ai Confidential
Kaggle success stories

v
H2O.ai Confidential
H2O.ai Predict the LLM

v
H2O.ai Confidential
CommonLit - Evaluate Student Summaries Competition

v
H2O.ai Confidential
LLM Science Exam Competition

H2O.ai Confidential
philipp.singer@h2o.ai
@ph_singer
Contact

Similar to KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions

Revitalizing CS Component StudioESUG

cleverti - Nearshore outsourcing IT services from PortugalCleverti

M-Venture - NOAH19 BerlinNOAH Advisors

Chunking Content with ConfidenceGunnar Krause

LEGO presentationEdoardo Falchetti

Case Study: ABAP Development Life Cycle and Governance at THE GLOBE AND MAIL ...Virtual Forge

Building Custom GenAI Apps at H2OSri Ambati

The BPO Transformation JourneyCapgemini

CoverLetter and CV HKo 2.2.2017v3Heikki Komulainen

OpenERP Partnership Program - OpenERP EnterpriseSavoir-faire Linux

How to manage software development in a funky way?Peter Horsten

masVenta hybrid-project-management-june-2017 - Agile Austria Graz 2017Rainer Wendt, PMP, PMI-ACP, PMI-PBA, CBAP

Software_AG_IR_Newsletter_12_18_Dec_2015_tcm16-137103Bapi Reddy Medapati

IoT & Embedded systems developmentWitekio

Polycom.pptxNadeem Ganai

May '23 Marketo Engage Seattle MUG Presentation Slides.pptxNate Smitha

Espedia Visual Enterprise OverviewEspedia Consulting

Robust Hybrid rather than Agile or WaterfallAgile Austria Conference

Resume Oliver GirkeOliver Girke

Working Agile in an Ever Changing WorldCapgemini

Similar to KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions (20)

Revitalizing CS Component Studio

cleverti - Nearshore outsourcing IT services from Portugal

M-Venture - NOAH19 Berlin

Chunking Content with Confidence

LEGO presentation

Case Study: ABAP Development Life Cycle and Governance at THE GLOBE AND MAIL ...

Building Custom GenAI Apps at H2O

The BPO Transformation Journey

CoverLetter and CV HKo 2.2.2017v3

OpenERP Partnership Program - OpenERP Enterprise

How to manage software development in a funky way?

masVenta hybrid-project-management-june-2017 - Agile Austria Graz 2017

Software_AG_IR_Newsletter_12_18_Dec_2015_tcm16-137103

IoT & Embedded systems development

Polycom.pptx

May '23 Marketo Engage Seattle MUG Presentation Slides.pptx

Espedia Visual Enterprise Overview

Robust Hybrid rather than Agile or Waterfall

Resume Oliver Girke

Working Agile in an Ever Changing World

Recently uploaded

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

"ML in Production",Oleksandr BaganFwdays

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

Anypoint Exchange: It’s Not Just a Repo!Manik S Magar

Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

Search Engine Optimization SEO PDF for 2024.pdfRankYa

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition

Advanced Test Driven-Development @ php[tek] 2024

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost

Human Factors of XR: Using Human Factors to Design XR Systems

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

"ML in Production",Oleksandr Bagan

DevoxxFR 2024 Reproducible Builds with Apache Maven

Anypoint Exchange: It’s Not Just a Repo!

Vector Databases 101 - An introduction to the world of Vector Databases

Developer Data Modeling Mistakes: From Postgres to NoSQL

What's New in Teams Calling, Meetings and Devices March 2024

Ensuring Technical Readiness For Copilot in Microsoft 365

Artificial intelligence in cctv survelliance.pptx

Unraveling Multimodality with Large Language Models.pdf

Search Engine Optimization SEO PDF for 2024.pdf

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Nell’iperspazio con Rocket: il Framework Web di Rust!

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Vertex AI Gemini Prompt Engineering Tips

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions

1. v H2O.ai Confidential Senior Principal Data Scientist at H2O.ai ● Vienna / Austria ● Kaggle GM ● All things deep learning ● I love training models ● H2O Hydrogen Torch ● H2O LLM Studio Philipp Singer

2. H2O.ai Confidential Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions

3. v H2O.ai Confidential The typical LLM use case Text generation Q&A / RAG Labeling Agents

4. v H2O.ai Confidential What about classification? Common business use-case Classify text into two or more categories Sentiment classification Document categorization Spam detection Language detection Topic classification

5. v H2O.ai Confidential The common way Supervised training Train a model on labeled data, predict on unlabeled data Bag of word approach Vectorize into fixed vocabulary, train gradient boosting models Transformer approach Train transformer models like BERT, Roberta, Deberta

6. v H2O.ai Confidential The LLM way Zero-shot classification Ask a LLM model for the prediction without training Zero-shot engineering Can we improve zero-shot prediction quality? Fine-tuning LLMs Fine-tune a LLM model for task-specific classification

7. H2O.ai Confidential Use case

8. v H2O.ai Confidential Financial sentiment data huggingface.co/datasets/financial_phrasebank ● Data from FiQA and Financial PhraseBank ● Subset with >=75% annotator agreement ● Data statistics ○ 3,453 rows ○ Train: 2,589 rows ○ Val: 864 rows ● Labels ○ ~62% neutral ○ ~26% positive ○ ~12% negative ● Simplest majority baseline ○ 0.622 Accuracy Cramo and Peab have signed exclusive five-year rental agreements in Finland and have extended their existing rental agreements in the Swedish market for another five years. Construction work on the Helsinki Music Centre is to start this autumn, with the total cost of the project estimated at 140 million euros. The company's profit totaled 578,100 in H1 2007, down 30.9% year- on-year.

9. v H2O.ai Confidential Let’s ask h2oGPT

10. v H2O.ai Confidential Zero-shot streamlining is difficult No training needed No labels needed Easy to get started Prompt engineering tricky Difficult to automate into business processes Runtime expensive Often interpretable results Evaluation still required → labels

11. v H2O.ai Confidential Zero-shot logits approach GPT Your task is to analyze the message below and predict whether it has negative, neutral or positive sentiment. Return on investment was 16.6% compared to 15.8% in 2004. The sentiment is

12. v H2O.ai Confidential Zero-shot logits approach

13. v H2O.ai Confidential Zero-shot logits approach

14. v H2O.ai Confidential Classification fine-tuning GPT Negative Neutral Positive Classification Head Finetune

15. v H2O.ai Confidential H2O LLM Studio github.com/h2oai/h2o-llmstudio ● Easily and effectively fine-tune LLMs ● CLI & No-Code GUI ● SOTA techniques ○ LoRA ○ Quantization ○ Hyperparameter tuning ○ Experiment tracking and advanced evaluation ○ SFT & RLHF ● Newest Problem Type: Causal Classification ● Fully open-sourced

16. v H2O.ai Confidential Fine-tuning approach

17. v H2O.ai Confidential Fine-tuning approach

18. H2O.ai Confidential Kaggle success stories

19. v H2O.ai Confidential H2O.ai Predict the LLM

20. v H2O.ai Confidential CommonLit - Evaluate Student Summaries Competition

21. v H2O.ai Confidential LLM Science Exam Competition

22. H2O.ai Confidential philipp.singer@h2o.ai @ph_singer Contact

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions

Recommended

Recommended

More Related Content

Similar to KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions

Similar to KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle Competitions