CEO & Co-Founder on Data Growth and Storage

•

0 likes•273 views

"The rise of large language models like GPT-4 and generative AI has changed the traditional approach of data and ML teams to engineer their own features and train models in batch on large sets of historic data. Since training of proprietary LLMs is not anymore an affordable option to most organizations, the inference of foundational models via APIs seems to be the natural way of consumption, providing new challenges for the architecture of real-time apps and workflows. In this talk we show how data and machine learning teams can rapidly prototype and deploy real-time ML apps, ingesting real-time data with the help of Apache Kafka® and Airy, an open-source app framework. We will discuss different options to finetune LLMs and „chaining“ them with other ML models at inference in a microservices architecture utilizing Kafka Streams and Kubernetes. We will also discuss how streaming can enable dynamic features for ML models and prompt engineering to integrate with generative AI. At the end of the talk we will give an outlook on the opportunity to dynamically retrain machine learning models in real-time with streaming and batch sources, utilizing Ray and Kubernetes to spin up GPU node pools for model training on demand. In this context, we will also discuss how event streaming can be used for reinforcement learning with human feedback (RLHF) to improve the accuracy of predictions and to make the ML model more robust over time."

Technology

〜
S O U R C E S : S T A T I S T A , G A R T N E R , D A T A A G E 2 0 2 5 , W I T H D A T A F R O M I D C G L O B A L D A T A S P H E R E , N O V 2 0 1 8
Z
E
T
T
A
B
Y
T
E
S

S O U R C E S : S T A T I S T A , G A R T N E R
E - M A I L S , D I R E C T M E S S A G E S , J S O N ,
W O R D D O C U M E N T S , T E X T F I L E S ,
P D F D O C U M E N T S , S P R E A D S H E E T S ,
I M A G E S , A U D I O , V I D E O S
D A T A W A R E H O U S E S ,
C R M , E T C .

S O U R C E S : N E O 4 J , P U B . T O W A R D S A I . N E T

S O U R C E : C O B U S G R E Y L I N G . C O M

CEO & Co-Founder on Data Growth and Storage

Recently uploaded

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

CloudStudio User manual (basic edition):comworks

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz

Commit 2024 - Secret Management made easyAlfredo García Lavilla

"ML in Production",Oleksandr BaganFwdays

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2

Install Stable Diffusion in windows machinePadma Pradeep

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

CloudStudio User manual (basic edition):

Gen AI in Business - Global Trends Report 2024.pdf

Streamlining Python Development: A Guide to a Modern Project Setup

Are Multi-Cloud and Serverless Good or Bad?

DevEX - reference for building teams, processes, and platforms

"Debugging python applications inside k8s environment", Andrii Soldatenko

Vertex AI Gemini Prompt Engineering Tips

SAP Build Work Zone - Overview L2-L3.pptx

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost

Commit 2024 - Secret Management made easy

"ML in Production",Oleksandr Bagan

Designing IA for AI - Information Architecture Conference 2024

The Future of Software Development - Devin AI Innovative Approach.pdf

Install Stable Diffusion in windows machine

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Scanning the Internet for External Cloud Exposures via SSL Certs