SlideShare a Scribd company logo
MLCon Berlin, November 2023
GenAppsonGoogleCloud
PaLM2andCodeyAPIsinAction
Márton Kodok
Software Architect at REEA.net
Berlin 2023
1. What is Vertex AI?
2. Generative AI / LLM / Foundation Models
3. Exploring Model Garden
4. Code Demo
5. Conclusions
Agenda
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
● Google Developer Expert on Cloud technologies (2016→)
● Champion of Google Cloud Innovators program (2021→)
● Among the Top 3 romanians on Stackoverflow 205k reputation
● Crafting Cloud Architecture+ML backends at REEA.net
Articles: martonkodok.medium.com
Twitter: @martonkodok
Slideshare:martonkodok
StackOverflow: pentium10
GitHub: pentium10
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
About me
@martonkodok
Available in 46 languages; 238 countries
Bard - Chat Based AI Tool from Google, Powered by PaLM 2
“ demo…
@martonkodok
@martonkodok
Bard Extensions
@martonkodok
Getting results from Gmail
@martonkodok
list the top 10 incidents from email for service name "trademark", group results in table, order
by date, hour descending, extract duration, make an analyses if there is a repeated pattern
@martonkodok
Sum my invoices
@martonkodok
find all emails from this year from Hidroelectrica that contain "factura" and list all invoice values
for this year, with dates, in a table, order by descending
Gen AI and Model Garden on Vertex AI @martonkodok
Google Cloud AI Portfolio
Foundation
Models
Vertex AI
End-to-End ML Platform
Generative Al
App Builder
Text Chat Code Image
Video
Google Cloud Infrastructure - GPUs/TPUs
Contact Center AI Healthcare AI
Discovery AI
Document AI
Conversation AI
Vertex AI
Search
Foundation
Models
Business Users
AI Practitioners
Developers
Audio and
Music
Generative AI
Studio
Generative AI
APIs
Model Garden
Duet Al for
Google Workspace
Duet Al for
Google Cloud
@martonkodok
What is
Vertex AI?
Part #1
“ VertexAI is a managed ML platform for developers
@martonkodok
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
VertexAI:Managed unified ML platform
Fine-tuning 1 click deploy
“ You can deploy models on VertexAI
and get a HTTPs Endpointsto do
inference rapidly and reliably.
Generative AI and Model Garden on Vertex AI @martonkodok
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Vertex AI provides tools to build with Gen AI
Model Garden
Generative AI Studio
Open Source
Models
Task Specific
AutoML and APIs
Foundation
Models
Tuning
Adaptive Layers
Prompt Design
Data Science Workbench
Experiment Train Deploy
MLOps
ML Platform
Model Garden
Part #2
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Model Garden
Task Specific
AutoML and APIs
Open Source
Models
Foundation
Models
Model Garden
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Model Garden
Foundation Models
Multi-task Large-scale Minimal training
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Language Foundation Models
PaLM 2 for Text
Custom language tasks
PaLM 2 for Chat
Multi-turn conversations with
session context
Codey for
Code Generation
Improve coding and debugging
Chirp
Turning audio containing speech into
formatted text representation
Imagen
Write text prompts to generate new
images or generate new areas of an
existing image.
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Introduction
Generative AI Studio
Console-tool Test generative models Rapid prototyping
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Generative AI Studio
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Custom language Prompt samples
1. Summarization
2. Classification
3. Extraction
4. Writing
5. Ideation
cloud.google.com/vertex-ai/docs/generative-ai/learn/prompt-samples
Part #3
Open-source
models
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Model Garden
Open Source Models
Deploy Large-scale Fine-tunable
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Open Source models
53+ other models
Providing open source model ecosystem
enables companies to look for
their particular use case
Falcon-instruct (PEFT)
Popular assistant-style tasks
Falcon-40B-instruct on Vertex AI
Llama 2
Meta's Llama 2 models
on Vertex AI
Code Llama
Designed for general code synthesis and
understanding, designed for Python
More…
“ LargeLanguageModelscome with
large sheer (cliff) size problems
…. deployment issues
@martonkodok
1. LLMS have enabled unseen new task paradigm
2. However, LLMs are challenging to deploy to real-world apps due to their largesize
3. 175b LLM requires at least 350GB of GPU memory - using specialized infrastructure
Challenging to serve in practice
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
“ VertexAI enables model inference via API
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
VertexAI:Managed unified ML platform
Fine-tuning 1 click deploy
@martonkodok
Explore AI models with Colab samples
cloud.google.com/vertex-ai/docs/start/explore-models#test-promptsub
Demo
Code repository
Part #4
Code demo … repository
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
@martonkodok
Sample code for GenAI on Google Cloud
goo.gle/gen-ai-github
@martonkodok
TextGenerationModel
@martonkodok
Text Extraction
@martonkodok
Text Extraction Results
@martonkodok
Text Extraction
@martonkodok
Text Extraction
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Introduction
Vertex AI Codey APIs
@martonkodok
Accelerate software development with Codey
@martonkodok
Codey supporting languages
@martonkodok
Fine tune Codey models
1. can use own code
2. improve model quality
3. generate code in language variants to use our standards
4. generate code for custom libraries
@martonkodok
Tune language models
cloud.google.com/vertex-ai/
docs/generative-ai/models/tune-model
s
Codey demo … repository
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Data engineering
Part #5
What’s included in VertexAI?
@martonkodok
Data Labeling
AutoML models
DL Environment (DL VM + DL Container)
Prediction
Feature Store Training
Experiments
Data Readiness
Feature
Engineering
Training/
HP-Tuning
Model
Monitoring
Model serving
Understanding/
Tuning
Edge
Model
Management
Notebooks
Pipelines (Orchestration)
Explainable AI
Hybrid AI
Model
Monitoring
Metadata
Vision-Video-NLP-Translate
Models
Datasets
Custom Models
Containers
Python
Endpoints
BigQuery ML
BigQuery Models
Publisher
Pre-trained
models
Model Garden GenAI Studio
Foundation -> LLM Palm 2 API
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
VertexAI:Managed unified ML platform
Fine-tuning 1 click deploy
Conclusion
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
“ At the end of the day, the largest model
is actually not the right answer
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
Vertex AI: From prompt samples to Fine-tuning
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
cloud.google.com/vertex-ai/
docs/generative-ai/models/tune-models
goo.gle/gen-ai-github
cloud.google.com/vertex-ai/
docs/generative-ai/learn/prompt-samples
1. Build with the groundbreaking ML tools that power Google
2. Model Garden provides a curated collection of 100+ models
3. Approachable from the non-ML developer perspective (managed models, fine-tuning training)
4. Accelerate ML with tooling for pre-trained, open source and custom models
5. Deploy to applications with just one-click
6. End-to-end integration for data and AI with build pipelines that outperform and solve complex ML tasks
Vertex AI: Enhanced ML developer experience
Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
slideshare.net/martonkodok
Intro to Model Garden and GenApps @martonkodok
Twitter: @martonkodok
Thank you. Q&A.
Reea.net - Integrated web solutions driven by creativity
to deliver projects.
Follow for articles:
martonkodok.medium.com
Slides available on:
slideshare.net/martonkodok
Gen Apps on Google Cloud PaLM2 and Codey APIs in Action

More Related Content

Similar to Gen Apps on Google Cloud PaLM2 and Codey APIs in Action

Giovanni Galloro - Make your applications see, understand and talk with Googl...
Giovanni Galloro - Make your applications see, understand and talk with Googl...Giovanni Galloro - Make your applications see, understand and talk with Googl...
Giovanni Galloro - Make your applications see, understand and talk with Googl...
Codemotion
 
GDG DevFest Romania - Architecting for the Google Cloud Platform
GDG DevFest Romania - Architecting for the Google Cloud PlatformGDG DevFest Romania - Architecting for the Google Cloud Platform
GDG DevFest Romania - Architecting for the Google Cloud Platform
Márton Kodok
 
Production ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google CloudProduction ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google Cloud
gdgsurrey
 
Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3
DataWorks Summit
 
Bhadale group of companies projects portfolio
Bhadale group of companies  projects portfolioBhadale group of companies  projects portfolio
Bhadale group of companies projects portfolio
Vijayananda Mohire
 
Google cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGoogle cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptx
GDSCNiT
 
MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...
MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...
MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...
Amit Sheth
 
MongoDB World 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB World 2018: Building Intelligent Apps with MongoDB & Google CloudMongoDB World 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB World 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB
 
Machine learning at scale by Amy Unruh from Google
Machine learning at scale by  Amy Unruh from GoogleMachine learning at scale by  Amy Unruh from Google
Machine learning at scale by Amy Unruh from Google
Bill Liu
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
DianaGray10
 
20240411 QFM009 Machine Intelligence Reading List March 2024
20240411 QFM009 Machine Intelligence Reading List March 202420240411 QFM009 Machine Intelligence Reading List March 2024
20240411 QFM009 Machine Intelligence Reading List March 2024
Matthew Sinclair
 
Tech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning productsTech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning products
Gianmario Spacagna
 
Creating an MVP with Oracle
Creating an MVP with OracleCreating an MVP with Oracle
Creating an MVP with Oracle
Franco Ucci
 
GDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud PlatformGDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud Platform
Márton Kodok
 
ALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngineALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngine
Abdelkrim Boujraf
 
AWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdfAWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdf
Ayyanar Jeyakrishnan
 
Large Language Models, Data & APIs - Integrating Generative AI Power into you...
Large Language Models, Data & APIs - Integrating Generative AI Power into you...Large Language Models, Data & APIs - Integrating Generative AI Power into you...
Large Language Models, Data & APIs - Integrating Generative AI Power into you...
NETUserGroupBern
 
Building Instruqt, a scalable learning platform
Building Instruqt, a scalable learning platformBuilding Instruqt, a scalable learning platform
Building Instruqt, a scalable learning platform
Instruqt
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
Databricks
 
The Magic of flutter Comex oman 2019
The Magic of flutter Comex oman 2019The Magic of flutter Comex oman 2019
The Magic of flutter Comex oman 2019
Ahmed Abu Eldahab
 

Similar to Gen Apps on Google Cloud PaLM2 and Codey APIs in Action (20)

Giovanni Galloro - Make your applications see, understand and talk with Googl...
Giovanni Galloro - Make your applications see, understand and talk with Googl...Giovanni Galloro - Make your applications see, understand and talk with Googl...
Giovanni Galloro - Make your applications see, understand and talk with Googl...
 
GDG DevFest Romania - Architecting for the Google Cloud Platform
GDG DevFest Romania - Architecting for the Google Cloud PlatformGDG DevFest Romania - Architecting for the Google Cloud Platform
GDG DevFest Romania - Architecting for the Google Cloud Platform
 
Production ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google CloudProduction ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google Cloud
 
Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3
 
Bhadale group of companies projects portfolio
Bhadale group of companies  projects portfolioBhadale group of companies  projects portfolio
Bhadale group of companies projects portfolio
 
Google cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGoogle cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptx
 
MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...
MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...
MobiCloud: Towards Cloud Mobile Hybrid Application Generation using Semantica...
 
MongoDB World 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB World 2018: Building Intelligent Apps with MongoDB & Google CloudMongoDB World 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB World 2018: Building Intelligent Apps with MongoDB & Google Cloud
 
Machine learning at scale by Amy Unruh from Google
Machine learning at scale by  Amy Unruh from GoogleMachine learning at scale by  Amy Unruh from Google
Machine learning at scale by Amy Unruh from Google
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
 
20240411 QFM009 Machine Intelligence Reading List March 2024
20240411 QFM009 Machine Intelligence Reading List March 202420240411 QFM009 Machine Intelligence Reading List March 2024
20240411 QFM009 Machine Intelligence Reading List March 2024
 
Tech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning productsTech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning products
 
Creating an MVP with Oracle
Creating an MVP with OracleCreating an MVP with Oracle
Creating an MVP with Oracle
 
GDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud PlatformGDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud Platform
 
ALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngineALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngine
 
AWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdfAWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdf
 
Large Language Models, Data & APIs - Integrating Generative AI Power into you...
Large Language Models, Data & APIs - Integrating Generative AI Power into you...Large Language Models, Data & APIs - Integrating Generative AI Power into you...
Large Language Models, Data & APIs - Integrating Generative AI Power into you...
 
Building Instruqt, a scalable learning platform
Building Instruqt, a scalable learning platformBuilding Instruqt, a scalable learning platform
Building Instruqt, a scalable learning platform
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 
The Magic of flutter Comex oman 2019
The Magic of flutter Comex oman 2019The Magic of flutter Comex oman 2019
The Magic of flutter Comex oman 2019
 

More from Márton Kodok

Discover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statementDiscover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statement
Márton Kodok
 
Cloud Run - the rise of serverless and containerization
Cloud Run - the rise of serverless and containerizationCloud Run - the rise of serverless and containerization
Cloud Run - the rise of serverless and containerization
Márton Kodok
 
BigQuery best practices and recommendations to reduce costs with BI Engine, S...
BigQuery best practices and recommendations to reduce costs with BI Engine, S...BigQuery best practices and recommendations to reduce costs with BI Engine, S...
BigQuery best practices and recommendations to reduce costs with BI Engine, S...
Márton Kodok
 
Cloud Workflows What's new in serverless orchestration and automation
Cloud Workflows What's new in serverless orchestration and automationCloud Workflows What's new in serverless orchestration and automation
Cloud Workflows What's new in serverless orchestration and automation
Márton Kodok
 
Serverless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud WorkflowsServerless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud Workflows
Márton Kodok
 
Serverless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud WorkflowsServerless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud Workflows
Márton Kodok
 
Serverless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud WorkflowsServerless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud Workflows
Márton Kodok
 
BigdataConference Europe - BigQuery ML
BigdataConference Europe - BigQuery MLBigdataConference Europe - BigQuery ML
BigdataConference Europe - BigQuery ML
Márton Kodok
 
DevFest Romania 2020 Keynote: Bringing the Cloud to you.
DevFest Romania 2020 Keynote: Bringing the Cloud to you.DevFest Romania 2020 Keynote: Bringing the Cloud to you.
DevFest Romania 2020 Keynote: Bringing the Cloud to you.
Márton Kodok
 
BigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLBigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQL
Márton Kodok
 
Applying BigQuery ML on e-commerce data analytics
Applying BigQuery ML on e-commerce data analyticsApplying BigQuery ML on e-commerce data analytics
Applying BigQuery ML on e-commerce data analytics
Márton Kodok
 
Supercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuerySupercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuery
Márton Kodok
 
Vibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer Expertig
Vibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer ExpertigVibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer Expertig
Vibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer Expertig
Márton Kodok
 
BigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLBigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQL
Márton Kodok
 
Google Cloud Platform Solutions for DevOps Engineers
Google Cloud Platform Solutions  for DevOps EngineersGoogle Cloud Platform Solutions  for DevOps Engineers
Google Cloud Platform Solutions for DevOps Engineers
Márton Kodok
 
Next18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youNext18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to you
Márton Kodok
 
6. DISZ - Webalkalmazások skálázhatósága a Google Cloud Platformon
6. DISZ - Webalkalmazások skálázhatósága  a Google Cloud Platformon6. DISZ - Webalkalmazások skálázhatósága  a Google Cloud Platformon
6. DISZ - Webalkalmazások skálázhatósága a Google Cloud Platformon
Márton Kodok
 
GCP - A felhőalapú architektúrák és szolgáltatások
GCP - A felhőalapú architektúrák és szolgáltatásokGCP - A felhőalapú architektúrák és szolgáltatások
GCP - A felhőalapú architektúrák és szolgáltatások
Márton Kodok
 
Efikot - Smart City, okos város - a jövőnk kulcsa
Efikot - Smart City, okos város - a jövőnk kulcsaEfikot - Smart City, okos város - a jövőnk kulcsa
Efikot - Smart City, okos város - a jövőnk kulcsa
Márton Kodok
 
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryCodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
Márton Kodok
 

More from Márton Kodok (20)

Discover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statementDiscover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statement
 
Cloud Run - the rise of serverless and containerization
Cloud Run - the rise of serverless and containerizationCloud Run - the rise of serverless and containerization
Cloud Run - the rise of serverless and containerization
 
BigQuery best practices and recommendations to reduce costs with BI Engine, S...
BigQuery best practices and recommendations to reduce costs with BI Engine, S...BigQuery best practices and recommendations to reduce costs with BI Engine, S...
BigQuery best practices and recommendations to reduce costs with BI Engine, S...
 
Cloud Workflows What's new in serverless orchestration and automation
Cloud Workflows What's new in serverless orchestration and automationCloud Workflows What's new in serverless orchestration and automation
Cloud Workflows What's new in serverless orchestration and automation
 
Serverless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud WorkflowsServerless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud Workflows
 
Serverless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud WorkflowsServerless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud Workflows
 
Serverless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud WorkflowsServerless orchestration and automation with Cloud Workflows
Serverless orchestration and automation with Cloud Workflows
 
BigdataConference Europe - BigQuery ML
BigdataConference Europe - BigQuery MLBigdataConference Europe - BigQuery ML
BigdataConference Europe - BigQuery ML
 
DevFest Romania 2020 Keynote: Bringing the Cloud to you.
DevFest Romania 2020 Keynote: Bringing the Cloud to you.DevFest Romania 2020 Keynote: Bringing the Cloud to you.
DevFest Romania 2020 Keynote: Bringing the Cloud to you.
 
BigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLBigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQL
 
Applying BigQuery ML on e-commerce data analytics
Applying BigQuery ML on e-commerce data analyticsApplying BigQuery ML on e-commerce data analytics
Applying BigQuery ML on e-commerce data analytics
 
Supercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuerySupercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuery
 
Vibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer Expertig
Vibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer ExpertigVibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer Expertig
Vibe Koli 2019 - Utazás az egyetem padjaitól a Google Developer Expertig
 
BigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLBigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQL
 
Google Cloud Platform Solutions for DevOps Engineers
Google Cloud Platform Solutions  for DevOps EngineersGoogle Cloud Platform Solutions  for DevOps Engineers
Google Cloud Platform Solutions for DevOps Engineers
 
Next18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youNext18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to you
 
6. DISZ - Webalkalmazások skálázhatósága a Google Cloud Platformon
6. DISZ - Webalkalmazások skálázhatósága  a Google Cloud Platformon6. DISZ - Webalkalmazások skálázhatósága  a Google Cloud Platformon
6. DISZ - Webalkalmazások skálázhatósága a Google Cloud Platformon
 
GCP - A felhőalapú architektúrák és szolgáltatások
GCP - A felhőalapú architektúrák és szolgáltatásokGCP - A felhőalapú architektúrák és szolgáltatások
GCP - A felhőalapú architektúrák és szolgáltatások
 
Efikot - Smart City, okos város - a jövőnk kulcsa
Efikot - Smart City, okos város - a jövőnk kulcsaEfikot - Smart City, okos város - a jövőnk kulcsa
Efikot - Smart City, okos város - a jövőnk kulcsa
 
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryCodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
 

Recently uploaded

Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
De mooiste recreatieve routes ontdekken met RouteYou en FME
De mooiste recreatieve routes ontdekken met RouteYou en FMEDe mooiste recreatieve routes ontdekken met RouteYou en FME
De mooiste recreatieve routes ontdekken met RouteYou en FME
Jelle | Nordend
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
informapgpstrackings
 
Cyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdfCyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdf
Cyanic lab
 
Why React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdfWhy React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdf
ayushiqss
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Hivelance Technology
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
XfilesPro
 
How Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptxHow Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptx
wottaspaceseo
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
Tendenci - The Open Source AMS (Association Management Software)
 
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERRORTROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
Tier1 app
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
Using IESVE for Room Loads Analysis - Australia & New Zealand
Using IESVE for Room Loads Analysis - Australia & New ZealandUsing IESVE for Room Loads Analysis - Australia & New Zealand
Using IESVE for Room Loads Analysis - Australia & New Zealand
IES VE
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 

Recently uploaded (20)

Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
 
De mooiste recreatieve routes ontdekken met RouteYou en FME
De mooiste recreatieve routes ontdekken met RouteYou en FMEDe mooiste recreatieve routes ontdekken met RouteYou en FME
De mooiste recreatieve routes ontdekken met RouteYou en FME
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
Cyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdfCyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdf
 
Why React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdfWhy React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdf
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBroker
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
 
How Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptxHow Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptx
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
 
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERRORTROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology Solutions
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
Using IESVE for Room Loads Analysis - Australia & New Zealand
Using IESVE for Room Loads Analysis - Australia & New ZealandUsing IESVE for Room Loads Analysis - Australia & New Zealand
Using IESVE for Room Loads Analysis - Australia & New Zealand
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 

Gen Apps on Google Cloud PaLM2 and Codey APIs in Action

  • 1. MLCon Berlin, November 2023 GenAppsonGoogleCloud PaLM2andCodeyAPIsinAction Márton Kodok Software Architect at REEA.net Berlin 2023
  • 2. 1. What is Vertex AI? 2. Generative AI / LLM / Foundation Models 3. Exploring Model Garden 4. Code Demo 5. Conclusions Agenda Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 3. ● Google Developer Expert on Cloud technologies (2016→) ● Champion of Google Cloud Innovators program (2021→) ● Among the Top 3 romanians on Stackoverflow 205k reputation ● Crafting Cloud Architecture+ML backends at REEA.net Articles: martonkodok.medium.com Twitter: @martonkodok Slideshare:martonkodok StackOverflow: pentium10 GitHub: pentium10 Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok About me
  • 4. @martonkodok Available in 46 languages; 238 countries Bard - Chat Based AI Tool from Google, Powered by PaLM 2
  • 8. @martonkodok list the top 10 incidents from email for service name "trademark", group results in table, order by date, hour descending, extract duration, make an analyses if there is a repeated pattern
  • 10. @martonkodok find all emails from this year from Hidroelectrica that contain "factura" and list all invoice values for this year, with dates, in a table, order by descending
  • 11. Gen AI and Model Garden on Vertex AI @martonkodok Google Cloud AI Portfolio Foundation Models Vertex AI End-to-End ML Platform Generative Al App Builder Text Chat Code Image Video Google Cloud Infrastructure - GPUs/TPUs Contact Center AI Healthcare AI Discovery AI Document AI Conversation AI Vertex AI Search Foundation Models Business Users AI Practitioners Developers Audio and Music Generative AI Studio Generative AI APIs Model Garden Duet Al for Google Workspace Duet Al for Google Cloud
  • 13. “ VertexAI is a managed ML platform for developers @martonkodok
  • 14. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok VertexAI:Managed unified ML platform Fine-tuning 1 click deploy
  • 15. “ You can deploy models on VertexAI and get a HTTPs Endpointsto do inference rapidly and reliably. Generative AI and Model Garden on Vertex AI @martonkodok
  • 16. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Vertex AI provides tools to build with Gen AI Model Garden Generative AI Studio Open Source Models Task Specific AutoML and APIs Foundation Models Tuning Adaptive Layers Prompt Design Data Science Workbench Experiment Train Deploy MLOps ML Platform
  • 18. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Model Garden Task Specific AutoML and APIs Open Source Models Foundation Models Model Garden
  • 19. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Model Garden Foundation Models Multi-task Large-scale Minimal training
  • 20. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Language Foundation Models PaLM 2 for Text Custom language tasks PaLM 2 for Chat Multi-turn conversations with session context Codey for Code Generation Improve coding and debugging Chirp Turning audio containing speech into formatted text representation Imagen Write text prompts to generate new images or generate new areas of an existing image.
  • 21. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Introduction Generative AI Studio Console-tool Test generative models Rapid prototyping
  • 22. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Generative AI Studio
  • 23. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Custom language Prompt samples 1. Summarization 2. Classification 3. Extraction 4. Writing 5. Ideation cloud.google.com/vertex-ai/docs/generative-ai/learn/prompt-samples
  • 25. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Model Garden Open Source Models Deploy Large-scale Fine-tunable
  • 26. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Open Source models 53+ other models Providing open source model ecosystem enables companies to look for their particular use case Falcon-instruct (PEFT) Popular assistant-style tasks Falcon-40B-instruct on Vertex AI Llama 2 Meta's Llama 2 models on Vertex AI Code Llama Designed for general code synthesis and understanding, designed for Python More…
  • 27. “ LargeLanguageModelscome with large sheer (cliff) size problems …. deployment issues @martonkodok
  • 28. 1. LLMS have enabled unseen new task paradigm 2. However, LLMs are challenging to deploy to real-world apps due to their largesize 3. 175b LLM requires at least 350GB of GPU memory - using specialized infrastructure Challenging to serve in practice Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 29. “ VertexAI enables model inference via API Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 30. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok VertexAI:Managed unified ML platform Fine-tuning 1 click deploy
  • 31. @martonkodok Explore AI models with Colab samples cloud.google.com/vertex-ai/docs/start/explore-models#test-promptsub
  • 33. Code demo … repository Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 34. @martonkodok Sample code for GenAI on Google Cloud goo.gle/gen-ai-github
  • 40. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok Introduction Vertex AI Codey APIs
  • 43. @martonkodok Fine tune Codey models 1. can use own code 2. improve model quality 3. generate code in language variants to use our standards 4. generate code for custom libraries
  • 45. Codey demo … repository Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 47. What’s included in VertexAI? @martonkodok Data Labeling AutoML models DL Environment (DL VM + DL Container) Prediction Feature Store Training Experiments Data Readiness Feature Engineering Training/ HP-Tuning Model Monitoring Model serving Understanding/ Tuning Edge Model Management Notebooks Pipelines (Orchestration) Explainable AI Hybrid AI Model Monitoring Metadata Vision-Video-NLP-Translate Models Datasets Custom Models Containers Python Endpoints BigQuery ML BigQuery Models Publisher Pre-trained models Model Garden GenAI Studio Foundation -> LLM Palm 2 API
  • 48. Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok VertexAI:Managed unified ML platform Fine-tuning 1 click deploy
  • 49. Conclusion Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 50. “ At the end of the day, the largest model is actually not the right answer Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 51. Vertex AI: From prompt samples to Fine-tuning Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok cloud.google.com/vertex-ai/ docs/generative-ai/models/tune-models goo.gle/gen-ai-github cloud.google.com/vertex-ai/ docs/generative-ai/learn/prompt-samples
  • 52. 1. Build with the groundbreaking ML tools that power Google 2. Model Garden provides a curated collection of 100+ models 3. Approachable from the non-ML developer perspective (managed models, fine-tuning training) 4. Accelerate ML with tooling for pre-trained, open source and custom models 5. Deploy to applications with just one-click 6. End-to-end integration for data and AI with build pipelines that outperform and solve complex ML tasks Vertex AI: Enhanced ML developer experience Gen Apps on Google Cloud: PaLM2 and Codey APIs in Action @martonkodok
  • 53. slideshare.net/martonkodok Intro to Model Garden and GenApps @martonkodok
  • 54. Twitter: @martonkodok Thank you. Q&A. Reea.net - Integrated web solutions driven by creativity to deliver projects. Follow for articles: martonkodok.medium.com Slides available on: slideshare.net/martonkodok