SlideShare a Scribd company logo
1 of 25
© HTEC Group, 2023
Approach To Architecting
GenerativeAI Solutions
Ivan Petrović
Senior Technology Lead @ HTEC Group
Empowering your digital tomorrow
© HTEC Group, 2023
2
Topics
Approach To Architecting Generative AI Solutions
Architecture Dive – Dynamic Agent
Intro To Non-AI Folks
HTEC’s Approach
01
02
03
© HTEC Group, 2023
Approach To Architecting Generative AI Solutions
Intro To
Non-AI Folks
© HTEC Group, 2023
4
What Are LLMs?
Approach To Architecting Generative AI Solutions
Large Language Model
Paris
What is the capital of France?
Output
Input
© HTEC Group, 2023
5
Hosting LLMs
Approach To Architecting Generative AI Solutions
Large Language Model
+
+
© HTEC Group, 2023
6
Multimodal Models
Approach To Architecting Generative AI Solutions
Multimodal
Model
IMU
© HTEC Group, 2023
7
Most Common Use-Cases
Approach To Architecting Generative AI Solutions
Classify
Generate
Rewrite
Cluster
Extract
Search
Summarize
© HTEC Group, 2023
HTEC’s
Approach
Approach To Architecting Generative AI Solutions
© HTEC Group, 2023
9
Throughout our consultative process we apply Cognitive Design principals and Responsible AI practices to ensure human integrity and ethical guidance
Framework
COGNITIVE DESIGN & RESPONSIBLE AI
ACCELERATORS
Analytics
Intelligent
assist
2
1
3
Data
AI/ML
ENGINEERING
DATA
ENGINEERING
DATA
SCIENCE
PRODUCT
DESIGN
Cognitive
Design
CX
SERVICE DESIGN
BUSINESS
DESIGN
Generative AI
Automation &
productivity
Autonomous
systems
Decision
intelligence
Data-driven
innovation
Computer
vision
Knowledge
management
Predictive
analytics
Intelligent
process
automation
Anomaly
detection
AI-powered
security &
compliance
Predictive
maintenance
STEP 1
Problem assessment
STEP 2
Cognitive design
STEP 3
Secure data onboarding & security
STEP 4
AI/ML model development
STEP 5
Continuous solution optimization
STEP 6
Deployment & scaling
Approach To Architecting Generative AI Solutions
© HTEC Group, 2023
10
Prompts and
Prompt Engineering
• Prompts are inputs sent to LLM to elicit a certain response
• Natural language instructions
• Prompt Enginering
• Influence the creativity of LLM
• Halucinations
• Training vs Prompting
Approach To Architecting Generative AI Solutions
Prompt content
Instruction
Context
Output Indicator
Input Data
© HTEC Group, 2023
11
Points of Interest
• Types of learning (LLM)
• Zero shot
• One shot
• Few shot
• Fine tunning
• Vector Databases
• Considerations
• Ethics
• Licencing
• Knowledge cutoff
• Token limitations
• Token modalities
Approach To Architecting Generative AI Solutions
Input
Zero Shot
Example 1
One Shot
Input
Example 1
Few Shot
…
Example N
Input
© HTEC Group, 2023
12
Complexity Hierarchy
Approach To Architecting Generative AI Solutions
Complexity level
No Context
Simple Context
Tool Use
Multi-Agent
Important points
No persistent memory, can be used hirerachaly
Retaining conv. history, conv. buffers, summary,
window
Persisten memory with vector storage
Multiple agents with different prompts, spawning
agents trough interanl main agent API
Use-case example
Document summarisation
Chatbot conversation memory
ChatGPT with plugins
AutoGPT, BabyAGI
© HTEC Group, 2023
Architecture
Dive - Dynamic
Agent
Approach To Architecting Generative AI Solutions
© HTEC Group, 2023
14
Agent Reusability
Configuration Agent type
(container)
Agent
instance
Approach To Architecting Generative AI Solutions
© HTEC Group, 2023
15
Auth
Token
Stream data
Make API calls
Send requests in natural language
Architecture
Approach To Architecting Generative AI Solutions
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Upload data with token
Send prompt
Send prompt to LLM
Send
progress
Consume
progress status
Read/write chat
history
© HTEC Group, 2023
16
Send prompt
Send
progress
Auth
Token
Stream data
Make API calls
Send requests in natural language
Authentication
Approach To Architecting Generative AI Solutions
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Upload data with token
Send prompt to LLM
Consume
progress status
Read/write chat
history
© HTEC Group, 2023
17
Auth
Token
Stream data
Make API calls
Send requests in natural language
Data Upload
Approach To Architecting Generative AI Solutions
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Upload data with token
Send prompt to LLM
Send
progress
Consume
progress status
Read/write chat
history
Send prompt
© HTEC Group, 2023
18
Send
progress
Auth
Token
Stream data
Make API calls
Send requests in natural language
Inference Flow
Approach To Architecting Generative AI Solutions
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Upload data with token
Send prompt
Send prompt to LLM
Consume
progress status
Read/write chat
history
© HTEC Group, 2023
19
Auth
Token
Stream data
Make API calls
Send requests in natural language
Notification Flow
Approach To Architecting Generative AI Solutions
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Upload data with token
Send prompt to LLM
Send
progress
Consume
progress status
Read/write chat
history
Send prompt
© HTEC Group, 2023
20
Chatbot components
Approach To Architecting Generative AI Solutions
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Memory configuration
Input data
Strategy config
Prompt templates
Consume message
- metadata
- agent configuration
- input data
Progress message Stream response data
Send prompt to LLM
© HTEC Group, 2023
21
Configuration
Approach To Architecting Generative AI Solutions
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Memory configuration
Input data
Strategy config
Prompt templates
Consume message
- metadata
- agent configuration
- input data
Progress message Stream response data
Send prompt to LLM
Notification
© HTEC Group, 2023
22
Agent Type
Approach To Architecting Generative AI Solutions
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Memory configuration
Strategy config
Prompt templates
Consume message
- metadata
- agent configuration
- input data
Progress message Stream response data
Send prompt to LLM
Notification
© HTEC Group, 2023
23
Agent Data Flow
Approach To Architecting Generative AI Solutions
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Memory configuration
Input data
Consume message
- input data
Progress message Stream response data
Send prompt to LLM
Notification
© HTEC Group, 2023
24
Transformation
Data Sources Business Logic End User
Ingestion
Analytics Service
Question Processing
Context Building
Query Generation
Fund Monitoring Platform
Azure Data Lake
(Raw Storage)
Azure Data Lake
(Analytics Storage)
Data Processing
(Spark Pool)
Ingestion Pipeline
UI
Argo CD Azure DevOps Azure Container
Registry
Azure Key Vault Azure OpenAI
Service
Azure Synapse Analytics
Real-world AI and Generative AI
AI-Assisted Fund Performance Analytics
Authentication &
Authorisation
(KeyCloak)
© HTEC Group, 2023
Architecting Generative AI Solutions
QnA
Ivan Petrović
Senior Technology Lead @ HTEC Group

More Related Content

Similar to [DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions

CWIN17 India / Industrialization - ad center -virtual-session rajashree das
CWIN17 India / Industrialization - ad center -virtual-session  rajashree dasCWIN17 India / Industrialization - ad center -virtual-session  rajashree das
CWIN17 India / Industrialization - ad center -virtual-session rajashree dasCapgemini
 
Cloud 12 08 V2
Cloud 12 08 V2Cloud 12 08 V2
Cloud 12 08 V2Pini Cohen
 
Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?confluent
 
Corporate Presentation Manam Infotech
Corporate Presentation Manam InfotechCorporate Presentation Manam Infotech
Corporate Presentation Manam InfotechManam Infotech
 
ANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit Tandon
ANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit TandonANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit Tandon
ANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit TandonAgileNetwork
 
WSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - KanchanaWSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - KanchanaWSO2
 
[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction
[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction
[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent InteractionDataScienceConferenc1
 
Core Synergetics Presentation 2015-16
Core Synergetics Presentation 2015-16Core Synergetics Presentation 2015-16
Core Synergetics Presentation 2015-16Yojana Karande
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
stackconf 2021 | Data Driven Security
stackconf 2021 | Data Driven Securitystackconf 2021 | Data Driven Security
stackconf 2021 | Data Driven SecurityNETWAYS
 
Content Strategy and Developer Engagement for DevPortals
Content Strategy and Developer Engagement for DevPortalsContent Strategy and Developer Engagement for DevPortals
Content Strategy and Developer Engagement for DevPortalsAxway
 
Resume-Sundar MR
Resume-Sundar MRResume-Sundar MR
Resume-Sundar MRSundar M R
 
Blockchain conference cwin18 mexico
Blockchain conference cwin18 mexicoBlockchain conference cwin18 mexico
Blockchain conference cwin18 mexicoCapgemini
 
ChainDigit your partner for blockchain solutions
ChainDigit   your partner for blockchain solutionsChainDigit   your partner for blockchain solutions
ChainDigit your partner for blockchain solutionsBaiju Jacob
 
Why Automate the Network?
Why Automate the Network?Why Automate the Network?
Why Automate the Network?Hank Preston
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfPhilipBasford
 
WSO2 ITALIA SMART TALK #4 - Telefonica Use Case
WSO2 ITALIA SMART TALK #4 - Telefonica Use CaseWSO2 ITALIA SMART TALK #4 - Telefonica Use Case
WSO2 ITALIA SMART TALK #4 - Telefonica Use CaseProfesia Srl, Lynx Group
 

Similar to [DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions (20)

CV_2015_Supreet
CV_2015_SupreetCV_2015_Supreet
CV_2015_Supreet
 
CWIN17 India / Industrialization - ad center -virtual-session rajashree das
CWIN17 India / Industrialization - ad center -virtual-session  rajashree dasCWIN17 India / Industrialization - ad center -virtual-session  rajashree das
CWIN17 India / Industrialization - ad center -virtual-session rajashree das
 
Cloud 12 08 V2
Cloud 12 08 V2Cloud 12 08 V2
Cloud 12 08 V2
 
Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?
 
Corporate Presentation Manam Infotech
Corporate Presentation Manam InfotechCorporate Presentation Manam Infotech
Corporate Presentation Manam Infotech
 
ANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit Tandon
ANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit TandonANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit Tandon
ANIn Bengaluru May 2023 | AI led Enterprise Transformation by Arpit Tandon
 
WSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - KanchanaWSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - Kanchana
 
[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction
[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction
[DSC Europe 23] Igor Sevo - Engineering Intelligence through Agent Interaction
 
Core synergetics presentation 2015 16
Core synergetics presentation 2015 16Core synergetics presentation 2015 16
Core synergetics presentation 2015 16
 
Core Synergetics Presentation 2015-16
Core Synergetics Presentation 2015-16Core Synergetics Presentation 2015-16
Core Synergetics Presentation 2015-16
 
Core synergetics presentation 2015-16
Core synergetics presentation 2015-16Core synergetics presentation 2015-16
Core synergetics presentation 2015-16
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
stackconf 2021 | Data Driven Security
stackconf 2021 | Data Driven Securitystackconf 2021 | Data Driven Security
stackconf 2021 | Data Driven Security
 
Content Strategy and Developer Engagement for DevPortals
Content Strategy and Developer Engagement for DevPortalsContent Strategy and Developer Engagement for DevPortals
Content Strategy and Developer Engagement for DevPortals
 
Resume-Sundar MR
Resume-Sundar MRResume-Sundar MR
Resume-Sundar MR
 
Blockchain conference cwin18 mexico
Blockchain conference cwin18 mexicoBlockchain conference cwin18 mexico
Blockchain conference cwin18 mexico
 
ChainDigit your partner for blockchain solutions
ChainDigit   your partner for blockchain solutionsChainDigit   your partner for blockchain solutions
ChainDigit your partner for blockchain solutions
 
Why Automate the Network?
Why Automate the Network?Why Automate the Network?
Why Automate the Network?
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
 
WSO2 ITALIA SMART TALK #4 - Telefonica Use Case
WSO2 ITALIA SMART TALK #4 - Telefonica Use CaseWSO2 ITALIA SMART TALK #4 - Telefonica Use Case
WSO2 ITALIA SMART TALK #4 - Telefonica Use Case
 

More from DataScienceConferenc1

[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdfDataScienceConferenc1
 
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...DataScienceConferenc1
 
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdfDataScienceConferenc1
 
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdfDataScienceConferenc1
 
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdfDataScienceConferenc1
 
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptxDataScienceConferenc1
 
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdfDataScienceConferenc1
 
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...DataScienceConferenc1
 
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdfDataScienceConferenc1
 
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...DataScienceConferenc1
 
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...DataScienceConferenc1
 
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdfDataScienceConferenc1
 
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptxDataScienceConferenc1
 
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...DataScienceConferenc1
 
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptxDataScienceConferenc1
 
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...DataScienceConferenc1
 
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...DataScienceConferenc1
 
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptxDataScienceConferenc1
 
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptxDataScienceConferenc1
 
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdfDataScienceConferenc1
 

More from DataScienceConferenc1 (20)

[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
 
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
 
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
 
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
 
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
 
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
 
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
 
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
 
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
 
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
 
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
 
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
 
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
 
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
 
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
 
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
 
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
 
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
 
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
 
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
 

Recently uploaded

Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理pyhepag
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group MeetingAlison Pitt
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理pyhepag
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Valters Lauzums
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Calllward7
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonPayment Village
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfscitechtalktv
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxDilipVasan
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理pyhepag
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfRobertoOcampo24
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理pyhepag
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理cyebo
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp onlinebalibahu1313
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfEmmanuel Dauda
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyRafigAliyev2
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...ssuserf63bd7
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancingmohamed Elzalabany
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Jon Hansen
 

Recently uploaded (20)

Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 

[DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions

  • 1. © HTEC Group, 2023 Approach To Architecting GenerativeAI Solutions Ivan Petrović Senior Technology Lead @ HTEC Group Empowering your digital tomorrow
  • 2. © HTEC Group, 2023 2 Topics Approach To Architecting Generative AI Solutions Architecture Dive – Dynamic Agent Intro To Non-AI Folks HTEC’s Approach 01 02 03
  • 3. © HTEC Group, 2023 Approach To Architecting Generative AI Solutions Intro To Non-AI Folks
  • 4. © HTEC Group, 2023 4 What Are LLMs? Approach To Architecting Generative AI Solutions Large Language Model Paris What is the capital of France? Output Input
  • 5. © HTEC Group, 2023 5 Hosting LLMs Approach To Architecting Generative AI Solutions Large Language Model + +
  • 6. © HTEC Group, 2023 6 Multimodal Models Approach To Architecting Generative AI Solutions Multimodal Model IMU
  • 7. © HTEC Group, 2023 7 Most Common Use-Cases Approach To Architecting Generative AI Solutions Classify Generate Rewrite Cluster Extract Search Summarize
  • 8. © HTEC Group, 2023 HTEC’s Approach Approach To Architecting Generative AI Solutions
  • 9. © HTEC Group, 2023 9 Throughout our consultative process we apply Cognitive Design principals and Responsible AI practices to ensure human integrity and ethical guidance Framework COGNITIVE DESIGN & RESPONSIBLE AI ACCELERATORS Analytics Intelligent assist 2 1 3 Data AI/ML ENGINEERING DATA ENGINEERING DATA SCIENCE PRODUCT DESIGN Cognitive Design CX SERVICE DESIGN BUSINESS DESIGN Generative AI Automation & productivity Autonomous systems Decision intelligence Data-driven innovation Computer vision Knowledge management Predictive analytics Intelligent process automation Anomaly detection AI-powered security & compliance Predictive maintenance STEP 1 Problem assessment STEP 2 Cognitive design STEP 3 Secure data onboarding & security STEP 4 AI/ML model development STEP 5 Continuous solution optimization STEP 6 Deployment & scaling Approach To Architecting Generative AI Solutions
  • 10. © HTEC Group, 2023 10 Prompts and Prompt Engineering • Prompts are inputs sent to LLM to elicit a certain response • Natural language instructions • Prompt Enginering • Influence the creativity of LLM • Halucinations • Training vs Prompting Approach To Architecting Generative AI Solutions Prompt content Instruction Context Output Indicator Input Data
  • 11. © HTEC Group, 2023 11 Points of Interest • Types of learning (LLM) • Zero shot • One shot • Few shot • Fine tunning • Vector Databases • Considerations • Ethics • Licencing • Knowledge cutoff • Token limitations • Token modalities Approach To Architecting Generative AI Solutions Input Zero Shot Example 1 One Shot Input Example 1 Few Shot … Example N Input
  • 12. © HTEC Group, 2023 12 Complexity Hierarchy Approach To Architecting Generative AI Solutions Complexity level No Context Simple Context Tool Use Multi-Agent Important points No persistent memory, can be used hirerachaly Retaining conv. history, conv. buffers, summary, window Persisten memory with vector storage Multiple agents with different prompts, spawning agents trough interanl main agent API Use-case example Document summarisation Chatbot conversation memory ChatGPT with plugins AutoGPT, BabyAGI
  • 13. © HTEC Group, 2023 Architecture Dive - Dynamic Agent Approach To Architecting Generative AI Solutions
  • 14. © HTEC Group, 2023 14 Agent Reusability Configuration Agent type (container) Agent instance Approach To Architecting Generative AI Solutions
  • 15. © HTEC Group, 2023 15 Auth Token Stream data Make API calls Send requests in natural language Architecture Approach To Architecting Generative AI Solutions Validate Auth Token Send message Consume message Notify on start Stream response Upload data with token Send prompt Send prompt to LLM Send progress Consume progress status Read/write chat history
  • 16. © HTEC Group, 2023 16 Send prompt Send progress Auth Token Stream data Make API calls Send requests in natural language Authentication Approach To Architecting Generative AI Solutions Validate Auth Token Send message Consume message Notify on start Stream response Upload data with token Send prompt to LLM Consume progress status Read/write chat history
  • 17. © HTEC Group, 2023 17 Auth Token Stream data Make API calls Send requests in natural language Data Upload Approach To Architecting Generative AI Solutions Validate Auth Token Send message Consume message Notify on start Stream response Upload data with token Send prompt to LLM Send progress Consume progress status Read/write chat history Send prompt
  • 18. © HTEC Group, 2023 18 Send progress Auth Token Stream data Make API calls Send requests in natural language Inference Flow Approach To Architecting Generative AI Solutions Validate Auth Token Send message Consume message Notify on start Stream response Upload data with token Send prompt Send prompt to LLM Consume progress status Read/write chat history
  • 19. © HTEC Group, 2023 19 Auth Token Stream data Make API calls Send requests in natural language Notification Flow Approach To Architecting Generative AI Solutions Validate Auth Token Send message Consume message Notify on start Stream response Upload data with token Send prompt to LLM Send progress Consume progress status Read/write chat history Send prompt
  • 20. © HTEC Group, 2023 20 Chatbot components Approach To Architecting Generative AI Solutions Make API calls Make API calls LLM request Prompt rendering Multiple techniques Conversation history Memory configuration Input data Strategy config Prompt templates Consume message - metadata - agent configuration - input data Progress message Stream response data Send prompt to LLM
  • 21. © HTEC Group, 2023 21 Configuration Approach To Architecting Generative AI Solutions Make API calls Make API calls LLM request Prompt rendering Multiple techniques Conversation history Memory configuration Input data Strategy config Prompt templates Consume message - metadata - agent configuration - input data Progress message Stream response data Send prompt to LLM Notification
  • 22. © HTEC Group, 2023 22 Agent Type Approach To Architecting Generative AI Solutions Make API calls Make API calls LLM request Prompt rendering Multiple techniques Conversation history Memory configuration Strategy config Prompt templates Consume message - metadata - agent configuration - input data Progress message Stream response data Send prompt to LLM Notification
  • 23. © HTEC Group, 2023 23 Agent Data Flow Approach To Architecting Generative AI Solutions Make API calls Make API calls LLM request Prompt rendering Multiple techniques Conversation history Memory configuration Input data Consume message - input data Progress message Stream response data Send prompt to LLM Notification
  • 24. © HTEC Group, 2023 24 Transformation Data Sources Business Logic End User Ingestion Analytics Service Question Processing Context Building Query Generation Fund Monitoring Platform Azure Data Lake (Raw Storage) Azure Data Lake (Analytics Storage) Data Processing (Spark Pool) Ingestion Pipeline UI Argo CD Azure DevOps Azure Container Registry Azure Key Vault Azure OpenAI Service Azure Synapse Analytics Real-world AI and Generative AI AI-Assisted Fund Performance Analytics Authentication & Authorisation (KeyCloak)
  • 25. © HTEC Group, 2023 Architecting Generative AI Solutions QnA Ivan Petrović Senior Technology Lead @ HTEC Group

Editor's Notes

  1. Greet the audience, greet the oportunity to speak and share on DSC Provide an short intro if not done before the presentation
  2. Glance over topics that we will cover - First one is something that will help Non AI folks to get an understanding what the heck is this genAI everyone talks about (if there are any) - Overview of how we @ HTEC approaches about thinking and solving these problems and what to consider when building - Go into one of the accelerators we have and an example of the system we have built
  3. This can be an overview for someone who is non AI engineer @ HTEC we have identified that there is a path engineer can take to get into the field of Generative AI and we will touch on that - To enable them to catch up - Learning Plan for engineers Why is it important to emphasise this path to learning and intro to Generative AI
  4. What are LLMs? Number of parameters, RLHF
  5. How LLMs can be used/deployed - on prem, deploying custom LLM on cloud enviorment - Managed by your company - ON EDGE - using 3rd party LLM, e.g. OpenAI via OpenAI API - Hybrid; combination of first two
  6. What modalities are there that LLMs are going if not to be using? - Text - Image - Audio - Sensor data (Meta) Etc. Naglasiti da postoje posebne implementacije koje to omogućavaju - možda reći malo o tome šta je tokenizacija -
  7. What are some of the use cases when LLMs can be used Focus is only on text right now, but we are seeing that image is being used also but not that often.
  8. Before diving into the concreete examples - describe to audience how we at HTEC are approaching the problem of identifying HTEC's
  9. Go trough the steps but from the engineering and problem perspective Emphasis PREDICTIVE WALK BETWEEN RND AND Productization This is a fork of existing workflow and augmented by the needs we have seen to emerge in GenAI Whitepaper was released by our colleagues that targets most of the first 2 steps. We will focus mostly on the middle part right now because it is important to see how much it differs from the traditional AI workflows - What we are observing as new steps emerging, or being more emphasized - Cognitive design - More emphasis on security and monitoring for potential rogue behavior (Agents can require access sensitive data, like query databases, generate queries, etc.) - Accelerators are something you build to help you jump-start the implementation
  10. - What are Prompts? - What is their modality? Considering only text now - What is prompt engineering? - How to handle creativity of the LLMs? - From which parts it consists of? What are tokens?
  11. Postaviti se iz pozicije da smo mi ovo pomagali našim arhitektama... - treniram trenere - šta smo mi pomogli i kako našim kolegama - What and how models can learn and be trained? - Model training – adapting the weights to accommodate new data - Model prompting – selecting model input to achieve desired output - Types of learning - slightly different than one in DL - Ethical consideration - Licensing and knowledge cutoff - Vector Databasees - How the semantic search is done - Token Limitation - Prompt length is limited - prompt contains task specification, additional information, and context information - Token Modalities - Text modality, a token is approx 3 characteds (3 tokens are used per word, on average) - Other modalities use different tokenization schemes - Tendencija da 'e modeli da budu skoro near real time... - Promena u poslednjih mesec dva koliko je - GROK je near real time - Recent update to openAI
  12. Talk about the complexity - Increasing complexity provides more capabitilities to be achieved - Stateless API – Two approaches - chat completition - text completition - Hint kako utiče kontekst i da li veličlina konteksta je bitna - AutoGPT malo bolje objasniti - šta su obećali a šta je moguće
  13. Dive into an example how an Dynamic Agent would be implemented Implementation depends on the use-case Focus is on reusability and modularity of the system that is being implemented
  14. Building the solution, modularity and configurability is the key. This is one of the ways how to create an accelearator
  15. Sinhroni i asinhrona komunikacija Ideja za regione, multiregion za lokalne limite Crtica ka spolja C4 L2 nivo dijagrama - Da postoje kontejneri Naglasiti kao simplified Vehicle ka arhitektama da oni budu svesni šta sve postoji ovde o čemu moraju voditi rešenja
  16. Sinhroni i asinhrona komunikacija Ideja za regione, multiregion za lokalne limite Crtica ka spolja C4 L2 nivo dijagrama - Da postoje kontejneri Naglasiti kao simplified Vehicle ka arhitektama da oni budu svesni šta sve postoji ovde o čemu moraju voditi rešenja
  17. Sinhroni i asinhrona komunikacija Ideja za regione, multiregion za lokalne limite Crtica ka spolja C4 L2 nivo dijagrama - Da postoje kontejneri Naglasiti kao simplified Vehicle ka arhitektama da oni budu svesni šta sve postoji ovde o čemu moraju voditi rešenja Get SAS Token
  18. Sinhroni i asinhrona komunikacija Ideja za regione, multiregion za lokalne limite Crtica ka spolja C4 L2 nivo dijagrama - Da postoje kontejneri Naglasiti kao simplified Vehicle ka arhitektama da oni budu svesni šta sve postoji ovde o čemu moraju voditi rešenja RPC - job based sistem za obradu nekih duga;kih zadataka - sistem koji ne zavisi od request response timeouta Progress queue Da bi korisnik znao gde se trenutno nalazi sistem 100 documents to process
  19. Sinhroni i asinhrona komunikacija Ideja za regione, multiregion za lokalne limite Crtica ka spolja C4 L2 nivo dijagrama - Da postoje kontejneri Naglasiti kao simplified Vehicle ka arhitektama da oni budu svesni šta sve postoji ovde o čemu moraju voditi rešenja
  20. Not constrained to function calling for certain tasks This enables the system to be interchangable