[DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions

© HTEC Group, 2023
Approach To Architecting
GenerativeAI Solutions
Ivan Petrović
Senior Technology Lead @ HTEC Group
Empowering your digital tomorrow

© HTEC Group, 2023
2
Topics
Approach To Architecting Generative AI Solutions
Architecture Dive – Dynamic Agent
Intro To Non-AI Folks
HTEC’s Approach
01
02
03

© HTEC Group, 2023
Intro To
Non-AI Folks

© HTEC Group, 2023
4
What Are LLMs?
Large Language Model
Paris
What is the capital of France?
Output
Input

© HTEC Group, 2023
5
Hosting LLMs
Large Language Model
+
+

© HTEC Group, 2023
6
Multimodal Models
Multimodal
Model
IMU

© HTEC Group, 2023
7
Most Common Use-Cases
Classify
Generate
Rewrite
Cluster
Extract
Search
Summarize

© HTEC Group, 2023
HTEC’s
Approach

© HTEC Group, 2023
9
Throughout our consultative process we apply Cognitive Design principals and Responsible AI practices to ensure human integrity and ethical guidance
Framework
COGNITIVE DESIGN & RESPONSIBLE AI
ACCELERATORS
Analytics
Intelligent
assist
2
1
3
Data
AI/ML
ENGINEERING
DATA
ENGINEERING
DATA
SCIENCE
PRODUCT
DESIGN
Cognitive
Design
CX
SERVICE DESIGN
BUSINESS
DESIGN
Generative AI
Automation &
productivity
Autonomous
systems
Decision
intelligence
Data-driven
innovation
Computer
vision
Knowledge
management
Predictive
analytics
Intelligent
process
automation
Anomaly
detection
AI-powered
security &
compliance
Predictive
maintenance
STEP 1
Problem assessment
STEP 2
Cognitive design
STEP 3
Secure data onboarding & security
STEP 4
AI/ML model development
STEP 5
Continuous solution optimization
STEP 6
Deployment & scaling

© HTEC Group, 2023
10
Prompts and
Prompt Engineering
• Prompts are inputs sent to LLM to elicit a certain response
• Natural language instructions
• Prompt Enginering
• Influence the creativity of LLM
• Halucinations
• Training vs Prompting
Prompt content
Instruction
Context
Output Indicator
Input Data

© HTEC Group, 2023
11
Points of Interest
• Types of learning (LLM)
• Zero shot
• One shot
• Few shot
• Fine tunning
• Vector Databases
• Considerations
• Ethics
• Licencing
• Knowledge cutoff
• Token limitations
• Token modalities
Input
Zero Shot
Example 1
One Shot
Input
Example 1
Few Shot
…
Example N
Input

© HTEC Group, 2023
12
Complexity Hierarchy
Complexity level
No Context
Simple Context
Tool Use
Multi-Agent
Important points
No persistent memory, can be used hirerachaly
Retaining conv. history, conv. buffers, summary,
window
Persisten memory with vector storage
Multiple agents with different prompts, spawning
agents trough interanl main agent API
Use-case example
Document summarisation
Chatbot conversation memory
ChatGPT with plugins
AutoGPT, BabyAGI

© HTEC Group, 2023
Architecture
Dive - Dynamic
Agent

© HTEC Group, 2023
14
Agent Reusability
Configuration Agent type
(container)
Agent
instance

© HTEC Group, 2023
15
Auth
Token
Stream data
Make API calls
Send requests in natural language
Architecture
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Upload data with token
Send prompt
Send prompt to LLM
Send
progress
Consume
progress status
Read/write chat
history

© HTEC Group, 2023
16
Send prompt
Send
progress
Auth
Token
Stream data
Make API calls
Authentication
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Send prompt to LLM
Consume
progress status
Read/write chat
history

© HTEC Group, 2023
17
Auth
Token
Stream data
Make API calls
Data Upload
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Send prompt to LLM
Send
progress
Consume
progress status
Read/write chat
history
Send prompt

© HTEC Group, 2023
18
Send
progress
Auth
Token
Stream data
Make API calls
Inference Flow
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Send prompt
Send prompt to LLM
Consume
progress status
Read/write chat
history

© HTEC Group, 2023
19
Auth
Token
Stream data
Make API calls
Notification Flow
Validate Auth Token
Send
message
Consume message
Notify on start
Stream response
Send prompt to LLM
Send
progress
Consume
progress status
Read/write chat
history
Send prompt

© HTEC Group, 2023
20
Chatbot components
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Memory configuration
Input data
Strategy config
Prompt templates
Consume message
- metadata
- agent configuration
- input data
Progress message Stream response data
Send prompt to LLM

© HTEC Group, 2023
21
Configuration
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Input data
Strategy config
Prompt templates
Consume message
- metadata
- input data
Send prompt to LLM
Notification

© HTEC Group, 2023
22
Agent Type
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Strategy config
Prompt templates
Consume message
- metadata
- input data
Send prompt to LLM
Notification

© HTEC Group, 2023
23
Agent Data Flow
Make API calls
Make API calls
LLM request
Prompt rendering
Multiple techniques
Conversation
history
Input data
Consume message
- input data
Send prompt to LLM
Notification

© HTEC Group, 2023
24
Transformation
Data Sources Business Logic End User
Ingestion
Analytics Service
Question Processing
Context Building
Query Generation
Fund Monitoring Platform
Azure Data Lake
(Raw Storage)
Azure Data Lake
(Analytics Storage)
Data Processing
(Spark Pool)
Ingestion Pipeline
UI
Argo CD Azure DevOps Azure Container
Registry
Azure Key Vault Azure OpenAI
Service
Azure Synapse Analytics
Real-world AI and Generative AI
AI-Assisted Fund Performance Analytics
Authentication &
Authorisation
(KeyCloak)

[DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions

Recommended

Recommended

More Related Content

Similar to [DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions

Similar to [DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions (20)

More from DataScienceConferenc1

More from DataScienceConferenc1 (20)

Recently uploaded

Recently uploaded (20)

[DSC Europe 23] Ivan Petrovic - Approach to Architecting Generative AI Solutions

Editor's Notes