Amazon Q and Amazon Bedrock, fully managed vs. custom - 2025-06-25
AIConf 2025
Overview of AWS services about machine learning and generative AI.
https://www.aiconf.it/e/sessione/3727/Amazon-Q-and-Amazon-Bedrock-fully-managed-vs-custom
AI Conf 2025
Oltre500 progetti su AWS
Corley Cloud è una realtà certificata
con innumerevoli riconoscimenti e
un portfolio di centinaia di progetti
AWS sviluppati in diversi ambiti:
cloud native, migrazione, machine
learning & AI, serverless, IoT,
sicurezza e cloudOps.
Advanced Partner AWS
AI Conf 2025
Whatare the steps of ML ?
➔ The data may arrive ready for learning,
but often some processing is needed
➔ Model training could be delegated to an
AI system, except for custom steps
➔ Evaluation is a prediction for which we
know the expected values, for which we
can calculate metrics
➔ The prediction works on new data
processed with point 1 with the best
model saved in point 3
Preparation
Training
& Tuning
Testing
& Evaluation
Prediction
(inference)
AI Conf 2025
Arethere other steps or actors in ML ?
➔ Embeddings are objects that contain
information about text, images, videos,
audio or code
➔ The prompt is the text that contains the
behavior that the model must have, the
instructions to follow to respond to the
request posed.
➔ Augmented Generation (AG) techniques
allow us to exploit generalist models by
providing them with instructions (the
prompt), context (an extract of the
embeddings) and a request to obtain a
specific response.
Question
AG
Answer
Embeddings
& Prompt
LM
AI Conf 2025
AWSServices Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
97.
AI Conf 2025
AWSServices Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
98.
AI Conf 2025
AWSServices Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
99.
AI Conf 2025
AWSServices Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
100.
AI Conf 2025
ServicesEmbeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
101.
AI Conf 2025
ServicesEmbeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
102.
AI Conf 2025
ServicesEmbeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
103.
AI Conf 2025
ServicesEmbeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
104.
AI Conf 2025
ServicesEmbeddings $ Training $ Inference $
Amazon Q Business 190
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month