SlideShare a Scribd company logo
1 of 44
Serverless
Generative AI
AWS User Groups of Florida
Fort Lauderdale, FL, USA
February 27th, 2024
Patrick Hannah
CTO
CloudHesive
AWS User Groups of Florida – Updates
We are back to In-Person Meetups and working towards a monthly cadence
Always open to ideas on how we can improve the content and format!
Collaborate with us after the MeetUp!
Future MeetUps – Presenters? Topics? Formats?
Slideshare – Keep an eye on our MeetUp Page – we will post a link to the Slides
Slack – Keep the conversation going
Today’s MeetUp Format
Feel free to ask questions throughout the session!
Dedicated Q&A at the end
Topic
In this session, I will unravel the complexities of serverless
generative AI, offering insights into its architecture, applications,
and potential impact on businesses across various industries.
Whether you're a seasoned AWS practitioner or just starting your
journey into cloud computing, this presentation promises to
broaden your horizons and spark new ideas.
Inspiration
“I'm wondering if there is a feature request to create something like a saved
query in Athena that can be executed via a CloudWatch Event?”
The AWS Step Functions service integration with Amazon Athena enables you to use
Step Functions to start and stop query execution, and get query results
AWS User Groups of Florida MeetUp - AWS API Architectures - Scott
Hendrickson, Partner Solutions Architect, AWS
Data sources and resolvers are how AWS AppSync translates GraphQL requests and
fetches information from your AWS resources
AWS Well Architected Framework Serverless Application Lens
If your Lambda function is not performing custom logic while integrating with other
AWS services, chances are that it may be unnecessary
Who doesn’t like connecting things together?
Compute’s Transition to Serverless
Compute - EC2 Bare Metal (Intel, AMD, Graviton, M1)
Compute - EC2 Virtual > Bare Metal (Xen, KVM/Nitro)
Containers - Fargate > ContainderD (was DockerD) > EC2
Serverless - Lambda > Firecracker (Micro VM) > EC2
Serverless’ Flavors
High Level Abstractions
SaaS (Connect)
Hybrid Abstractions
PaaS (DynamoDB)
Low Level Abstractions
IaaS (Lambda)
Service Categories
Analytics
Application Integration
AR & VR
AWS Cost Management
Blockchain
Business Applications
Compute
Customer Engagement
Database
Developer Tools
End User Computing
Game Tech
Internet of Things
Machine Learning
Management & Governance
Media Services
Migration & Transfer
Mobile
Networking & Content Delivery
Quantum Technologies
Robotics
Satellite
Security, Identity, & Compliance
Storage
Workload Personas
Migrated
Server Based
Migrated & Optimized
Blends of Server and Service Based
Serverless/Native
Service Based
Orchestrated
ECS, EKS, K8s
Inherited
Wildcard!
Hybrid
Wildcard!
Well Architected Framework
Operational Excellence
Security
Reliability
Performance Efficiency
Cost Optimization
Sustainability
Cloud Workload Lifecycle Management
Workload
Architecture
Monitoring
Automation
Processes
Integration
Workload + Architecture Drives Service Selection
Containers
Container File
Versioning
Multi-threaded/Single-task
Minutes to Days
Per VM/Per Hour
Virtual Machines
AMI
Patching
Multi-threaded/Multi-task
Hours to Months
Per VM/Per Hour
Functions/Services
Code
Versioning
Single-threaded/Single-task
Microseconds to Seconds
Per Memory/Second/Per Request
Automation + Processes Drives Lifecycle Management Selection
Organizations
Cross-Account Asset Management + Governance
Control Tower
Account vending/default standardization
Service Catalog
Workload platform vending/default standardization
CloudFormation
IaC
Ephemeral Compute + API Managed Data/Control Plane for
Persistence Tiers
Hands off/Lights out
Processes
Patching
Backup/Restore Testing
Failover Testing (AZ)
Credential Rotation/Credential Audit
Event Response Testing
Incident Response Testing
Performance Testing
Performance/Cost Review
Vulnerability/Penetration Testing
Integration
AI/ML Options
Generalized
Specialized
“Balanced”
Generative AI in the context of AWS
Amazon Bedrock
Amazon SageMaker, Studio and Canvas (and Redshift Inferences)
NVIDIA GPU-powered Amazon EC2 instances
AWS Tranium
AWS Inferentia
Amazon EC2 UltraClusters
Amazon Q: Business, AWS, QuickSight, Connect, Supply Chain, Code
Catalyst, IDE, Code Transformation, Query Editor (Redshift)
PartyRock
AWS CodeWhisperer
AWS HealthScribe
Generative AI in the context of AWS
Services that accelerate development for AWS
Services that are powered by it – No-code data connectors/Zero
ETL, Instance Selection, Console to Code (and AppComposer),
Natural Language Querying, Code Scanning, Datazone
(Descriptions)
Services that accelerate development for you – Lex
(Conversational FAQ, Slot Resolution, Bot builder, Utterance
Generator), Personalize (Themes), Transcribe (Summarization)
Services improved by it – Alexa
Rationalization
Why Serverless – how does serverless change how we incept,
launch, and iterate product?
Why GenAI – how does Generative AI change how we think
about solving problems with data?
Foundational Model
Bedrock Operationalization
Non-functional
Regional Considerations
FM Subscription
Throughput/Quotas
Security
Operational Monitoring
Traffic Flow (Private Link)
Functional
Prompt Engineering
Tokens
Model Parameters
Inference Parameters
Sessions
Databases that can be used to store Vector Embeddings
OpenSearch/Serverless
Redis Enterprise and MemoryDB
Pinecone
Aurora (Postgres)
RDS (Postgres)
MongoDB
DocumentDB
Neptune
Machine Learning
Amazon Augmented AI - Easily implement human review of machine learning predictions
Amazon CodeGuru - Intelligent recommendations for building and running modern applications
Amazon Comprehend - Analyze Unstructured Text
Amazon Comprehend Medical - Amazon Comprehend Medical uses machine learning to extract
insights and relationships from medical text.
AWS DeepComposer - AWS DeepComposer allows developers of all skill levels to get started with
Generative AI.
AWS DeepLens - Deep Learning Enabled Video Camera
AWS DeepRacer - Fully autonomous 1/18th scale race car, driven by machine learning
Amazon DevOps Guru - ML-powered cloud operations service to improve application availability.
Amazon Forecast - Amazon Forecast is a fully-managed service for accurate time-series
forecasting
Amazon Fraud Detector - Detect more online fraud faster using machine learning
Amazon HealthLake - Making sense of health data
Amazon Kendra - Highly accurate enterprise search service powered by machine learning
AWS HealthImaging
Amazon Lex - Build Voice and Text Chatbots
Amazon Lookout for Equipment - Detect abnormal equipment behavior by analyzing sensor data
Amazon Lookout for Metrics - Accurately detect anomalies in your business metrics and quickly
understand why
Amazon Lookout for Vision - Identify defects using computer vision to automate quality inspection.
Amazon Monitron - End-to-end system for equipment monitoring
Amazon Omics - Transform omics data into insights.
AWS Panorama - Enabling computer vision applications at the edge
Amazon Personalize - Amazon Personalize helps you easily add real-time recommendations to
your apps
Amazon Polly - Turn Text into Lifelike Speech
Amazon Rekognition - Search and Analyze Images
Amazon SageMaker - Build, Train, and Deploy Machine Learning Models
Amazon Textract - Easily extract text and data from virtually any document
Amazon Transcribe - Powerful Speech Recognition
Amazon Translate - Powerful Neural Machine Translation
Amazon Bedrock
Primary Services
API Tier
API Gateway – API Management
AppSync – GraphQL API
Application (Execution)/Code Tier
Lambda – Serverless Compute
Data Store Tier
DynamoDB – Key/Value Data Base
Service Tier
Event Bridge/Step Functions – Event Bus, Low Code/No Code Workflow
Athena – Interactive Query Service
S3 – Object Storage
Glue – Data Integration Service
Options for APIs
Client > API Gateway HTTP > Things
Client > API Gateway REST > Things
Client > AppSync GraphQL > Things
Client > Application Load Balancer > Lambda
Client > Lambda Function URLs
Client > CloudFront (Authorizer) > Lambda
Client > AWS IoT
Options to call AWS services w/o Lambda
APIs
API Gateway > AWS Services
AppSync > GraphQL > Resolvers > AWS Services
Event
Step Functions > AWS Services
EventBridge
API Gateway Integrations
AWS
Service
Lambda
AWS Proxy
Service
Lambda
HTTP
HTTP Proxy
Mock
AppSync Resolvers
DynamoDB
RDS
OpenSearch
Lambda
HTTP
Sync versus Async
Can the payload fit in the size/time constraints
What is the impact to the client?
Step Functions Optimized Integrations
Lambda
Batch
DynamoDB
ECS/Fargate
SNS
SQS
Glue, DataBrew
SageMaker
EMR
CodeBuild
Athena
EKS
API Gateway
EventBridge
Step Functions
HTTP Destinations (New) - https://aws.amazon.com/blogs/aws/external-endpoints-and-testing-of-task-states-now-available-in-aws-step-functions/
Bedrock (New)- https://aws.amazon.com/about-aws/whats-new/2023/11/aws-step-functions-optimized-integration-bedrock/
Options for Event Buses/Messaging/Queuing
DynamoDB > Triggers
CloudWatch Logs > Metrics > Alarms / Lambda
CloudWatch Metrics > Destination
Kinesis > Lambda
Event Bridge (DLQ Support) > Lambda
SQS (DLQ Support) > Lambda
SNS (DLQ Support) > Lambda
(DLQ Support) Lambda
Twitter @radzikowski_m
Serverless Data Stores - The Easy Button
S3 Query – Query objects in S3, through S3
Athena (and S3 and Glue) – Query objects in S3, Presto
AppFlow – Data Integration Platform
Profiles
Wisdom
Tasks
Serverless Data Stores
DynamoDB – Key/Value
Timescale – Time Series
Keyspaces – Cassandra
QLDB – Ledger
Aurora – Relational
Prometheus – Prometheus
Grafana – Grafana
MWAA – Airflow
General Considerations
Multi-Region? Single-Region? Which Region(s)?
Which Services?
What will they cost? How are they metered/billed?
How far do we need to scale?
What compliance requirements do we need to meet?
What tools do we have in our reach? (Frameworks, Patterns,
etc.)
API Gateway
Development (Isolation, Stages, SAM)
Client Security (Certificates, API Keys, Authorizers)
Gateway Security (WAF, Throttling)
Endpoint Type (Edge optimized, Regional, Private, API Cache)
Integration (Methods, Proxy, Response Codes)
Operationalization (CloudWatch Logs, CloudWatch Metrics,
Access Logging, X-Ray
Testing (Direct, PostMan)
Lambda
Runtime
Pre-Warming
Sizing/Timeouts
Development (Isolation, Versions, SAM, Cloud9, Parameterization)
Integration (Methods, Response Codes)
Security (KMS, Execution Role)
Operationalization (CloudWatch Logs, CloudWatch Metrics, X-Ray)
Testing (Direct)
“The Rest”
Development (Coding Best Practices, Runtime, RDBMS, DevOps)
Data Stores that are not Serverless (Sizing, CloudWatch, Logs, Events,
Backup/Recovery, Multi-AZ, Database “Stuff”)
Trade-off
VPC (Public Subnets, Private Subnets, Security Groups)
Typical of Legacy Integrations, Non-Serverless Data Stores, etc.
General (What are all of the things we need to think about when we create a
new AWS account?)
“Landing Zone”
Conclusion
AWS continues to increase the breadth and depth of their service offerings
I wish it did that
I didn’t know I needed that
It’s easier to get started today than it was yesterday
Simplicity
Support
Cost
Lessons Learned
Regional Availability
Flexibility of implementation to change FMs (or even support custom FMs) and tune FM specific parameters
Conclusion
Generative AI and API Access to Generative AI services (like Bedrock) can be an easy button
Not an end all – value can be found in context, which takes us back to needing a strong data foundation
Priorities are still priorities – customers don’t care about Generative AI if your customers have needs unfulfilled by the product or by Generative AI
Customers may also need to be led to it – if the customer isn’t asking, pushing it on them won’t help – they need education
Consider sustainability when choosing an approach – Maslow’s Hammer
Don’t forget about team enablement
Limited by your imagination and ability to execute
References
https://docs.aws.amazon.com/wellarchitected/latest/serverless-applications-lens/wellarchitected-
serverless-applications-lens.pdf – Well Architected Serverless Application Lens
https://docs.aws.amazon.com/apigateway/latest/developerguide/getting-started-aws-proxy.html – API
Gateway Service Proxy Example
https://docs.aws.amazon.com/apigateway/latest/developerguide/websocket-api-chat-app.html – API
Gateway Websocket Example
https://docs.aws.amazon.com/appsync/latest/devguide/tutorials.html – AppSync Tutorials
https://docs.aws.amazon.com/appsync/latest/devguide/tutorial-dynamodb-resolvers.html – AppSync
Tutorial DynamoDB Resolver
https://docs.aws.amazon.com/lambda/latest/dg/lambda-urls.html – Lambda URLS
https://docs.aws.amazon.com/step-functions/latest/dg/connect-supported-services.html – Step Functions
Supported Services
https://docs.aws.amazon.com/step-functions/latest/dg/sample-athena-query.html – Step Functions
Athena Query
0800-860-2040
sales-latam@cloudhesive.com
cloudhesive.com
Fort Lauderdale
2419 E. Commercial Blvd, Ste. 300
Ft. Lauderdale, Florida
USA
Buenos Aires
Av. Del Libertador 6680, Piso 6
CABA, Ciudad de Buenos Aires
Argentina
Santiago de Chile
Cerro El Plomo 5420 SB1, Oficina 15
Nueva Las Condes, Santiago de Chile
Chile

More Related Content

Similar to Serverless Generative AI on AWS, AWS User Groups of Florida

How to build and deploy serverless apps - AWS Summit Cape Town 2018
How to build and deploy serverless apps - AWS Summit Cape Town 2018How to build and deploy serverless apps - AWS Summit Cape Town 2018
How to build and deploy serverless apps - AWS Summit Cape Town 2018Amazon Web Services
 
Deep Dive on Serverless Web Applications - AWS May 2016 Webinar Series
Deep Dive on Serverless Web Applications - AWS May 2016 Webinar SeriesDeep Dive on Serverless Web Applications - AWS May 2016 Webinar Series
Deep Dive on Serverless Web Applications - AWS May 2016 Webinar SeriesAmazon Web Services
 
SRV203 Getting Started with AWS Lambda and the Serverless Cloud
SRV203 Getting Started with AWS Lambda and the Serverless CloudSRV203 Getting Started with AWS Lambda and the Serverless Cloud
SRV203 Getting Started with AWS Lambda and the Serverless CloudAmazon Web Services
 
Getting Started with AWS Lambda and Serverless Computing
Getting Started with AWS Lambda and Serverless ComputingGetting Started with AWS Lambda and Serverless Computing
Getting Started with AWS Lambda and Serverless ComputingKristana Kane
 
Raleigh DevDay 2017: Build a serverless web application in one day workshop
Raleigh DevDay 2017: Build a serverless web application in one day workshopRaleigh DevDay 2017: Build a serverless web application in one day workshop
Raleigh DevDay 2017: Build a serverless web application in one day workshopAmazon Web Services
 
Primeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessPrimeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessjavier ramirez
 
Build and run applications without thinking about servers
Build and run applications without thinking about serversBuild and run applications without thinking about servers
Build and run applications without thinking about serversAmazon Web Services
 
Testing of Serverless Application on Amazon WebService Cloud
Testing of Serverless Application on Amazon WebService CloudTesting of Serverless Application on Amazon WebService Cloud
Testing of Serverless Application on Amazon WebService CloudRustam Zeynalov
 
DPD:AWS Developer Training
DPD:AWS Developer TrainingDPD:AWS Developer Training
DPD:AWS Developer TrainingJosh Curtis
 
20180111 we bde-bs - serverless url shortener
20180111   we bde-bs - serverless url shortener20180111   we bde-bs - serverless url shortener
20180111 we bde-bs - serverless url shortenerLuca Bianchi
 
Bluesoft @ AWS re:Invent 2017 + AWS 101
Bluesoft @ AWS re:Invent 2017 + AWS 101Bluesoft @ AWS re:Invent 2017 + AWS 101
Bluesoft @ AWS re:Invent 2017 + AWS 101André Faria Gomes
 
AWS Chicago user group: AWS Platform for .NET Developers
AWS Chicago user group: AWS Platform for .NET DevelopersAWS Chicago user group: AWS Platform for .NET Developers
AWS Chicago user group: AWS Platform for .NET DevelopersAWS Chicago
 
So You Think You're an AWS Master aka Serverless Computing
So You Think You're an AWS Master aka Serverless ComputingSo You Think You're an AWS Master aka Serverless Computing
So You Think You're an AWS Master aka Serverless ComputingAmazon Web Services
 
Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...
Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...
Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...Amazon Web Services
 
Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...
Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...
Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...Amazon Web Services
 
Hong Kong AWS Summit 2017 - Keynote
Hong Kong AWS Summit 2017 - KeynoteHong Kong AWS Summit 2017 - Keynote
Hong Kong AWS Summit 2017 - KeynoteAmazon Web Services
 
Accelerate your Cloud Success with Platform Services
Accelerate your Cloud Success with Platform ServicesAccelerate your Cloud Success with Platform Services
Accelerate your Cloud Success with Platform ServicesAmazon Web Services
 
SMC301 The State of Serverless Computing
SMC301 The State of Serverless ComputingSMC301 The State of Serverless Computing
SMC301 The State of Serverless ComputingAmazon Web Services
 

Similar to Serverless Generative AI on AWS, AWS User Groups of Florida (20)

How to build and deploy serverless apps - AWS Summit Cape Town 2018
How to build and deploy serverless apps - AWS Summit Cape Town 2018How to build and deploy serverless apps - AWS Summit Cape Town 2018
How to build and deploy serverless apps - AWS Summit Cape Town 2018
 
Deep Dive on Serverless Web Applications - AWS May 2016 Webinar Series
Deep Dive on Serverless Web Applications - AWS May 2016 Webinar SeriesDeep Dive on Serverless Web Applications - AWS May 2016 Webinar Series
Deep Dive on Serverless Web Applications - AWS May 2016 Webinar Series
 
SRV203 Getting Started with AWS Lambda and the Serverless Cloud
SRV203 Getting Started with AWS Lambda and the Serverless CloudSRV203 Getting Started with AWS Lambda and the Serverless Cloud
SRV203 Getting Started with AWS Lambda and the Serverless Cloud
 
Getting Started with AWS Lambda and Serverless Computing
Getting Started with AWS Lambda and Serverless ComputingGetting Started with AWS Lambda and Serverless Computing
Getting Started with AWS Lambda and Serverless Computing
 
Raleigh DevDay 2017: Build a serverless web application in one day workshop
Raleigh DevDay 2017: Build a serverless web application in one day workshopRaleigh DevDay 2017: Build a serverless web application in one day workshop
Raleigh DevDay 2017: Build a serverless web application in one day workshop
 
Media Workloads on AWS
Media Workloads on AWSMedia Workloads on AWS
Media Workloads on AWS
 
Primeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessPrimeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverless
 
Build and run applications without thinking about servers
Build and run applications without thinking about serversBuild and run applications without thinking about servers
Build and run applications without thinking about servers
 
Testing of Serverless Application on Amazon WebService Cloud
Testing of Serverless Application on Amazon WebService CloudTesting of Serverless Application on Amazon WebService Cloud
Testing of Serverless Application on Amazon WebService Cloud
 
DPD:AWS Developer Training
DPD:AWS Developer TrainingDPD:AWS Developer Training
DPD:AWS Developer Training
 
20180111 we bde-bs - serverless url shortener
20180111   we bde-bs - serverless url shortener20180111   we bde-bs - serverless url shortener
20180111 we bde-bs - serverless url shortener
 
Bluesoft @ AWS re:Invent 2017 + AWS 101
Bluesoft @ AWS re:Invent 2017 + AWS 101Bluesoft @ AWS re:Invent 2017 + AWS 101
Bluesoft @ AWS re:Invent 2017 + AWS 101
 
AWS Chicago user group: AWS Platform for .NET Developers
AWS Chicago user group: AWS Platform for .NET DevelopersAWS Chicago user group: AWS Platform for .NET Developers
AWS Chicago user group: AWS Platform for .NET Developers
 
So You Think You're an AWS Master aka Serverless Computing
So You Think You're an AWS Master aka Serverless ComputingSo You Think You're an AWS Master aka Serverless Computing
So You Think You're an AWS Master aka Serverless Computing
 
Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...
Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...
Connect and Interconnect – The Mesh of Event-Driven Compute and Marvelous Vir...
 
Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...
Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...
Cost Optimization for Microsoft Workloads on AWS - AWS Transformation Day: Sa...
 
Hong Kong AWS Summit 2017 - Keynote
Hong Kong AWS Summit 2017 - KeynoteHong Kong AWS Summit 2017 - Keynote
Hong Kong AWS Summit 2017 - Keynote
 
Reinvent recap
Reinvent recapReinvent recap
Reinvent recap
 
Accelerate your Cloud Success with Platform Services
Accelerate your Cloud Success with Platform ServicesAccelerate your Cloud Success with Platform Services
Accelerate your Cloud Success with Platform Services
 
SMC301 The State of Serverless Computing
SMC301 The State of Serverless ComputingSMC301 The State of Serverless Computing
SMC301 The State of Serverless Computing
 

More from CloudHesive

Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...CloudHesive
 
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...CloudHesive
 
Accelerating Business and Research Through Automation and Artificial Intellig...
Accelerating Business and Research Through Automation and Artificial Intellig...Accelerating Business and Research Through Automation and Artificial Intellig...
Accelerating Business and Research Through Automation and Artificial Intellig...CloudHesive
 
Amazon Connect Rethink Your Contact Center with CloudHesive.pptx
Amazon Connect Rethink Your Contact Center with CloudHesive.pptxAmazon Connect Rethink Your Contact Center with CloudHesive.pptx
Amazon Connect Rethink Your Contact Center with CloudHesive.pptxCloudHesive
 
ConnectPath Introduction
ConnectPath IntroductionConnectPath Introduction
ConnectPath IntroductionCloudHesive
 
Modernize your contact center with ConnectPath CX v2.pdf
Modernize your contact center with ConnectPath CX v2.pdfModernize your contact center with ConnectPath CX v2.pdf
Modernize your contact center with ConnectPath CX v2.pdfCloudHesive
 
Modernize your contact center with ConnectPath CX — Chart.pdf
Modernize your contact center with ConnectPath CX — Chart.pdfModernize your contact center with ConnectPath CX — Chart.pdf
Modernize your contact center with ConnectPath CX — Chart.pdfCloudHesive
 
End User Computing at CloudHesive.pptx
End User Computing at CloudHesive.pptxEnd User Computing at CloudHesive.pptx
End User Computing at CloudHesive.pptxCloudHesive
 
Analytics at CloudHesive
Analytics at CloudHesiveAnalytics at CloudHesive
Analytics at CloudHesiveCloudHesive
 
Supporting your CMMC initiatives with Sumo Logic
Supporting your CMMC initiatives with Sumo LogicSupporting your CMMC initiatives with Sumo Logic
Supporting your CMMC initiatives with Sumo LogicCloudHesive
 
Best Practices and Resources to Effectively Manage and Optimize Your AWS Costs
Best Practices and Resources to Effectively Manage and Optimize Your AWS CostsBest Practices and Resources to Effectively Manage and Optimize Your AWS Costs
Best Practices and Resources to Effectively Manage and Optimize Your AWS CostsCloudHesive
 
Serverless data and analytics on AWS for operations
Serverless data and analytics on AWS for operations Serverless data and analytics on AWS for operations
Serverless data and analytics on AWS for operations CloudHesive
 
reInvent reCap 2022
reInvent reCap 2022reInvent reCap 2022
reInvent reCap 2022CloudHesive
 
Serverless without Code (Lambda)
Serverless without Code (Lambda)Serverless without Code (Lambda)
Serverless without Code (Lambda)CloudHesive
 
AWS Advanced Analytics Automation Toolkit (AAA)
AWS Advanced Analytics Automation Toolkit (AAA)AWS Advanced Analytics Automation Toolkit (AAA)
AWS Advanced Analytics Automation Toolkit (AAA)CloudHesive
 
AWS Control Tower
AWS Control TowerAWS Control Tower
AWS Control TowerCloudHesive
 
Security on AWS, 2021 Edition Meetup
Security on AWS, 2021 Edition MeetupSecurity on AWS, 2021 Edition Meetup
Security on AWS, 2021 Edition MeetupCloudHesive
 
5 minutes on security
5 minutes on security5 minutes on security
5 minutes on securityCloudHesive
 
NIST Cybersecurity Framework (CSF) on the Public Cloud
NIST Cybersecurity Framework (CSF) on the Public CloudNIST Cybersecurity Framework (CSF) on the Public Cloud
NIST Cybersecurity Framework (CSF) on the Public CloudCloudHesive
 

More from CloudHesive (20)

Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
 
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
 
Accelerating Business and Research Through Automation and Artificial Intellig...
Accelerating Business and Research Through Automation and Artificial Intellig...Accelerating Business and Research Through Automation and Artificial Intellig...
Accelerating Business and Research Through Automation and Artificial Intellig...
 
Amazon Connect Rethink Your Contact Center with CloudHesive.pptx
Amazon Connect Rethink Your Contact Center with CloudHesive.pptxAmazon Connect Rethink Your Contact Center with CloudHesive.pptx
Amazon Connect Rethink Your Contact Center with CloudHesive.pptx
 
ConnectPath Introduction
ConnectPath IntroductionConnectPath Introduction
ConnectPath Introduction
 
Modernize your contact center with ConnectPath CX v2.pdf
Modernize your contact center with ConnectPath CX v2.pdfModernize your contact center with ConnectPath CX v2.pdf
Modernize your contact center with ConnectPath CX v2.pdf
 
Modernize your contact center with ConnectPath CX — Chart.pdf
Modernize your contact center with ConnectPath CX — Chart.pdfModernize your contact center with ConnectPath CX — Chart.pdf
Modernize your contact center with ConnectPath CX — Chart.pdf
 
End User Computing at CloudHesive.pptx
End User Computing at CloudHesive.pptxEnd User Computing at CloudHesive.pptx
End User Computing at CloudHesive.pptx
 
Analytics at CloudHesive
Analytics at CloudHesiveAnalytics at CloudHesive
Analytics at CloudHesive
 
Supporting your CMMC initiatives with Sumo Logic
Supporting your CMMC initiatives with Sumo LogicSupporting your CMMC initiatives with Sumo Logic
Supporting your CMMC initiatives with Sumo Logic
 
Best Practices and Resources to Effectively Manage and Optimize Your AWS Costs
Best Practices and Resources to Effectively Manage and Optimize Your AWS CostsBest Practices and Resources to Effectively Manage and Optimize Your AWS Costs
Best Practices and Resources to Effectively Manage and Optimize Your AWS Costs
 
Serverless data and analytics on AWS for operations
Serverless data and analytics on AWS for operations Serverless data and analytics on AWS for operations
Serverless data and analytics on AWS for operations
 
reInvent reCap 2022
reInvent reCap 2022reInvent reCap 2022
reInvent reCap 2022
 
Serverless without Code (Lambda)
Serverless without Code (Lambda)Serverless without Code (Lambda)
Serverless without Code (Lambda)
 
AWS Advanced Analytics Automation Toolkit (AAA)
AWS Advanced Analytics Automation Toolkit (AAA)AWS Advanced Analytics Automation Toolkit (AAA)
AWS Advanced Analytics Automation Toolkit (AAA)
 
AWS Control Tower
AWS Control TowerAWS Control Tower
AWS Control Tower
 
Security on AWS, 2021 Edition Meetup
Security on AWS, 2021 Edition MeetupSecurity on AWS, 2021 Edition Meetup
Security on AWS, 2021 Edition Meetup
 
Security on AWS
Security on AWSSecurity on AWS
Security on AWS
 
5 minutes on security
5 minutes on security5 minutes on security
5 minutes on security
 
NIST Cybersecurity Framework (CSF) on the Public Cloud
NIST Cybersecurity Framework (CSF) on the Public CloudNIST Cybersecurity Framework (CSF) on the Public Cloud
NIST Cybersecurity Framework (CSF) on the Public Cloud
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

Serverless Generative AI on AWS, AWS User Groups of Florida

  • 1. Serverless Generative AI AWS User Groups of Florida Fort Lauderdale, FL, USA February 27th, 2024 Patrick Hannah CTO CloudHesive
  • 2. AWS User Groups of Florida – Updates We are back to In-Person Meetups and working towards a monthly cadence Always open to ideas on how we can improve the content and format! Collaborate with us after the MeetUp! Future MeetUps – Presenters? Topics? Formats? Slideshare – Keep an eye on our MeetUp Page – we will post a link to the Slides Slack – Keep the conversation going Today’s MeetUp Format Feel free to ask questions throughout the session! Dedicated Q&A at the end
  • 3. Topic In this session, I will unravel the complexities of serverless generative AI, offering insights into its architecture, applications, and potential impact on businesses across various industries. Whether you're a seasoned AWS practitioner or just starting your journey into cloud computing, this presentation promises to broaden your horizons and spark new ideas.
  • 4. Inspiration “I'm wondering if there is a feature request to create something like a saved query in Athena that can be executed via a CloudWatch Event?” The AWS Step Functions service integration with Amazon Athena enables you to use Step Functions to start and stop query execution, and get query results AWS User Groups of Florida MeetUp - AWS API Architectures - Scott Hendrickson, Partner Solutions Architect, AWS Data sources and resolvers are how AWS AppSync translates GraphQL requests and fetches information from your AWS resources AWS Well Architected Framework Serverless Application Lens If your Lambda function is not performing custom logic while integrating with other AWS services, chances are that it may be unnecessary
  • 5. Who doesn’t like connecting things together?
  • 6. Compute’s Transition to Serverless Compute - EC2 Bare Metal (Intel, AMD, Graviton, M1) Compute - EC2 Virtual > Bare Metal (Xen, KVM/Nitro) Containers - Fargate > ContainderD (was DockerD) > EC2 Serverless - Lambda > Firecracker (Micro VM) > EC2
  • 7. Serverless’ Flavors High Level Abstractions SaaS (Connect) Hybrid Abstractions PaaS (DynamoDB) Low Level Abstractions IaaS (Lambda)
  • 8. Service Categories Analytics Application Integration AR & VR AWS Cost Management Blockchain Business Applications Compute Customer Engagement Database Developer Tools End User Computing Game Tech Internet of Things Machine Learning Management & Governance Media Services Migration & Transfer Mobile Networking & Content Delivery Quantum Technologies Robotics Satellite Security, Identity, & Compliance Storage
  • 9. Workload Personas Migrated Server Based Migrated & Optimized Blends of Server and Service Based Serverless/Native Service Based Orchestrated ECS, EKS, K8s Inherited Wildcard! Hybrid Wildcard!
  • 10. Well Architected Framework Operational Excellence Security Reliability Performance Efficiency Cost Optimization Sustainability
  • 11. Cloud Workload Lifecycle Management Workload Architecture Monitoring Automation Processes Integration
  • 12. Workload + Architecture Drives Service Selection Containers Container File Versioning Multi-threaded/Single-task Minutes to Days Per VM/Per Hour Virtual Machines AMI Patching Multi-threaded/Multi-task Hours to Months Per VM/Per Hour Functions/Services Code Versioning Single-threaded/Single-task Microseconds to Seconds Per Memory/Second/Per Request
  • 13. Automation + Processes Drives Lifecycle Management Selection Organizations Cross-Account Asset Management + Governance Control Tower Account vending/default standardization Service Catalog Workload platform vending/default standardization CloudFormation IaC Ephemeral Compute + API Managed Data/Control Plane for Persistence Tiers Hands off/Lights out
  • 14. Processes Patching Backup/Restore Testing Failover Testing (AZ) Credential Rotation/Credential Audit Event Response Testing Incident Response Testing Performance Testing Performance/Cost Review Vulnerability/Penetration Testing
  • 17. Generative AI in the context of AWS Amazon Bedrock Amazon SageMaker, Studio and Canvas (and Redshift Inferences) NVIDIA GPU-powered Amazon EC2 instances AWS Tranium AWS Inferentia Amazon EC2 UltraClusters Amazon Q: Business, AWS, QuickSight, Connect, Supply Chain, Code Catalyst, IDE, Code Transformation, Query Editor (Redshift) PartyRock AWS CodeWhisperer AWS HealthScribe
  • 18. Generative AI in the context of AWS Services that accelerate development for AWS Services that are powered by it – No-code data connectors/Zero ETL, Instance Selection, Console to Code (and AppComposer), Natural Language Querying, Code Scanning, Datazone (Descriptions) Services that accelerate development for you – Lex (Conversational FAQ, Slot Resolution, Bot builder, Utterance Generator), Personalize (Themes), Transcribe (Summarization) Services improved by it – Alexa
  • 19. Rationalization Why Serverless – how does serverless change how we incept, launch, and iterate product? Why GenAI – how does Generative AI change how we think about solving problems with data?
  • 21. Bedrock Operationalization Non-functional Regional Considerations FM Subscription Throughput/Quotas Security Operational Monitoring Traffic Flow (Private Link) Functional Prompt Engineering Tokens Model Parameters Inference Parameters Sessions
  • 22.
  • 23.
  • 24. Databases that can be used to store Vector Embeddings OpenSearch/Serverless Redis Enterprise and MemoryDB Pinecone Aurora (Postgres) RDS (Postgres) MongoDB DocumentDB Neptune
  • 25. Machine Learning Amazon Augmented AI - Easily implement human review of machine learning predictions Amazon CodeGuru - Intelligent recommendations for building and running modern applications Amazon Comprehend - Analyze Unstructured Text Amazon Comprehend Medical - Amazon Comprehend Medical uses machine learning to extract insights and relationships from medical text. AWS DeepComposer - AWS DeepComposer allows developers of all skill levels to get started with Generative AI. AWS DeepLens - Deep Learning Enabled Video Camera AWS DeepRacer - Fully autonomous 1/18th scale race car, driven by machine learning Amazon DevOps Guru - ML-powered cloud operations service to improve application availability. Amazon Forecast - Amazon Forecast is a fully-managed service for accurate time-series forecasting Amazon Fraud Detector - Detect more online fraud faster using machine learning Amazon HealthLake - Making sense of health data Amazon Kendra - Highly accurate enterprise search service powered by machine learning AWS HealthImaging Amazon Lex - Build Voice and Text Chatbots Amazon Lookout for Equipment - Detect abnormal equipment behavior by analyzing sensor data Amazon Lookout for Metrics - Accurately detect anomalies in your business metrics and quickly understand why Amazon Lookout for Vision - Identify defects using computer vision to automate quality inspection. Amazon Monitron - End-to-end system for equipment monitoring Amazon Omics - Transform omics data into insights. AWS Panorama - Enabling computer vision applications at the edge Amazon Personalize - Amazon Personalize helps you easily add real-time recommendations to your apps Amazon Polly - Turn Text into Lifelike Speech Amazon Rekognition - Search and Analyze Images Amazon SageMaker - Build, Train, and Deploy Machine Learning Models Amazon Textract - Easily extract text and data from virtually any document Amazon Transcribe - Powerful Speech Recognition Amazon Translate - Powerful Neural Machine Translation Amazon Bedrock
  • 26.
  • 27. Primary Services API Tier API Gateway – API Management AppSync – GraphQL API Application (Execution)/Code Tier Lambda – Serverless Compute Data Store Tier DynamoDB – Key/Value Data Base Service Tier Event Bridge/Step Functions – Event Bus, Low Code/No Code Workflow Athena – Interactive Query Service S3 – Object Storage Glue – Data Integration Service
  • 28. Options for APIs Client > API Gateway HTTP > Things Client > API Gateway REST > Things Client > AppSync GraphQL > Things Client > Application Load Balancer > Lambda Client > Lambda Function URLs Client > CloudFront (Authorizer) > Lambda Client > AWS IoT
  • 29. Options to call AWS services w/o Lambda APIs API Gateway > AWS Services AppSync > GraphQL > Resolvers > AWS Services Event Step Functions > AWS Services EventBridge
  • 30. API Gateway Integrations AWS Service Lambda AWS Proxy Service Lambda HTTP HTTP Proxy Mock
  • 32. Sync versus Async Can the payload fit in the size/time constraints What is the impact to the client?
  • 33. Step Functions Optimized Integrations Lambda Batch DynamoDB ECS/Fargate SNS SQS Glue, DataBrew SageMaker EMR CodeBuild Athena EKS API Gateway EventBridge Step Functions HTTP Destinations (New) - https://aws.amazon.com/blogs/aws/external-endpoints-and-testing-of-task-states-now-available-in-aws-step-functions/ Bedrock (New)- https://aws.amazon.com/about-aws/whats-new/2023/11/aws-step-functions-optimized-integration-bedrock/
  • 34. Options for Event Buses/Messaging/Queuing DynamoDB > Triggers CloudWatch Logs > Metrics > Alarms / Lambda CloudWatch Metrics > Destination Kinesis > Lambda Event Bridge (DLQ Support) > Lambda SQS (DLQ Support) > Lambda SNS (DLQ Support) > Lambda (DLQ Support) Lambda
  • 36. Serverless Data Stores - The Easy Button S3 Query – Query objects in S3, through S3 Athena (and S3 and Glue) – Query objects in S3, Presto AppFlow – Data Integration Platform Profiles Wisdom Tasks
  • 37. Serverless Data Stores DynamoDB – Key/Value Timescale – Time Series Keyspaces – Cassandra QLDB – Ledger Aurora – Relational Prometheus – Prometheus Grafana – Grafana MWAA – Airflow
  • 38. General Considerations Multi-Region? Single-Region? Which Region(s)? Which Services? What will they cost? How are they metered/billed? How far do we need to scale? What compliance requirements do we need to meet? What tools do we have in our reach? (Frameworks, Patterns, etc.)
  • 39. API Gateway Development (Isolation, Stages, SAM) Client Security (Certificates, API Keys, Authorizers) Gateway Security (WAF, Throttling) Endpoint Type (Edge optimized, Regional, Private, API Cache) Integration (Methods, Proxy, Response Codes) Operationalization (CloudWatch Logs, CloudWatch Metrics, Access Logging, X-Ray Testing (Direct, PostMan)
  • 40. Lambda Runtime Pre-Warming Sizing/Timeouts Development (Isolation, Versions, SAM, Cloud9, Parameterization) Integration (Methods, Response Codes) Security (KMS, Execution Role) Operationalization (CloudWatch Logs, CloudWatch Metrics, X-Ray) Testing (Direct)
  • 41. “The Rest” Development (Coding Best Practices, Runtime, RDBMS, DevOps) Data Stores that are not Serverless (Sizing, CloudWatch, Logs, Events, Backup/Recovery, Multi-AZ, Database “Stuff”) Trade-off VPC (Public Subnets, Private Subnets, Security Groups) Typical of Legacy Integrations, Non-Serverless Data Stores, etc. General (What are all of the things we need to think about when we create a new AWS account?) “Landing Zone”
  • 42. Conclusion AWS continues to increase the breadth and depth of their service offerings I wish it did that I didn’t know I needed that It’s easier to get started today than it was yesterday Simplicity Support Cost Lessons Learned Regional Availability Flexibility of implementation to change FMs (or even support custom FMs) and tune FM specific parameters Conclusion Generative AI and API Access to Generative AI services (like Bedrock) can be an easy button Not an end all – value can be found in context, which takes us back to needing a strong data foundation Priorities are still priorities – customers don’t care about Generative AI if your customers have needs unfulfilled by the product or by Generative AI Customers may also need to be led to it – if the customer isn’t asking, pushing it on them won’t help – they need education Consider sustainability when choosing an approach – Maslow’s Hammer Don’t forget about team enablement Limited by your imagination and ability to execute
  • 43. References https://docs.aws.amazon.com/wellarchitected/latest/serverless-applications-lens/wellarchitected- serverless-applications-lens.pdf – Well Architected Serverless Application Lens https://docs.aws.amazon.com/apigateway/latest/developerguide/getting-started-aws-proxy.html – API Gateway Service Proxy Example https://docs.aws.amazon.com/apigateway/latest/developerguide/websocket-api-chat-app.html – API Gateway Websocket Example https://docs.aws.amazon.com/appsync/latest/devguide/tutorials.html – AppSync Tutorials https://docs.aws.amazon.com/appsync/latest/devguide/tutorial-dynamodb-resolvers.html – AppSync Tutorial DynamoDB Resolver https://docs.aws.amazon.com/lambda/latest/dg/lambda-urls.html – Lambda URLS https://docs.aws.amazon.com/step-functions/latest/dg/connect-supported-services.html – Step Functions Supported Services https://docs.aws.amazon.com/step-functions/latest/dg/sample-athena-query.html – Step Functions Athena Query
  • 44. 0800-860-2040 sales-latam@cloudhesive.com cloudhesive.com Fort Lauderdale 2419 E. Commercial Blvd, Ste. 300 Ft. Lauderdale, Florida USA Buenos Aires Av. Del Libertador 6680, Piso 6 CABA, Ciudad de Buenos Aires Argentina Santiago de Chile Cerro El Plomo 5420 SB1, Oficina 15 Nueva Las Condes, Santiago de Chile Chile

Editor's Notes

  1. +Websockets