SlideShare a Scribd company logo
1 of 24
Download to read offline
Webinar:
Why a Multi-Cloud Strategy
Matters for Your AI Platform
Tarik Bennett
tarik.bennett@alluxio.com
February 27th, 2023
Senior Solutions Engineer
@ Alluxio
Tarik Bennett
2
Balancing performance, scalability, and cost
Agnostic data layer
Best practices for hybrid and multi-cloud
Agenda
Managing Costs
Cloud Agility or Resource Availability
Training Efficiency
Primary Scenarios Addressed
Source: Gartner 2023
1. By 2028, the adoption of AI will culminate in over
50% of cloud compute resources… up from less
than 10% in 2023.
2. Global spending on public cloud services is forecast to increase 20.4% in 2024… the
source of growth will be combination of cloud vendor price increases and increased
utilization.
3. Deep learning models fed by images, internet-scale applications or even telemetry data
have ever growing data requirements.
AI Adoption is Ballooning Cloud Costs
● Efficient distributed computing
● Workload scheduling
● Modernizing or reducing legacy storage
● Minimizing data movement
● Improving data access
● Increasing scalability
Efficiencies via Platform Improvements
Source: Gartner 2023
According to the survey, almost half (47%) of C-suite
executives don’t feel prepared for the accelerating rate
of technological change.
Further, only 27% claim their organizations are ready to scale up generative AI, and 44% say it
will take more than six months to do so and take advantage of the potential benefits.
Scalability and Cloud Agility
Technical
● Improves scalability
● Enables hybrid cloud
● Expanded access to GPUs
● Best-of-breed AI tools available
Non-Technical
● Leverage in cloud negotiations
● Security and governance, privacy, etc
● Service resilience
● Flexible access to the most
cost-effective resources
Why Multi-Cloud?
Agility Comes with Some Overhead
● Data replication between DCs or regions
Multi-Cloud Challenges
Source: Alluxio
Agility Comes with Some Overhead
● Data replication between DCs or regions
● Disruptive, costly or prolonged migrations to upgrade
HDFS
Object
Store
Multi-Cloud Challenges
Agility Comes with Some Overhead
● Data replication between DCs or regions
● Disruptive, costly or prolonged migrations to upgrade
● Overlapping resources in cloud + on-prem
compute compute compute
Multi-Cloud Challenges
Agility Comes with Some Overhead
● Data replication between DCs or regions
● Disruptive, costly or prolonged migrations to upgrade
● Overlapping resources in cloud + on-prem
● Need to address non-technical requirements within CSPs
Multi-Cloud Challenges
Given Multi-Cloud Benefits for AI, You Can Optimize
● Simplify wherever possible
● Reduce replication wherever possible
● Finding cost efficiencies via caching or other means
● Increase data locality
● Unify data access
● Increase throughput of commodity storage
● Reduce bandwidth congestion
Best Practices
● Multi-Cloud architecture
○ Google Cloud Platform (GCP)
○ Oracle® Cloud Infrastructure (OCI)
● Data orchestration and caching
Uber Multi-Cloud Architecture (Future)
Source: Uber Jing Zhao 2024
Alluxio Intro
16
Alluxio Data Platform
High Performance data access, unified global view
18
Portability via Alluxio Kubernetes Operator
Reduced Data Replication
Source: Alluxio
Some data cannot be persisted in the cloud. Security teams will often
approve ephemeral cache, while other options will be denied.
High Performance Data Access
Sensitive model
training data
Data evicted
from the cache
Benefits of Caching for Sensitive Data
Standalone Cluster
High Performance Data Access Layer
Data from multiple sources served to GPU nodes
Virtual Caching Across Local
GPU Storage
Data source synced to Virtual Alluxio Storage and
shared between GPU nodes
Alluxio Deployment Options for AI
Case Study
BUSINESS BENEFIT:
TECH BENEFIT:
Increase GPU
utilization
50%
93%
File System
Training
Data
Training
Data
M
o
d
e
l
s
Training
Data
Models
Model
Training
Model
Training
Model
Deployment
Model
Inference
Downstream
Applications
Model
Update
Training Clouds Offline Cloud Online Cloud
APAC Quora CASE STUDY:
High Performance AI Platform for LLM
2 - 4X faster
time-to-market
Before Alluxio: (1) Low GPU Utilization, (2) Overloaded Storage, (3) Network Congestion & Slow Model Refresh
Thank You!

More Related Content

Similar to Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

Similar to Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform (20)

Solving enterprise challenges through scale out storage & big compute final
Solving enterprise challenges through scale out storage & big compute finalSolving enterprise challenges through scale out storage & big compute final
Solving enterprise challenges through scale out storage & big compute final
 
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
 
Impact of Cloud Computing on IT Infrastructure Support.pdf
Impact of Cloud Computing on IT Infrastructure Support.pdfImpact of Cloud Computing on IT Infrastructure Support.pdf
Impact of Cloud Computing on IT Infrastructure Support.pdf
 
Slides: Accelerating Queries on Cloud Data Lakes
Slides: Accelerating Queries on Cloud Data LakesSlides: Accelerating Queries on Cloud Data Lakes
Slides: Accelerating Queries on Cloud Data Lakes
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 
A Successful Journey to the Cloud with Data Virtualization
A Successful Journey to the Cloud with Data VirtualizationA Successful Journey to the Cloud with Data Virtualization
A Successful Journey to the Cloud with Data Virtualization
 
Data Orchestration for the Hybrid Cloud Era
Data Orchestration for the Hybrid Cloud EraData Orchestration for the Hybrid Cloud Era
Data Orchestration for the Hybrid Cloud Era
 
Fog Computing Platform
Fog Computing PlatformFog Computing Platform
Fog Computing Platform
 
Green Cloud Computing :Emerging Technology
Green Cloud Computing :Emerging TechnologyGreen Cloud Computing :Emerging Technology
Green Cloud Computing :Emerging Technology
 
Equinix microsoft 2019 use case playbook
Equinix microsoft 2019 use case playbookEquinix microsoft 2019 use case playbook
Equinix microsoft 2019 use case playbook
 
Maximize the Capabilities of Oracle® Golden Gate: Replicate Data Bi-Direction...
Maximize the Capabilities of Oracle® Golden Gate: Replicate Data Bi-Direction...Maximize the Capabilities of Oracle® Golden Gate: Replicate Data Bi-Direction...
Maximize the Capabilities of Oracle® Golden Gate: Replicate Data Bi-Direction...
 
Cloud Migration.pdf
Cloud Migration.pdfCloud Migration.pdf
Cloud Migration.pdf
 
Optimizing Your Hybrid IT Strategy
Optimizing Your Hybrid IT StrategyOptimizing Your Hybrid IT Strategy
Optimizing Your Hybrid IT Strategy
 
Coud computing
Coud computingCoud computing
Coud computing
 
Peek into Neo4j Product Strategy and Roadmap
Peek into Neo4j Product Strategy and RoadmapPeek into Neo4j Product Strategy and Roadmap
Peek into Neo4j Product Strategy and Roadmap
 
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/MLBig Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
Conduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization ToolConduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization Tool
 
Overview of GovCloud Today
Overview of GovCloud TodayOverview of GovCloud Today
Overview of GovCloud Today
 

More from Alluxio, Inc.

More from Alluxio, Inc. (20)

Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
 
Optimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with AlluxioOptimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with Alluxio
 
Speed Up Presto at Uber with Alluxio Caching
Speed Up Presto at Uber with Alluxio CachingSpeed Up Presto at Uber with Alluxio Caching
Speed Up Presto at Uber with Alluxio Caching
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
 
Data Infra Meetup | FIFO Queues are All You Need for Cache Eviction
Data Infra Meetup | FIFO Queues are All You Need for Cache EvictionData Infra Meetup | FIFO Queues are All You Need for Cache Eviction
Data Infra Meetup | FIFO Queues are All You Need for Cache Eviction
 
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio EdgeData Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
 
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the CloudData Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
 
Data Infra Meetup | ByteDance's Native Parquet Reader
Data Infra Meetup | ByteDance's Native Parquet ReaderData Infra Meetup | ByteDance's Native Parquet Reader
Data Infra Meetup | ByteDance's Native Parquet Reader
 
Data Infra Meetup | Uber's Data Storage Evolution
Data Infra Meetup | Uber's Data Storage EvolutionData Infra Meetup | Uber's Data Storage Evolution
Data Infra Meetup | Uber's Data Storage Evolution
 
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
 
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
 
AI Infra Day | The AI Infra in the Generative AI Era
AI Infra Day | The AI Infra in the Generative AI EraAI Infra Day | The AI Infra in the Generative AI Era
AI Infra Day | The AI Infra in the Generative AI Era
 
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
 
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...
 
AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
AI Infra Day | Composable PyTorch Distributed with PT2 @ MetaAI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
 
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber ScaleAI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
 
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
 
Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/...
Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/...Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/...
Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/...
 
Alluxio Monthly Webinar - Accelerate AI Path to Production
Alluxio Monthly Webinar - Accelerate AI Path to ProductionAlluxio Monthly Webinar - Accelerate AI Path to Production
Alluxio Monthly Webinar - Accelerate AI Path to Production
 

Recently uploaded

Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 

Recently uploaded (20)

Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
WSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid EnvironmentsWSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid Environments
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
WSO2Con2024 - Unleashing the Financial Potential of 13 Million People
WSO2Con2024 - Unleashing the Financial Potential of 13 Million PeopleWSO2Con2024 - Unleashing the Financial Potential of 13 Million People
WSO2Con2024 - Unleashing the Financial Potential of 13 Million People
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
WSO2CON 2024 - Software Engineering for Digital Businesses
WSO2CON 2024 - Software Engineering for Digital BusinessesWSO2CON 2024 - Software Engineering for Digital Businesses
WSO2CON 2024 - Software Engineering for Digital Businesses
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
 
Driving Innovation: Scania's API Revolution with WSO2
Driving Innovation: Scania's API Revolution with WSO2Driving Innovation: Scania's API Revolution with WSO2
Driving Innovation: Scania's API Revolution with WSO2
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
WSO2Con2024 - From Blueprint to Brilliance: WSO2's Guide to API-First Enginee...
WSO2Con2024 - From Blueprint to Brilliance: WSO2's Guide to API-First Enginee...WSO2Con2024 - From Blueprint to Brilliance: WSO2's Guide to API-First Enginee...
WSO2Con2024 - From Blueprint to Brilliance: WSO2's Guide to API-First Enginee...
 
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
 
WSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
WSO2CON2024 - Why Should You Consider Ballerina for Your Next IntegrationWSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
WSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
 

Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

  • 1. Webinar: Why a Multi-Cloud Strategy Matters for Your AI Platform Tarik Bennett tarik.bennett@alluxio.com February 27th, 2023
  • 2. Senior Solutions Engineer @ Alluxio Tarik Bennett 2
  • 3. Balancing performance, scalability, and cost Agnostic data layer Best practices for hybrid and multi-cloud Agenda
  • 4. Managing Costs Cloud Agility or Resource Availability Training Efficiency Primary Scenarios Addressed
  • 5. Source: Gartner 2023 1. By 2028, the adoption of AI will culminate in over 50% of cloud compute resources… up from less than 10% in 2023. 2. Global spending on public cloud services is forecast to increase 20.4% in 2024… the source of growth will be combination of cloud vendor price increases and increased utilization. 3. Deep learning models fed by images, internet-scale applications or even telemetry data have ever growing data requirements. AI Adoption is Ballooning Cloud Costs
  • 6. ● Efficient distributed computing ● Workload scheduling ● Modernizing or reducing legacy storage ● Minimizing data movement ● Improving data access ● Increasing scalability Efficiencies via Platform Improvements
  • 7. Source: Gartner 2023 According to the survey, almost half (47%) of C-suite executives don’t feel prepared for the accelerating rate of technological change. Further, only 27% claim their organizations are ready to scale up generative AI, and 44% say it will take more than six months to do so and take advantage of the potential benefits. Scalability and Cloud Agility
  • 8. Technical ● Improves scalability ● Enables hybrid cloud ● Expanded access to GPUs ● Best-of-breed AI tools available Non-Technical ● Leverage in cloud negotiations ● Security and governance, privacy, etc ● Service resilience ● Flexible access to the most cost-effective resources Why Multi-Cloud?
  • 9. Agility Comes with Some Overhead ● Data replication between DCs or regions Multi-Cloud Challenges Source: Alluxio
  • 10. Agility Comes with Some Overhead ● Data replication between DCs or regions ● Disruptive, costly or prolonged migrations to upgrade HDFS Object Store Multi-Cloud Challenges
  • 11. Agility Comes with Some Overhead ● Data replication between DCs or regions ● Disruptive, costly or prolonged migrations to upgrade ● Overlapping resources in cloud + on-prem compute compute compute Multi-Cloud Challenges
  • 12. Agility Comes with Some Overhead ● Data replication between DCs or regions ● Disruptive, costly or prolonged migrations to upgrade ● Overlapping resources in cloud + on-prem ● Need to address non-technical requirements within CSPs Multi-Cloud Challenges
  • 13. Given Multi-Cloud Benefits for AI, You Can Optimize ● Simplify wherever possible ● Reduce replication wherever possible ● Finding cost efficiencies via caching or other means ● Increase data locality ● Unify data access ● Increase throughput of commodity storage ● Reduce bandwidth congestion Best Practices
  • 14. ● Multi-Cloud architecture ○ Google Cloud Platform (GCP) ○ Oracle® Cloud Infrastructure (OCI) ● Data orchestration and caching Uber Multi-Cloud Architecture (Future) Source: Uber Jing Zhao 2024
  • 16. 16 Alluxio Data Platform High Performance data access, unified global view
  • 17.
  • 18. 18 Portability via Alluxio Kubernetes Operator
  • 20. Some data cannot be persisted in the cloud. Security teams will often approve ephemeral cache, while other options will be denied. High Performance Data Access Sensitive model training data Data evicted from the cache Benefits of Caching for Sensitive Data
  • 21. Standalone Cluster High Performance Data Access Layer Data from multiple sources served to GPU nodes Virtual Caching Across Local GPU Storage Data source synced to Virtual Alluxio Storage and shared between GPU nodes Alluxio Deployment Options for AI
  • 23. BUSINESS BENEFIT: TECH BENEFIT: Increase GPU utilization 50% 93% File System Training Data Training Data M o d e l s Training Data Models Model Training Model Training Model Deployment Model Inference Downstream Applications Model Update Training Clouds Offline Cloud Online Cloud APAC Quora CASE STUDY: High Performance AI Platform for LLM 2 - 4X faster time-to-market Before Alluxio: (1) Low GPU Utilization, (2) Overloaded Storage, (3) Network Congestion & Slow Model Refresh