SlideShare a Scribd company logo
1 of 48
Joseph E. Gonzalez
Co-director of the RISE Lab
jegonzal@cs.berkeley.edu
AI-Systems
Machine Learning
in the Cloud
What is cloud computing?
“The interesting thing about Cloud
Computing is that we’ve redefined
Cloud Computing to include
everything that we already do. . . . I
don’t understand what we would do
differently in the light of Cloud
Computing other than change the
wording of some of our ads.”
-- Larry Ellison,
Wall Street Journal, 2008
3
Quote from “Above the Clouds: A Berkeley View of Cloud Computing”
4
2016
8 years later …
“If ‘cloud computing’has a
meaning, it is not a way of doing
computing, but rather a way of
thinking about computing: a devil-
may-care approach which says,
‘Don't ask questions. Don't worry
about who controls your computing
or who holds your data. Don't check
for a hook hidden inside our service
before you swallow it. Trust
companies without hesitation.’In
other words, ‘Be a sucker.’”
-- Richard Stallman,
Boston Review, 2010
5
https://bostonreview.net/articles/richard-stallman-free-software-drm/
Made the case for cloud computing
Illusion of infinite resources
Elimination of up-front costs
Pay-per-use
Economies of scale for everyone
Simplified operations
Higher hardware utilization
Published in 2009 and received over
8,4000 citations
Early paper on cloud
computing and helped drive
excitement in the field.
2006-2011
Defining Characteristics of the Cloud
 The illusion of infinite computing resources available on demand
 The elimination of an up-front commitment by users
 The ability to pay for use of computing resources on a short-term basis as needed
 Essentially Utility Computing
Public Cloud: when these services are available to the general public
Private Cloud: when these services are sold within a business
Quote from “Above the Clouds: A Berkeley View of Cloud Computing”
The Beginning of the Cloud
1961: John McCarthy presents idea of Utility Computing
2006: Amazon Web Services (AWS) Launched
2008: Google Cloud Launched
2008: Azure Cloud Launched
2009: Berkeley writes: “Above the Clouds: A Berkeley View of Cloud
Computing”
Utility Computing is an old idea
“If computers of the kind I have advocated become the computers of the
future, then computing may someday be organized as a public utility just
as the telephone system is a public utility... The computer utility could
become the basis of a new and important industry.”
– John McCarthy
9
Early Pioneer in Artificial Intelligence
Turing award for contributions to AI
Public Cloud
(>$300B)
Why did it take until 2006?
 Change is business models for computing
 Web2.0 – low commitment, self-serve, pay-as-you-go -- not enough capital
to own equipment.
 Existing big-tech companies were aggressively exploiting economies
of scale to drive down costs
 Renting becomes cheaper than buying machines
 New workloads emerged to leverage elasticity:
 Web applications needed to handle demand surges
 Data intensive apps leveraged scale -- fast is cheap
10
Economics of the Cloud
 CapEx to OpEx: transition from large up-front capital expenditures to
operational expenditures
 More money to spend on your launching your business
 Improved Utilization through statistical multiplexing
 real-world server utilization for a single business ~ 5% to 20%.
 Economies of scales
 Negotiate lower hardware prices
 Spread management costs
 Leverage existing investments
11
The Cloud Enabled Academic Research
 Access to the latest hardware
 Ability to burst experiments near conference deadlines
 Usually…
 Ability for students to build and evaluate large-scale systems
 I would frequently run concurrent experiments with hundreds of machines each!
 industrial adoption
 Companies can evaluate open-source (academic) big data tools without big upfront
investment in hardware.
Y-axis scales are hidden
because they are
embarrassing
What about the Cloud?
 Access to latest GPUs and TPUs drove AI research
 used a LOT OF CREDITS (thank you AWS, Azure, & Google!)
13
We only had to “pay” for
this region.
Conference
Deadlines
The Elasticity
of the cloud
drove us to rethink
our approach to AI Research
14
Time
Accuracy
Deadline
Time
Accuracy
Deadline
Time
Accuracy
Deadline
Model
learning rate
𝒍, weight
decay d
GPU 1
GPU 2
GPU 3
GPU 4
GPU 5
GPU 6
GPU 7
GPU 8
Time (mins)
00:00 60:00
GPU 1
GPU 2
GPU 3
GPU 4
GPU 5
GPU 6
GPU 7
GPU 8
Time (mins)
00:00 45:00
Exploitation
Exploration
GPU 1
GPU 2
GPU 3
GPU 4
GPU 5
GPU 6
GPU 7
GPU 8
Time (mins)
00:00 45:00
Exploitation
1.0 2.0 4.0 8.0
Number of GPUs
0.0
2.0
4.0
6.0
8.0
Throughput
Scaling
1.684x
3.107x
5.735x
ResNet50
Linear
Scaling
1.0 2.0 4.0 8.0
Number of GPUs
0.0
2.0
4.0
6.0
8.0
Throughput
Scaling
1.684x
3.107x
5.735x
ResNet50
GPU 1
GPU 2
GPU 3
GPU 4
GPU 5
GPU 6
GPU 7
GPU 8
Time (mins)
00:00 45:00
Linear
Scaling
Exploitation
Resources = Machines Resources = Money
Fixed Cluster Cloud
GPU-mins
Resources = Machines Resources = Money
Fixed Cluster Cloud
deadline
start Time
GPU-mins
Resources = Machines Resources = Money
Fixed Cluster Cloud
deadline
start Time
GPU-mins
deadline
start Time
GPU-mins
More
parallel
exploration
Less Parallel
Exploitation
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
Liaw et al.
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
GPU 5
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
GPU 5
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
GPU 5
GPU 6
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
GPU 5
GPU 6
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
GPU 5
GPU 6
When it comes to machine-time allocation
Time (mins)
00:00 Deadline
GPU 1
GPU 2
GPU 3
GPU 4
Triangles > Rectangles
GPU 1
GPU 2
GPU 3
GPU 4
Time (mins)
00:00 Deadline
GPU 5
GPU 6
Think outside the box
Cloud computing  Infinite resources with a finite budget (volume
constraints).
31
Compute
Storage
Cluster
Computing
Compute
Storage
Cloud
Computing
The cloud as a utility
“Serverless”
32
Compute
Storage
Cloud
Computing
33
©2019 RISELab
Canonical example
Upload
Cloud Storage
Save
thumbnail
Save
path
Cloud Storage
Cloud Database
Cloud Server
Canonical example
Upload
Cloud Storage
Save
thumbnail
Save
path
Cloud Storage
Cloud Database
Cloud Function
FaaS
BaaS
BaaS
BaaS
36
©2019 RISELab
AWS Lambda
Introduced Function as a Service (FaaS)
Autoscaling done right
Highly elastic - adapts quickly
Scales down to zero
Fine-grained 100 ms billing increment
Cloud provider shares risk and responsibility for utilization
Strong isolation allowing multi-tenant multiplexing
Benefits from scale of Amazon’s platform & ecosystem of APIs
A Layer to Simplify Using the Cloud
Three essential qualities of serverless
computing
 Hides servers and the complexity of programming and operating
them
 Offers a pay-per-use cost model with no charge for idle resources
 Has excellent autoscaling so resources match demand closely
Airport Analogy
When you arrive at the destination airport and need to get to your hotel
you could:
1. Buy a car and drive [Legacy on premise systems]
 Long term investment and you are responsible for everything
2. Rent a car and drive [Serverfull]
 You are still responsible for fuel, parking, insurance, …
3. Take an Uber. [Serverless]
 You are paying only for the transportation you need
Where is the cloud headed?
With each phase of the cloud
(and computing), we raised the
abstraction.
This continuous process of
simplification enables
concurrent innovation on
both sides of the boundary.
It also improves portability and
transforms the market.
Sky Computing
Sky Example
45
ML Pipeline
Data
proc
Training Serving
Secure
Data proc
Training Serving
Sky
● Use Azure Confidential Computing for secure data processing
● Use Google Cloud for training on TPUs (fastest and cheapest)
● Use AWS for serving on Inferentia (cheapest)
Conjectures for the Future
● We will continue to race towards utility-oriented computing
○ Pay for consumption and not capacity
● Higher levels of abstraction will
○ Reduce operational complexity (burden shifts to the cloud + ML)
○ Drive more rapid innovation in cloud hardware
○ Enable applications to more easily span multiple clouds
● Sky computing is the inevitable future of computing
46
Readings This Week
47
Pollux: Co-adaptive Cluster Scheduling for
Goodput-Optimized Deep Learning
 Published in OSDI’21 – Best Paper Award
 Why we chose it?
 Good example of work exploring scheduling of for ML
 Addresses ML and Systems concerns: throughput, improvements in accuracy,
fairness?
 Things to think about:
 Implications for elasticity?
 What about hyperparameter search?
48
The Sky Above the Clouds [Unpublished]
 Draft of the vision paper describing Sky research agenda
 Do Not Distribute
 Feedback will help the paper (be critical!)
 Makes a case for both the inevitability and need for research in “Sky
Computing”
 Things to think about:
 Presentation of premise [what is proposed?]
 Role of data
 Role of research
 ML Systems Research Case?
49
FrugalML: How to Use ML Prediction APIs More
Accurately and Cheaply
 Published in NuerIPS’20
 Example of a “Sky Computing” ML research direction
 Combining competing prediction services to improve accuracy and reduce
costs.
 Potentially exciting new research direction!
 Things to think about:
 Latency
 Out-of-domain performance
 Uncertainty calibration and model biases
 Mathematical presentations
50

More Related Content

Similar to cloud_computing_for_ml_sys_hhhhhhhhhhhhhhhhhhhhhhhhhhh

Cloud computing – An Overview
Cloud computing – An OverviewCloud computing – An Overview
Cloud computing – An OverviewKannan Subbiah
 
Overview of Cloud Computing
Overview of Cloud ComputingOverview of Cloud Computing
Overview of Cloud ComputingDr Ganesh Iyer
 
What is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete GuideWhat is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete GuideAlaina Carter
 
What is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete GuideWhat is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete GuideMarianne Harness
 
Privacy Issues of Cloud Computing in the Federal Sector
Privacy Issues of Cloud Computing in the Federal SectorPrivacy Issues of Cloud Computing in the Federal Sector
Privacy Issues of Cloud Computing in the Federal SectorLew Oleinick
 
Introduction to Cloud computing
Introduction to Cloud computingIntroduction to Cloud computing
Introduction to Cloud computingMathews Job
 
Cloud storage & cloud computing
Cloud storage & cloud computingCloud storage & cloud computing
Cloud storage & cloud computingMichele Minighin
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland mictc
 
Cloud computing presentation
Cloud computing presentationCloud computing presentation
Cloud computing presentationAhmed Abdisalan
 

Similar to cloud_computing_for_ml_sys_hhhhhhhhhhhhhhhhhhhhhhhhhhh (20)

Cloud computing – An Overview
Cloud computing – An OverviewCloud computing – An Overview
Cloud computing – An Overview
 
cloud computing models
cloud computing modelscloud computing models
cloud computing models
 
Overview of Cloud Computing
Overview of Cloud ComputingOverview of Cloud Computing
Overview of Cloud Computing
 
Slide_Deck_CloudAttack.pptx
Slide_Deck_CloudAttack.pptxSlide_Deck_CloudAttack.pptx
Slide_Deck_CloudAttack.pptx
 
What is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete GuideWhat is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete Guide
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
What is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete GuideWhat is Cloud Computing? A Complete Guide
What is Cloud Computing? A Complete Guide
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Cloud Computing Essays
Cloud Computing EssaysCloud Computing Essays
Cloud Computing Essays
 
Computer project
Computer projectComputer project
Computer project
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Privacy Issues of Cloud Computing in the Federal Sector
Privacy Issues of Cloud Computing in the Federal SectorPrivacy Issues of Cloud Computing in the Federal Sector
Privacy Issues of Cloud Computing in the Federal Sector
 
Introduction to Cloud computing
Introduction to Cloud computingIntroduction to Cloud computing
Introduction to Cloud computing
 
Cloud storage & cloud computing
Cloud storage & cloud computingCloud storage & cloud computing
Cloud storage & cloud computing
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland
 
Cloud computingjun28
Cloud computingjun28Cloud computingjun28
Cloud computingjun28
 
Cloud computingjun28
Cloud computingjun28Cloud computingjun28
Cloud computingjun28
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Cloud computing presentation
Cloud computing presentationCloud computing presentation
Cloud computing presentation
 

Recently uploaded

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 

Recently uploaded (20)

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 

cloud_computing_for_ml_sys_hhhhhhhhhhhhhhhhhhhhhhhhhhh

  • 1. Joseph E. Gonzalez Co-director of the RISE Lab jegonzal@cs.berkeley.edu AI-Systems Machine Learning in the Cloud
  • 2. What is cloud computing?
  • 3. “The interesting thing about Cloud Computing is that we’ve redefined Cloud Computing to include everything that we already do. . . . I don’t understand what we would do differently in the light of Cloud Computing other than change the wording of some of our ads.” -- Larry Ellison, Wall Street Journal, 2008 3 Quote from “Above the Clouds: A Berkeley View of Cloud Computing”
  • 5. “If ‘cloud computing’has a meaning, it is not a way of doing computing, but rather a way of thinking about computing: a devil- may-care approach which says, ‘Don't ask questions. Don't worry about who controls your computing or who holds your data. Don't check for a hook hidden inside our service before you swallow it. Trust companies without hesitation.’In other words, ‘Be a sucker.’” -- Richard Stallman, Boston Review, 2010 5 https://bostonreview.net/articles/richard-stallman-free-software-drm/
  • 6. Made the case for cloud computing Illusion of infinite resources Elimination of up-front costs Pay-per-use Economies of scale for everyone Simplified operations Higher hardware utilization Published in 2009 and received over 8,4000 citations Early paper on cloud computing and helped drive excitement in the field. 2006-2011
  • 7. Defining Characteristics of the Cloud  The illusion of infinite computing resources available on demand  The elimination of an up-front commitment by users  The ability to pay for use of computing resources on a short-term basis as needed  Essentially Utility Computing Public Cloud: when these services are available to the general public Private Cloud: when these services are sold within a business Quote from “Above the Clouds: A Berkeley View of Cloud Computing”
  • 8. The Beginning of the Cloud 1961: John McCarthy presents idea of Utility Computing 2006: Amazon Web Services (AWS) Launched 2008: Google Cloud Launched 2008: Azure Cloud Launched 2009: Berkeley writes: “Above the Clouds: A Berkeley View of Cloud Computing”
  • 9. Utility Computing is an old idea “If computers of the kind I have advocated become the computers of the future, then computing may someday be organized as a public utility just as the telephone system is a public utility... The computer utility could become the basis of a new and important industry.” – John McCarthy 9 Early Pioneer in Artificial Intelligence Turing award for contributions to AI Public Cloud (>$300B)
  • 10. Why did it take until 2006?  Change is business models for computing  Web2.0 – low commitment, self-serve, pay-as-you-go -- not enough capital to own equipment.  Existing big-tech companies were aggressively exploiting economies of scale to drive down costs  Renting becomes cheaper than buying machines  New workloads emerged to leverage elasticity:  Web applications needed to handle demand surges  Data intensive apps leveraged scale -- fast is cheap 10
  • 11. Economics of the Cloud  CapEx to OpEx: transition from large up-front capital expenditures to operational expenditures  More money to spend on your launching your business  Improved Utilization through statistical multiplexing  real-world server utilization for a single business ~ 5% to 20%.  Economies of scales  Negotiate lower hardware prices  Spread management costs  Leverage existing investments 11
  • 12. The Cloud Enabled Academic Research  Access to the latest hardware  Ability to burst experiments near conference deadlines  Usually…  Ability for students to build and evaluate large-scale systems  I would frequently run concurrent experiments with hundreds of machines each!  industrial adoption  Companies can evaluate open-source (academic) big data tools without big upfront investment in hardware.
  • 13. Y-axis scales are hidden because they are embarrassing What about the Cloud?  Access to latest GPUs and TPUs drove AI research  used a LOT OF CREDITS (thank you AWS, Azure, & Google!) 13 We only had to “pay” for this region. Conference Deadlines
  • 14. The Elasticity of the cloud drove us to rethink our approach to AI Research 14
  • 18. Model learning rate 𝒍, weight decay d GPU 1 GPU 2 GPU 3 GPU 4 GPU 5 GPU 6 GPU 7 GPU 8 Time (mins) 00:00 60:00 GPU 1 GPU 2 GPU 3 GPU 4 GPU 5 GPU 6 GPU 7 GPU 8 Time (mins) 00:00 45:00 Exploitation Exploration
  • 19. GPU 1 GPU 2 GPU 3 GPU 4 GPU 5 GPU 6 GPU 7 GPU 8 Time (mins) 00:00 45:00 Exploitation 1.0 2.0 4.0 8.0 Number of GPUs 0.0 2.0 4.0 6.0 8.0 Throughput Scaling 1.684x 3.107x 5.735x ResNet50 Linear Scaling
  • 20. 1.0 2.0 4.0 8.0 Number of GPUs 0.0 2.0 4.0 6.0 8.0 Throughput Scaling 1.684x 3.107x 5.735x ResNet50 GPU 1 GPU 2 GPU 3 GPU 4 GPU 5 GPU 6 GPU 7 GPU 8 Time (mins) 00:00 45:00 Linear Scaling Exploitation
  • 21. Resources = Machines Resources = Money Fixed Cluster Cloud
  • 22. GPU-mins Resources = Machines Resources = Money Fixed Cluster Cloud deadline start Time GPU-mins
  • 23. Resources = Machines Resources = Money Fixed Cluster Cloud deadline start Time GPU-mins deadline start Time GPU-mins More parallel exploration Less Parallel Exploitation
  • 24. GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline Liaw et al.
  • 25. GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline GPU 5
  • 26. GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline GPU 5
  • 27. GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline GPU 5 GPU 6
  • 28. GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline GPU 5 GPU 6
  • 29. GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline GPU 5 GPU 6
  • 30. When it comes to machine-time allocation Time (mins) 00:00 Deadline GPU 1 GPU 2 GPU 3 GPU 4 Triangles > Rectangles GPU 1 GPU 2 GPU 3 GPU 4 Time (mins) 00:00 Deadline GPU 5 GPU 6
  • 31. Think outside the box Cloud computing  Infinite resources with a finite budget (volume constraints). 31 Compute Storage Cluster Computing Compute Storage Cloud Computing
  • 32. The cloud as a utility “Serverless” 32 Compute Storage Cloud Computing
  • 35. Canonical example Upload Cloud Storage Save thumbnail Save path Cloud Storage Cloud Database Cloud Function FaaS BaaS BaaS BaaS
  • 36. 36 ©2019 RISELab AWS Lambda Introduced Function as a Service (FaaS) Autoscaling done right Highly elastic - adapts quickly Scales down to zero Fine-grained 100 ms billing increment Cloud provider shares risk and responsibility for utilization Strong isolation allowing multi-tenant multiplexing Benefits from scale of Amazon’s platform & ecosystem of APIs
  • 37. A Layer to Simplify Using the Cloud
  • 38. Three essential qualities of serverless computing  Hides servers and the complexity of programming and operating them  Offers a pay-per-use cost model with no charge for idle resources  Has excellent autoscaling so resources match demand closely
  • 39. Airport Analogy When you arrive at the destination airport and need to get to your hotel you could: 1. Buy a car and drive [Legacy on premise systems]  Long term investment and you are responsible for everything 2. Rent a car and drive [Serverfull]  You are still responsible for fuel, parking, insurance, … 3. Take an Uber. [Serverless]  You are paying only for the transportation you need
  • 40. Where is the cloud headed?
  • 41. With each phase of the cloud (and computing), we raised the abstraction. This continuous process of simplification enables concurrent innovation on both sides of the boundary. It also improves portability and transforms the market.
  • 43. Sky Example 45 ML Pipeline Data proc Training Serving Secure Data proc Training Serving Sky ● Use Azure Confidential Computing for secure data processing ● Use Google Cloud for training on TPUs (fastest and cheapest) ● Use AWS for serving on Inferentia (cheapest)
  • 44. Conjectures for the Future ● We will continue to race towards utility-oriented computing ○ Pay for consumption and not capacity ● Higher levels of abstraction will ○ Reduce operational complexity (burden shifts to the cloud + ML) ○ Drive more rapid innovation in cloud hardware ○ Enable applications to more easily span multiple clouds ● Sky computing is the inevitable future of computing 46
  • 46. Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning  Published in OSDI’21 – Best Paper Award  Why we chose it?  Good example of work exploring scheduling of for ML  Addresses ML and Systems concerns: throughput, improvements in accuracy, fairness?  Things to think about:  Implications for elasticity?  What about hyperparameter search? 48
  • 47. The Sky Above the Clouds [Unpublished]  Draft of the vision paper describing Sky research agenda  Do Not Distribute  Feedback will help the paper (be critical!)  Makes a case for both the inevitability and need for research in “Sky Computing”  Things to think about:  Presentation of premise [what is proposed?]  Role of data  Role of research  ML Systems Research Case? 49
  • 48. FrugalML: How to Use ML Prediction APIs More Accurately and Cheaply  Published in NuerIPS’20  Example of a “Sky Computing” ML research direction  Combining competing prediction services to improve accuracy and reduce costs.  Potentially exciting new research direction!  Things to think about:  Latency  Out-of-domain performance  Uncertainty calibration and model biases  Mathematical presentations 50

Editor's Notes

  1. At the core of all machine learning is the curve of a models accuracy over training time. Read through any ML paper today and you will probably see a curve like this or an accuracy number which is simply the accuracy attained by the time the authors needed to publish their results. This is a very limited view of the model training process however, so let’s zoom out to see what this process looks like over the weeks before this conference or production deadline. Each of the lines in this graph represent a model with a different hyperparameter configuration, where a hyperparameter is any parameter that can affect the models training, and you can see that there is a surge of trial hyperparameters training very close to the deadline in order to obtain a model with the highest possible accuracy. This process is hyperparameter tuning and it is the most compute intensive part of the model development process. Since the space of possible hyperparameters is incredibly large and it takes a lot of compute to train even one model to convergence, developers will launch a subset of possible configurations, stopping some of these trials early if they do not look promising in order to reallocate those compute resources. The choice of hyperparameter has a drastic affect on final accuracy so its very important to find and train the optimal or close to optimal trial depected here in red. It is this deadline focused hyperparameter tuning that we are interested in for this talk. More specifically, how can we sufficiently explore the space of possible configurations to find a good trial and train it so that we can return a highly accuracte model given a fixed deadline and resource constraints.
  2. Hyperparameter tuning is a necessary but often dreaded step in the machine learning pipeline. Every model contains parameters that need to be set in training, for example learning rate or weight decay, that can drastically affect the learning process and thus the final accuracy. For example this graph shows the accuracy curves during training of a vision model set with different hyperparameter configurations. As you can see, the final accuracies are all over the place, and this effect is seen across almost all models and datasets. While this may
  3. Explain sub linear parallelism
  4. Explain sub linear parallelism
  5. Explain the concept of resrouce time
  6. Explain the concept of resrouce time
  7. Explain the concept of resrouce time
  8. Motivate traingles from squares
  9. Motivate traingles from squares
  10. Motivate traingles from squares
  11. Motivate traingles from squares
  12. Motivate traingles from squares
  13. Motivate traingles from squares
  14. Motivate traingles from squares
  15. Johann Started with you
  16. Johann App Engine launched earlier, very similar, some key differences
  17. TODO animate this slide, add example TcaplusDB
  18. TODO animate this slide, add example TcaplusDB