SlideShare a Scribd company logo
1 of 50
Download to read offline
Data Science | Design | Technology
https://www.meetup.com/DSDTMTL
May
26
2021
2
Data Science | Design | Technology
https://www.meetup.com/DSDTMTL
May
26
Please, don't forget to
mute yourself
(2021)
JL Maréchaux
DSDT Co-Organizer
Marc G. Bellemare
Research Scientist
Google Research
Brain Team
https://www.meetup.com/DSDTMTL
Agenda
3:45 - 4:00 Arrival & Networking 
4:00 - 4:15 News & Intro
4:15 - 5:15 How can reinforcement learning
help us fly balloons in the stratosphere?
5:15 - 5:30: Virtual Snack & Networking
4
DSDT Meetup - May 26, 2021
5
A special thanks to our contributors…
Lorem ipsum congue
tempus
Lorem ipsum
tempus
Lorem ipsum congue
tempus
Lorem ipsum
tempus
Lorem ipsum
congue tempus
Lorem ipsum congue
tempus
Thanks
Merci
The
(virtual)
venue
sponsor
& snacks
The brain
...
DSDT Mtl meetup
Pdipiscing elit
322,722 views
DSDT Meetup
Pdipiscing elit
322,722 views
DSDT Meetup
Pdipiscing elit
322,722 views
DSDT
Pdipiscin
322,722
Virtual Meetups
Until we can do in-person events
again in Montreal…
Past (and future) presentations
available on Slideshare.
http://www.slideshare.net/DSDT_MTL
Monthly cadence, on Wednesdays.
Incredible sessions already planned for May, June and July.
Contact us with your expectations & ideas.
ML
Validation
Reinforcement
Learning
Explainable
AI
RNN & Time
Series
Lorem ipsum
Commodo
April 28
May 26 July 21
What is coming in 2021
June 16
Your ideas,
your meetup.
http://bit.ly/DSDTsurvey2021
8
Our 2021 campaign to fight against
poverty and social exclusion.
Data Science.
Design.
Technology.
https://centraide-mtl.org/dsdtmtl
How can
reinforcement
learning help us fly
balloons in the
stratosphere?
Data Science | Design | Technology 9
Marc G. Bellemare
Picture
Decisions from data: Controlling complex systems with
reinforcement learning
Marc G. Bellemare1
, Salvatore Candido2
, Pablo Samuel Castro1
, Jun Gong2
, Marlos C.
Machado1
, Subhodeep Moitra1
, Sameera S. Ponda2
, Ziyu Wang1
1
Google Research, Brain team
2
Loon
With thanks to: Beth Reid, Joshua Greaves, Bradley Rhodes, and many more
https://www.nature.com/articles/s41586-020-2939-8
Image credit: https://bejofo.net/ttt
Reinforcement learning = trial and error
data → decisions
Trial and error and cats
Credit assignment is at the heart of RL
Credit assignment via the Bellman equation
Markov decision process
Implemented as a
Deep neural network
80 units; Tesauro (1995) 40 x 256 convolutional filters;
Silver et al. (2017)
Deep reinforcement learning
Many RL problems are...
Underactuated Partially observable
Loon proprietary
The quasi-biennial oscillation, Baldwin et al. (2001)
Loon proprietary
312 Days in the Stratosphere, Loon, Oct 28 2020.
Loon proprietary
Long-term objective,
binary signal
Partial
observability
Limited
power
Underactuated system,
stochastic dynamics
StationSeeker in equations
1) Wind score.
2) Per-altitude score.
3) Setpoint to max. scoring altitude.
Deep reinforcement learning for balloon navigation
+16 ambient variables
Forecast +
measurements +
Gaussian process =
wind column
The ERA5 reanalysis (dataset) provides baseline winds
Like real, but
Low resolution.
Baseline winds are upsampled using procedural noise:
Statistically plausible
High resolution
Effectively infinite supply
The simulator
Design and training
2-day training simulations
In the tropics (+/- 25 lat.)
Starting up to 200km away
Light filtering of “impossible” conditions
Distributional predictions (QR-DQN)
Distributed training:
100 actors
4 replay buffers
1 GPU
1.1B training steps (~30 days wall time)
Pacific Ocean Experiment
26 Oct 2019 – 25 Jan 2020
13 balloons
Total 2884 RL flight hours
Longest RL flight ~16 days
0N 114W
“StationSeeker”
balloons
“Perciatelli”
balloons
StationSeeker Perciatelli
TWR50: 72%
Power: 33 W
TWR50: 79%
Power: 29 W
StationSeeker Perciatelli
TWR50: 72%
Power: 33 W
TWR50: 79%
Power: 29 W
312 Days in the Stratosphere, Loon, Oct 28 2020.
www.flightradar24.com
Why does this matter?
Bellemare, Naddaf, Veness, Bowling (2013)
Mnih, Silver, Kavukcuoglu, Rusu, Veness, Bellemare, et al. (Nature, 2015)
Levine et al., (2016)
Kalashnikov, Irpan, et al. (2018)
Silver, Huang, Maddison, et al. (2016, 2017)
Bard, Foerster, Chandar, et al. (2020)
OpenAI et al. (2019)
Vinyals et al. (2019)
Deep reinforcement learning
Mirhoseini, Goldie, et al. (arXiv, 2020)
Won et al., 2020
Glavic et al., 2017
Ie et al., 2019
Reddy et al. (2018)
Merci / Thank You
@DsdtMtl
Data Science | Design | Technology
(Check for next DSDT meetup at https://www.meetup.com/DSDTmtl)
http://bit.ly/dsdtmtl-in

More Related Content

Similar to DSDT Meetup May 2021

SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...
SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...
SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...
South Tyrol Free Software Conference
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
SERC at Carleton College
 
Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...
Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...
Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...
Dustin Dewett
 

Similar to DSDT Meetup May 2021 (20)

RemoteSensing_DeepLearning_v2.pptx
RemoteSensing_DeepLearning_v2.pptxRemoteSensing_DeepLearning_v2.pptx
RemoteSensing_DeepLearning_v2.pptx
 
SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...
SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...
SFScon21 - Daniel Frisinghelli - The Cost of Traditional Machine Learning and...
 
afternoon3.pdf
afternoon3.pdfafternoon3.pdf
afternoon3.pdf
 
Near Exascale Computing in the Cloud
Near Exascale Computing in the CloudNear Exascale Computing in the Cloud
Near Exascale Computing in the Cloud
 
GeoCAPE Strategies
GeoCAPE StrategiesGeoCAPE Strategies
GeoCAPE Strategies
 
Drones and A.I in Earth Science
Drones and A.I in Earth ScienceDrones and A.I in Earth Science
Drones and A.I in Earth Science
 
Jasper Horrell - SKA and Big Data: Up in Space and on the Ground
Jasper Horrell - SKA and Big Data: Up in Space and on the GroundJasper Horrell - SKA and Big Data: Up in Space and on the Ground
Jasper Horrell - SKA and Big Data: Up in Space and on the Ground
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
INC 2004: An Efficient Mechanism for Adaptive Resource Discovery in Grids
INC 2004: An Efficient Mechanism for Adaptive Resource Discovery in GridsINC 2004: An Efficient Mechanism for Adaptive Resource Discovery in Grids
INC 2004: An Efficient Mechanism for Adaptive Resource Discovery in Grids
 
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
 
Godiva2 Overview
Godiva2 OverviewGodiva2 Overview
Godiva2 Overview
 
Well identification and environmental management
Well identification and environmental managementWell identification and environmental management
Well identification and environmental management
 
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...
Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...
Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...
 
Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...
Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...
Network Based Kernel Density Estimation for Cycling Facilities Optimal Locati...
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...
Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...
Fault Enhancement Using Spectrally Based Seismic Attributes -- Dewett and Hen...
 
PhD Thesis Proposal
PhD Thesis Proposal PhD Thesis Proposal
PhD Thesis Proposal
 

More from DSDT_MTL

More from DSDT_MTL (14)

DSDT Meetup Septembre 2021
DSDT Meetup Septembre 2021DSDT Meetup Septembre 2021
DSDT Meetup Septembre 2021
 
DSDT Meetup May 2019
DSDT Meetup May 2019DSDT Meetup May 2019
DSDT Meetup May 2019
 
DSDT Meetup March 2019
DSDT Meetup March 2019DSDT Meetup March 2019
DSDT Meetup March 2019
 
DSDT Meetup February 2019
DSDT Meetup February 2019DSDT Meetup February 2019
DSDT Meetup February 2019
 
DSDT Meetup May 2017
DSDT Meetup May 2017DSDT Meetup May 2017
DSDT Meetup May 2017
 
DSDT Meetup July 2017
DSDT Meetup July 2017DSDT Meetup July 2017
DSDT Meetup July 2017
 
DSDT Meetup October 2017
DSDT Meetup October 2017DSDT Meetup October 2017
DSDT Meetup October 2017
 
DSDT Meetup Nov 2017
DSDT Meetup Nov 2017DSDT Meetup Nov 2017
DSDT Meetup Nov 2017
 
DSDT Meetup January 2018
DSDT Meetup January 2018DSDT Meetup January 2018
DSDT Meetup January 2018
 
DSDT Meetup February 2018
DSDT Meetup February 2018DSDT Meetup February 2018
DSDT Meetup February 2018
 
DSDT Meetup May 2018
DSDT Meetup May 2018DSDT Meetup May 2018
DSDT Meetup May 2018
 
DSDT Meetup June 2018
DSDT Meetup June 2018DSDT Meetup June 2018
DSDT Meetup June 2018
 
DSDT Meetup July 2018
DSDT Meetup July 2018DSDT Meetup July 2018
DSDT Meetup July 2018
 
DSDT Meetup November 2018
DSDT Meetup November 2018DSDT Meetup November 2018
DSDT Meetup November 2018
 

Recently uploaded

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 

Recently uploaded (20)

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 

DSDT Meetup May 2021

  • 1. Data Science | Design | Technology https://www.meetup.com/DSDTMTL May 26 2021
  • 2. 2 Data Science | Design | Technology https://www.meetup.com/DSDTMTL May 26 Please, don't forget to mute yourself (2021)
  • 3. JL Maréchaux DSDT Co-Organizer Marc G. Bellemare Research Scientist Google Research Brain Team https://www.meetup.com/DSDTMTL
  • 4. Agenda 3:45 - 4:00 Arrival & Networking  4:00 - 4:15 News & Intro 4:15 - 5:15 How can reinforcement learning help us fly balloons in the stratosphere? 5:15 - 5:30: Virtual Snack & Networking 4 DSDT Meetup - May 26, 2021
  • 5. 5 A special thanks to our contributors… Lorem ipsum congue tempus Lorem ipsum tempus Lorem ipsum congue tempus Lorem ipsum tempus Lorem ipsum congue tempus Lorem ipsum congue tempus Thanks Merci The (virtual) venue sponsor & snacks The brain ...
  • 6. DSDT Mtl meetup Pdipiscing elit 322,722 views DSDT Meetup Pdipiscing elit 322,722 views DSDT Meetup Pdipiscing elit 322,722 views DSDT Pdipiscin 322,722 Virtual Meetups Until we can do in-person events again in Montreal… Past (and future) presentations available on Slideshare. http://www.slideshare.net/DSDT_MTL
  • 7. Monthly cadence, on Wednesdays. Incredible sessions already planned for May, June and July. Contact us with your expectations & ideas. ML Validation Reinforcement Learning Explainable AI RNN & Time Series Lorem ipsum Commodo April 28 May 26 July 21 What is coming in 2021 June 16 Your ideas, your meetup. http://bit.ly/DSDTsurvey2021
  • 8. 8 Our 2021 campaign to fight against poverty and social exclusion. Data Science. Design. Technology. https://centraide-mtl.org/dsdtmtl
  • 9. How can reinforcement learning help us fly balloons in the stratosphere? Data Science | Design | Technology 9 Marc G. Bellemare Picture
  • 10. Decisions from data: Controlling complex systems with reinforcement learning Marc G. Bellemare1 , Salvatore Candido2 , Pablo Samuel Castro1 , Jun Gong2 , Marlos C. Machado1 , Subhodeep Moitra1 , Sameera S. Ponda2 , Ziyu Wang1 1 Google Research, Brain team 2 Loon With thanks to: Beth Reid, Joshua Greaves, Bradley Rhodes, and many more https://www.nature.com/articles/s41586-020-2939-8
  • 11.
  • 12.
  • 13.
  • 15. Reinforcement learning = trial and error data → decisions
  • 16. Trial and error and cats
  • 17. Credit assignment is at the heart of RL
  • 18. Credit assignment via the Bellman equation Markov decision process Implemented as a Deep neural network
  • 19. 80 units; Tesauro (1995) 40 x 256 convolutional filters; Silver et al. (2017) Deep reinforcement learning
  • 20. Many RL problems are... Underactuated Partially observable
  • 22. The quasi-biennial oscillation, Baldwin et al. (2001)
  • 24. 312 Days in the Stratosphere, Loon, Oct 28 2020. Loon proprietary
  • 25.
  • 27.
  • 28. StationSeeker in equations 1) Wind score. 2) Per-altitude score. 3) Setpoint to max. scoring altitude.
  • 29.
  • 30. Deep reinforcement learning for balloon navigation +16 ambient variables Forecast + measurements + Gaussian process = wind column
  • 31. The ERA5 reanalysis (dataset) provides baseline winds Like real, but Low resolution. Baseline winds are upsampled using procedural noise: Statistically plausible High resolution Effectively infinite supply The simulator
  • 32. Design and training 2-day training simulations In the tropics (+/- 25 lat.) Starting up to 200km away Light filtering of “impossible” conditions Distributional predictions (QR-DQN) Distributed training: 100 actors 4 replay buffers 1 GPU 1.1B training steps (~30 days wall time)
  • 33.
  • 34.
  • 35.
  • 36.
  • 37. Pacific Ocean Experiment 26 Oct 2019 – 25 Jan 2020 13 balloons Total 2884 RL flight hours Longest RL flight ~16 days 0N 114W “StationSeeker” balloons “Perciatelli” balloons
  • 38. StationSeeker Perciatelli TWR50: 72% Power: 33 W TWR50: 79% Power: 29 W
  • 39. StationSeeker Perciatelli TWR50: 72% Power: 33 W TWR50: 79% Power: 29 W
  • 40.
  • 41.
  • 42.
  • 43.
  • 44. 312 Days in the Stratosphere, Loon, Oct 28 2020.
  • 45.
  • 47. Why does this matter?
  • 48. Bellemare, Naddaf, Veness, Bowling (2013) Mnih, Silver, Kavukcuoglu, Rusu, Veness, Bellemare, et al. (Nature, 2015) Levine et al., (2016) Kalashnikov, Irpan, et al. (2018) Silver, Huang, Maddison, et al. (2016, 2017) Bard, Foerster, Chandar, et al. (2020) OpenAI et al. (2019) Vinyals et al. (2019) Deep reinforcement learning
  • 49. Mirhoseini, Goldie, et al. (arXiv, 2020) Won et al., 2020 Glavic et al., 2017 Ie et al., 2019 Reddy et al. (2018)
  • 50. Merci / Thank You @DsdtMtl Data Science | Design | Technology (Check for next DSDT meetup at https://www.meetup.com/DSDTmtl) http://bit.ly/dsdtmtl-in