Distributed deep rl on spark strata singapore

•

3 likes•1,976 views

This talk briefly covers deep reinforcemeent learning on spark and the benefits of using large scale commodity compute with gpus for ease of running simulations as well as distributed training for use cases that aren't games such as network intrusion and risk. This talk also briefly mentions rl4j and our work with openai gym.

Data & Analytics

SKYMIND INTELLIGENCE LAYER (SKIL)
REFERENCE ARCHITECTURE

Overview
● Why am I up here?
● Reinforcement Learning
● Use cases
● Demo!
● Deep Reinforcement Learning
● Rl4j
● Dl4j
● Spark/RL - why?

Why am I up
here?
Wrote this -->
Book Giveaway!

Reinforcement
Learning
● Learn a “policy” with repeated trial
and error
● An agent explores a search space
● Learns from rewards and penalties
each time it takes a step
● Think of win/lose scenarios
● Rewards/punishment set by an
“environment”
Credit:
http://ai.berkeley.edu/reinforcement.ht
ml

Use cases (not
games!)
● Risk analysis (loans)
● Network Intrusion
● Learning patterns from
simulations (MCMC)

Deep
Reinforcement
Learning
● Teach a neural net from environment
● Policy determines gradient descent steps
● Most work has been based on raw frames
from games (pixel input)
● Various techniques (A3C,Policy Gradients,Deep
Q,..)
● Core idea: Neural net has a softmax
(probability distribution) mapped to actions to
take in an environment

RL4j
● Deep Reinforcement Learning
library for Java
● Openai Gym Intregration
● Deep Reinforcement Learning
with DL4j
● Implementations of A3C,DeepQ,
Policy Gradients
● Openai Gym Java Bindings

Dl4j
● Import keras models
● Focus on running in production
● Integrate with existing big data ecosystem
● Transparent usage of cpus and gpus
● End to end ecosystem for building data
products (not just algorithms!)

Spark/RL Why?
● Spark is distributed compute
● A lot of simulations and
environments to run
● Distributed workers running
experiments in parallel
● Data Parallelism with neural nets

Summary
● Spark for orchestrating simulations
● Spark for distributed training
● Integrated storage with HDFS
● Orchestrate GPU based spark jobs
● Easy to hook in to production (java/scala)
● Great streaming ecosystem for incremental
updates

Distributed deep rl on spark strata singapore

Viewers also liked

Deep Learning with GPUs in Production - AI By the BayAdam Gibson

Deep learning in production with the bestAdam Gibson

SKIL - Dl4j in the wild meetupAdam Gibson

Dl4j in the wildAdam Gibson

Anomaly detection in deep learning (Updated) EnglishAdam Gibson

Productionizing dl from the ground upAdam Gibson

Hadoop summit 2016Adam Gibson

Recurrent nets and sensorsAdam Gibson

Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshopHortonworks

Future of ai on the jvmAdam Gibson

Deep Learning using Spark and DL4J for fun and profitDataWorks Summit/Hadoop Summit

Anomaly Detection in Deep Learning (Updated)Adam Gibson

The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...Big Data Spain

Strata Beijing - Deep Learning in Production on SparkAdam Gibson

H2O World - Top 10 Deep Learning Tips & Tricks - Arno CandelSri Ambati

August 2016 HUG: Open Source Big Data Ingest with StreamSets Data Collector Yahoo Developer Network

Suneel Marthi - Deep Learning with Apache Flink and DL4JFlink Forward

Apache Hadoop 3.0 What's new in YARN and MapReduceDataWorks Summit/Hadoop Summit

August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...Yahoo Developer Network

Dynamic and Static ModelingSaurabh Kumar

Viewers also liked (20)

Deep Learning with GPUs in Production - AI By the Bay

Deep learning in production with the best

SKIL - Dl4j in the wild meetup

Dl4j in the wild

Anomaly detection in deep learning (Updated) English

Productionizing dl from the ground up

Hadoop summit 2016

Recurrent nets and sensors

Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop

Future of ai on the jvm

Deep Learning using Spark and DL4J for fun and profit

Anomaly Detection in Deep Learning (Updated)

The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...

Strata Beijing - Deep Learning in Production on Spark

H2O World - Top 10 Deep Learning Tips & Tricks - Arno Candel

August 2016 HUG: Open Source Big Data Ingest with StreamSets Data Collector

Suneel Marthi - Deep Learning with Apache Flink and DL4J

Apache Hadoop 3.0 What's new in YARN and MapReduce

August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...

Dynamic and Static Modeling

Similar to Distributed deep rl on spark strata singapore

Sequential Decision Making in RecommendationsJaya Kawale

Ai architectureand designpatternsgdc2009SinisterM

Memory-based Reinforcement LearningHung Le

Is Production RL at a tipping point?M Waleed Kadous

The Risks of YOLOing-2.pdfHacken

How DeepMind Mastered The Game Of GoTim Riser

Self-supervised Learning Lecture NoteSangwoo Mo

Building a deep learning ai.pptxDaniel Slater

Rewrite the whole damn thingCrypto Cg

Neural networks with pythonTom Dierickx

Avoiding GraphQL insecurities with OWASP SKF - OWASP HU meetupDavide Cioccia

Performance optimization techniques for Java codeAttila Balazs

"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...Quantopian

Salt Identification Challengekenluck2001

OISF - Continuous Skills Improvement for EveryoneCiNPA Security SIG

A brief overview of Reinforcement Learning applied to gamesThomas da Silva Paula

LearningKit.pptbutest

anintroductiontoreinforcementlearning-180912151720.pdfssuseradaf5f

An introduction to reinforcement learningSubrat Panda, PhD

Validating Big Data Pipelines - Big Data Spain 2018Holden Karau

Similar to Distributed deep rl on spark strata singapore (20)

Sequential Decision Making in Recommendations

Ai architectureand designpatternsgdc2009

Memory-based Reinforcement Learning

Is Production RL at a tipping point?

The Risks of YOLOing-2.pdf

How DeepMind Mastered The Game Of Go

Self-supervised Learning Lecture Note

Building a deep learning ai.pptx

Rewrite the whole damn thing

Neural networks with python

Avoiding GraphQL insecurities with OWASP SKF - OWASP HU meetup

Performance optimization techniques for Java code

"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...

Salt Identification Challenge

OISF - Continuous Skills Improvement for Everyone

A brief overview of Reinforcement Learning applied to games

LearningKit.ppt

anintroductiontoreinforcementlearning-180912151720.pdf

An introduction to reinforcement learning

Validating Big Data Pipelines - Big Data Spain 2018

Recently uploaded

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Invezz.com - Grow your wealth with trading signalsInvezz1

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

Recently uploaded (20)

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

04242024_CCC TUG_Joins and Relationships

Brighton SEO | April 2024 | Data Storytelling

Customer Service Analytics - Make Sense of All Your Data.pptx

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

Unveiling Insights: The Role of a Data Analyst

Invezz.com - Grow your wealth with trading signals

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

Predicting Employee Churn: A Data-Driven Approach Project Presentation

Log Analysis using OSSEC sasoasasasas.pptx

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

FESE Capital Markets Fact Sheet 2024 Q1.pdf

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

Distributed deep rl on spark strata singapore

2. SKYMIND INTELLIGENCE LAYER (SKIL) REFERENCE ARCHITECTURE

3. Overview ● Why am I up here? ● Reinforcement Learning ● Use cases ● Demo! ● Deep Reinforcement Learning ● Rl4j ● Dl4j ● Spark/RL - why?

4. Why am I up here? Wrote this --> Book Giveaway!

5. Reinforcement Learning ● Learn a “policy” with repeated trial and error ● An agent explores a search space ● Learns from rewards and penalties each time it takes a step ● Think of win/lose scenarios ● Rewards/punishment set by an “environment” Credit: http://ai.berkeley.edu/reinforcement.ht ml

6. Use cases (not games!) ● Risk analysis (loans) ● Network Intrusion ● Learning patterns from simulations (MCMC)

7. Demo! Cartpole (Hello world of RL)

8. Deep Reinforcement Learning ● Teach a neural net from environment ● Policy determines gradient descent steps ● Most work has been based on raw frames from games (pixel input) ● Various techniques (A3C,Policy Gradients,Deep Q,..) ● Core idea: Neural net has a softmax (probability distribution) mapped to actions to take in an environment

9. RL4j ● Deep Reinforcement Learning library for Java ● Openai Gym Intregration ● Deep Reinforcement Learning with DL4j ● Implementations of A3C,DeepQ, Policy Gradients ● Openai Gym Java Bindings

10. Dl4j

11. Dl4j ● Import keras models ● Focus on running in production ● Integrate with existing big data ecosystem ● Transparent usage of cpus and gpus ● End to end ecosystem for building data products (not just algorithms!)

12. Spark/RL Why? ● Spark is distributed compute ● A lot of simulations and environments to run ● Distributed workers running experiments in parallel ● Data Parallelism with neural nets

13. Summary ● Spark for orchestrating simulations ● Spark for distributed training ● Integrated storage with HDFS ● Orchestrate GPU based spark jobs ● Easy to hook in to production (java/scala) ● Great streaming ecosystem for incremental updates

Distributed deep rl on spark strata singapore

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

Similar to Distributed deep rl on spark strata singapore

Similar to Distributed deep rl on spark strata singapore (20)

More from Adam Gibson

More from Adam Gibson (17)

Recently uploaded

Recently uploaded (20)

Distributed deep rl on spark strata singapore