Submit Search
Upload
Exem flamingo meetup-7th-sparkr
•
3 likes
•
262 views
남 남종환
Follow
SparkR 이해 및 분석 사례 소개
Read less
Read more
Software
Report
Share
Report
Share
1 of 23
Download now
Download to read offline
Recommended
Spark Hsinchu meetup
Spark Hsinchu meetup
Yung-An He
Apache Spark At Apple with Sam Maclennan and Vishwanath Lakkundi
Apache Spark At Apple with Sam Maclennan and Vishwanath Lakkundi
Databricks
Getting Started with Apache Spark on Kubernetes
Getting Started with Apache Spark on Kubernetes
Databricks
Apache Spark Tutorial | Spark Tutorial for Beginners | Apache Spark Training ...
Apache Spark Tutorial | Spark Tutorial for Beginners | Apache Spark Training ...
Edureka!
End-to-End Data Pipelines with Apache Spark
End-to-End Data Pipelines with Apache Spark
Burak Yavuz
Databricks with R: Deep Dive
Databricks with R: Deep Dive
Databricks
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
Databricks
MongoDB and Spark
MongoDB and Spark
Norberto Leite
Recommended
Spark Hsinchu meetup
Spark Hsinchu meetup
Yung-An He
Apache Spark At Apple with Sam Maclennan and Vishwanath Lakkundi
Apache Spark At Apple with Sam Maclennan and Vishwanath Lakkundi
Databricks
Getting Started with Apache Spark on Kubernetes
Getting Started with Apache Spark on Kubernetes
Databricks
Apache Spark Tutorial | Spark Tutorial for Beginners | Apache Spark Training ...
Apache Spark Tutorial | Spark Tutorial for Beginners | Apache Spark Training ...
Edureka!
End-to-End Data Pipelines with Apache Spark
End-to-End Data Pipelines with Apache Spark
Burak Yavuz
Databricks with R: Deep Dive
Databricks with R: Deep Dive
Databricks
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
Databricks
MongoDB and Spark
MongoDB and Spark
Norberto Leite
MongoDB.local Dallas 2019: MongoDB and Spark
MongoDB.local Dallas 2019: MongoDB and Spark
MongoDB
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
Edureka!
Spark Will Replace Hadoop ! Know Why
Spark Will Replace Hadoop ! Know Why
Edureka!
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Lillian Pierson
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
Lightbend
Introduction to Apache Spark 2.0
Introduction to Apache Spark 2.0
Knoldus Inc.
Databricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User Group
Paco Nathan
How do we work with customers on Big Data / ML / Analytics Projects using Scr...
How do we work with customers on Big Data / ML / Analytics Projects using Scr...
GetInData
Started with-apache-spark
Started with-apache-spark
Happiest Minds Technologies
Getting started with SparkSQL - Desert Code Camp 2016
Getting started with SparkSQL - Desert Code Camp 2016
clairvoyantllc
Spark is going to replace Apache Hadoop! Know Why?
Spark is going to replace Apache Hadoop! Know Why?
Edureka!
Vectorized R Execution in Apache Spark
Vectorized R Execution in Apache Spark
Databricks
Spark for big data analytics
Spark for big data analytics
Edureka!
Apache Spark Training | Spark Tutorial For Beginners | Apache Spark Certifica...
Apache Spark Training | Spark Tutorial For Beginners | Apache Spark Certifica...
Edureka!
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Edureka!
Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3
Dongjoon Hyun
Media_Entertainment_Veriticals
Media_Entertainment_Veriticals
Peyman Mohajerian
Developing and deploying big data machine learning models
Developing and deploying big data machine learning models
Narayana Swamy
Spark1
Spark1
Dr. G. Bharadwaja Kumar
#startathon2.0 - Spark Core
#startathon2.0 - Spark Core
sl2square
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
soniya singh
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
More Related Content
Similar to Exem flamingo meetup-7th-sparkr
MongoDB.local Dallas 2019: MongoDB and Spark
MongoDB.local Dallas 2019: MongoDB and Spark
MongoDB
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
Edureka!
Spark Will Replace Hadoop ! Know Why
Spark Will Replace Hadoop ! Know Why
Edureka!
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Lillian Pierson
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
Lightbend
Introduction to Apache Spark 2.0
Introduction to Apache Spark 2.0
Knoldus Inc.
Databricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User Group
Paco Nathan
How do we work with customers on Big Data / ML / Analytics Projects using Scr...
How do we work with customers on Big Data / ML / Analytics Projects using Scr...
GetInData
Started with-apache-spark
Started with-apache-spark
Happiest Minds Technologies
Getting started with SparkSQL - Desert Code Camp 2016
Getting started with SparkSQL - Desert Code Camp 2016
clairvoyantllc
Spark is going to replace Apache Hadoop! Know Why?
Spark is going to replace Apache Hadoop! Know Why?
Edureka!
Vectorized R Execution in Apache Spark
Vectorized R Execution in Apache Spark
Databricks
Spark for big data analytics
Spark for big data analytics
Edureka!
Apache Spark Training | Spark Tutorial For Beginners | Apache Spark Certifica...
Apache Spark Training | Spark Tutorial For Beginners | Apache Spark Certifica...
Edureka!
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Edureka!
Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3
Dongjoon Hyun
Media_Entertainment_Veriticals
Media_Entertainment_Veriticals
Peyman Mohajerian
Developing and deploying big data machine learning models
Developing and deploying big data machine learning models
Narayana Swamy
Spark1
Spark1
Dr. G. Bharadwaja Kumar
#startathon2.0 - Spark Core
#startathon2.0 - Spark Core
sl2square
Similar to Exem flamingo meetup-7th-sparkr
(20)
MongoDB.local Dallas 2019: MongoDB and Spark
MongoDB.local Dallas 2019: MongoDB and Spark
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
Spark Will Replace Hadoop ! Know Why
Spark Will Replace Hadoop ! Know Why
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
Introduction to Apache Spark 2.0
Introduction to Apache Spark 2.0
Databricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User Group
How do we work with customers on Big Data / ML / Analytics Projects using Scr...
How do we work with customers on Big Data / ML / Analytics Projects using Scr...
Started with-apache-spark
Started with-apache-spark
Getting started with SparkSQL - Desert Code Camp 2016
Getting started with SparkSQL - Desert Code Camp 2016
Spark is going to replace Apache Hadoop! Know Why?
Spark is going to replace Apache Hadoop! Know Why?
Vectorized R Execution in Apache Spark
Vectorized R Execution in Apache Spark
Spark for big data analytics
Spark for big data analytics
Apache Spark Training | Spark Tutorial For Beginners | Apache Spark Certifica...
Apache Spark Training | Spark Tutorial For Beginners | Apache Spark Certifica...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3
Media_Entertainment_Veriticals
Media_Entertainment_Veriticals
Developing and deploying big data machine learning models
Developing and deploying big data machine learning models
Spark1
Spark1
#startathon2.0 - Spark Core
#startathon2.0 - Spark Core
Recently uploaded
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
soniya singh
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
kzayra69
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
VICTOR MAESTRE RAMIREZ
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
umasea
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
smiwainfosol
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
Diego Iván Oliveros Acosta
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
Sujith Sukumaran
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Stefano Stabellini
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
bntitsolutionsrishis
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
FerryKemperman
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
Christina Lin
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Ahmed Mohamed
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
Christoph Pohl
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Christina Lin
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
Łukasz Chruściel
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
andrehoraa
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
Philip Schwarz
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
jennyeacort
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
Hanief Utama
Recently uploaded
(20)
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
Exem flamingo meetup-7th-sparkr
1.
SparkR 을 이용한
Fraud Detection 빅데이터본부 | FEA팀 남종환 SparkR 을 이해하고, Credit Card Fraud Detection 을 구현 해봅니다.
2.
1. Spark Overview 2.
SparkR Motivation / Creator / History 3. SparkRArchitecture 4. SparkR Usage 5. Demo – Credit Card Fraud Detection 6. Future Work 목차
3.
Spark Overview
4.
SparkR Motivation
5.
SparkR Creator http://shivaram.org/#projects https://www.linkedin.com/in/zonghengyang/
6.
SparkR History SparkR DataFrame https://en.wikipedia.org/wiki/Apache_Spark SparkR Start
7.
SparkR Architecture https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
8.
SparkR Usage –
How to Run • Run from Script • Run from R Studio
9.
SparkR Usage -
API http://spark.apache.org/docs/latest/api/R/index.html
10.
SparkR Usage -
API http://spark.apache.org/docs/latest/api/R/index.html
11.
SparkR Usage -
Data Conversion https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
12.
SparkR Usage –
Descriptive https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
13.
SparkR Usage –
Descriptive https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
14.
SparkR Usage –
Descriptive https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
15.
SparkR Usage –
Predictive https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
16.
SparkR Usage –
Predictive https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
17.
SparkR Usage –
1.6 to 2.0
18.
SparkR Future Directions https://www.slideshare.net/databricks/recent- developments-in-sparkr-for-advanced-analytics
19.
• http://spark.apache.org/docs/latest/api/R/index.html • http://hoondongkim.blogspot.kr/search/label/SparkML •
https://www.youtube.com/watch?v=cSnkb7HYdc0&feature=youtu.be • https://www.slideshare.net/databricks/recent-developments-in-sparkr-for- advanced-analytics • https://github.com/hoxo-m/SparkRext 참고링크
20.
• Credit Card
Fraud Detection • From R To SparkR Demo https://www.kaggle.com/dalpozz/creditcardfraud
21.
• Dimension :
284,807 rows x 31 columns • Description • Time : 시간 • V1 - V29 : 카드 사용에 관한 데이터, 변수명으로 추측불가 • Class : 1 = Fraud, 0 = normal Demo
22.
Demo • Work Flow
( Logistic Regression ) • Data collection • Data Sampling • Predictive Modelling • Split data 70 : 30 • Convert Rdata to SparkDataFrame • Logistic Regression • Check output class distribution
23.
감사합니다 빅데이터본부 | FEA팀
남종환
Download now