How to run appache spark on windows(in sbt console)

•

0 likes•63 views

This document will guide you on how to initialize Spark context in REPL/local mode i.e. in sbt console in command prompt. This document has test code and important links where you can go through all those links to get good hands on on Data Analytics.

Data & Analytics

How to run Appache Spark
on Windows(in SBT
console)
ANKITKUMAR KANERI
EMAIL: KANERI776@GMAIL.COM

Agenda
• Requirements
• Creating a Maven compliant folder structure
• Testing
• Useful Links

Requirements:
• Computer system with windows
• OS 7/8/10
• SBT console installed
• Scala(I used 2.10.6)
• Java(version used 1.8.0_181)

Creating a Maven compliant folder structure
Please follow the below steps:
1. Go to any directory in windows and create below folder structure
Ex: C:User(systemname)
2. Create folder of any name(I created as ie_spark) and create sub-
folder shown below strutucre

Ex: C:Users(systemname)ie_sparksrcmainscala
3. Create file called build.sbt and below content in that file and place
that file in the following path
Ex: C:Users(systemname)ie_spark
• Content of build.sbt file is as follows:
name := "ie_spark_example"
version := "0.1.1"
scalaVersion := "2.10.6“ //(my scala version is 2.10.6)
libraryDependencies += "org.apache.spark" % "spark-core_2.10" %
"1.6.2“
• Basically we are specifying instructions for SBT to resolve the library
dependencies that you mentioned in the build.sbt.

• When you “compile” scala code using SBT, it looks into build.sbt to
see what dependencies are needed and it will go the specific
repositories and will download them so that when your program
needs them, they are there.
• So basically we are saying that your scala program that you’re gonna
use will require Apache Spark Core libraries. So SBT will then go to
maven repository to grab this library along with all of its
dependencies.

• Now go to command prompt and perform the following steps:
1. Change directory to ie_spark . Use following command to change the
directory.
cd C:Users(systemname)scaladie_spark
2. Use command sbt console and enter

• You first create a Spark Conf object where you also specify the Spark
mode. As you are executing it on your lonely laptop, so you will
specify “local” mode here.
val conf=new org.apache.spark.SparkConf().setAppName("IE
SPARK").setMaster("local")
and the next step is to create Spark Context (the entry point to use
Spark Cluster and its APIs) using the conf object:
val sc=new org.apache.spark.SparkContext(conf)

Figure shows the creating configuration and initializing spark
context

Testing:
• Attached video for your reference, where it shows Spark
Context is working as expected.
• You can also check by using the below code:
scala> val parallel = sc.parallelize(Array("Ankit", "Vinay")) //
one input
parallel: org.apache.spark.rdd.RDD[String] =
ParallelCollectionRDD[27] at parallelize at <console>:24
scala> val par2 = sc.parallelize(Array("praveen", "Raju",
"Ankit")) // sec input
par2: org.apache.spark.rdd.RDD[String] =
ParallelCollectionRDD[28] at parallelize at <console>:24
scala> parallel.union(par2).collect
res21: Array[String] = Array(Ankit,Vinay, praveen,
Raju)//union output
scala> parallel.intersection(par2).collect
res21: Array[String] = Array(Ankit)//intersection output

Useful Links:
• Youtube Channel: https://www.youtube.com/channel/UCRzsq7k4-kT-
h3TDUBQ82-w
• Apache Spark: https://spark.apache.org/documentation.html
• Hortonworks: https://hortonworks.com/apache/spark/
• GitHub: https://github.com/apache/spark
• Cloudera: https://www.cloudera.com/products/open-source/apache-
hadoop/apache-spark.html
• Databrick: https://databricks.com/spark/about

What's hot

patchVantage Cloud Starter Pack David McNish

Enabling Security For ActiveMQ JMX AccessRamakrishna Narkedamilli

Why You Should Use Oracle SQL DeveloperJeffrey Kemp

Amazon S3 connectorThang Loi

Learn to love networking on iOSPaolo Tagliani

Java on Azure JJUG Night Seminar 2016 0322Yoshio Terada

Performance testing meets the cloud - Artem ShendrikovAneta Kołosowska (Wiśniewska)

ReadmeKineteco Inc.

Enable oracle database vaultOsama Mustafa

Apachejtpond

Dspace for windowsRavi Bankar

CONTINUOUS INTEGRATION && DOCKERStfalcon Meetups

Virtual server on awskamarul kawnayeen

Selenium lightning-talkStephen Donner

Sql installationBalakumaran Arunachalam

EXPERTALKS: Nov 2012 - Web Application ClusteringEXPERTALKS

Introduction to Elastic BeanstalkWolfgang Schell

Introduction to node.jsSu Zin Kyaw

Azure sharepointsqlPatrick Shim (심승욱)

Ansible automation tool with modulesmohamedmoharam

What's hot (20)

patchVantage Cloud Starter Pack

Enabling Security For ActiveMQ JMX Access

Why You Should Use Oracle SQL Developer

Amazon S3 connector

Learn to love networking on iOS

Java on Azure JJUG Night Seminar 2016 0322

Performance testing meets the cloud - Artem Shendrikov

Readme

Enable oracle database vault

Apache

Dspace for windows

CONTINUOUS INTEGRATION && DOCKER

Virtual server on aws

Selenium lightning-talk

Sql installation

EXPERTALKS: Nov 2012 - Web Application Clustering

Introduction to Elastic Beanstalk

Introduction to node.js

Azure sharepointsql

Ansible automation tool with modules

Similar to How to run appache spark on windows(in sbt console)

Apache AntVinod Kumar V H

Installing spark scala console in windows 10Ankit Kaneri

Spring Live Sample ChapterSyed Shahul

Apache spark undocumented extensionsSandeep Joshi

IBM Index 2018 Conference Workshop: Modernizing Traditional Java App's with D...Eric Smalling

How to use_cucumber_rest-assured_api_frameworkHarshad Ingle

Maven: Managing Software Projects for Repeatable ResultsSteve Keener

Tomcat + other thingsAravindharamanan S

UKLUG 2012 - XPages, Beyond the basicsUlrich Krause

Spring hibernate tutorialRohit Jagtap

How to start using ScalaNgoc Dao

Maven TestNg frame work (1) (1)Gopi Raghavendra

XPages -Beyond the BasicsUlrich Krause

[DanNotes] XPages - Beyound the BasicsUlrich Krause

How to Play at Work - A Play Framework TutorialAssistSoftware

Dockerized .Net Core based app services in azure K8s Ranjeet Bhargava

Introduction to Apache AntShih-Hsiang Lin

Part 4 Scripting and Virtualization (due Week 7)Objectives1. .docxkarlhennesey

Dockerization of Azure Platformnirajrules

Build application using sbtsparrowAnalytics.com

Similar to How to run appache spark on windows(in sbt console) (20)

Apache Ant

Installing spark scala console in windows 10

Spring Live Sample Chapter

Apache spark undocumented extensions

IBM Index 2018 Conference Workshop: Modernizing Traditional Java App's with D...

How to use_cucumber_rest-assured_api_framework

Maven: Managing Software Projects for Repeatable Results

Tomcat + other things

UKLUG 2012 - XPages, Beyond the basics

Spring hibernate tutorial

How to start using Scala

Maven TestNg frame work (1) (1)

XPages -Beyond the Basics

[DanNotes] XPages - Beyound the Basics

How to Play at Work - A Play Framework Tutorial

Dockerized .Net Core based app services in azure K8s

Introduction to Apache Ant

Part 4 Scripting and Virtualization (due Week 7)Objectives1. .docx

Dockerization of Azure Platform

Build application using sbt

Recently uploaded

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

E-Commerce Order PredictionShraddha Kamble.pptxBoston Institute of Analytics

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics

B2 Creative Industry Response Evaluation.docxStephen266013

Industrialised data - the key to AI success.pdfLars Albertsson

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一ffjhghh

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Recently uploaded (20)

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

E-Commerce Order PredictionShraddha Kamble.pptx

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

Predicting Employee Churn: A Data-Driven Approach Project Presentation

B2 Creative Industry Response Evaluation.docx

Industrialised data - the key to AI success.pdf

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

100-Concepts-of-AI by Anupama Kate .pptx

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...

Log Analysis using OSSEC sasoasasasas.pptx

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

How to run appache spark on windows(in sbt console)

1. How to run Appache Spark on Windows(in SBT console) ANKITKUMAR KANERI EMAIL: KANERI776@GMAIL.COM

2. Agenda • Requirements • Creating a Maven compliant folder structure • Testing • Useful Links

3. Requirements: • Computer system with windows • OS 7/8/10 • SBT console installed • Scala(I used 2.10.6) • Java(version used 1.8.0_181)

4. Creating a Maven compliant folder structure Please follow the below steps: 1. Go to any directory in windows and create below folder structure Ex: C:User(systemname) 2. Create folder of any name(I created as ie_spark) and create sub- folder shown below strutucre

5. Ex: C:Users(systemname)ie_sparksrcmainscala 3. Create file called build.sbt and below content in that file and place that file in the following path Ex: C:Users(systemname)ie_spark • Content of build.sbt file is as follows: name := "ie_spark_example" version := "0.1.1" scalaVersion := "2.10.6“ //(my scala version is 2.10.6) libraryDependencies += "org.apache.spark" % "spark-core_2.10" % "1.6.2“ • Basically we are specifying instructions for SBT to resolve the library dependencies that you mentioned in the build.sbt.

6. • When you “compile” scala code using SBT, it looks into build.sbt to see what dependencies are needed and it will go the specific repositories and will download them so that when your program needs them, they are there. • So basically we are saying that your scala program that you’re gonna use will require Apache Spark Core libraries. So SBT will then go to maven repository to grab this library along with all of its dependencies.

7. • Now go to command prompt and perform the following steps: 1. Change directory to ie_spark . Use following command to change the directory. cd C:Users(systemname)scaladie_spark 2. Use command sbt console and enter

8. • You first create a Spark Conf object where you also specify the Spark mode. As you are executing it on your lonely laptop, so you will specify “local” mode here. val conf=new org.apache.spark.SparkConf().setAppName("IE SPARK").setMaster("local") and the next step is to create Spark Context (the entry point to use Spark Cluster and its APIs) using the conf object: val sc=new org.apache.spark.SparkContext(conf)

9. Figure shows the creating configuration and initializing spark context

10. Testing: • Attached video for your reference, where it shows Spark Context is working as expected. • You can also check by using the below code: scala> val parallel = sc.parallelize(Array("Ankit", "Vinay")) // one input parallel: org.apache.spark.rdd.RDD[String] = ParallelCollectionRDD[27] at parallelize at <console>:24 scala> val par2 = sc.parallelize(Array("praveen", "Raju", "Ankit")) // sec input par2: org.apache.spark.rdd.RDD[String] = ParallelCollectionRDD[28] at parallelize at <console>:24 scala> parallel.union(par2).collect res21: Array[String] = Array(Ankit,Vinay, praveen, Raju)//union output scala> parallel.intersection(par2).collect res21: Array[String] = Array(Ankit)//intersection output

11. Useful Links: • Youtube Channel: https://www.youtube.com/channel/UCRzsq7k4-kT- h3TDUBQ82-w • Apache Spark: https://spark.apache.org/documentation.html • Hortonworks: https://hortonworks.com/apache/spark/ • GitHub: https://github.com/apache/spark • Cloudera: https://www.cloudera.com/products/open-source/apache- hadoop/apache-spark.html • Databrick: https://databricks.com/spark/about

12. Thank you

How to run appache spark on windows(in sbt console)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to How to run appache spark on windows(in sbt console)

Similar to How to run appache spark on windows(in sbt console) (20)

Recently uploaded

Recently uploaded (20)

How to run appache spark on windows(in sbt console)