SlideShare a Scribd company logo
1 of 9
Download to read offline
GET STARTED
WITH FOR
R
DATA SCIENCE
© Copyright 2023. United States Data Science Institute. All Rights Reserved us .org
dsi
In modern-day businesses, no organization can afford to ignore the
importance of data science. By leveraging the power of data
science, many companies have reached new heights that used to be
impossible once. By properly analyzing the data, these
organizations are able to make a more accurate data-driven
decision that not only helps to improve their business operations
but also enhances their customer experience.
The recent ranking released by the in May
World Economic Forum
2023 ranking the fastest growing jobs has ranked the data science
th
jobs 5 on the list. Also, the data science market is witnessing a
rapid growth rate and it is expected to reach a market value of $501
billion by 2032, as reported by Precedence Research.
With the growing amount of data, where millions of terabytes of
data are generated every day, businesses are looking for skilled data
science professionals who can process this enormous amount of
data for their organization’s best interest. And it is the perfect time
to step into this domain for a successful career ahead.
When it comes to performing data science tasks, there are several
programming languages and tools that come in handy such as
Python, R, Scala, Java, SQL, etc. Here we will guide you about one
of the most popular languages used for Data Science – the R
programming language. So, if you are a beginner, then this
document is for you to learn everything you need to know about R.
© Copyright 2023. United States Data Science Institute. All Rights Reserved
1 us .org
dsi
What is R?
R is among the very popular programming
languages that several data science professionals
use specifically for data analysis and statistical
computing. This programming language was
created by Ross Ihaka and Robert Gentleman
from the University of Auckland, New Zealand in
the 1990s. And since then, R has gained immense
popularity in the data science community. As this
language offers extensive libraries and robust
statistical capabilities, it has managed to garner a
huge user base since its inception.
How to Install R?
Installing R is very easy. It is available for all
major platforms including Windows, MacOS, and
Linux. Before you can start using this software,
you can download it from the Comprehensive R
Archive Network (CRAN) website (https://cran.r-
project.org/) and install it in your system. Ensure
you have downloaded the latest version to make
use of the latest features. You will get the
installation instructions for your operating
system on this website as well.
Getting started with Rstudio
So, now that you have downloaded and installed
R on your system, it’s time to get started with
RStudio. It is a popular Integrated Development
Environment that makes working with R more
user-friendly. Even though R can be used from
the command line, many data science
professionals prefer using IDEs i.e. RStudion for
their work.
Rstudio can be downloaded from their official
website
https://www.rstudio.com/products/rstudio/dow
nload/. After installing this, you will see a user-
friendly interface where you get access to panels
for writing code, viewing data, and visualizing
results.
Basic concepts of R:
So now let us get started with the basics of R and
its components. The first thing to understand is,
that R is an object-oriented language which
means every operation you perform in R is
around objects. These objects also known as
building blocks of R, are:
© Copyright 2023. United States Data Science Institute. All Rights Reserved
2 usdsi.org
© Copyright 2023. United States Data Science Institute. All Rights Reserved
3 usdsi.org
Variables
InR,youcanassignvaluestovariablesusingtheassignmentoperator<-or=.Forexample:
Thisassignsthevalue10tothevariable‘x’.
DataStructures
Data Structure refers to the nouns of Programming in R and data items of different types are organized
intodatastructures.Thesedatastructurescantaketheformof
Ÿ Vectors–theyareone-dimensionalarraysthatcanholdmultiplevaluesofthesamedatatype
Ÿ Data frames - Two-dimensional tables used to store data, with rows and columns, similar to
spreadsheets
Ÿ Lists-Containersthatcanholdelementsofdifferentdatatypes
Functions
Another thing that makes R a popular programming language is its built-in functions that can perform
various functions. For example, you can use the ‘mean()’ function to calculate the mean of a vector of
numbers:
© Copyright 2023. United States Data Science Institute. All Rights Reserved usdsi.org
Packages
PackagesinRrefertothecollectionsoffunctions,data,anddocumentation.ByinstallingandloadingR
packages, you can enhance the R’s functionality in doing specific data science tasks. The
‘install.packages()’functioninstallspackages,whilethe‘library()’functionloadsthem.Forexample:
Thisassignsthevalue10tothevariable‘x’.
DataManipulationinR
Data manipulation is the most important part of data analysis and R offers several packages to perform data
manipulationtasks.‘dplyr’and‘tidyr’aresuchpackagesthatareusedtoperformthebelow-mentionedtasks
easily:
DataImport
You need to import the data first before you can start working on it. R can work on various data formats
such as CSV, Excel, and different databases. The ‘read.csv()’ function is commonly used for reading CSV
files.Forexample:
DataExploration
After the data has been imported, you can now explore these using different functions like ‘head()’,
‘tail()’,and‘summary()’.Thesefunctionswillhelpyouprovideaquickoverviewofyourdataset.
DataFiltering
The ‘filter()’ function in ‘dplyr’ is used to subset data based on specific criteria. For example, if you want
tofilterdatawithagemorethan30,thenusethecommand
4
© Copyright 2023. United States Data Science Institute. All Rights Reserved
5 usdsi.org
DataTransformation
Data transformation refers to the modification of variables or the creation of new ones. To perform this
‘mutate()’functioniscommonlyusedin‘dplyr’.Hereisanexampleofatransformfunction:
DataAggregation
Itinvolvesaggregationofdatai.e.summarizingorgroupingthemtoobtaininsights.Forexample:
DataVisualizationinR
The core of any data science project is Data Visualization which refers to creating beautiful and interactive
visuals of the findings from the data analysis. With this technique, complex insights can be easily conveyed to
stakeholders. With R, packages like ‘ggplot2’ can be used for data visualization. With this package, the
followingtypesofplotscanbecreated:
ScatterPlot
Itisusedtovisualizetherelationshipbetweentwonumericalvalues.
© Copyright 2023. United States Data Science Institute. All Rights Reserved
6 usdsi.org
Histogram
Histogramsareusedtorepresentthedistributionofsinglevariables.
BarChart
Itissuitableforvisualizingcategoricaldata.
These are just only a few of the examples of how you can use ggplot2 for data visualization. You can
explore it more and use your creativity to visualize your data in your own way. Since the R and its
package offer great flexibility, you can customize the visualization for your findings.
StatisticalAnalysisinR
What distinguishes R from other programming languages is its statistical capabilities. And this makes it an
ideal choice among data scientists. With the help of R, you can perform various statistical tests and analyses
toderivemeaningfulinsightsfromyourdata.SomecommonstatisticalfunctionsinRinclude:
Ÿ T-tests–forcomparingmeansoftwogroups
Ÿ ANOVA–foranalysisofvariancetestsforcomparingmultiplegroups
Ÿ LinearRegression–forbuildingrelationshipsbetweenvariables
Ÿ Correlation – it measures the strength and direction of the relationship between two numerical
values
© Copyright 2023. United States Data Science Institute. All Rights Reserved us .org
dsi
7
Conclusion
So, these are some basics of R that can help you get
started with this programming language for data
science. You must remember, that R is a comprehensive
programming language and offers several functions and
packages. It requires a lot of practice and working on
several different types of data and projects to completely
understandallitsfunctionalities.
The more you practice, the more you will concepts will
get cleared. So, download it now, jump into online
communities, dive into video tutorials, get enrolled in
data science certification, and master this incredible tool
fordatascience.
© Copyright 2023. United States Data Science Institute. All Rights Reserved
LOCATIONS
info@usdsi.org | www.usdsi.org
Arizona
1345 E. Chandler BLVD.,
Suite 111-D Phoenix,
AZ 85048,
info.az@usdsi.org
Connecticut
Connecticut 680 E Main Street
#699, Stamford, CT 06901
info.ct@usdsi.org
Illinois
1 East Erie St, Suite 525
Chicago, IL 60611
info.il@usdsi.org
Singapore
No 7 Temasek Boulevard#12-07
Suntec Tower One, Singapore, 038987
Singapore, info.sg@usdsi.org
United Kingdom
29 Whitmore Road, Whitnash
Learmington Spa, Warwickshire,
United Kingdom CV312JQ
info.uk@usdsi.org
About
The United States Data Science Institute
®
(USDSI ) is deemed a high-end and in-depth
technical certification provider for Data Science
Professionals and leads the global panorama in
Data Science Organizational Transformation,
®
Innovation, and Leadership. USDSI researches,
designs, and certifies personnel who enter or
engage in various emerging Data Science
Majors.
GROW BIG WITH
DATA SCIENTIST
EXPERTISE
VIA
CERTIFICATIONS
REGISTER NOW

More Related Content

Similar to GET STARTED WITH R FOR DATA SCIENCE

Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistancephdAssistance1
 
Overview of tools for data analysis and visualisation (2021)
Overview of tools for data analysis and visualisation (2021)Overview of tools for data analysis and visualisation (2021)
Overview of tools for data analysis and visualisation (2021)Marié Roux
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...phdAssistance1
 
Overview data analyis and visualisation tools 2020
Overview data analyis and visualisation tools 2020Overview data analyis and visualisation tools 2020
Overview data analyis and visualisation tools 2020Marié Roux
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
Python vs. r for data science
Python vs. r for data sciencePython vs. r for data science
Python vs. r for data scienceHugo Shi
 
Big Data - Analytics with R
Big Data - Analytics with RBig Data - Analytics with R
Big Data - Analytics with RTechsparks
 
2019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 42019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 4Ferdin Joe John Joseph PhD
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with RGreat Wide Open
 
The Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningThe Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningIRJET Journal
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data scienceShilpaKrishna6
 
Key Skills Required for Data Engineering
Key Skills Required for Data EngineeringKey Skills Required for Data Engineering
Key Skills Required for Data EngineeringFibonalabs
 
The Best Programming Langauge for Data Science.pptx
The Best Programming Langauge for Data Science.pptxThe Best Programming Langauge for Data Science.pptx
The Best Programming Langauge for Data Science.pptxAvinash Sharma
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
 

Similar to GET STARTED WITH R FOR DATA SCIENCE (20)

Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
Airline Data Analysis
Airline Data AnalysisAirline Data Analysis
Airline Data Analysis
 
Reason To learn & use r
Reason To learn & use rReason To learn & use r
Reason To learn & use r
 
Overview of tools for data analysis and visualisation (2021)
Overview of tools for data analysis and visualisation (2021)Overview of tools for data analysis and visualisation (2021)
Overview of tools for data analysis and visualisation (2021)
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
R for data analytics
R for data analyticsR for data analytics
R for data analytics
 
Overview data analyis and visualisation tools 2020
Overview data analyis and visualisation tools 2020Overview data analyis and visualisation tools 2020
Overview data analyis and visualisation tools 2020
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Python vs. r for data science
Python vs. r for data sciencePython vs. r for data science
Python vs. r for data science
 
Big Data - Analytics with R
Big Data - Analytics with RBig Data - Analytics with R
Big Data - Analytics with R
 
Toolboxes for data scientists
Toolboxes for data scientistsToolboxes for data scientists
Toolboxes for data scientists
 
2019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 42019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 4
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
The Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningThe Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine Learning
 
Python para Manual de Ciência de Dados
Python para Manual de Ciência de DadosPython para Manual de Ciência de Dados
Python para Manual de Ciência de Dados
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data science
 
Key Skills Required for Data Engineering
Key Skills Required for Data EngineeringKey Skills Required for Data Engineering
Key Skills Required for Data Engineering
 
The Best Programming Langauge for Data Science.pptx
The Best Programming Langauge for Data Science.pptxThe Best Programming Langauge for Data Science.pptx
The Best Programming Langauge for Data Science.pptx
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 

More from USDSI

Comparing Data Science, Big Data, and Data Analytics.pdf
Comparing Data Science, Big Data, and Data Analytics.pdfComparing Data Science, Big Data, and Data Analytics.pdf
Comparing Data Science, Big Data, and Data Analytics.pdfUSDSI
 
Data Science Careers in 2024
Data Science Careers in 2024Data Science Careers in 2024
Data Science Careers in 2024USDSI
 
FROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdf
FROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdfFROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdf
FROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdfUSDSI
 
Factsheet: Data Science Careers in 2024
Factsheet: Data Science Careers in 2024Factsheet: Data Science Careers in 2024
Factsheet: Data Science Careers in 2024USDSI
 
The Limitless Possibilities in Data Science.pdf
The Limitless Possibilities in Data Science.pdfThe Limitless Possibilities in Data Science.pdf
The Limitless Possibilities in Data Science.pdfUSDSI
 
Master Data-Driven Decision-Making in 2024
Master Data-Driven Decision-Making in 2024Master Data-Driven Decision-Making in 2024
Master Data-Driven Decision-Making in 2024USDSI
 
IS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdf
IS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdfIS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdf
IS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdfUSDSI
 
TOP DATA SCIENCE TRENDS 2024-01.pdf
TOP DATA SCIENCE TRENDS 2024-01.pdfTOP DATA SCIENCE TRENDS 2024-01.pdf
TOP DATA SCIENCE TRENDS 2024-01.pdfUSDSI
 
Become a data science professional
Become a data science professionalBecome a data science professional
Become a data science professionalUSDSI
 
CERTIFIED SENIOR DATA SCIENTIST (CSDS™)
CERTIFIED SENIOR DATA SCIENTIST (CSDS™)CERTIFIED SENIOR DATA SCIENTIST (CSDS™)
CERTIFIED SENIOR DATA SCIENTIST (CSDS™)USDSI
 
Do You Know How Virtual Numbers Can Help You.pdf
Do You Know How Virtual Numbers Can Help You.pdfDo You Know How Virtual Numbers Can Help You.pdf
Do You Know How Virtual Numbers Can Help You.pdfUSDSI
 
Why is Data Science a Popular Career Choice.pdf
Why is Data Science a Popular Career Choice.pdfWhy is Data Science a Popular Career Choice.pdf
Why is Data Science a Popular Career Choice.pdfUSDSI
 
Case_Study_WhatsApp_Business_API_solution_for_NoBroker.pdf
Case_Study_WhatsApp_Business_API_solution_for_NoBroker.pdfCase_Study_WhatsApp_Business_API_solution_for_NoBroker.pdf
Case_Study_WhatsApp_Business_API_solution_for_NoBroker.pdfUSDSI
 
15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf
15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf
15 DATA SCIENCE TRENDS TO RULE IN 2023.pdfUSDSI
 
Automate Interaction With IVR To Boost Customer Satisfaction.pdf
Automate Interaction With IVR To Boost Customer Satisfaction.pdfAutomate Interaction With IVR To Boost Customer Satisfaction.pdf
Automate Interaction With IVR To Boost Customer Satisfaction.pdfUSDSI
 
Artificial Intelligence Career In 2023
Artificial Intelligence Career In 2023Artificial Intelligence Career In 2023
Artificial Intelligence Career In 2023USDSI
 
CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™
CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™
CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™USDSI
 
INTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdf
INTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdfINTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdf
INTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdfUSDSI
 
6 Major libraries used for Data Science and Machine Learning applications in ...
6 Major libraries used for Data Science and Machine Learning applications in ...6 Major libraries used for Data Science and Machine Learning applications in ...
6 Major libraries used for Data Science and Machine Learning applications in ...USDSI
 
Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?USDSI
 

More from USDSI (20)

Comparing Data Science, Big Data, and Data Analytics.pdf
Comparing Data Science, Big Data, and Data Analytics.pdfComparing Data Science, Big Data, and Data Analytics.pdf
Comparing Data Science, Big Data, and Data Analytics.pdf
 
Data Science Careers in 2024
Data Science Careers in 2024Data Science Careers in 2024
Data Science Careers in 2024
 
FROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdf
FROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdfFROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdf
FROM DATA TO DOLLARS DISCOVER THE IMPACT OF DATA SCIENCE ON BUSINESS.pdf
 
Factsheet: Data Science Careers in 2024
Factsheet: Data Science Careers in 2024Factsheet: Data Science Careers in 2024
Factsheet: Data Science Careers in 2024
 
The Limitless Possibilities in Data Science.pdf
The Limitless Possibilities in Data Science.pdfThe Limitless Possibilities in Data Science.pdf
The Limitless Possibilities in Data Science.pdf
 
Master Data-Driven Decision-Making in 2024
Master Data-Driven Decision-Making in 2024Master Data-Driven Decision-Making in 2024
Master Data-Driven Decision-Making in 2024
 
IS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdf
IS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdfIS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdf
IS GENERATIVE AI BENEFICIAL FOR DATA ENGINEER.pdf
 
TOP DATA SCIENCE TRENDS 2024-01.pdf
TOP DATA SCIENCE TRENDS 2024-01.pdfTOP DATA SCIENCE TRENDS 2024-01.pdf
TOP DATA SCIENCE TRENDS 2024-01.pdf
 
Become a data science professional
Become a data science professionalBecome a data science professional
Become a data science professional
 
CERTIFIED SENIOR DATA SCIENTIST (CSDS™)
CERTIFIED SENIOR DATA SCIENTIST (CSDS™)CERTIFIED SENIOR DATA SCIENTIST (CSDS™)
CERTIFIED SENIOR DATA SCIENTIST (CSDS™)
 
Do You Know How Virtual Numbers Can Help You.pdf
Do You Know How Virtual Numbers Can Help You.pdfDo You Know How Virtual Numbers Can Help You.pdf
Do You Know How Virtual Numbers Can Help You.pdf
 
Why is Data Science a Popular Career Choice.pdf
Why is Data Science a Popular Career Choice.pdfWhy is Data Science a Popular Career Choice.pdf
Why is Data Science a Popular Career Choice.pdf
 
Case_Study_WhatsApp_Business_API_solution_for_NoBroker.pdf
Case_Study_WhatsApp_Business_API_solution_for_NoBroker.pdfCase_Study_WhatsApp_Business_API_solution_for_NoBroker.pdf
Case_Study_WhatsApp_Business_API_solution_for_NoBroker.pdf
 
15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf
15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf
15 DATA SCIENCE TRENDS TO RULE IN 2023.pdf
 
Automate Interaction With IVR To Boost Customer Satisfaction.pdf
Automate Interaction With IVR To Boost Customer Satisfaction.pdfAutomate Interaction With IVR To Boost Customer Satisfaction.pdf
Automate Interaction With IVR To Boost Customer Satisfaction.pdf
 
Artificial Intelligence Career In 2023
Artificial Intelligence Career In 2023Artificial Intelligence Career In 2023
Artificial Intelligence Career In 2023
 
CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™
CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™
CERTIFIED ARTIFICIAL INTELLIGENCE ENGINEER - CAIE™
 
INTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdf
INTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdfINTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdf
INTERNATIONAL SCHOLARSHIP EXAM POLICY 2023.pdf
 
6 Major libraries used for Data Science and Machine Learning applications in ...
6 Major libraries used for Data Science and Machine Learning applications in ...6 Major libraries used for Data Science and Machine Learning applications in ...
6 Major libraries used for Data Science and Machine Learning applications in ...
 
Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?
 

Recently uploaded

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 

Recently uploaded (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 

GET STARTED WITH R FOR DATA SCIENCE

  • 1. GET STARTED WITH FOR R DATA SCIENCE © Copyright 2023. United States Data Science Institute. All Rights Reserved us .org dsi
  • 2. In modern-day businesses, no organization can afford to ignore the importance of data science. By leveraging the power of data science, many companies have reached new heights that used to be impossible once. By properly analyzing the data, these organizations are able to make a more accurate data-driven decision that not only helps to improve their business operations but also enhances their customer experience. The recent ranking released by the in May World Economic Forum 2023 ranking the fastest growing jobs has ranked the data science th jobs 5 on the list. Also, the data science market is witnessing a rapid growth rate and it is expected to reach a market value of $501 billion by 2032, as reported by Precedence Research. With the growing amount of data, where millions of terabytes of data are generated every day, businesses are looking for skilled data science professionals who can process this enormous amount of data for their organization’s best interest. And it is the perfect time to step into this domain for a successful career ahead. When it comes to performing data science tasks, there are several programming languages and tools that come in handy such as Python, R, Scala, Java, SQL, etc. Here we will guide you about one of the most popular languages used for Data Science – the R programming language. So, if you are a beginner, then this document is for you to learn everything you need to know about R. © Copyright 2023. United States Data Science Institute. All Rights Reserved 1 us .org dsi
  • 3. What is R? R is among the very popular programming languages that several data science professionals use specifically for data analysis and statistical computing. This programming language was created by Ross Ihaka and Robert Gentleman from the University of Auckland, New Zealand in the 1990s. And since then, R has gained immense popularity in the data science community. As this language offers extensive libraries and robust statistical capabilities, it has managed to garner a huge user base since its inception. How to Install R? Installing R is very easy. It is available for all major platforms including Windows, MacOS, and Linux. Before you can start using this software, you can download it from the Comprehensive R Archive Network (CRAN) website (https://cran.r- project.org/) and install it in your system. Ensure you have downloaded the latest version to make use of the latest features. You will get the installation instructions for your operating system on this website as well. Getting started with Rstudio So, now that you have downloaded and installed R on your system, it’s time to get started with RStudio. It is a popular Integrated Development Environment that makes working with R more user-friendly. Even though R can be used from the command line, many data science professionals prefer using IDEs i.e. RStudion for their work. Rstudio can be downloaded from their official website https://www.rstudio.com/products/rstudio/dow nload/. After installing this, you will see a user- friendly interface where you get access to panels for writing code, viewing data, and visualizing results. Basic concepts of R: So now let us get started with the basics of R and its components. The first thing to understand is, that R is an object-oriented language which means every operation you perform in R is around objects. These objects also known as building blocks of R, are: © Copyright 2023. United States Data Science Institute. All Rights Reserved 2 usdsi.org
  • 4. © Copyright 2023. United States Data Science Institute. All Rights Reserved 3 usdsi.org Variables InR,youcanassignvaluestovariablesusingtheassignmentoperator<-or=.Forexample: Thisassignsthevalue10tothevariable‘x’. DataStructures Data Structure refers to the nouns of Programming in R and data items of different types are organized intodatastructures.Thesedatastructurescantaketheformof Ÿ Vectors–theyareone-dimensionalarraysthatcanholdmultiplevaluesofthesamedatatype Ÿ Data frames - Two-dimensional tables used to store data, with rows and columns, similar to spreadsheets Ÿ Lists-Containersthatcanholdelementsofdifferentdatatypes Functions Another thing that makes R a popular programming language is its built-in functions that can perform various functions. For example, you can use the ‘mean()’ function to calculate the mean of a vector of numbers:
  • 5. © Copyright 2023. United States Data Science Institute. All Rights Reserved usdsi.org Packages PackagesinRrefertothecollectionsoffunctions,data,anddocumentation.ByinstallingandloadingR packages, you can enhance the R’s functionality in doing specific data science tasks. The ‘install.packages()’functioninstallspackages,whilethe‘library()’functionloadsthem.Forexample: Thisassignsthevalue10tothevariable‘x’. DataManipulationinR Data manipulation is the most important part of data analysis and R offers several packages to perform data manipulationtasks.‘dplyr’and‘tidyr’aresuchpackagesthatareusedtoperformthebelow-mentionedtasks easily: DataImport You need to import the data first before you can start working on it. R can work on various data formats such as CSV, Excel, and different databases. The ‘read.csv()’ function is commonly used for reading CSV files.Forexample: DataExploration After the data has been imported, you can now explore these using different functions like ‘head()’, ‘tail()’,and‘summary()’.Thesefunctionswillhelpyouprovideaquickoverviewofyourdataset. DataFiltering The ‘filter()’ function in ‘dplyr’ is used to subset data based on specific criteria. For example, if you want tofilterdatawithagemorethan30,thenusethecommand 4
  • 6. © Copyright 2023. United States Data Science Institute. All Rights Reserved 5 usdsi.org DataTransformation Data transformation refers to the modification of variables or the creation of new ones. To perform this ‘mutate()’functioniscommonlyusedin‘dplyr’.Hereisanexampleofatransformfunction: DataAggregation Itinvolvesaggregationofdatai.e.summarizingorgroupingthemtoobtaininsights.Forexample: DataVisualizationinR The core of any data science project is Data Visualization which refers to creating beautiful and interactive visuals of the findings from the data analysis. With this technique, complex insights can be easily conveyed to stakeholders. With R, packages like ‘ggplot2’ can be used for data visualization. With this package, the followingtypesofplotscanbecreated: ScatterPlot Itisusedtovisualizetherelationshipbetweentwonumericalvalues.
  • 7. © Copyright 2023. United States Data Science Institute. All Rights Reserved 6 usdsi.org Histogram Histogramsareusedtorepresentthedistributionofsinglevariables. BarChart Itissuitableforvisualizingcategoricaldata. These are just only a few of the examples of how you can use ggplot2 for data visualization. You can explore it more and use your creativity to visualize your data in your own way. Since the R and its package offer great flexibility, you can customize the visualization for your findings. StatisticalAnalysisinR What distinguishes R from other programming languages is its statistical capabilities. And this makes it an ideal choice among data scientists. With the help of R, you can perform various statistical tests and analyses toderivemeaningfulinsightsfromyourdata.SomecommonstatisticalfunctionsinRinclude: Ÿ T-tests–forcomparingmeansoftwogroups Ÿ ANOVA–foranalysisofvariancetestsforcomparingmultiplegroups Ÿ LinearRegression–forbuildingrelationshipsbetweenvariables Ÿ Correlation – it measures the strength and direction of the relationship between two numerical values
  • 8. © Copyright 2023. United States Data Science Institute. All Rights Reserved us .org dsi 7 Conclusion So, these are some basics of R that can help you get started with this programming language for data science. You must remember, that R is a comprehensive programming language and offers several functions and packages. It requires a lot of practice and working on several different types of data and projects to completely understandallitsfunctionalities. The more you practice, the more you will concepts will get cleared. So, download it now, jump into online communities, dive into video tutorials, get enrolled in data science certification, and master this incredible tool fordatascience.
  • 9. © Copyright 2023. United States Data Science Institute. All Rights Reserved LOCATIONS info@usdsi.org | www.usdsi.org Arizona 1345 E. Chandler BLVD., Suite 111-D Phoenix, AZ 85048, info.az@usdsi.org Connecticut Connecticut 680 E Main Street #699, Stamford, CT 06901 info.ct@usdsi.org Illinois 1 East Erie St, Suite 525 Chicago, IL 60611 info.il@usdsi.org Singapore No 7 Temasek Boulevard#12-07 Suntec Tower One, Singapore, 038987 Singapore, info.sg@usdsi.org United Kingdom 29 Whitmore Road, Whitnash Learmington Spa, Warwickshire, United Kingdom CV312JQ info.uk@usdsi.org About The United States Data Science Institute ® (USDSI ) is deemed a high-end and in-depth technical certification provider for Data Science Professionals and leads the global panorama in Data Science Organizational Transformation, ® Innovation, and Leadership. USDSI researches, designs, and certifies personnel who enter or engage in various emerging Data Science Majors. GROW BIG WITH DATA SCIENTIST EXPERTISE VIA CERTIFICATIONS REGISTER NOW