SlideShare a Scribd company logo
1 of 19
INTRODUCTION TO DATA SCIENCE
DATA SCIENCE TOOLS
DATA SCIENCE TOOLS FOR DATA MANIPULATION
DATA SCIENCE TOOLS FOR EDA
www.edureka.co
DATA SCIENCE TOOLS FOR DATA STORAGE
DATA SCIENCE TOOLS FOR DATA VISUALIZATION
INTRODUCTION TO DATA SCIENCE
www.edureka.co
Introduction To Data Science
www.edureka.co
Data Science is the process of extracting knowledge and insights from data by
using scientific methods.
Data Science involves collecting, analysing and modelling data to solve real-world problems. It is
used for fraud detection, disease detection, recommendation engines and so on.
DATA SCIENCE TOOLS
www.edureka.co
Data Science Tools come with pre-defined functions, algorithms, and a very user-friendly GUI.
Hence, they can be used to build convoluted Machine Learning models without the use of a
programming language.
DATA SCIENCE TOOLS
Data Science
Data Collection
Exploratory Data Analysis
Data Modelling
Data Visualization
www.edureka.co
DATA SCIENCE TOOLS FOR DATA STORAGE
www.edureka.co
Scale and manage massive
amounts of data
Hadoop Distributed File System
(HDFS) for data storage
Integrate with , Hadoop
MapReduce, Hadoop YARN
www.edureka.co
Data processing via Apache
Hadoop and Spark clusters
The default storage system is
Windows Azure Blob
Provides Microsoft R Server
www.edureka.co
DATA SCIENCE TOOLS FOR EDA
www.edureka.co
Data Integration tool based on
Extract Transform Load architecture
Extract Transform Load tool
to manage data
Support for distributed processing, grid
computing, adaptive load balancing.
www.edureka.co
Data processing, building
Machine Learning models, etc
Support for integrating Hadoop
framework
Generate predictive models
through automated modelling
www.edureka.co
DATA SCIENCE TOOLS FOR DATA MODELLING
www.edureka.co
Easy to apply Machine Learning
Supports GLM, Boosting ML models
& Deep Learning
Support to integrate with Apache
Hadoop
www.edureka.co
Supports parallel programming to
perform data analysis, data
modelling, etc
Tests and trains Machine Learning
models at lightning fast speed
Makes model evaluation much
easier.
www.edureka.co
DATA SCIENCE TOOLS FOR VISUALIZATION
www.edureka.co
Can visualize massive data sets to find
correlations and patterns
Create customized reports and
dashboards
Support to integrate with Apache
Hadoop
www.edureka.co
Clear & concise visualizations
Supports in-memory data
processing
Automatically generates data
associations
www.edureka.co
www.edureka.co

More Related Content

What's hot

What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 

What's hot (20)

Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
 
Data science and Artificial Intelligence
Data science and Artificial IntelligenceData science and Artificial Intelligence
Data science and Artificial Intelligence
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
 
Machine Learning in R
Machine Learning in RMachine Learning in R
Machine Learning in R
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Introduction to Python for Data Science
Introduction to Python for Data ScienceIntroduction to Python for Data Science
Introduction to Python for Data Science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare Diagnostics
 
Data science
Data scienceData science
Data science
 
Data science
Data scienceData science
Data science
 
Data science
Data scienceData science
Data science
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
PPT on Data Science Using Python
PPT on Data Science Using PythonPPT on Data Science Using Python
PPT on Data Science Using Python
 

Similar to Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka

Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
MuhammadTahiriqbal13
 
DATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptxDATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptx
SaravanaD2
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
Ravi Teja
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
Jordan Engbers
 

Similar to Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka (20)

Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
Best Data Science course in Delhi HTS institute
Best Data Science course in Delhi HTS instituteBest Data Science course in Delhi HTS institute
Best Data Science course in Delhi HTS institute
 
DATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptxDATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptx
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
OVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfOVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdf
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
data science
data sciencedata science
data science
 
data science
data sciencedata science
data science
 
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxUnlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Fair by design
Fair by designFair by design
Fair by design
 
Information & data science (1) converted
Information & data science (1) convertedInformation & data science (1) converted
Information & data science (1) converted
 
25 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 202225 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 2022
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 

More from Edureka!

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka

  • 1.
  • 2. INTRODUCTION TO DATA SCIENCE DATA SCIENCE TOOLS DATA SCIENCE TOOLS FOR DATA MANIPULATION DATA SCIENCE TOOLS FOR EDA www.edureka.co DATA SCIENCE TOOLS FOR DATA STORAGE DATA SCIENCE TOOLS FOR DATA VISUALIZATION
  • 3. INTRODUCTION TO DATA SCIENCE www.edureka.co
  • 4. Introduction To Data Science www.edureka.co Data Science is the process of extracting knowledge and insights from data by using scientific methods. Data Science involves collecting, analysing and modelling data to solve real-world problems. It is used for fraud detection, disease detection, recommendation engines and so on.
  • 6. Data Science Tools come with pre-defined functions, algorithms, and a very user-friendly GUI. Hence, they can be used to build convoluted Machine Learning models without the use of a programming language. DATA SCIENCE TOOLS Data Science Data Collection Exploratory Data Analysis Data Modelling Data Visualization www.edureka.co
  • 7. DATA SCIENCE TOOLS FOR DATA STORAGE www.edureka.co
  • 8. Scale and manage massive amounts of data Hadoop Distributed File System (HDFS) for data storage Integrate with , Hadoop MapReduce, Hadoop YARN www.edureka.co
  • 9. Data processing via Apache Hadoop and Spark clusters The default storage system is Windows Azure Blob Provides Microsoft R Server www.edureka.co
  • 10. DATA SCIENCE TOOLS FOR EDA www.edureka.co
  • 11. Data Integration tool based on Extract Transform Load architecture Extract Transform Load tool to manage data Support for distributed processing, grid computing, adaptive load balancing. www.edureka.co
  • 12. Data processing, building Machine Learning models, etc Support for integrating Hadoop framework Generate predictive models through automated modelling www.edureka.co
  • 13. DATA SCIENCE TOOLS FOR DATA MODELLING www.edureka.co
  • 14. Easy to apply Machine Learning Supports GLM, Boosting ML models & Deep Learning Support to integrate with Apache Hadoop www.edureka.co
  • 15. Supports parallel programming to perform data analysis, data modelling, etc Tests and trains Machine Learning models at lightning fast speed Makes model evaluation much easier. www.edureka.co
  • 16. DATA SCIENCE TOOLS FOR VISUALIZATION www.edureka.co
  • 17. Can visualize massive data sets to find correlations and patterns Create customized reports and dashboards Support to integrate with Apache Hadoop www.edureka.co
  • 18. Clear & concise visualizations Supports in-memory data processing Automatically generates data associations www.edureka.co