SlideShare a Scribd company logo
1 of 9
Download to read offline
Data Science Softwares and Tools
Introduction
Data Science is a very hot trend now. You may read that there are many data science projects
existed and you may heard that there are many data available. You have also heard about data
mining, text mining, social network analysis, and Big Data. So, what are they?
Data Mining is usually used to cater numerical data. Text Mining is usually used to cater textual data.
Data Mining usually follows the CRISP DM process to identify new patterns and knowledge.
Extracted from: https://en.m.wikipedia.org/wiki/Cross-industry_standard_process_for_data_mining
Social Network Analysis is used to analyze social networks like Facebook, Weibo and etc. using
graphs with edges or nodes, can be directed or undirected. Big Data is for data that are too large to
process on a computer, and we usually use parallel or distributed system like Hadoop to process the
data.
The following are some very popular data science tools.
R Programming
R Programming is very famous for statistics, visualizing, and statistical learning. R Programming is open
source, and is known in the research community. R has many extensions that allow data scientists and
statisticians to do data mining, text analysis, data visualizations, and Big Data Analysis. R is the
programming language and RStudio is the Integrated Development Environment. There are packages
like Rattle and ggplot for predictive analysis and data visualization.
Extracted from: http://rprogramming.net/download-and-install-rstudio/
Python Programming
Python is a high-level language, that has object oriented features in it. This meant that developers can
write scripts and codes with reference to real world objects. Python has many libraries for Statistics
using Scipy and numpy, predictive analytics with Scikit Learn, data visualizations using Matplotlib. While
R is initially developed for statistics, Python is a real programming language that can develop real
applications.
Extracted from: https://deparkes.co.uk/2012/10/29/winpython-a-matlab-alternative/
Excel
Excel can actually do a lot of data analysis, including data visualization using charts. Excel can be used to
conduct statistical analysis, including descriptives statistics. Inferential statistics and regressions can be
implemented with Excel data analysis addins. You can expand Excel with Excel VBA. For prediction using
machine learning, you will have to use R or Python.
Extracted from: https://chrome.google.com/webstore/detail/excel-
online/iljnkagajgfdmfnnidjijobijlfjfgnb
SAS
SAS is for advanced analytics, data management, and social media analytics, offering advanced
robust data science suite. SAS is very famous for business intelligence analysis on large data sets.
SAS topped the Gartner Magic Quadrant list and has integration with Python, R, Hadoop. SAS
Enterprise Guide offers GUI for SAS Programming to use in data analysis, and SAS Enterprise
Miner offers predictive analytics.
Extracted from:
http://support.sas.com/documentation/cdl/en/gridref/63292/HTML/default/viewer.htm#p0l098ovcs9xt
bn1f4cv3eexy0d0.htm
SPSS
SPSS is another competitor of SAS, and is the Industrial standard for data mining and offers
advanced analytics. Statistics offers advanced statistical analysis, which includes descriptive
statistics, inferential statistics, Regressions, and data visualization. SPSS Modeler offers predictive
analytics with statistical learning and machine learning algorithms. SPSS Modeler offers text
analysis plugins to analyze textual data.
Extracted from: https://developer.ibm.com/predictiveanalytics/2015/05/14/solving-business-problems-
ibm-spss-modeler-churn-model/
DSTK – Data Science Toolkit 3
DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM
model. DSTK offers data understanding using statistical and text analysis, data preparation using
normalization and text processing, modeling and evaluation for machine learning and statistical
learning algorithms. ChartPlotter is a New Addition to the DSTK softwares, and it allows you to build
interactive Plotly JS charts and dashboards in minutes, using only mouse clicks. DSTK
Studio allows you to build recommendation and prediction data products.
DSTK 3 consists of DSTK Engine, DSTK ScriptWriter, DSTK Studio, DSTK Text Explorer,
and DSTK ChartPlotter. DSTK Engine is R simplified, focusing on Data Mining. DSTK ScriptWriter
offers GUI to write script for DSTK Engine. DSTK Studio offers SPSS Statistics like GUI for data
mining, DSTK Text Explorer offers GUI for Text Mining, and DSTK Chart Plotter offers GUI for data
visualizations. DSTK does not have the level of advanced analytics in SPSS and SAS, but it is more
cost effective aiming at smaller companies that need analytics, but does not need advanced
analytics.
DSTK Engine and DSTK ScriptWriter are free of charge and have been uploaded to
Sourceforge.net They are under GNU GPL License. DSTK Studio, Text Explorer, and Chart
Plotter, however, requires a small fee of $59 usd to help support us. A demo version of DSTK
Studio and DSTK Text Explorer is included in DSTK 3 package, but you can only use them 10
times.
Visit: http://dstk.tech for more information.
Text Link Analysis using DSTK Text Explorer. You do not have to read all the customers’ opinions.
DSTK Studio
DSTK ScriptWriter. It is FREE but you need to write script
References
https://developer.ibm.com/predictiveanalytics/2015/05/14/solving-business-problems-ibm-spss-
modeler-churn-model/
http://support.sas.com/documentation/cdl/en/gridref/63292/HTML/default/viewer.htm#p0l098ovcs9xt
bn1f4cv3eexy0d0.htm
https://chrome.google.com/webstore/detail/excel-online/iljnkagajgfdmfnnidjijobijlfjfgnb
https://deparkes.co.uk/2012/10/29/winpython-a-matlab-alternative/
http://rprogramming.net/download-and-install-rstudio/
https://en.m.wikipedia.org/wiki/Cross-industry_standard_process_for_data_mining
https://www.upwork.com/hiring/data/big-data-science-tools/

More Related Content

More from marcocrowther

More from marcocrowther (20)

Best liposuction houston
Best liposuction houstonBest liposuction houston
Best liposuction houston
 
Hungry nomad truck
Hungry nomad truckHungry nomad truck
Hungry nomad truck
 
Coffee grind chart
Coffee grind chartCoffee grind chart
Coffee grind chart
 
De lindenberg
De lindenbergDe lindenberg
De lindenberg
 
Diamond guide
Diamond guideDiamond guide
Diamond guide
 
Different types of weed
Different types of weedDifferent types of weed
Different types of weed
 
Tips for moving on a budget
Tips for moving on a budgetTips for moving on a budget
Tips for moving on a budget
 
Foods you must eat in europe
Foods you must eat in europeFoods you must eat in europe
Foods you must eat in europe
 
Benefits of sleep
Benefits of sleepBenefits of sleep
Benefits of sleep
 
15 foods rich in vitamin e
15 foods rich in vitamin e15 foods rich in vitamin e
15 foods rich in vitamin e
 
6 mistakes to avoid when choosing a caterer
6 mistakes to avoid when choosing a caterer6 mistakes to avoid when choosing a caterer
6 mistakes to avoid when choosing a caterer
 
Best commercial copiers
Best commercial copiersBest commercial copiers
Best commercial copiers
 
Pirates life flex fit hat
Pirates life flex fit hatPirates life flex fit hat
Pirates life flex fit hat
 
Hospitals in dallas
Hospitals in dallasHospitals in dallas
Hospitals in dallas
 
Color sorter
Color sorterColor sorter
Color sorter
 
House painters near me
House painters near meHouse painters near me
House painters near me
 
10 tasty british summer foods
10 tasty british summer foods10 tasty british summer foods
10 tasty british summer foods
 
Roofing contractors carrollton
Roofing contractors carrolltonRoofing contractors carrollton
Roofing contractors carrollton
 
So buddha.com
So buddha.comSo buddha.com
So buddha.com
 
Vakantiepark hambachtal
Vakantiepark hambachtalVakantiepark hambachtal
Vakantiepark hambachtal
 

Recently uploaded

FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
dollysharma2066
 
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pillsMifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Abortion pills in Kuwait Cytotec pills in Kuwait
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
Renandantas16
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
lizamodels9
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
amitlee9823
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
Matteo Carbone
 

Recently uploaded (20)

KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
 
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...
 
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pillsMifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 

Data science softwares and tools

  • 1. Data Science Softwares and Tools Introduction Data Science is a very hot trend now. You may read that there are many data science projects existed and you may heard that there are many data available. You have also heard about data mining, text mining, social network analysis, and Big Data. So, what are they? Data Mining is usually used to cater numerical data. Text Mining is usually used to cater textual data. Data Mining usually follows the CRISP DM process to identify new patterns and knowledge. Extracted from: https://en.m.wikipedia.org/wiki/Cross-industry_standard_process_for_data_mining Social Network Analysis is used to analyze social networks like Facebook, Weibo and etc. using graphs with edges or nodes, can be directed or undirected. Big Data is for data that are too large to process on a computer, and we usually use parallel or distributed system like Hadoop to process the data. The following are some very popular data science tools.
  • 2. R Programming R Programming is very famous for statistics, visualizing, and statistical learning. R Programming is open source, and is known in the research community. R has many extensions that allow data scientists and statisticians to do data mining, text analysis, data visualizations, and Big Data Analysis. R is the programming language and RStudio is the Integrated Development Environment. There are packages like Rattle and ggplot for predictive analysis and data visualization. Extracted from: http://rprogramming.net/download-and-install-rstudio/
  • 3. Python Programming Python is a high-level language, that has object oriented features in it. This meant that developers can write scripts and codes with reference to real world objects. Python has many libraries for Statistics using Scipy and numpy, predictive analytics with Scikit Learn, data visualizations using Matplotlib. While R is initially developed for statistics, Python is a real programming language that can develop real applications. Extracted from: https://deparkes.co.uk/2012/10/29/winpython-a-matlab-alternative/
  • 4. Excel Excel can actually do a lot of data analysis, including data visualization using charts. Excel can be used to conduct statistical analysis, including descriptives statistics. Inferential statistics and regressions can be implemented with Excel data analysis addins. You can expand Excel with Excel VBA. For prediction using machine learning, you will have to use R or Python. Extracted from: https://chrome.google.com/webstore/detail/excel- online/iljnkagajgfdmfnnidjijobijlfjfgnb
  • 5. SAS SAS is for advanced analytics, data management, and social media analytics, offering advanced robust data science suite. SAS is very famous for business intelligence analysis on large data sets. SAS topped the Gartner Magic Quadrant list and has integration with Python, R, Hadoop. SAS Enterprise Guide offers GUI for SAS Programming to use in data analysis, and SAS Enterprise Miner offers predictive analytics. Extracted from: http://support.sas.com/documentation/cdl/en/gridref/63292/HTML/default/viewer.htm#p0l098ovcs9xt bn1f4cv3eexy0d0.htm
  • 6. SPSS SPSS is another competitor of SAS, and is the Industrial standard for data mining and offers advanced analytics. Statistics offers advanced statistical analysis, which includes descriptive statistics, inferential statistics, Regressions, and data visualization. SPSS Modeler offers predictive analytics with statistical learning and machine learning algorithms. SPSS Modeler offers text analysis plugins to analyze textual data. Extracted from: https://developer.ibm.com/predictiveanalytics/2015/05/14/solving-business-problems- ibm-spss-modeler-churn-model/
  • 7. DSTK – Data Science Toolkit 3 DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and statistical learning algorithms. ChartPlotter is a New Addition to the DSTK softwares, and it allows you to build interactive Plotly JS charts and dashboards in minutes, using only mouse clicks. DSTK Studio allows you to build recommendation and prediction data products. DSTK 3 consists of DSTK Engine, DSTK ScriptWriter, DSTK Studio, DSTK Text Explorer, and DSTK ChartPlotter. DSTK Engine is R simplified, focusing on Data Mining. DSTK ScriptWriter offers GUI to write script for DSTK Engine. DSTK Studio offers SPSS Statistics like GUI for data mining, DSTK Text Explorer offers GUI for Text Mining, and DSTK Chart Plotter offers GUI for data visualizations. DSTK does not have the level of advanced analytics in SPSS and SAS, but it is more cost effective aiming at smaller companies that need analytics, but does not need advanced analytics. DSTK Engine and DSTK ScriptWriter are free of charge and have been uploaded to Sourceforge.net They are under GNU GPL License. DSTK Studio, Text Explorer, and Chart Plotter, however, requires a small fee of $59 usd to help support us. A demo version of DSTK Studio and DSTK Text Explorer is included in DSTK 3 package, but you can only use them 10 times. Visit: http://dstk.tech for more information. Text Link Analysis using DSTK Text Explorer. You do not have to read all the customers’ opinions.
  • 8. DSTK Studio DSTK ScriptWriter. It is FREE but you need to write script