SlideShare a Scribd company logo
1 of 22
Advanced Analytics Tools
What is Data Analytics?
Data analytics is the science of analyzing raw data in order to make
conclusions about that information.
Data Analytics Pipeline
Collect Refine Store Analyze Presentation
Data Acquisition
How To Collect Data!
★ REST API
★ From end users
★ Web scrape
★ Email and cloud storage
★ Client’s server
Requests
A library for making HTTP
requests in Python.
Tool for Data Acquisition:
Key Features:
• Keep-alive & Connection Pooling
• Sessions with Cookie Persistence
BeautifulSoup
A library for parsing HTML
and XML documents.
Tool for Data Acquisition:
Key Features:
• Multiple parser support (e.g., lxml,
html5lib, and others)
• Creates parse tree which is easy to
navigate
Flask, Flask-RESTPlus
and Swagger UI
Flask is a micro web framework
written in Python.
Flask-RESTPlus is an extension for
Flask that adds support for
quickly building REST APIs. It
automatically documents the APIs
which is visible in Swagger UI.
Tools for Data Acquisition:
Data Pre-Processing and
Storage
How To Clean Data!
★ Remove duplicate
★ Validate
★ Handle missing data
★ Fix errors
★ Filter outliers
How To Store Data!
★ RDBMS
★ ORM
Pandas
A library for data
manipulation and analysis.
Tool for Data Manipulation:
Key Features:
• Loading data into in-memory data
objects from different file formats.
• Data alignment and integrated
handling of missing data.
SQLAlchemy
SQLAlchemy is a popular
SQL toolkit and Object
Relational Mapper.
Tool for Database Operations:
Key Features:
• Function-based query construction.
• Multiple database support (e.g.,
SQLite, Postgresql, MySQL, Oracle,
MS-SQL, Firebird, Sybase and
others).
Data Analysis
How To Analyze Data!
★ Five number summary
(maximum, minimum, median,
1st quartile, 3rd quartile)
★ Average
★ Standard Deviation
★ Ratio
★ Interval
★ Trends
★ Aggregate and group by
★ Regression
★ Clustering
R and RStudio
R is a popular
programming language
for data analysis. RStudio
is an IDE for R.
Tools for Data Analysis:
Original Classes Clusters by k-means
Data Presentation
How To Visualize Data!
★ Charts
○ Line
○ Bar
○ Pie
○ Scatter
★ Graphs
★ Maps
○ Bubble
○ Polygon
★ Dashboards
Plotly
An interactive graphing
library.
Matplotlib
A plotting library for
Python.
Tools for Data Visualization:
Apache Superset
A Data Visualization and Data
Exploration Platform.
Tool for Data Visualization:
Key Features:
• It supports all the data sources that support SQL
Alchemy and supports querying using SQL.
• Superset allows sharing dashboards.
• It comes with security features like Authentication,
User Management and Roles.
Other Notable Tools
★ Excel
★ Tableau Public
★ Grafana
★ Microsoft Power BI
★ And many more . . .
Challenges
1. Poor quality data
2. Data privacy and security
3. Weak infrastructure
4. Data from multiple sources
5. Scaling data analysis
Links
Flask: https://flask.palletsprojects.com/en/2.0.x/
Flask-RESTPlus: https://flask-restplus.readthedocs.io/en/stable/
SQLAlchemy: https://www.sqlalchemy.org/
R Programming Language: https://www.r-project.org/
k-means Clustering: https://en.wikipedia.org/wiki/K-means_clustering
Plotly: https://plotly.com/
Superset Docs: https://superset.apache.org/docs/intro
Presentation GitHub Link: https://github.com/saadrumon/basic-data-analytics-tools-presentation.git
Thank You
Any Questions?
“Information is the oil of the 21st century, and analytics is the combustion engine.”
- Peter Sondergaard

More Related Content

Similar to Advanced Data Analytics techniques .pptx

Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...Michael Rys
 
Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021Mobcoder
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine LearningMark Tabladillo
 
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
Overview of Apache Flink: Next-Gen Big Data Analytics FrameworkOverview of Apache Flink: Next-Gen Big Data Analytics Framework
Overview of Apache Flink: Next-Gen Big Data Analytics FrameworkSlim Baltagi
 
An R primer for SQL folks
An R primer for SQL folksAn R primer for SQL folks
An R primer for SQL folksThomas Hütter
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Martin Bém
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineeringThang Bui (Bob)
 
Red hat infrastructure for analytics
Red hat infrastructure for analyticsRed hat infrastructure for analytics
Red hat infrastructure for analyticsKyle Bader
 
Prague data management meetup 2017-01-23
Prague data management meetup 2017-01-23Prague data management meetup 2017-01-23
Prague data management meetup 2017-01-23Martin Bém
 
AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned Omid Vahdaty
 
Data saturday malta - ADX Azure Data Explorer overview
Data saturday malta - ADX Azure Data Explorer overviewData saturday malta - ADX Azure Data Explorer overview
Data saturday malta - ADX Azure Data Explorer overviewRiccardo Zamana
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring databodaceacat
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring dataSara-Jayne Terp
 
Kylin and Druid Presentation
Kylin and Druid PresentationKylin and Druid Presentation
Kylin and Druid Presentationargonauts007
 

Similar to Advanced Data Analytics techniques .pptx (20)

Spark meetup TCHUG
Spark meetup TCHUGSpark meetup TCHUG
Spark meetup TCHUG
 
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
 
Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
CC -Unit4.pptx
CC -Unit4.pptxCC -Unit4.pptx
CC -Unit4.pptx
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
 
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
Overview of Apache Flink: Next-Gen Big Data Analytics FrameworkOverview of Apache Flink: Next-Gen Big Data Analytics Framework
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
 
An R primer for SQL folks
An R primer for SQL folksAn R primer for SQL folks
An R primer for SQL folks
 
DA_01_Intro.pptx
DA_01_Intro.pptxDA_01_Intro.pptx
DA_01_Intro.pptx
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27
 
Internship.pptx
Internship.pptxInternship.pptx
Internship.pptx
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineering
 
Red hat infrastructure for analytics
Red hat infrastructure for analyticsRed hat infrastructure for analytics
Red hat infrastructure for analytics
 
Prague data management meetup 2017-01-23
Prague data management meetup 2017-01-23Prague data management meetup 2017-01-23
Prague data management meetup 2017-01-23
 
AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned
 
Data saturday malta - ADX Azure Data Explorer overview
Data saturday malta - ADX Azure Data Explorer overviewData saturday malta - ADX Azure Data Explorer overview
Data saturday malta - ADX Azure Data Explorer overview
 
R training at Aimia
R training at AimiaR training at Aimia
R training at Aimia
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring data
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring data
 
Kylin and Druid Presentation
Kylin and Druid PresentationKylin and Druid Presentation
Kylin and Druid Presentation
 

More from Anshika865276

Advanced Data Analytics with R Programming.ppt
Advanced Data Analytics with R Programming.pptAdvanced Data Analytics with R Programming.ppt
Advanced Data Analytics with R Programming.pptAnshika865276
 
Introduction and Concept of Concurrent Engineering.pptx
Introduction and Concept of Concurrent Engineering.pptxIntroduction and Concept of Concurrent Engineering.pptx
Introduction and Concept of Concurrent Engineering.pptxAnshika865276
 
Innovation Classification and Types and Phases.pptx
Innovation Classification and Types and Phases.pptxInnovation Classification and Types and Phases.pptx
Innovation Classification and Types and Phases.pptxAnshika865276
 
INNOVATIONS MANAGEMENT and Process of Innovation.pptx
INNOVATIONS MANAGEMENT and Process of Innovation.pptxINNOVATIONS MANAGEMENT and Process of Innovation.pptx
INNOVATIONS MANAGEMENT and Process of Innovation.pptxAnshika865276
 
Different Sources of financing Businesses.ppt
Different Sources of financing Businesses.pptDifferent Sources of financing Businesses.ppt
Different Sources of financing Businesses.pptAnshika865276
 
Capital Structure - Concept and Theories.ppt
Capital Structure - Concept and Theories.pptCapital Structure - Concept and Theories.ppt
Capital Structure - Concept and Theories.pptAnshika865276
 
Machine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptMachine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptAnshika865276
 
Introduction to Machine Learning and different types of Learning
Introduction to Machine Learning and different types of LearningIntroduction to Machine Learning and different types of Learning
Introduction to Machine Learning and different types of LearningAnshika865276
 
Overview of Business Models.pptx
Overview of Business Models.pptxOverview of Business Models.pptx
Overview of Business Models.pptxAnshika865276
 
Security Issues in E-Commerce.pptx
Security Issues in E-Commerce.pptxSecurity Issues in E-Commerce.pptx
Security Issues in E-Commerce.pptxAnshika865276
 
Impact of E-Commerce.pptx
Impact of E-Commerce.pptxImpact of E-Commerce.pptx
Impact of E-Commerce.pptxAnshika865276
 
Electronic Commerce Technologies.pptx
Electronic Commerce Technologies.pptxElectronic Commerce Technologies.pptx
Electronic Commerce Technologies.pptxAnshika865276
 
2. CONCEPT OF INFORMATION.pptx
2. CONCEPT OF INFORMATION.pptx2. CONCEPT OF INFORMATION.pptx
2. CONCEPT OF INFORMATION.pptxAnshika865276
 
Types of Products.pdf
Types of Products.pdfTypes of Products.pdf
Types of Products.pdfAnshika865276
 
bussinesscommunicationfinal-181130112044.pdf
bussinesscommunicationfinal-181130112044.pdfbussinesscommunicationfinal-181130112044.pdf
bussinesscommunicationfinal-181130112044.pdfAnshika865276
 
Group Discussion and Interviews.pptx
Group Discussion and Interviews.pptxGroup Discussion and Interviews.pptx
Group Discussion and Interviews.pptxAnshika865276
 
The_Financial_System.pptx
The_Financial_System.pptxThe_Financial_System.pptx
The_Financial_System.pptxAnshika865276
 
Personal Selling.pptx
Personal Selling.pptxPersonal Selling.pptx
Personal Selling.pptxAnshika865276
 
Brand and Branding Strategy.pptx
Brand and Branding Strategy.pptxBrand and Branding Strategy.pptx
Brand and Branding Strategy.pptxAnshika865276
 

More from Anshika865276 (20)

Advanced Data Analytics with R Programming.ppt
Advanced Data Analytics with R Programming.pptAdvanced Data Analytics with R Programming.ppt
Advanced Data Analytics with R Programming.ppt
 
Introduction and Concept of Concurrent Engineering.pptx
Introduction and Concept of Concurrent Engineering.pptxIntroduction and Concept of Concurrent Engineering.pptx
Introduction and Concept of Concurrent Engineering.pptx
 
Innovation Classification and Types and Phases.pptx
Innovation Classification and Types and Phases.pptxInnovation Classification and Types and Phases.pptx
Innovation Classification and Types and Phases.pptx
 
INNOVATIONS MANAGEMENT and Process of Innovation.pptx
INNOVATIONS MANAGEMENT and Process of Innovation.pptxINNOVATIONS MANAGEMENT and Process of Innovation.pptx
INNOVATIONS MANAGEMENT and Process of Innovation.pptx
 
Different Sources of financing Businesses.ppt
Different Sources of financing Businesses.pptDifferent Sources of financing Businesses.ppt
Different Sources of financing Businesses.ppt
 
Capital Structure - Concept and Theories.ppt
Capital Structure - Concept and Theories.pptCapital Structure - Concept and Theories.ppt
Capital Structure - Concept and Theories.ppt
 
Machine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptMachine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.ppt
 
Introduction to Machine Learning and different types of Learning
Introduction to Machine Learning and different types of LearningIntroduction to Machine Learning and different types of Learning
Introduction to Machine Learning and different types of Learning
 
Overview of Business Models.pptx
Overview of Business Models.pptxOverview of Business Models.pptx
Overview of Business Models.pptx
 
Security Issues in E-Commerce.pptx
Security Issues in E-Commerce.pptxSecurity Issues in E-Commerce.pptx
Security Issues in E-Commerce.pptx
 
Impact of E-Commerce.pptx
Impact of E-Commerce.pptxImpact of E-Commerce.pptx
Impact of E-Commerce.pptx
 
Electronic Commerce Technologies.pptx
Electronic Commerce Technologies.pptxElectronic Commerce Technologies.pptx
Electronic Commerce Technologies.pptx
 
2. CONCEPT OF INFORMATION.pptx
2. CONCEPT OF INFORMATION.pptx2. CONCEPT OF INFORMATION.pptx
2. CONCEPT OF INFORMATION.pptx
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
Types of Products.pdf
Types of Products.pdfTypes of Products.pdf
Types of Products.pdf
 
bussinesscommunicationfinal-181130112044.pdf
bussinesscommunicationfinal-181130112044.pdfbussinesscommunicationfinal-181130112044.pdf
bussinesscommunicationfinal-181130112044.pdf
 
Group Discussion and Interviews.pptx
Group Discussion and Interviews.pptxGroup Discussion and Interviews.pptx
Group Discussion and Interviews.pptx
 
The_Financial_System.pptx
The_Financial_System.pptxThe_Financial_System.pptx
The_Financial_System.pptx
 
Personal Selling.pptx
Personal Selling.pptxPersonal Selling.pptx
Personal Selling.pptx
 
Brand and Branding Strategy.pptx
Brand and Branding Strategy.pptxBrand and Branding Strategy.pptx
Brand and Branding Strategy.pptx
 

Recently uploaded

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 

Recently uploaded (20)

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 

Advanced Data Analytics techniques .pptx

  • 2. What is Data Analytics? Data analytics is the science of analyzing raw data in order to make conclusions about that information. Data Analytics Pipeline Collect Refine Store Analyze Presentation
  • 4. How To Collect Data! ★ REST API ★ From end users ★ Web scrape ★ Email and cloud storage ★ Client’s server
  • 5. Requests A library for making HTTP requests in Python. Tool for Data Acquisition: Key Features: • Keep-alive & Connection Pooling • Sessions with Cookie Persistence
  • 6. BeautifulSoup A library for parsing HTML and XML documents. Tool for Data Acquisition: Key Features: • Multiple parser support (e.g., lxml, html5lib, and others) • Creates parse tree which is easy to navigate
  • 7. Flask, Flask-RESTPlus and Swagger UI Flask is a micro web framework written in Python. Flask-RESTPlus is an extension for Flask that adds support for quickly building REST APIs. It automatically documents the APIs which is visible in Swagger UI. Tools for Data Acquisition:
  • 9. How To Clean Data! ★ Remove duplicate ★ Validate ★ Handle missing data ★ Fix errors ★ Filter outliers How To Store Data! ★ RDBMS ★ ORM
  • 10. Pandas A library for data manipulation and analysis. Tool for Data Manipulation: Key Features: • Loading data into in-memory data objects from different file formats. • Data alignment and integrated handling of missing data.
  • 11. SQLAlchemy SQLAlchemy is a popular SQL toolkit and Object Relational Mapper. Tool for Database Operations: Key Features: • Function-based query construction. • Multiple database support (e.g., SQLite, Postgresql, MySQL, Oracle, MS-SQL, Firebird, Sybase and others).
  • 13. How To Analyze Data! ★ Five number summary (maximum, minimum, median, 1st quartile, 3rd quartile) ★ Average ★ Standard Deviation ★ Ratio ★ Interval ★ Trends ★ Aggregate and group by ★ Regression ★ Clustering
  • 14. R and RStudio R is a popular programming language for data analysis. RStudio is an IDE for R. Tools for Data Analysis: Original Classes Clusters by k-means
  • 16. How To Visualize Data! ★ Charts ○ Line ○ Bar ○ Pie ○ Scatter ★ Graphs ★ Maps ○ Bubble ○ Polygon ★ Dashboards
  • 17. Plotly An interactive graphing library. Matplotlib A plotting library for Python. Tools for Data Visualization:
  • 18. Apache Superset A Data Visualization and Data Exploration Platform. Tool for Data Visualization: Key Features: • It supports all the data sources that support SQL Alchemy and supports querying using SQL. • Superset allows sharing dashboards. • It comes with security features like Authentication, User Management and Roles.
  • 19. Other Notable Tools ★ Excel ★ Tableau Public ★ Grafana ★ Microsoft Power BI ★ And many more . . .
  • 20. Challenges 1. Poor quality data 2. Data privacy and security 3. Weak infrastructure 4. Data from multiple sources 5. Scaling data analysis
  • 21. Links Flask: https://flask.palletsprojects.com/en/2.0.x/ Flask-RESTPlus: https://flask-restplus.readthedocs.io/en/stable/ SQLAlchemy: https://www.sqlalchemy.org/ R Programming Language: https://www.r-project.org/ k-means Clustering: https://en.wikipedia.org/wiki/K-means_clustering Plotly: https://plotly.com/ Superset Docs: https://superset.apache.org/docs/intro Presentation GitHub Link: https://github.com/saadrumon/basic-data-analytics-tools-presentation.git
  • 22. Thank You Any Questions? “Information is the oil of the 21st century, and analytics is the combustion engine.” - Peter Sondergaard