SlideShare a Scribd company logo
Data Skills for Digital Era
The Top Data Skills You Need To Get Hired
Main Focus
Data Science Business Intelligence
Big Data Data Engineering
Mohtat@ut.ac.ir 2
Data Science
Math & Statistics
Computer Science
Subject Matter Expertise
Mohtat@ut.ac.ir 4
Data Science is an
interdisciplinary field about
processes and systems to
extract knowledge or
insights from data, which is
a continuation of some of
the data analysis fields such
as statistics, data mining,
and predictive analytics,
similar to Knowledge
Discovery in
Databases (KDD).
Types of Analytics
Descriptive
Diagnostic
Prescriptive
Predictive
Mohtat@ut.ac.ir 6
Data
Science
Technology
Application
Mohtat@ut.ac.ir 8
Critical Skills for Data Scientists
Python
R
SQL
Data Mining Tools
Knime , ReapidMiner,
IBM SPSS Modeler
Excel
BI Tools
Tableau, Power BI, Qlik
Mohtat@ut.ac.ir 9
Top Python Libraries in Data Science
TensorFlow
“TensorFlow is an open source
software library for numerical
computation using data flow graphs.
PyTorch
“PyTorch is a Python package that
provides Deep neural networks built
on a tape-based autograd system
Numpy
“NumPy is the fundamental
package needed for scientific
computing with Python.
Scikit-Learn
“scikit-learn is a Python module for
machine learning built on NumPy,
SciPy and matplotlib.
Keras
“Keras is a high-level neural networks
API, written in Python and capable of
running on top of TensorFlow, CNTK,
or Theano.
Scipy
“SciPy is open-source software for
mathematics, science, and engineering.
Pandas
“pandas is a Python package providing
fast, flexible, and expressive data
structures designed to make working
with "relational" or "labeled" data both
easy and intuitive
Matplotlib
“Matplotlib is a Python 2D plotting
library which produces publication-
quality figures in a variety of
hardcopy formats and interactive
environments across platforms.
Scrapy
“Scrapy is a fast high-level web crawling
and web scraping framework, used to
crawl websites and extract structured
data from their pages.
Mohtat@ut.ac.ir 10
Top Skills every Data Scientist needs to Master
TensorFlow Keras Hadoop Spark Hive Java Matlab
Mohtat@ut.ac.ir 11
Most Essential Skills for Data Scientists
Complex Problem Solving
Team Working
Emotional Intelligence
Creativity
Critical Thinking
Negotiation
Mohtat@ut.ac.ir 12
Applied Data Science with Python
Michigan University(Coursera)
Basic Data Visualization Machine Learning Text Mining SNA
Applied Text Mining in Python
Introduction to Data Science in Python
Applied Plotting, Charting & Data
Representation in Python
Applied Machine Learning in Python Applied Social Network Analysis in
Python
Mohtat@ut.ac.ir 13LOGO HERE
Data Science Books
14
Business Intelligence
encompasses a wide variety of
tools, applications and
methodologies that enable
organizations to collect data
from internal systems and
external sources; prepare it for
analysis; develop and run
queries against that data; and
create reports, dashboards and
data visualizations to make the
analytical results available to
corporate decision-makers, as
well as operational workers.
BI
Mohtat@ut.ac.ir 17
Business Skills
Link to Business Strategy
Define Priorities
Define BI Vision
Lead Organization / BPR
Analytics Skills
Data Mining
Social BI
IT Skills
Infrastructure
Build Technology
Data Integration & Quality
Business
Intelligence
Architect
Simple is what it needs in business
Top Business Intelligence Skills
SQL
Data Warehousing
Data Analysis
Tableau
ETL
23%
85%
28%
41%
65%
Mohtat@ut.ac.ir 20
28%
Top Business Intelligence Skills
Business Analyst
Oracle
SQL Server BI
Business Process
Data Modeling 17%
85%
19%
21%
22%
Mohtat@ut.ac.ir 21
19%
Top Business Intelligence Tools
Tableau Power BI Qlik
Your Choice Is Clear
Mohtat@ut.ac.ir 22
Big Data
Volume
Terabyte
Distribute
Big Table
Velocity
Real-time
Stream Processing
Variety
Structured
Unstructured
Text, Image, Video
Mohtat@ut.ac.ir 27
Big data is a term used to
refer to data sets that are
too large or complex for
traditional data-processing
application software to
adequately deal with.
It’s what organizations do
with the data that matters.
Big data can be analyzed
for insights that lead to
better decisions and
strategic business moves.
Hadoop Ecosystem
3 Types of Big Data Jobs
1 2
3
Big Data Developer
Big Data Administration
Big Data Analytics
Mohtat@ut.ac.ir 29
Top Big Data Programming Languages
Not only Hadoop, many other big data analysis tools like Storm,
Spark, and Kafka are written in Java and run on the JVM
Java
Python is a simple, open-source, general-purpose language.
Hence, it is easy to learn Python for anyone.. With its rich set
of utilities and libraries and easy-to-use features, it works
wonder for big data processing and analysis.
Python
Scala is a rival of Java and Python in the world of Data Science
and becoming more and more popular due to extensive use of
Apache Spark in Big data Hadoop industry.
Scala
Mohtat@ut.ac.ir 30
Pathway to Success
Success
Apache Hadoop
Apache Spark
Start
NoSQL Database
Data Analytics
Data Visualization
Mohtat@ut.ac.ir 31
Big Data Companies & Vendors
Cloudera, Inc. is a US-based
software company that
provides a software platform
for data engineering, data
warehousing, machine
learning and analytics that
runs in the cloud or on
premises
Cloudera
MapR is a business software
company headquartered in
Santa Clara, California. MapR
provides access to a variety of
data sources from a single
computer cluster, including big
data workloads
MapR
Hortonworks is a data software
company based in Santa Clara,
California that develops,
supports, and provides expertise
on a set of open-source software
designed to manage data and
processing for things such as IOT,
single view of X, and advanced
analytics and machine learning
Hortonworks
34
‫داده‬‫کالن‬ ‫زیرساخت‬ ‫اجرا‬ ‫و‬ ‫نصب‬
Mohtat@ut.ac.ir
35
‫داده‬‫کالن‬ ‫زیرساخت‬ ‫اجرا‬ ‫و‬ ‫نصب‬
Mohtat@ut.ac.ir
Big Data Specialization
Michigan University(Coursera)
Introduction to Big Data
Big Data Modeling and
Management Systems
Big Data Integration and Processing
Machine Learning With Big Data
Graph Analytics for Big Data
Mohtat@ut.ac.ir 36LOGO HERE
Apache Spark
Berkeley University
Mohtat@ut.ac.ir 37LOGO HERE
Big Data Book
38
Data Scientist VS Data Engineer
Mohtat@ut.ac.ir 40
Dolor sit ametis
Data Engineering
Data Scientist
Data Pipelines
Visualization & Storytelling
Programming
Modeling & Advance Analytics
Math & Statistics
System Implementation
How To Become A Data Engineer
Linux
NoSQL & SQL
Python / Java / Scala
Agile Development
Data Ingestion
Processing Frameworks
Mohtat@ut.ac.ir 42
Best Data Processing Frameworks
MapReduce is a programming model
and an associated implementation for
processing and generating big data
sets with a parallel, distributed
algorithm on a cluster
Apache Spark is an open-
source distributed
general-purpose cluster-
computing framework.
Apache Storm is a free
and open source
distributed realtime
computation system.
The core of Apache Flink
is a distributed streaming
dataflow engine written in
Java and Scala
43
Cassandra
Best NoSQL Database
Mohtat@ut.ac.ir 44
Data Ingestion Tools
Apache Kafka
SSIS & ODI
Apache NiFi
Logstash
Mohtat@ut.ac.ir 45
Mohtat@ut.ac.ir
https://www.linkedin.com/in/mohtat
https://www.t.me/DataAnalysis
Contact Us
Thank You

More Related Content

What's hot

Data Mining and Business Intelligence Tools
Data Mining and Business Intelligence ToolsData Mining and Business Intelligence Tools
Data Mining and Business Intelligence Tools
Motaz Saad
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
Cambridge Semantics
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
Caserta
 
Big data course | big data training | big data classes
Big data course | big data training | big data classesBig data course | big data training | big data classes
Big data course | big data training | big data classes
NaviWalker
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
Teradata Aster
 
Big data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili Saghafi
Professor Lili Saghafi
 
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Edureka!
 
Brochure_Big-Data_Offerings
Brochure_Big-Data_OfferingsBrochure_Big-Data_Offerings
Brochure_Big-Data_Offerings
Anisha Lamba
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
Cambridge Semantics
 
dsl & bigdata
dsl & bigdatadsl & bigdata
dsl & bigdata
Andzhey Arshavskiy
 
Unit i big data introduction
Unit  i big data introductionUnit  i big data introduction
Unit i big data introduction
SujaMaryD
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia article
HimanshuPise1
 
Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
shankar_radhakrishnan
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
Dr.K.Sreenivas Rao
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
Multisoft Virtual Academy
 
Bigdata
BigdataBigdata
Ehr challenges [bigdata]
Ehr challenges [bigdata]Ehr challenges [bigdata]
Ehr challenges [bigdata]
Nesma Almoazamy
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
The Year of the Graph
The Year of the GraphThe Year of the Graph
The Year of the Graph
Cambridge Semantics
 

What's hot (20)

Data Mining and Business Intelligence Tools
Data Mining and Business Intelligence ToolsData Mining and Business Intelligence Tools
Data Mining and Business Intelligence Tools
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Big data course | big data training | big data classes
Big data course | big data training | big data classesBig data course | big data training | big data classes
Big data course | big data training | big data classes
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
 
Big data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili Saghafi
 
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
 
Brochure_Big-Data_Offerings
Brochure_Big-Data_OfferingsBrochure_Big-Data_Offerings
Brochure_Big-Data_Offerings
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
 
dsl & bigdata
dsl & bigdatadsl & bigdata
dsl & bigdata
 
Unit i big data introduction
Unit  i big data introductionUnit  i big data introduction
Unit i big data introduction
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia article
 
Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Bigdata
BigdataBigdata
Bigdata
 
Ehr challenges [bigdata]
Ehr challenges [bigdata]Ehr challenges [bigdata]
Ehr challenges [bigdata]
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
The Year of the Graph
The Year of the GraphThe Year of the Graph
The Year of the Graph
 

Similar to Data Skills for Digital Era-مهارت های داده ای

Data Skills for Digital Era
Data Skills for Digital EraData Skills for Digital Era
Data Skills for Digital Era
Mohamadreza Mohtat
 
Python para Manual de Ciência de Dados
Python para Manual de Ciência de DadosPython para Manual de Ciência de Dados
Python para Manual de Ciência de Dados
Rafael Oliveira Bitcoin
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
AbderrahmanABID2
 
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPSPYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
USDSI
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
phdAssistance1
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
NagarajanG35
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
phdAssistance1
 
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
David J Rosenthal
 
Bhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystemBhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystem
Vijayananda Mohire
 
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
CIOWomenMagazine
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Simplilearn
 
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics Dell Statisti...
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics  Dell Statisti...BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics  Dell Statisti...
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics Dell Statisti...
Big Data Week
 
Data Analytics in your IoT Solution Fukiat Julnual, Technical Evangelist, Mic...
Data Analytics in your IoT SolutionFukiat Julnual, Technical Evangelist, Mic...Data Analytics in your IoT SolutionFukiat Julnual, Technical Evangelist, Mic...
Data Analytics in your IoT Solution Fukiat Julnual, Technical Evangelist, Mic...
BAINIDA
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Sreedhar Chowdam
 
A Data Fabric for All Things Intelligent
A Data Fabric for All Things IntelligentA Data Fabric for All Things Intelligent
A Data Fabric for All Things Intelligent
Denodo
 
Data Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | DotechtalkData Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | Dotechtalk
DOTECHTALK
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 

Similar to Data Skills for Digital Era-مهارت های داده ای (20)

Data Skills for Digital Era
Data Skills for Digital EraData Skills for Digital Era
Data Skills for Digital Era
 
Python para Manual de Ciência de Dados
Python para Manual de Ciência de DadosPython para Manual de Ciência de Dados
Python para Manual de Ciência de Dados
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPSPYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
 
Bhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystemBhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystem
 
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics Dell Statisti...
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics  Dell Statisti...BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics  Dell Statisti...
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics Dell Statisti...
 
Data Analytics in your IoT Solution Fukiat Julnual, Technical Evangelist, Mic...
Data Analytics in your IoT SolutionFukiat Julnual, Technical Evangelist, Mic...Data Analytics in your IoT SolutionFukiat Julnual, Technical Evangelist, Mic...
Data Analytics in your IoT Solution Fukiat Julnual, Technical Evangelist, Mic...
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
A Data Fabric for All Things Intelligent
A Data Fabric for All Things IntelligentA Data Fabric for All Things Intelligent
A Data Fabric for All Things Intelligent
 
Data Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | DotechtalkData Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | Dotechtalk
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 

More from Hosseinieh Ershad Public Library

تجربه مشتریان داده محور
تجربه مشتریان داده محورتجربه مشتریان داده محور
تجربه مشتریان داده محور
Hosseinieh Ershad Public Library
 
محصول داده محور
محصول داده محورمحصول داده محور
محصول داده محور
Hosseinieh Ershad Public Library
 
محصول داده محور
محصول داده محورمحصول داده محور
محصول داده محور
Hosseinieh Ershad Public Library
 
مباشرت داده: نقشی نوین فراتر از تخصص
مباشرت داده: نقشی نوین فراتر از تخصصمباشرت داده: نقشی نوین فراتر از تخصص
مباشرت داده: نقشی نوین فراتر از تخصص
Hosseinieh Ershad Public Library
 
از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
از مباشرتِ داده‌ها تا حکمرانیِ داده‌هااز مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
Hosseinieh Ershad Public Library
 
فرهنگِ داده‌محور در سازمان
 فرهنگِ داده‌محور در سازمان فرهنگِ داده‌محور در سازمان
فرهنگِ داده‌محور در سازمان
Hosseinieh Ershad Public Library
 
مهارت های داده ای
مهارت های داده ایمهارت های داده ای
مهارت های داده ای
Hosseinieh Ershad Public Library
 
همسویی داده با اهداف سازمانی
همسویی داده با اهداف سازمانیهمسویی داده با اهداف سازمانی
همسویی داده با اهداف سازمانی
Hosseinieh Ershad Public Library
 
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانیBusiness Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Hosseinieh Ershad Public Library
 
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محورData driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
Hosseinieh Ershad Public Library
 
Data driven design-طراحی داده محور
Data driven design-طراحی داده محورData driven design-طراحی داده محور
Data driven design-طراحی داده محور
Hosseinieh Ershad Public Library
 
استارتاپ + داده
استارتاپ + دادهاستارتاپ + داده
استارتاپ + داده
Hosseinieh Ershad Public Library
 
Data driven innovation
Data driven innovationData driven innovation
Data driven innovation
Hosseinieh Ershad Public Library
 
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
Hosseinieh Ershad Public Library
 
Data Strategy
Data StrategyData Strategy
استراتژی داده
استراتژی دادهاستراتژی داده
استراتژی داده
Hosseinieh Ershad Public Library
 
مديريت زنجيره تأمين رویکرد داده محور
مديريت زنجيره تأمين رویکرد داده محورمديريت زنجيره تأمين رویکرد داده محور
مديريت زنجيره تأمين رویکرد داده محور
Hosseinieh Ershad Public Library
 
زنجیره تامین داده محور و انقلاب صنعتی چهارم
زنجیره تامین داده محور و انقلاب صنعتی چهارمزنجیره تامین داده محور و انقلاب صنعتی چهارم
زنجیره تامین داده محور و انقلاب صنعتی چهارم
Hosseinieh Ershad Public Library
 
Data driven industery-صنعت داده محور
Data driven industery-صنعت داده محورData driven industery-صنعت داده محور
Data driven industery-صنعت داده محور
Hosseinieh Ershad Public Library
 
صنعت داده محور
صنعت داده محورصنعت داده محور
صنعت داده محور
Hosseinieh Ershad Public Library
 

More from Hosseinieh Ershad Public Library (20)

تجربه مشتریان داده محور
تجربه مشتریان داده محورتجربه مشتریان داده محور
تجربه مشتریان داده محور
 
محصول داده محور
محصول داده محورمحصول داده محور
محصول داده محور
 
محصول داده محور
محصول داده محورمحصول داده محور
محصول داده محور
 
مباشرت داده: نقشی نوین فراتر از تخصص
مباشرت داده: نقشی نوین فراتر از تخصصمباشرت داده: نقشی نوین فراتر از تخصص
مباشرت داده: نقشی نوین فراتر از تخصص
 
از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
از مباشرتِ داده‌ها تا حکمرانیِ داده‌هااز مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
 
فرهنگِ داده‌محور در سازمان
 فرهنگِ داده‌محور در سازمان فرهنگِ داده‌محور در سازمان
فرهنگِ داده‌محور در سازمان
 
مهارت های داده ای
مهارت های داده ایمهارت های داده ای
مهارت های داده ای
 
همسویی داده با اهداف سازمانی
همسویی داده با اهداف سازمانیهمسویی داده با اهداف سازمانی
همسویی داده با اهداف سازمانی
 
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانیBusiness Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
 
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محورData driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
 
Data driven design-طراحی داده محور
Data driven design-طراحی داده محورData driven design-طراحی داده محور
Data driven design-طراحی داده محور
 
استارتاپ + داده
استارتاپ + دادهاستارتاپ + داده
استارتاپ + داده
 
Data driven innovation
Data driven innovationData driven innovation
Data driven innovation
 
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
 
Data Strategy
Data StrategyData Strategy
Data Strategy
 
استراتژی داده
استراتژی دادهاستراتژی داده
استراتژی داده
 
مديريت زنجيره تأمين رویکرد داده محور
مديريت زنجيره تأمين رویکرد داده محورمديريت زنجيره تأمين رویکرد داده محور
مديريت زنجيره تأمين رویکرد داده محور
 
زنجیره تامین داده محور و انقلاب صنعتی چهارم
زنجیره تامین داده محور و انقلاب صنعتی چهارمزنجیره تامین داده محور و انقلاب صنعتی چهارم
زنجیره تامین داده محور و انقلاب صنعتی چهارم
 
Data driven industery-صنعت داده محور
Data driven industery-صنعت داده محورData driven industery-صنعت داده محور
Data driven industery-صنعت داده محور
 
صنعت داده محور
صنعت داده محورصنعت داده محور
صنعت داده محور
 

Recently uploaded

STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 

Recently uploaded (20)

STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 

Data Skills for Digital Era-مهارت های داده ای

  • 1. Data Skills for Digital Era The Top Data Skills You Need To Get Hired
  • 2. Main Focus Data Science Business Intelligence Big Data Data Engineering Mohtat@ut.ac.ir 2
  • 3.
  • 4. Data Science Math & Statistics Computer Science Subject Matter Expertise Mohtat@ut.ac.ir 4 Data Science is an interdisciplinary field about processes and systems to extract knowledge or insights from data, which is a continuation of some of the data analysis fields such as statistics, data mining, and predictive analytics, similar to Knowledge Discovery in Databases (KDD).
  • 7. Critical Skills for Data Scientists Python R SQL Data Mining Tools Knime , ReapidMiner, IBM SPSS Modeler Excel BI Tools Tableau, Power BI, Qlik Mohtat@ut.ac.ir 9
  • 8. Top Python Libraries in Data Science TensorFlow “TensorFlow is an open source software library for numerical computation using data flow graphs. PyTorch “PyTorch is a Python package that provides Deep neural networks built on a tape-based autograd system Numpy “NumPy is the fundamental package needed for scientific computing with Python. Scikit-Learn “scikit-learn is a Python module for machine learning built on NumPy, SciPy and matplotlib. Keras “Keras is a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. Scipy “SciPy is open-source software for mathematics, science, and engineering. Pandas “pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive Matplotlib “Matplotlib is a Python 2D plotting library which produces publication- quality figures in a variety of hardcopy formats and interactive environments across platforms. Scrapy “Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. Mohtat@ut.ac.ir 10
  • 9. Top Skills every Data Scientist needs to Master TensorFlow Keras Hadoop Spark Hive Java Matlab Mohtat@ut.ac.ir 11
  • 10. Most Essential Skills for Data Scientists Complex Problem Solving Team Working Emotional Intelligence Creativity Critical Thinking Negotiation Mohtat@ut.ac.ir 12
  • 11. Applied Data Science with Python Michigan University(Coursera) Basic Data Visualization Machine Learning Text Mining SNA Applied Text Mining in Python Introduction to Data Science in Python Applied Plotting, Charting & Data Representation in Python Applied Machine Learning in Python Applied Social Network Analysis in Python Mohtat@ut.ac.ir 13LOGO HERE
  • 13.
  • 14. Business Intelligence encompasses a wide variety of tools, applications and methodologies that enable organizations to collect data from internal systems and external sources; prepare it for analysis; develop and run queries against that data; and create reports, dashboards and data visualizations to make the analytical results available to corporate decision-makers, as well as operational workers. BI Mohtat@ut.ac.ir 17 Business Skills Link to Business Strategy Define Priorities Define BI Vision Lead Organization / BPR Analytics Skills Data Mining Social BI IT Skills Infrastructure Build Technology Data Integration & Quality
  • 16. Top Business Intelligence Skills SQL Data Warehousing Data Analysis Tableau ETL 23% 85% 28% 41% 65% Mohtat@ut.ac.ir 20 28%
  • 17. Top Business Intelligence Skills Business Analyst Oracle SQL Server BI Business Process Data Modeling 17% 85% 19% 21% 22% Mohtat@ut.ac.ir 21 19%
  • 18. Top Business Intelligence Tools Tableau Power BI Qlik Your Choice Is Clear Mohtat@ut.ac.ir 22
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Big Data Volume Terabyte Distribute Big Table Velocity Real-time Stream Processing Variety Structured Unstructured Text, Image, Video Mohtat@ut.ac.ir 27 Big data is a term used to refer to data sets that are too large or complex for traditional data-processing application software to adequately deal with. It’s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.
  • 25. 3 Types of Big Data Jobs 1 2 3 Big Data Developer Big Data Administration Big Data Analytics Mohtat@ut.ac.ir 29
  • 26. Top Big Data Programming Languages Not only Hadoop, many other big data analysis tools like Storm, Spark, and Kafka are written in Java and run on the JVM Java Python is a simple, open-source, general-purpose language. Hence, it is easy to learn Python for anyone.. With its rich set of utilities and libraries and easy-to-use features, it works wonder for big data processing and analysis. Python Scala is a rival of Java and Python in the world of Data Science and becoming more and more popular due to extensive use of Apache Spark in Big data Hadoop industry. Scala Mohtat@ut.ac.ir 30
  • 27. Pathway to Success Success Apache Hadoop Apache Spark Start NoSQL Database Data Analytics Data Visualization Mohtat@ut.ac.ir 31
  • 28. Big Data Companies & Vendors Cloudera, Inc. is a US-based software company that provides a software platform for data engineering, data warehousing, machine learning and analytics that runs in the cloud or on premises Cloudera MapR is a business software company headquartered in Santa Clara, California. MapR provides access to a variety of data sources from a single computer cluster, including big data workloads MapR Hortonworks is a data software company based in Santa Clara, California that develops, supports, and provides expertise on a set of open-source software designed to manage data and processing for things such as IOT, single view of X, and advanced analytics and machine learning Hortonworks
  • 31. Big Data Specialization Michigan University(Coursera) Introduction to Big Data Big Data Modeling and Management Systems Big Data Integration and Processing Machine Learning With Big Data Graph Analytics for Big Data Mohtat@ut.ac.ir 36LOGO HERE
  • 34.
  • 35. Data Scientist VS Data Engineer Mohtat@ut.ac.ir 40 Dolor sit ametis Data Engineering Data Scientist Data Pipelines Visualization & Storytelling Programming Modeling & Advance Analytics Math & Statistics System Implementation
  • 36. How To Become A Data Engineer Linux NoSQL & SQL Python / Java / Scala Agile Development Data Ingestion Processing Frameworks Mohtat@ut.ac.ir 42
  • 37. Best Data Processing Frameworks MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster Apache Spark is an open- source distributed general-purpose cluster- computing framework. Apache Storm is a free and open source distributed realtime computation system. The core of Apache Flink is a distributed streaming dataflow engine written in Java and Scala 43
  • 39. Data Ingestion Tools Apache Kafka SSIS & ODI Apache NiFi Logstash Mohtat@ut.ac.ir 45
  • 40.