3. STUFF I DO
Data & Analytics Cloud Enablement Leader
Service Delivery, Project Sponsorship, Technical
Leadership, People Management
https://www.cloudreach.com/careers/
3
4. STUFF THAT’S HAPPENING
757ColorCoded
We exist to educate and empower local people of color to
achieve careers in technology and improve their lives.
May 25, 2019 at 2:00 PM: Build a Website, Part II: WordPress 101
Slover Library (Lower Level), 235 E Plume St, Norfolk, VA 23510
https://757colorcoded.org
https://757ColorCoded.slack.com
4
5. STUFF THAT’S HAPPENING
RevolutionConf
RevolutionConf is a two-day, platform and language
agnostic software development conference.
June 6-7, 2019
Wyndham Oceanfront, Virginia Beach, VA 23451
https://revolutionconf.com
https://diversity.revolutionconf.com
5
6. STUFF THAT’S HAPPENING
SQL Saturday
A day of Data Platform and SQL Server training for all levels. Admittance
to this event is free, however there is a $12 fee for lunch &
refreshments. Please register soon as seating is limited.
June 8, 2019
ECPI University, 5555 Greenwich Road, Virginia Beach, VA 23462
https://www.sqlsaturday.com/839/EventHome.aspx
6
7. AGENDA
⊗ Some Definitions
⊗ The Big Data Problem
⊗ The Data Science Hierarchy of Needs
⊗ Data Science & Analytics Roles
7
9. Big data is a term used to refer
to data sets that are too large
or complex for traditional data-
processing application software
to adequately deal with.
9
https://en.wikipedia.org/wiki/Big_data
10. TERABYTES
⊗ 1 TB = 1,500 CD-ROMs
⊗ 2 TB = 130,000 digital photos
⊗ 10 TB = 1 year of data from the
Hubble Space Telescope
10
https://www.lifewire.com/terabytes-gigabytes-amp-petabytes-how-big-are-they-4125169
11. PETABYTES
⊗ 1 PB = 20,000,000 filing cabinets
⊗ 1 PB = 10,000 hours of TV
⊗ 2.5 PB = capacity of a human brain
11
https://www.makeuseof.com/tag/memory-sizes-gigabytes-terabytes-petabytes/
12. Data analytics is the pursuit of
extracting meaning from raw
data using specialized computer
systems. These systems
transform, organize, and model
the data to draw conclusions
and identify patterns...
12
https://www.informatica.com/services-and-training/glossary-of-terms/data-analytics-definition.html
14. Data science is an
interdisciplinary field that uses
scientific methods, processes,
algorithms and systems to
extract knowledge and insights
from data in various forms, both
structured and unstructured…
14
https://en.wikipedia.org/wiki/Data_science
16. Machine learning (ML) is the scientific
study of algorithms and statistical
models that computer systems use to
effectively perform a specific task
without using explicit instructions,
relying on patterns and inference
instead.
16
https://en.wikipedia.org/wiki/Machine_learning
17. Artificial intelligence: the theory and
development of computer systems able
to perform tasks that normally require
human intelligence, such as visual
perception, speech recognition,
decision-making, and translation
between languages.
17
https://en.oxforddictionaries.com/definition/artificial_intelligence
19. The never-ending stream of
information is incredibly useful
for businesses, but it can also
be a challenge to draw relevant
insights from such a large data
pool.
19
https://marketinginsidergroup.com/strategy/big-data-trends-you-should-know-about-in-2018-infographic/
24. COMMON SKILLS
Skills
⊗ Excellent written & verbal communication
⊗ Effective collaboration
⊗ Database knowledge (MySQL, PostgreSQL, Cassandra, MongoDB, Redis)
⊗ Proficiency in SQL
24
25. DATA ENGINEER
Responsibilities
⊗ Build and maintain architectures that
support big data systems such as
ETL pipelines.
⊗ Create applications and tools to
support data scientists and data
analysts.
⊗ Collaborate with data scientists to
build algorithms to derive meaning
from data sets.
Skills
⊗ Python, Java, Scala, C++
⊗ Spark, PySpark
⊗ Jupyter notebooks
⊗ Data warehousing
⊗ Data storage
⊗ ETL
⊗ Basic machine learning (Tensorflow,
PyTorch, MXNet)
25
26. DATA ANALYST
Responsibilities
⊗ Collect and clean data from disparate
data sources for analysis.
⊗ Identify, analyze, and interpret data to
uncover patterns or trends.
⊗ Generate domain-specific reporting,
visualizations, and dashboards.
Skills
⊗ Business Intelligence & Data
Visualization Tools (Tableau, Power
BI, Qlik)
⊗ Microsoft Excel
⊗ Analytics Tools (Google Analytics,
Google Tag Manager, etc.)
26
27. DATA SCIENTIST
Responsibilities
⊗ Collect and clean data from disparate
data sources for analysis.
⊗ Train and deploy models to predict
outcomes.
⊗ Communicate findings to business
stakeholders.
Skills
⊗ R, Python, Java, Scala
⊗ Jupyter notebooks
⊗ Machine learning (Tensorflow,
PyTorch, MXNet)
⊗ Statistics, Linear Algebra
⊗ Business Intelligence & Data
Visualization Tools (Tableau, Power
BI, Qlik)
27
29. GET STARTED
29
Courses
⊗ Kaggle: Your Home for Data Science
⊗ Coursera
⊗ Udemy
⊗ A Cloud Guru ($29/month)
Certifications
⊗ Microsoft Professional Program in Big Data* ($99)
⊗ AWS Certified Cloud Practitioner ($100)
⊗ Google Cloud Certified Associate Cloud Engineer ($125)