Successfully reported this slideshow.
Your SlideShare is downloading. ×

Data Sciences Learning

Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Loading in …3
×

Check these out next

1 of 218 Ad

Data Sciences Learning

Download to read offline

This presentation is about tools and techniques used in the field of data sciences, data analytics and data engineering. it is a collection of graphics and tabular data for quick learning.

This presentation is about tools and techniques used in the field of data sciences, data analytics and data engineering. it is a collection of graphics and tabular data for quick learning.

Advertisement
Advertisement

More Related Content

Similar to Data Sciences Learning (20)

Advertisement

Data Sciences Learning

  1. 1. General Learning Data Sciences | AI | Big Data Analytics
  2. 2. Tasks To Do • Install SQL Server and communicate via some client • Cloud Deployments • Understand Linux Architecture and basic commands • Understand IP Addressing • Understand Hypervisor • Understand protocols: DNS, DHCP, HTTP, SSL, TLS, HTTPS, FTP, SMTp • Master Python & TENSORFLOW • What are micro-services ? Vs API ! • HashiCorp’s TERRAFORM • Study of bahria research groups https://bahria.edu.pk/oric/
  3. 3. Companies to work in future • Ublox Lahore https://www.u- blox.com/en/job-openings • NETSOL • TERESOL • CONTOUR SOFTWARE https://contour- software.com/careers/#Jobs • TERADATA http://nicat.pk/
  4. 4. Active Job Openings • https://www.u-blox.com/en/job- openings#Open-jobs • https://contour- software.com/careers/#Jobs
  5. 5. DS Tools and Requirement Tool Requirement Tool Requirement Tool Requirement KAFKA BIG Data Messaging TERRAFORM Multi-Cloud management through code TENSORFLOW Low-level software library created by google to implement ML models and solve complex numerical problems HADOOP BIG Data online storage DOCKER KERAS High Level Deep Learning API in Python for easy implementation and computation of neural networks APACHE SPARC BIG Data Stream handling real-time KUBERNETES PYTORCH Low-Level API developed by Facbeook for NLP and computer vision. More powerful version of numpy TABELAU
  6. 6. DS Tools and Requirement
  7. 7. DASHBOARDING TOOLS
  8. 8. Useful resources (5) Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for Beginners | Edureka - YouTube
  9. 9. LINUX R/Python HDFS,HIVE, YARN, MAPREDUCE
  10. 10. Data Scientists Jobs @ DUBAI • data scientist job dubai (google.com) • IMP: https://g.co/kgs/AEjBMy
  11. 11. Data Science jobs @ Glassdoor
  12. 12. Data Science jobs @ Glassdoor
  13. 13. TOP COMPANIES WORKING IN DATA SCIENCE IN DUBAI Eurasian Resources Group - ERG Cobblestone Kognitiv Corporation Careem First Abu Dhabi Bank VISA DATABUZZ LTD Foodics nybl Constellation Software, Inc. The Emirates Group TRANSFERWISE TMC Binance.US Careem Al Futtaim Agility ARTEFACT MARS Landmark Group Amazon Middle East and NA UHRS RAK BANK Millennium Plaza Hotel Dubai Standard Chartered Bank Affaan Technologies Siemens DataRobot Arthur Lawrence Parsons International Manipal Academy of Higher Education, Dubai GMG WOW AI LLC APCO Worldwide Accenture BlackSky Swvl Dataiku Emirates NBD Procter & Gamble Zayed University Mastercard
  14. 14. Linux – Shells vs Scripts
  15. 15. Traditional Programming vs. Machine Learning
  16. 16. Polyglot • When you can use multiple languages in a software platform
  17. 17. Device to interconnect various LANS
  18. 18. Coaxial / Fiber Optics
  19. 19. Data Center Standards / Compliances
  20. 20. https://www.youtube.com/watch?v=BzJvVBxSEOM Source
  21. 21. https://www.youtube.com/watch?v=iyES7UwJfvw
  22. 22. excel
  23. 23. Cluster Computing / Programming • A computer cluster is a set of computers that work together so that they can be viewed as a single system. Unlike grid computers, computer clusters have each node set to perform the same task, controlled and scheduled by software
  24. 24. https://www.youtube.com/watch?v=8BBDxzJL6fY In a cluster setup, we will bond the one ip to multiple servers. If one server is down, other can respond.
  25. 25. High Performance Cluster
  26. 26. Processing / Computing requirement is either - Too large - Or it takes too long On standard computers
  27. 27. If A task on ON-PREMISE 16 PC Cluster with 4 core processors each ( = 64 processing nodes) takes 3 months .. then same task can be done in just 16 hours on 125,000 cores on cloud at same or no incremental cost ! >>> CLOUD Computing benefits On-premisis cluster
  28. 28. CLOUD BENEFITS
  29. 29. https://youtu.be/WXoIcjA5a9Y https://www.youtube.com/watch?v=rvqCqK2Lpjg https://www.youtube.com/watch?v=k7zu3NXEiGY
  30. 30. BATCH PROCESSING
  31. 31. DEDICATED SERVING LAYER
  32. 32. What is actually happening? • Descriptive Analysis What will happen next? • Predictive Analysis What to do now? • Prescriptive Analysis What was the reason? • Diagnostic Analysis
  33. 33. (5) Top Big Data Technologies | Big Data Tools Tutorial | Big Data Hadoop Training | Edureka - YouTube
  34. 34. For predictive analysis
  35. 35. In-memory computing capabilities for real-time big data streams
  36. 36. SPARK VS HADOOP https://www.youtube.com/watch?v=9mELEARcxJo
  37. 37. The most widely-used engine for scalable computing Thousands of companies, including 80% of the Fortune 500, use Apache Spark™. Over 2,000 contributors to the open source project from industry and academia.
  38. 38. HADOOP is batch processing only But SPARK is real time processing also. !!
  39. 39. PySpark https://www.youtube.com/watch?v=5dARTeE6OpU
  40. 40. WHAT IS KAFKA A data pipeline messaging system
  41. 41. What is KAFKA – explained again • A messaging system • Simplifies management of data pipelines • Retain messages even when there is issue in a pipeline due to network issue • Any sink of message system can subcribe to data pipeline • Queue and public subscribe Model
  42. 42. (1) What is terraform in Hindi/Urdu | Lec-01 | Terraform tutorial for beginners | Infrastructure as Code - YouTube
  43. 43. https://www.youtube.com/watch?v=4L86D_fU6sQ

×