The Center for Information & Communication Technology (CICT) at IBA provides professional education in information technology, information systems, and computer science through diplomas, certifications, and workshops. It was established in 2016 to provide high-quality professional education to the public and private sectors in Pakistan. CICT works with renowned faculty who conduct courses and workshops contributing to digitalization in education and professional fields.
2. education providing professional diplomas, certifications, and workshops in the field of Information technology, Information
Systems, and Computer Science. The CICT was established in 2016 providing high-quality professional education to the private
and public sectors in Pakistan. The CICT has been associated with renowned faculty members who conduct certain courses and
workshops which contribute to the digitalization in the educational and professional sectors.
The Center for Information & Communication Technology (CICT) aspires to meet the highest standards of IT excellence
required in pursuit of management strategies.
The Center for Information & Communication Technology (CICT) is to provide excellent teaching and research environment
specially in Information Technology to produce students/professionals who distinguish themselves by their professional
competence, research, entrepreneurship, humanistic outlook, ethical rectitude, pragmatic approach to problem solving,
managerial skills and ability to respond to the challenge of socio-economic development to serve as the vanguard of techno-
industrial transformation of the society.
3. • Be respectful to your teachers as well as your classmates, any kind of disrespect or misbehavior will be subject to dismissal from the course.
• Be on time for your classes and avoid being irregular.
• All students are required to carry their ID Cards provided by the administration to avoid any inconveniences while entering the premise.
• In case of Card loss, the student is required to report to the department to state their case.
• Avoid usage of mobile devices during your sessions, the instructor has a full right to dismiss any student who has been found using the phone during
class.
• Do not bring any food or drinks inside your classrooms especially computer labs as it is strictly prohibited.
• Don't forget to turn off the machine once you are done using it.
• Do not plug in external devices without scanning them for computer viruses.
Note: IBA plays a vital role in maintaining decorum which applies to every candidate entering the premises. Keeping in mind that the campus is Smoke-
free any candidate carrying out any mal activity is reluctantly charged & may result is dismissal from the course.
4. The convergence of big data and machine learning with technologies such as cloud services, sensors, ubiquitous computing, mobile
devices, and the Internet of Things has created vast new opportunities for business. Analytics has become a competitive and
sustainable advantage for many organizations. To harness the benefits of big data and machine learning, however, business
leaders face the pressing challenge of not only acquiring the right technologies and talent to analyze and interpret the data but also
weaving a data-centric mindset into the organization's structure and cultural fabric.
This Four-Month Diploma will empower me with the skills and confidence to tackle data-driven opportunities and accelerate data-
analysis transformation in the organization. Through lectures, case studies, and discussions, real-world insights will be gained on
various applications of big data analytics and machine learning, and how they can be used to fuel better decision-making within the
context of the attendee's own Department/Organization.
5. Mr. Sohail Imran:
For more than 18 years, Mr. Sohail Imran is conducting
training and workshops for databases (SQL and
NoSQL), Big Data Infrastructure, and Machine
Learning for different institutes, universities, and the
corporate sector. More than 8 years of professional
experience in Big Data Analytics, Data Science, Data
Mining, Data Warehousing, and DBMS (SQL and
NoSQL). Providing consultancy in designing and
developing Big Data Analytics platforms using Java,
RapideMiner, Radoop, Python, Hadoop, Hive, Spark,
Kafka, Spark Streaming, Storm, etc.
Mr. Muhammad Rizwan:
Muhammad Rizwan provides digital leadership to
organizations, from strategy to execution, globally and
locally. In his 25+ corporate career, he has worked in both
public and private equity spaces with technology-led and
digitally-enabled businesses in various management
vernaculars, including C-Suite. He holds a certificate from
Stanford University in Machine Learning. A master's degree
from Hamdard. Bachelor's degree in Statistics/Commerce
and Information Systems. He also carries an international
diploma in Software Engineering. Currently, heading the
information systems at Dollar Industries (Pvt) Ltd. He has
worked for Hino Pak Motors, Karachi Stock Exchange, and
CPLC (Car theft software) projects.
Dr. Affan Alim:
Dr. Muhammad Affan Alim has 16 years of teaching, research, and development experience in
Machine Learning, Deep Learning, Data Science, pattern classification, computer vision,
Optimization of models, and statistical & mathematical analysis. He also has several years of
professional experience in software development in Pakistan and the United Kingdom (UK). He has
developed several industry-based projects.
Mr. Muhammad Shamim Ahmed:
With overall 10 years of experience in Project management, Oracle functional consultation,
and 4 years as an Oracle University trainer, Mr. Shamim has been a great asset to IBA
CICT. He has executed 5 projects to date in various capacities, with proven experience with
the complete life cycle of Oracle EBS implementation. He has been actively involved in all
stages of project management such as initiation, Business Blueprint (AS-IS Document),
GAP Analysis, Solution design, and much more. He is exceptionally motivated, energetic,
and enthusiastic about learning and teaching
6. • History and Evolution of Python
• Advantages of Apache Spark with Python in a Big Data Environment
• Setting up Big Data Programming Development Environments
• Programming Language Basics
• Collections and their types
• Conditional Control Structures
• Iterative Control Structures
• Methods with Practice Examples
• Module and File I/O
• Object-Oriented Programming Concepts
• Apache Spark and Python for Machine Learning
• Exam
I. Python Foundations for Big Data Analytics
• Intro to data science, Role of the database designer, data engineer, data analyst, and data scientist. what data scientists do.
• The available format of data and what types of data a data scientist received. The life cycle of data science, data science competitions
• The difference between wrangling and feature engineering, the steps of wrangling, and its detail
• Reading of different methods of the dataset using pandas python
Steps of feature engineering for machine learning. During understanding, some real examples will also be discussed. Requirement of the tools and techniques of data science.
• Missing data imputation using pandas. How to find the missing values in the dataset, what handling of missing values is important
• Fillna() method with different parameters for missing values.
• Drop all rows of missing data in Data Frame, and drop missing data rows with respect to a specific column.
• How to fill using aggregate values, how to fill forward and backward, and fill with reference to other columns. Practice questions
• The real application-based problem for missing values
• What is the outlier, what is the impact of an outlier in data, and how do find and visualize the outliers?
• Outlier removal strategies, Performing winsorization, a python implementation
II. Big Data Wrangling
7. • Three case studies for EDA
• Exploratory data analytics
• Handling the structuring issues in the dataset, inconsistencies in date, and any other attributes. Unnecessary character attachment, Exploring these issues
• Handling the structuring issues using regex
• Categorical Variables: Encoding Categorical Variables: on hot encoding, dummy encoding, effect encoding, pros and cons of a categorical variable encoding
• Discussion of project
• Live demonstration of Kaggle and its features
• how Kaggle will helpful for data scientist
• Exam
II. Big Data Wrangling
III. Business Intelligence (BI) and Big Data Visualization
• Introduction to BI & commonly using BI tools.
• Power BI introduction & its components.
• Power View, Query, Pivot & Power BI Service
• Introduction to Power Query and its usage.
• Basic Power BI Navigation.
Basic Power BI Charts.
• Column Chart.
• Stacked Column Chart.
• Pie Chart.
• Donut Chart.
• Funnel Chart.
• Ribbon Chart.
• Include and Exclude.
• Export data from Visual.
Maps in Power BI Desktop
• Map.
• Filled Map.
• Map with Pie Chart.
• Formatting in Map.
• Background Changes in Map.
• Map of Pakistan in Power BI.
• Map of Australia.
Table & Matrix in Power BI Desktop
• Creating a Simple Table.
• Formatting in Table.
• Conditional Formatting in Table.
• Changing Aggregation in Table.
• Creating a Matrix in Power BI.
• Conditional Formatting in Matrix.
• Automatic Hierarchy in Matrix.
Other Charts in Power BI Desktop
• Line Chart.
• Drill down in Line Chart.
• Area Chart.
• Line vs Column Chart.
• Scatter Plot.
• Waterfall Chart.
8. Cards and Filters in Power BI Desktop.
• Number Card.
• Text Card.
• Date Card.
• Multi-Row Card.
• Filter on Visual.
• Filter on Page.
• Filter on All Pages.
• Drill through.
III. Business Intelligence (BI) and Big Data Visualization
Slicers in Power BI Desktop
• Slicer for Text.
• Format Text Slicer.
• Date Slicer.
• Format Date Slicer .
• Number Slicer.
Advanced Charts in Power BI Desktop
• Animated Bar Chart Race.
• Drill Down Donut Chart.
• Drill Down Column Chart.
• Word Cloud.
• Sankey Chart.
• Infographic.
• Play Axis.
• Scroller.
• Sunburst Chart.
• 10- Histogram
Objects and Actions (Hyperlinks) in PBI
• Insert Image.
• Insert Text.
• Insert Shapes.
• Insert Buttons.
• Action - Web URL.
• Action - Page Navigation.
• Action - Bookmark Action.
• Action - Drill through.
Power BI Service Introduction
• Creating a Superstore Report.
• Create an Account on Power BI Service.
• Publish Report to Power BI Service Account.
• Export (PPT, PDF, PBIX) Report and Share.
• Comment, Share and Subscribe to a report.
• Create a dashboard in Power BI Service.
• Problem in Power BI Dashboard & its solution.
• Automatic Refresh - Data Gatewayn.
• Exam
9. • Introduction to NoSQL databases
• Comparison with SQL databases
• Document NoSQL Store
• Introduction and Installation
• Basic commands
• Document NoSQL Data Modeling.
• Practice exercises
• Integration of Document NoSQL database with Apache Spark
Machine Learning
• Practice exercises
• Graph NoSQL Store
• Introduction and Installation
• Basic commands
• Graph NoSQL Data Modeling
• Integration of Graph NoSQL database with Apache Spark
Machine Learning
• Practice exercises
• Key-Value NoSQL Store
• Introduction and Installation
• Exam
IV. Big Data Management Systems with NoSQL Data Stores V. Machine Learning for Big Data
• What is Machine Learning, and what tools are required for
learning it
• Differences between classification and regression-based
problems, Supervise and unsupervised categories, and real-
world examples
• Machine learning protocol for implementation
• Linear regression; How it works, mathematical and graphical
representation of LR
• Python implementation of Linear regression
• Discuss the performance metrics for regression-based
problems
• Logistic regression; how it works, Decision boundaries,
Sigmoid function,
• Python implementation of Logistic regression
• Discuss the performance metrics for classification-based
problems
• The real-life problem of logistic regression
• For regression and classification-based problems
• K-neighbour nearest
• Support vector machine
• Discussion of overfitting and underfitting
• Cross-validation
• Hold out cross-validation
10. • K-fold and its types of cross-validation
• Leave one out cross-validation
• Bootstrap cross-validation
• Python implementation of Cross validation
• Classification and Regression
• Decision tree
• Random forest
• Parameter behavior of both algorithms
• Overfitting and underfitting handling
• Hyper-parameter
• Un supervised learning; K-mean
• Feature selection; PCA
• Discussion of project
• live demonstration on Kaggle submission
• real-life problem solving
• Final Exam
• Infrastructure Development for Real-Time Big Data Analytics
• Streaming Introduction
• Big Data Pipelines: The Rise of Real-Time
V. Machine Learning for Big Data VI. Case Study
Stream processing with Apache Storm
• How does Twitter compute trends
• Improve performance using distributed processing
• Building blocks of Storm Topologies
• Adding Parallelism in a Storm Topology
• Components of Storm Cluster
• A simple Hello World Topology
• Implementing Bolt & Submitting a Topology
Processing Data using Files
• Reading Data from a file
• Representing Data using Tuples
• Accessing Data from Tuples
• Writing Data to a File
• Assignment 1
11. VI. Case Study
Spark Streaming
• Streaming Architecture
• Deployment of Collection and Message Queuing Tiers
• Introduction of message queuing tier using Apache Kafka
Running The Collection Tier (Part II - Sending Data)
Data Access Tier
• Introduction to Data Access tier - MongoDB
• Exploring Spring Reactive
• Exposing Data Access tier in browser
• Analysis Tier
• Introduction to Analysis tier - Apache Spark
• Plug-in Spark Analysis Tier to Our Pipelines
• A brief overview of Spark RDDs
• Fault Tolerance
• Kafka Connect
• Assignment 2
Brief introduction to
• DaLambda vs Kafka architecture
• taFrame, DataSets, and SparkSQL
• Spark Structured Streaming
Benefits of Kappa architecture.
Building Data Pipelines using Apache Airflow
• Advantages of using DAGs in Apache Airflow
• Apache Airflow UI
• Building DAG using Airflow
• Airflow Monitoring and Logging
• Assignment 3
VII. Final Exam