3. Business Intelligence
The term Business Intelligence (BI) refers to
Technologies
Applications and
Practices
for the :
Collection
Integration
Analysis and
Presentation
4. WHY?
•Smart decisions
• Decision Support System (DSS)
•Different sources:
◦ Facebook
◦ Twitter
◦ LinkedIn
◦ Instagram etc.
•Different formats of data
◦ Structured
◦ Unstructured
5. Collection
•Staging
• Full Load/Truncate Load
• Incremental Load
•Replication
•Loading
•Transformation
• Aggregation
• Union
• Joining
• Expression
• Filter etc.
6. Integration
Source Systems Central Storage System Target System Reporting End
ETL ETL
Sybase IQ,
SQL, MS
ACCESS,
MY SQL,
SPARK,
HIVE,
HANA,
HADOOP
Tableau
BI Tools
SAP
HANA
Etc.
Inform-
atica,
IBM -
Websp
here
DataSt
age,
SAP –
Busine-
ss
Objects
IBM –
Cognos,
SAS –
Data,
Oracle -
Data
Integrat
or
18. Tools
Hadoop
◦ Open source software framework
◦ Handle very large data sets.
◦ Two main parts: Hadoop Distributed File System (HDFS) and MapReduce (Processor)
◦ HDFS is the storage component.
◦ MapReduce is the processing engine of Hadoop.
◦ Hadoop processes data by delivering code to nodes to process in parallel.
Apache Spark
◦ Quickly growing data analytics tool.
◦ Open source framework for cluster computing
◦ Spark is frequently used as an alternate to Hadoop's MapReduce because it is able to analyze data up to 100
times faster for certain applications.
◦ Common use cases for Apache Spark include streaming data, machine learning and interactive analysis
19. Tools
Apache Hive
◦ SQL-on-Hadoop data processing engine
◦ Apache Hive excels at batch processing of ETL jobs and SQL queries
◦ Hive utilizes a query language called HiveQLbased on SQL, but does not strictly follow the SQL-92 standard.
SAP HANA
◦ Real-time analytics
Sybase IQ
◦ Database server optimized for analytics/BI
20. Tools
Informatica
◦ Informatica PowerCenter is a widely used extraction, transformation and loading (ETL) tool used in building
enterprise data warehouses
Tableau
◦ Tableau is a business intelligence (BI) tool that can help you create beautiful and visually-appealing reports,
charts, graphs and dashboards using your data.
◦ These reports are interactive and can easily be shared with anyone.
◦ This data visualization software is extremely fast and easy to use as it has a drag and drop interface, so you do
not need to be a techie to use it.