SlideShare a Scribd company logo
International Association of Scientific Innovation and Research (IASIR)
(An Association Unifying the Sciences, Engineering, and Applied Research)
International Journal of Engineering, Business and Enterprise
Applications (IJEBEA)
www.iasir.net
IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 105
ISSN (Print): 2279-0020
ISSN (Online): 2279-0039
Integration of Big Data in Banking Sector to Speed up the Analytical
Process
1
Prof. Dr. P.K. Srimani, F.N.A.Sc. 2
Prof. Rajasekharaiah K.M.
1
Former Chairman, Dept. of Computer Science & Maths, Bangalore University
Director, R & D, Bangalore University, Bangalore, India.
2
Professor & HOD, Department of Computer Science and Engineering,
JnanaVikas Institute of Technology, Bangalore Mysore High Way, Bidadi, Bangalore,
Visvesvaraya Technological University (VTU), Belgaum, Karnataka, India.
____________________________________________________________________________________________________
Abstract: In banking area, we find Big Data which is scattered in different places or sources in heterogeneous
format using different Databases or Files. Hence, it is very difficult to analyze the data fastly for making
Decision Support System (DSS). In this paper, we have developed a High Level Design (HLD) of Data
Warehouse system and making the whole process or the system automated using ETL (Extraction,
Transformation and Loading) tools like IBM InfoSphere Information Server, PowerCenter Informatica etc/, In
the first phase, Hadoop Data Warehouse is designed by integrating Big Data from various sources like Oracle
DB’s, DB2, Sybase, SAP, Data Marts, Flat Files, on WEB SPHERE etc. into a Warehouse in a single format
and in one place. Hence, we use ETL tool – Informatica to integrate all banking data and also use “ERWIN”
for warehouse design and “SQL LOADER” for fast data transfer. It can be operated on Windows and/or Unix
O/s platform. In order to integrate all this data, initially we design a Multi-dimensional Modeling of Data
(MDMD) by using Star Schema and Snow Flake Schema. Secondly, we pool all the data in one area called
“Staging Area”, from this we make ETL process of all data into Data Warehouse.
Keywords: Hadoop Data Warehouse, heterogeneous data, Database files, Flat files, HLD, automated , ETL,
Informatica, Web Sphere, Staging Area
_________________________________________________________________________________________
I. INTRODUCTION:
In this paper, a detailed study of the banking system which uses OLTP (On Line Transaction Processing) for
handling the day-to-day transactions and to generate the business analysis reports is made. The existing system
provides limited options for analyst to generate reports for future business forecasting and also to develop
business strategies. Further, these reports do not support system applications and thus cannot meet the
requirements of the Bank to enhance their business objectives.
Currently the Big Data in the business is competitive in all directions vertically, horizontally and parallelly. The
success of the banking sector or organizations depends on the effectiveness of the use of technology, tools and
services in meeting the customer’s requirements and their satisfaction.
Certain developmental activities in this direction move through a set of planned strategies consisting of
establishment of clear objectives and goals, from the generation of ideas to concept development, service
design, prototyping, service launch and customer feedback. As mentioned here some expert of literature exists
in this direction but have served major drawbacks. Hence, the present study is carried out [1, 3, 10, 11].
II. OBJECTIVES AND GOALS:
A. Objectives:
Our research will dwell in the following area:
 Data Mining both from structured and unstructured data
 Mapping from heterogeneous sources of data through Staging Area into DWH
 Big Data integration and analytics to speed up the process for querying or report generation
B. Goals:
Our research goal is to create DWH using ETL tool – Informatica. This tool is used for analyze DW and
provides us various reports of the Bank [2].
The results/solutions are compared with other business analytical tools and prove that the advantages in our
solutions are the best to practice and to implement in all business enterprises.
III. PROBLEM DOMAIN:
Presently, the Big Data is scattered in various sources and also in different formats. We are facing the following
problems –
P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110
IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 106
 It is very difficult to analyze those data fastly
 Limited options for analysis
 Limited options for analyst to generate reports
 Reports taken are not sufficient or sometimes short falls for DSS like business forecasting and to
develop business strategies
 Reports even do not support some system applications and can’t meet the requirements of the Bank to
enhance their business
IV. DESIGN PROCESS & DEPLOYMENT:
In Fig. 1, the technical diagram of a complex Data Warehouse Architecture (DWA) is presented,
Implementations are done by using the following Hardware and Software’s: [5, 6, 8, 9]
A Bank needs the development and design of an analytical DWH which is inextricably linked to various
business needs. The various design process which involves are – [6, 9]
Figure 1: Technical Diagram of Complex Data Warehouse Architecture
1. OLTP – Transaction Processing
The input to the DWH (Data Warehouse) if from various sources likes –
 Oracle DWH tables , dup files, data files etc
 Flat Files or Text Files, Excel Sheet etc.
2. CRS and SRS
(Customer and System Requirement Specifications)
3. ETL (Extraction Transformation & Loading) specifications
Involves Source Data to the Target Data
4. HLD – High Level Documents
Description of the tools used and naming conventions
5. DWH – Data Warehouse Design
It involves three phases of design –
 Conceptual Design - (Dimensions and Fact Tables)
 Logical Design - (Using Dimension Modeling Technique, Attributes and Constraints)
 Physical Design - (Data type, Data size, Data Tables and SQL statements)
6. Loading into DWH
(Loading all data from different sources into one storage area i.e. Staging Area into DWH and in one format to
make query/retrieval of various reports easily)
7. Testing
(Nest step is to test the loaded data by using Unit and System Testing)
Unit Testing is done by developer by writing SQL procedure or query.
System Testing is done by using Software Testing Tools.
8. Certification
(We have to complete ETL specifications with mappings done by developers. If our design meets the ETL
specifications then it is implemented.)
9. Production Phase
(This is the final phase where in further enhancements are carried out depending upon the customer’s need or
requirements, after it is successful, full implementation will be done.) (See Fig. 2)
V. CASE STUDY of AFFIN BANK, MALAYSIA:
In our research, we implement Data Warehouse Architecture (DWA – Fig.2) which deals with heterogeneous
data sets. In the first phase, we have created and designed the Data Warehouse, Dimensions and Fact tables. In
the second phase, we are going to mapping with source and target data marts. The bank has a need for an
P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110
IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 107
analytical data warehouse and a leading bank since from 30 years. Since it is fiancé cum facilitator banking
company,
Figure 2: Systems Development Life Cycle Phases (SDLC)
it has to be linked with many kinds of business enhancements and competitive edge over business using
Information Technology to make –
 Better decisions
 Dedicated better customer services
 Business Intelligence Analysis
Further, Bank offers the following additional services to their customers –
1. Offshore Finance
2. Commercial Fiancé
3. Trade Finance
4. Vehicle Fiancé
5. Housing Finance
The Bank have number of branches all over South Africa and searching for new business avenues, attracting
more new investments and to increase number of customers by using various medias, promoting new finance
schemes, implementing new business strategies and decisions. [7]
A. SCOPE:
The below Fig.4 describes the HLD – High Level Documents requirements of the Data Warehouse System. It is
meant for use by the designers and developers and will be the basis for validating the final deliverables of the
system.
Figure 3: 5 Phases of Data Warehouse Architecture:
S
T
A
G
I
N
G
A
R
E
A
E
T
L
T
O
O
L
S
D
W
H
Source n
Source1
Source 2
PHASE 3 PH 4 & 5PHASE 1 & 2
O
L
A
P
S
T
A
G
I
N
G
A
R
E
A
P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110
IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 108
Figure 4: Physical Data Integration
VI. PROPOSED SOLUTIONS:
By considering all the above problems and reports, we are finding solutions as – [2, 5, 8]
 All these data is to be integrated in a single format and pooling in one place, (see Fig. 2 DWH -
implementation) so that the mining will be effective and efficient report/information for making proper
business analysis and decision making
 Using latest ETL technology tools as mentioned earlier for fast processing of data
 Mapping is done by using source and target data
 Whole process is made Automated by using the above ETL tool – Informatica Power Center Ver.9.0.
 Creating High Level Design (HLD) of DW System and making the whole process Automated
 Creating of Dimensions (MDDM )and Fact tables
 Using ‘ERWIN’ for DW design
 Using ‘SQL-LOADER’ for fast data transfer
 We design Multi Dimensional Modeling of Data (MDMD) in order to integrate all the data by using Star
and Snowflake Schema
 After all the above operations is over we will pool all the data in an intermediate area called ‘STAGING
AREA’ (Ref. Fig.3)
 Finally, from Staging Area, we are going to pool all data into DW by using ETL (See Figs. (3) to (6)).
Figure5: ETL Process Figure 6: Stakeholders who uses reports
P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(1), March-May., 2014, pp. xx-xx
IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 109
VII. ADVANTAGES
The following are the advantages of the system over the existing system of the Bank:-
 User friendly, easy to handle and flexible in all reports
 Cost is reduced by using this tool and also saves lot of time
 Uncovering those details which are lacking right before
 Validations are made throughout the entire process to avoid occurrence of errors
 Error handling and exceptions are made easy by redirecting to a particular box by naming its path.
 Error handling with error descriptions are also populated in the system

VIII. APPLICATIONS
The applications are widely used in banking sector and the following are various stakeholders –
 Business Analysts and Executives
 Senior Managers
 Top level and Middle level management people to take DSS in their business
 Management Information System tool –
 To make forecasting of business
 To analyze trend identification
 To make market analysis
 To make competitive business edge in the market
 To create global market
 Also supports OLAP applications and to generate various reports

IX. CONCLUSION
In this paper, we concluded that the Data Integration of Banking Finance System is successfully designed,
developed, tested and implemented with case study. Care is taken for data validation check at each level of data
flow. Further, the Software is friendly, menu driven, easy accessible and maintainable.
X. FUTURE ENHANCEMENT
Future enhancements can be done to control data redundancy, data independence, data accuracy and integrity
and also recovery from failure.
REFERRENCES
[1]. Inmon W.H .”Building the Data Warehouse”, Second Edition ,J Wiley and Sons ,New York,1996
[2]. B de ville (2001),”Microsoft Data Mining :Integrated Business Intelligence for E-Commerce and knowledge Management”.
Boston: Digital press.
[3]. Frawley W Piatetsky –Shapiro G and Matheus C ,”Knowledge Discovery in Databases” An overview”.Al Magazine,Fall
1992,pgs 213-228
[4]. Integrate the Insight An oracle approach to integrate the big data and white paper. 5. 2012” IBM Global Training outlook “
March 2012.http://www.research.ibm.com/files/pdfs/goto_booklets_executive_review_march_12pdf”.
[5]. ”Data warehousing Life cycle and ETL tool kit. Informatica Guide Ralph Kimball
[6]. D Pyle (2003) “Business Modeling and Data mining” Morgan Kaufmann, an Francisco, CA
[7]. Barry D Data Warehouse from architecture to implementation Addison Wesley 1997.
[8]. Krulj D “Design and implementation of data warehouse systems .M.Sc. Thesis, Faculty of Organizational sciences, Belgrade
2003.
[9]. Lohr ,Steve .”The Age of Big Data” “New York Times.11 Feb 2012.http://www.nytimescom/2012/02/12/sunday-review/big-
datas-impact-in-the-world.html? r=2 & pagewanted=all
[10]. Manyika,James,Michel Chui, Brad Brown, Jacques Bughin ,Richard Dobbs, Charles Rexburg and Angela H.Byers.”Big data:
The net frontier for innovations, competition and productivity c Kinsey Global Institute (2011) 1-137 May 2011.
[11]. Boyd ,Dana and Crawford,Kate “Six Provocations for Big Data”Working Paper –Oxford Internet Institute
21Sept.2011http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1926431 Boyd, Dana and Crawford, Kate.
[12]. Bohanec .M (2001) What is Decision Support? Proceedings Information Society IS-2001: Data Mining and Decision Support in
action! (pp 86-89), Ljubljana, Slovenia
[13]. Bajec,M & Krisper,M (2005) .A Methodology and Tool Support for Managing Business Rules in Organizations ,Information
Systems,30,423-443
[14]. Holsheimer,M (1999) data mining by Business Users :Integrating Data Mining in Business Process. Proceedings International
Conference on Knowledge Discovery and Data Mining KDD-99( p.p 266-291) ,San Diego USA:ACM.
ACKNOWLEDGEMENT
One of the author’s Mr. Rajasekharaiah K.M. thanks Ms. Chhaya Dule, Asst.Prof. Jyothy Institute of Technology, Bangalore for her
valuable suggestions.
AUTHOR:
Presently Mr. Rajasekharaiah K.M. is working as Professor & HOD Department of Computer Science &
Engineering, Jnana Vikas Institute of Technology, Bangalore. He has done M.Tech. in Computer Science &
Engg. M.Sc. Information Technology, M.Phil. in Computer Science, and PGDIT from reputed Universities,
India. He is having 30+ years of total experience including 16 years of Industrial experiences. He is a Life fellow
Member of Indian Society for Technical Education (ISTE), New Delhi. He is presently pursuing the doctoral
degree in the Branch of Computer Science & Engineering, in the domain area of Data Mining & Warehousing.
P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110
IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 110
He has research publications in reputed national and international journals. His other area of interests are DBMS, Software Engg., Software
Architecture, Computer Networks, Programming Languages, Data Structures and Mobile Computing. He is also a resource scholar for other
Engineering Colleges/University
Screen Shots, Reports and Dashboard Snapshots
Report: 1 Report: 2
Report: 3 Report: 4
Report: 5 Report: 6

More Related Content

What's hot

N1803017478
N1803017478N1803017478
N1803017478
IOSR Journals
 
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning AlgorithmsIRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET Journal
 
IRJET- Comparative Study of ETL and E-LT in Data Warehousing
IRJET- Comparative Study of ETL and E-LT in Data WarehousingIRJET- Comparative Study of ETL and E-LT in Data Warehousing
IRJET- Comparative Study of ETL and E-LT in Data Warehousing
IRJET Journal
 
Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method
Estimation of Functional Size of a Data Warehouse System using COSMIC FSM MethodEstimation of Functional Size of a Data Warehouse System using COSMIC FSM Method
Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method
idescitation
 
VenkatSubbaReddy_Resume
VenkatSubbaReddy_ResumeVenkatSubbaReddy_Resume
VenkatSubbaReddy_Resume
Venkata SubbaReddy
 
Planning Data Warehouse
Planning Data WarehousePlanning Data Warehouse
Planning Data Warehouse
Fahri Firdausillah
 
Data Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural FrameworkData Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural Framework
Dr. Sunil Kr. Pandey
 
Research Article
Research ArticleResearch Article
Research Article
sparwaiz
 
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
IJCSIS Research Publications
 
Getting relational database from legacy data mdre approach
Getting relational database from legacy data mdre approachGetting relational database from legacy data mdre approach
Getting relational database from legacy data mdre approach
Alexander Decker
 
Issue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-businessIssue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-business
Technical Advisor at Iraqi Government
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_services
Cindy Irby
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Dr. Sunil Kr. Pandey
 
Unit 5
Unit 5 Unit 5
Unit 5
sasisanjeev
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Edureka!
 
Understanding System and Architecture for Big Data
Understanding System and Architecture for Big DataUnderstanding System and Architecture for Big Data
Understanding System and Architecture for Big Data
IBM India Smarter Computing
 
Dss
DssDss
Systems Lifecycle workbook
Systems Lifecycle workbookSystems Lifecycle workbook
Systems Lifecycle workbook
MISY
 
People soft basics
People soft basicsPeople soft basics
People soft basics
technicalguru
 

What's hot (19)

N1803017478
N1803017478N1803017478
N1803017478
 
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning AlgorithmsIRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
 
IRJET- Comparative Study of ETL and E-LT in Data Warehousing
IRJET- Comparative Study of ETL and E-LT in Data WarehousingIRJET- Comparative Study of ETL and E-LT in Data Warehousing
IRJET- Comparative Study of ETL and E-LT in Data Warehousing
 
Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method
Estimation of Functional Size of a Data Warehouse System using COSMIC FSM MethodEstimation of Functional Size of a Data Warehouse System using COSMIC FSM Method
Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method
 
VenkatSubbaReddy_Resume
VenkatSubbaReddy_ResumeVenkatSubbaReddy_Resume
VenkatSubbaReddy_Resume
 
Planning Data Warehouse
Planning Data WarehousePlanning Data Warehouse
Planning Data Warehouse
 
Data Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural FrameworkData Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural Framework
 
Research Article
Research ArticleResearch Article
Research Article
 
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
 
Getting relational database from legacy data mdre approach
Getting relational database from legacy data mdre approachGetting relational database from legacy data mdre approach
Getting relational database from legacy data mdre approach
 
Issue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-businessIssue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-business
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_services
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Unit 5
Unit 5 Unit 5
Unit 5
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Understanding System and Architecture for Big Data
Understanding System and Architecture for Big DataUnderstanding System and Architecture for Big Data
Understanding System and Architecture for Big Data
 
Dss
DssDss
Dss
 
Systems Lifecycle workbook
Systems Lifecycle workbookSystems Lifecycle workbook
Systems Lifecycle workbook
 
People soft basics
People soft basicsPeople soft basics
People soft basics
 

Viewers also liked

Ijebea14 276
Ijebea14 276Ijebea14 276
Ijebea14 276
Iasir Journals
 
Ijebea14 270
Ijebea14 270Ijebea14 270
Ijebea14 270
Iasir Journals
 
Ijebea14 277
Ijebea14 277Ijebea14 277
Ijebea14 277
Iasir Journals
 
Ijebea14 272
Ijebea14 272Ijebea14 272
Ijebea14 272
Iasir Journals
 
Ijebea14 278
Ijebea14 278Ijebea14 278
Ijebea14 278
Iasir Journals
 
Ijebea14 271
Ijebea14 271Ijebea14 271
Ijebea14 271
Iasir Journals
 
Ijebea14 285
Ijebea14 285Ijebea14 285
Ijebea14 285
Iasir Journals
 
Ijebea14 287
Ijebea14 287Ijebea14 287
Ijebea14 287
Iasir Journals
 

Viewers also liked (8)

Ijebea14 276
Ijebea14 276Ijebea14 276
Ijebea14 276
 
Ijebea14 270
Ijebea14 270Ijebea14 270
Ijebea14 270
 
Ijebea14 277
Ijebea14 277Ijebea14 277
Ijebea14 277
 
Ijebea14 272
Ijebea14 272Ijebea14 272
Ijebea14 272
 
Ijebea14 278
Ijebea14 278Ijebea14 278
Ijebea14 278
 
Ijebea14 271
Ijebea14 271Ijebea14 271
Ijebea14 271
 
Ijebea14 285
Ijebea14 285Ijebea14 285
Ijebea14 285
 
Ijebea14 287
Ijebea14 287Ijebea14 287
Ijebea14 287
 

Similar to Ijebea14 267

MODERN DATA PIPELINE
MODERN DATA PIPELINEMODERN DATA PIPELINE
MODERN DATA PIPELINE
IRJET Journal
 
Design and implementation of the web (extract, transform, load) process in da...
Design and implementation of the web (extract, transform, load) process in da...Design and implementation of the web (extract, transform, load) process in da...
Design and implementation of the web (extract, transform, load) process in da...
IAESIJAI
 
Jeeva_Resume
Jeeva_ResumeJeeva_Resume
Jeeva_Resume
Jeeva Ramasamy
 
Arunkumar_Resume
Arunkumar_ResumeArunkumar_Resume
Arunkumar_Resume
Arunkumar Santharam
 
Abdul ETL Resume
Abdul ETL ResumeAbdul ETL Resume
Abdul ETL Resume
Abdul mohammed
 
Richa_Profile
Richa_ProfileRicha_Profile
Richa_Profile
Richa Sharma
 
ABHIJEET MURLIDHAR GHAG Axisbank
ABHIJEET MURLIDHAR GHAG AxisbankABHIJEET MURLIDHAR GHAG Axisbank
ABHIJEET MURLIDHAR GHAG Axisbank
Abhijeet Ghag
 
Gowthami_Resume
Gowthami_ResumeGowthami_Resume
Gowthami_Resume
Gowthami Subramaniam
 
ETL Profile-Rajnish Kumar
ETL Profile-Rajnish KumarETL Profile-Rajnish Kumar
ETL Profile-Rajnish Kumar
Rajnish Kumar
 
IRJET- Business Intelligence using Hadoop
IRJET-  	  Business Intelligence using HadoopIRJET-  	  Business Intelligence using Hadoop
IRJET- Business Intelligence using Hadoop
IRJET Journal
 
Lokesh_Reddy_Datastage_Resume
Lokesh_Reddy_Datastage_ResumeLokesh_Reddy_Datastage_Resume
Lokesh_Reddy_Datastage_Resume
Lokesh Reddy
 
Mani_Sagar_ETL
Mani_Sagar_ETLMani_Sagar_ETL
Mani_Sagar_ETL
Mani Sagar
 
Business Intelligence Module 3
Business Intelligence Module 3Business Intelligence Module 3
Business Intelligence Module 3
Home
 
PratikGhosh_Resume_Final
PratikGhosh_Resume_FinalPratikGhosh_Resume_Final
PratikGhosh_Resume_Final
Pratik Ghosh
 
Nitin - Data Specialist
Nitin - Data SpecialistNitin - Data Specialist
Nitin - Data Specialist
Nitin singhal
 
PeterBarnumGenResume
PeterBarnumGenResumePeterBarnumGenResume
PeterBarnumGenResume
Peter Barnum
 
IRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using QlikIRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using Qlik
IRJET Journal
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
Ting Yin
 
RajeshS_ETL
RajeshS_ETLRajeshS_ETL
RajeshS_ETL
Rajesh Sundaram
 
Resume1
Resume1Resume1

Similar to Ijebea14 267 (20)

MODERN DATA PIPELINE
MODERN DATA PIPELINEMODERN DATA PIPELINE
MODERN DATA PIPELINE
 
Design and implementation of the web (extract, transform, load) process in da...
Design and implementation of the web (extract, transform, load) process in da...Design and implementation of the web (extract, transform, load) process in da...
Design and implementation of the web (extract, transform, load) process in da...
 
Jeeva_Resume
Jeeva_ResumeJeeva_Resume
Jeeva_Resume
 
Arunkumar_Resume
Arunkumar_ResumeArunkumar_Resume
Arunkumar_Resume
 
Abdul ETL Resume
Abdul ETL ResumeAbdul ETL Resume
Abdul ETL Resume
 
Richa_Profile
Richa_ProfileRicha_Profile
Richa_Profile
 
ABHIJEET MURLIDHAR GHAG Axisbank
ABHIJEET MURLIDHAR GHAG AxisbankABHIJEET MURLIDHAR GHAG Axisbank
ABHIJEET MURLIDHAR GHAG Axisbank
 
Gowthami_Resume
Gowthami_ResumeGowthami_Resume
Gowthami_Resume
 
ETL Profile-Rajnish Kumar
ETL Profile-Rajnish KumarETL Profile-Rajnish Kumar
ETL Profile-Rajnish Kumar
 
IRJET- Business Intelligence using Hadoop
IRJET-  	  Business Intelligence using HadoopIRJET-  	  Business Intelligence using Hadoop
IRJET- Business Intelligence using Hadoop
 
Lokesh_Reddy_Datastage_Resume
Lokesh_Reddy_Datastage_ResumeLokesh_Reddy_Datastage_Resume
Lokesh_Reddy_Datastage_Resume
 
Mani_Sagar_ETL
Mani_Sagar_ETLMani_Sagar_ETL
Mani_Sagar_ETL
 
Business Intelligence Module 3
Business Intelligence Module 3Business Intelligence Module 3
Business Intelligence Module 3
 
PratikGhosh_Resume_Final
PratikGhosh_Resume_FinalPratikGhosh_Resume_Final
PratikGhosh_Resume_Final
 
Nitin - Data Specialist
Nitin - Data SpecialistNitin - Data Specialist
Nitin - Data Specialist
 
PeterBarnumGenResume
PeterBarnumGenResumePeterBarnumGenResume
PeterBarnumGenResume
 
IRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using QlikIRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using Qlik
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
RajeshS_ETL
RajeshS_ETLRajeshS_ETL
RajeshS_ETL
 
Resume1
Resume1Resume1
Resume1
 

More from Iasir Journals

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
Iasir Journals
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
Iasir Journals
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
Iasir Journals
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
Iasir Journals
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
Iasir Journals
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
Iasir Journals
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
Iasir Journals
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
Iasir Journals
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
Iasir Journals
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
Iasir Journals
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
Iasir Journals
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
Iasir Journals
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
Iasir Journals
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
Iasir Journals
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
Iasir Journals
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
Iasir Journals
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
Iasir Journals
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
Iasir Journals
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
Iasir Journals
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
Iasir Journals
 

More from Iasir Journals (20)

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
 

Recently uploaded

Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Transcat
 
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
ijseajournal
 
SENTIMENT ANALYSIS ON PPT AND Project template_.pptx
SENTIMENT ANALYSIS ON PPT AND Project template_.pptxSENTIMENT ANALYSIS ON PPT AND Project template_.pptx
SENTIMENT ANALYSIS ON PPT AND Project template_.pptx
b0754201
 
OOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming languageOOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming language
PreethaV16
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
Roger Rozario
 
AI-Based Home Security System : Home security
AI-Based Home Security System : Home securityAI-Based Home Security System : Home security
AI-Based Home Security System : Home security
AIRCC Publishing Corporation
 
P5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civilP5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civil
AnasAhmadNoor
 
Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...
cannyengineerings
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
upoux
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
upoux
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
Addu25809
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
uqyfuc
 
Supermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdfSupermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdf
Kamal Acharya
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
Prakhyath Rai
 
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
ydzowc
 
Mechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineeringMechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineering
sachin chaurasia
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
Kamal Acharya
 
Zener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and ApplicationsZener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and Applications
Shiny Christobel
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 

Recently uploaded (20)

Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
 
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
 
SENTIMENT ANALYSIS ON PPT AND Project template_.pptx
SENTIMENT ANALYSIS ON PPT AND Project template_.pptxSENTIMENT ANALYSIS ON PPT AND Project template_.pptx
SENTIMENT ANALYSIS ON PPT AND Project template_.pptx
 
OOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming languageOOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming language
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
 
AI-Based Home Security System : Home security
AI-Based Home Security System : Home securityAI-Based Home Security System : Home security
AI-Based Home Security System : Home security
 
P5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civilP5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civil
 
Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
 
Supermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdfSupermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdf
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
 
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
 
Mechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineeringMechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineering
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
 
Zener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and ApplicationsZener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and Applications
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 

Ijebea14 267

  • 1. International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Engineering, Business and Enterprise Applications (IJEBEA) www.iasir.net IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 105 ISSN (Print): 2279-0020 ISSN (Online): 2279-0039 Integration of Big Data in Banking Sector to Speed up the Analytical Process 1 Prof. Dr. P.K. Srimani, F.N.A.Sc. 2 Prof. Rajasekharaiah K.M. 1 Former Chairman, Dept. of Computer Science & Maths, Bangalore University Director, R & D, Bangalore University, Bangalore, India. 2 Professor & HOD, Department of Computer Science and Engineering, JnanaVikas Institute of Technology, Bangalore Mysore High Way, Bidadi, Bangalore, Visvesvaraya Technological University (VTU), Belgaum, Karnataka, India. ____________________________________________________________________________________________________ Abstract: In banking area, we find Big Data which is scattered in different places or sources in heterogeneous format using different Databases or Files. Hence, it is very difficult to analyze the data fastly for making Decision Support System (DSS). In this paper, we have developed a High Level Design (HLD) of Data Warehouse system and making the whole process or the system automated using ETL (Extraction, Transformation and Loading) tools like IBM InfoSphere Information Server, PowerCenter Informatica etc/, In the first phase, Hadoop Data Warehouse is designed by integrating Big Data from various sources like Oracle DB’s, DB2, Sybase, SAP, Data Marts, Flat Files, on WEB SPHERE etc. into a Warehouse in a single format and in one place. Hence, we use ETL tool – Informatica to integrate all banking data and also use “ERWIN” for warehouse design and “SQL LOADER” for fast data transfer. It can be operated on Windows and/or Unix O/s platform. In order to integrate all this data, initially we design a Multi-dimensional Modeling of Data (MDMD) by using Star Schema and Snow Flake Schema. Secondly, we pool all the data in one area called “Staging Area”, from this we make ETL process of all data into Data Warehouse. Keywords: Hadoop Data Warehouse, heterogeneous data, Database files, Flat files, HLD, automated , ETL, Informatica, Web Sphere, Staging Area _________________________________________________________________________________________ I. INTRODUCTION: In this paper, a detailed study of the banking system which uses OLTP (On Line Transaction Processing) for handling the day-to-day transactions and to generate the business analysis reports is made. The existing system provides limited options for analyst to generate reports for future business forecasting and also to develop business strategies. Further, these reports do not support system applications and thus cannot meet the requirements of the Bank to enhance their business objectives. Currently the Big Data in the business is competitive in all directions vertically, horizontally and parallelly. The success of the banking sector or organizations depends on the effectiveness of the use of technology, tools and services in meeting the customer’s requirements and their satisfaction. Certain developmental activities in this direction move through a set of planned strategies consisting of establishment of clear objectives and goals, from the generation of ideas to concept development, service design, prototyping, service launch and customer feedback. As mentioned here some expert of literature exists in this direction but have served major drawbacks. Hence, the present study is carried out [1, 3, 10, 11]. II. OBJECTIVES AND GOALS: A. Objectives: Our research will dwell in the following area:  Data Mining both from structured and unstructured data  Mapping from heterogeneous sources of data through Staging Area into DWH  Big Data integration and analytics to speed up the process for querying or report generation B. Goals: Our research goal is to create DWH using ETL tool – Informatica. This tool is used for analyze DW and provides us various reports of the Bank [2]. The results/solutions are compared with other business analytical tools and prove that the advantages in our solutions are the best to practice and to implement in all business enterprises. III. PROBLEM DOMAIN: Presently, the Big Data is scattered in various sources and also in different formats. We are facing the following problems –
  • 2. P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110 IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 106  It is very difficult to analyze those data fastly  Limited options for analysis  Limited options for analyst to generate reports  Reports taken are not sufficient or sometimes short falls for DSS like business forecasting and to develop business strategies  Reports even do not support some system applications and can’t meet the requirements of the Bank to enhance their business IV. DESIGN PROCESS & DEPLOYMENT: In Fig. 1, the technical diagram of a complex Data Warehouse Architecture (DWA) is presented, Implementations are done by using the following Hardware and Software’s: [5, 6, 8, 9] A Bank needs the development and design of an analytical DWH which is inextricably linked to various business needs. The various design process which involves are – [6, 9] Figure 1: Technical Diagram of Complex Data Warehouse Architecture 1. OLTP – Transaction Processing The input to the DWH (Data Warehouse) if from various sources likes –  Oracle DWH tables , dup files, data files etc  Flat Files or Text Files, Excel Sheet etc. 2. CRS and SRS (Customer and System Requirement Specifications) 3. ETL (Extraction Transformation & Loading) specifications Involves Source Data to the Target Data 4. HLD – High Level Documents Description of the tools used and naming conventions 5. DWH – Data Warehouse Design It involves three phases of design –  Conceptual Design - (Dimensions and Fact Tables)  Logical Design - (Using Dimension Modeling Technique, Attributes and Constraints)  Physical Design - (Data type, Data size, Data Tables and SQL statements) 6. Loading into DWH (Loading all data from different sources into one storage area i.e. Staging Area into DWH and in one format to make query/retrieval of various reports easily) 7. Testing (Nest step is to test the loaded data by using Unit and System Testing) Unit Testing is done by developer by writing SQL procedure or query. System Testing is done by using Software Testing Tools. 8. Certification (We have to complete ETL specifications with mappings done by developers. If our design meets the ETL specifications then it is implemented.) 9. Production Phase (This is the final phase where in further enhancements are carried out depending upon the customer’s need or requirements, after it is successful, full implementation will be done.) (See Fig. 2) V. CASE STUDY of AFFIN BANK, MALAYSIA: In our research, we implement Data Warehouse Architecture (DWA – Fig.2) which deals with heterogeneous data sets. In the first phase, we have created and designed the Data Warehouse, Dimensions and Fact tables. In the second phase, we are going to mapping with source and target data marts. The bank has a need for an
  • 3. P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110 IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 107 analytical data warehouse and a leading bank since from 30 years. Since it is fiancé cum facilitator banking company, Figure 2: Systems Development Life Cycle Phases (SDLC) it has to be linked with many kinds of business enhancements and competitive edge over business using Information Technology to make –  Better decisions  Dedicated better customer services  Business Intelligence Analysis Further, Bank offers the following additional services to their customers – 1. Offshore Finance 2. Commercial Fiancé 3. Trade Finance 4. Vehicle Fiancé 5. Housing Finance The Bank have number of branches all over South Africa and searching for new business avenues, attracting more new investments and to increase number of customers by using various medias, promoting new finance schemes, implementing new business strategies and decisions. [7] A. SCOPE: The below Fig.4 describes the HLD – High Level Documents requirements of the Data Warehouse System. It is meant for use by the designers and developers and will be the basis for validating the final deliverables of the system. Figure 3: 5 Phases of Data Warehouse Architecture: S T A G I N G A R E A E T L T O O L S D W H Source n Source1 Source 2 PHASE 3 PH 4 & 5PHASE 1 & 2 O L A P S T A G I N G A R E A
  • 4. P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110 IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 108 Figure 4: Physical Data Integration VI. PROPOSED SOLUTIONS: By considering all the above problems and reports, we are finding solutions as – [2, 5, 8]  All these data is to be integrated in a single format and pooling in one place, (see Fig. 2 DWH - implementation) so that the mining will be effective and efficient report/information for making proper business analysis and decision making  Using latest ETL technology tools as mentioned earlier for fast processing of data  Mapping is done by using source and target data  Whole process is made Automated by using the above ETL tool – Informatica Power Center Ver.9.0.  Creating High Level Design (HLD) of DW System and making the whole process Automated  Creating of Dimensions (MDDM )and Fact tables  Using ‘ERWIN’ for DW design  Using ‘SQL-LOADER’ for fast data transfer  We design Multi Dimensional Modeling of Data (MDMD) in order to integrate all the data by using Star and Snowflake Schema  After all the above operations is over we will pool all the data in an intermediate area called ‘STAGING AREA’ (Ref. Fig.3)  Finally, from Staging Area, we are going to pool all data into DW by using ETL (See Figs. (3) to (6)). Figure5: ETL Process Figure 6: Stakeholders who uses reports
  • 5. P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(1), March-May., 2014, pp. xx-xx IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 109 VII. ADVANTAGES The following are the advantages of the system over the existing system of the Bank:-  User friendly, easy to handle and flexible in all reports  Cost is reduced by using this tool and also saves lot of time  Uncovering those details which are lacking right before  Validations are made throughout the entire process to avoid occurrence of errors  Error handling and exceptions are made easy by redirecting to a particular box by naming its path.  Error handling with error descriptions are also populated in the system  VIII. APPLICATIONS The applications are widely used in banking sector and the following are various stakeholders –  Business Analysts and Executives  Senior Managers  Top level and Middle level management people to take DSS in their business  Management Information System tool –  To make forecasting of business  To analyze trend identification  To make market analysis  To make competitive business edge in the market  To create global market  Also supports OLAP applications and to generate various reports  IX. CONCLUSION In this paper, we concluded that the Data Integration of Banking Finance System is successfully designed, developed, tested and implemented with case study. Care is taken for data validation check at each level of data flow. Further, the Software is friendly, menu driven, easy accessible and maintainable. X. FUTURE ENHANCEMENT Future enhancements can be done to control data redundancy, data independence, data accuracy and integrity and also recovery from failure. REFERRENCES [1]. Inmon W.H .”Building the Data Warehouse”, Second Edition ,J Wiley and Sons ,New York,1996 [2]. B de ville (2001),”Microsoft Data Mining :Integrated Business Intelligence for E-Commerce and knowledge Management”. Boston: Digital press. [3]. Frawley W Piatetsky –Shapiro G and Matheus C ,”Knowledge Discovery in Databases” An overview”.Al Magazine,Fall 1992,pgs 213-228 [4]. Integrate the Insight An oracle approach to integrate the big data and white paper. 5. 2012” IBM Global Training outlook “ March 2012.http://www.research.ibm.com/files/pdfs/goto_booklets_executive_review_march_12pdf”. [5]. ”Data warehousing Life cycle and ETL tool kit. Informatica Guide Ralph Kimball [6]. D Pyle (2003) “Business Modeling and Data mining” Morgan Kaufmann, an Francisco, CA [7]. Barry D Data Warehouse from architecture to implementation Addison Wesley 1997. [8]. Krulj D “Design and implementation of data warehouse systems .M.Sc. Thesis, Faculty of Organizational sciences, Belgrade 2003. [9]. Lohr ,Steve .”The Age of Big Data” “New York Times.11 Feb 2012.http://www.nytimescom/2012/02/12/sunday-review/big- datas-impact-in-the-world.html? r=2 & pagewanted=all [10]. Manyika,James,Michel Chui, Brad Brown, Jacques Bughin ,Richard Dobbs, Charles Rexburg and Angela H.Byers.”Big data: The net frontier for innovations, competition and productivity c Kinsey Global Institute (2011) 1-137 May 2011. [11]. Boyd ,Dana and Crawford,Kate “Six Provocations for Big Data”Working Paper –Oxford Internet Institute 21Sept.2011http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1926431 Boyd, Dana and Crawford, Kate. [12]. Bohanec .M (2001) What is Decision Support? Proceedings Information Society IS-2001: Data Mining and Decision Support in action! (pp 86-89), Ljubljana, Slovenia [13]. Bajec,M & Krisper,M (2005) .A Methodology and Tool Support for Managing Business Rules in Organizations ,Information Systems,30,423-443 [14]. Holsheimer,M (1999) data mining by Business Users :Integrating Data Mining in Business Process. Proceedings International Conference on Knowledge Discovery and Data Mining KDD-99( p.p 266-291) ,San Diego USA:ACM. ACKNOWLEDGEMENT One of the author’s Mr. Rajasekharaiah K.M. thanks Ms. Chhaya Dule, Asst.Prof. Jyothy Institute of Technology, Bangalore for her valuable suggestions. AUTHOR: Presently Mr. Rajasekharaiah K.M. is working as Professor & HOD Department of Computer Science & Engineering, Jnana Vikas Institute of Technology, Bangalore. He has done M.Tech. in Computer Science & Engg. M.Sc. Information Technology, M.Phil. in Computer Science, and PGDIT from reputed Universities, India. He is having 30+ years of total experience including 16 years of Industrial experiences. He is a Life fellow Member of Indian Society for Technical Education (ISTE), New Delhi. He is presently pursuing the doctoral degree in the Branch of Computer Science & Engineering, in the domain area of Data Mining & Warehousing.
  • 6. P.K. Srimani et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 105-110 IJEBEA 14-267; © 2014, IJEBEA All Rights Reserved Page 110 He has research publications in reputed national and international journals. His other area of interests are DBMS, Software Engg., Software Architecture, Computer Networks, Programming Languages, Data Structures and Mobile Computing. He is also a resource scholar for other Engineering Colleges/University Screen Shots, Reports and Dashboard Snapshots Report: 1 Report: 2 Report: 3 Report: 4 Report: 5 Report: 6