This document summarizes Evgeny Babaskin's projects for Volkswagen Finanz and Volkswagen Group RUS. The projects aimed to integrate data from various sources and build a data warehouse and reporting system. Key results included eliminating data quality issues, automatically synchronizing data between systems, and developing over 127 reports to meet the needs of different business units. The solutions utilized technologies like IBM DataStage, DB2, and Alphablox to extract, transform, load and analyze data for improved decision making.
3. Volkswagen Finanz Data integration The idea of this project is to eliminate a number of problems with data quality in master data (clients, dealers, contracts) and inherited files with leasing information I have used a mixed approach based on ETL (extract-transform-load) and web services Core of the platform is IBM’s middleware – DataStage, DB2, WebSphere Application Server The cleansing solution is built upon nonlinear statistical approach (proprietary Java plug-ins to the Information Server) Results: High performance of the whole platform High precision in data induplication(decrease of duplicates in ERP and FI systems from 23% at the beginning ofthe project to 0.3% after cleansing) The solution significantly simplified ETLprocedures for data warehouse Data from inherited sources has been uploaded to the ERP system ERP and FI systems synchronize dataautomatically The delay of upload of credit reports from banks (in Excel semi structured format) has been decreased significantly Evgeny BABASKIN
5. Volkswagen Finanz Data integration. Example I would like to present one short example of one of integration processes – exact data matching for the “agents” table (clients, dealers, banks, etc). The information in this table MUST be managed, though company is using lots of data sources and was not capable of managing the data properly. The idea is to implement a solution changing the existing systems minimally. We have defined a point at which the data about agent is “known” and should be synchronized between systems (when a new portion of information is in a “draft” state, it cannot be used to produce documents – from some point of view it does not exist). Results: Only one simple dialog window is needed. Communication with systems is accomplishedthrough SOAP (standard protocol). Methodology and solution is reusableand has been extensively used to correct mistakes in reports from banksand other external systems. Quality of data has been improved significantly. In order to control quality of dataa module for the reporting system has beendeveloped. Evgeny BABASKIN
7. Volkswagen FinanzData warehouse and Reporting system The reporting system is built upon IBM Alphablox framework which helps to develop web-based reports. A number of additional modules has been developed: XML/XSL-based interfaces for reports (improvement in reusability of interface components), e-mail module for automatic notification, integration with IBM Infosphere DataStage (Integration platform) and module for Dashboards (KPIs for Top-management). Data is uploaded to DWH database using DataStage parallel jobs (with a number of proprietary Java transformers and parallel functions). Results: One-window approach: all information about current state and evolution of the enterprise can be received in one application (according to security rules, surely) 3 modules for different groups of users (KPIs, reporting for managers and data quality check module for administrators) High performance Significant decrease in delays of development cycle Evgeny BABASKIN
8. Volkswagen Group RUS Data warehouse and Reporting system Task: Develop a high-performance solution to produce analytical reports for all divisions of VW GR (car importer) All divisions have different approaches to reporting (from OLAP to “document”-formattedreports) Data warehouse should also play a role of integration solution between internal and external systems (warranty data, sales results etc) Tools and solutions: IBM DB2 Alphablox (reporting framework) + XML-based interfaces IBM DB2 (database) IBM Infosphere DataStage (ETL tool) Results: One-window approach More than 127 tailor-made reports for different groups of users (Sales, Aftersales, Warranty, Logistics, Marketing, Call center etc) Evgeny BABASKIN