SlideShare a Scribd company logo
1 of 64
Download to read offline
Big Data Pilot Demo Days
Despina Kopanaki (FORTH) dkopanaki@ics.forth.gr
Marieke Willems (Trust-IT) m.willems@trust-itservices.com
Andrea Schillaci (Trust-IT) a.schillaci@trust-itservices.com
BDV PPP Summit 2020 went virtual
29/05/20 2
Why Big Data Pilot Demo Days?
The new data-driven industrial revolution highlights the need for
big data technologies to unlock the potential in various
application domains.
BDV PPP projects I-BiDaaS, BigDataStack and Track & Know and
deliver innovative technologies to address the emerging needs of
data operations and applications.
To fully exploit the sustainability of the developed technologies,
the projects onboarded pilots that exhibit their applicability in a
wide variety of sectors.
In their third and final year, the projects are ready to demonstrate
the developed and implemented technologies to interested end-
users from industry as well as technology providers, for further
adoption.
29/05/20 3
BDV PPP Projects Join Forces
29/05/20 4
Industrial-Driven Big Data as a Self-Service Solution
Holistic stack for big data applications and
operations
Big Data for Mobility Tracking Knowledge
Extraction in Urban Areas
BDV PPP projects joining forces to showcase application of innovative technologies in a
variety of domains, fostering futher adoption, contributing to Europe’s digital future.
Big Data Pilot Demo Days - A Series of Webinars
5
I-BiDaaS Application
to the Financial Sector
1
BigDataStack Connected
Consumer demo
Track & Know applied to the
fleet management sector
2 3
456
7 8 9
A BigDataStack
Seafarer’s Tale on Real
Time Shipping
I-BiDaaS Application to the
Telecommunication Sector
I-BiDaaS Application to the
manufacturing sector
Track & Know applied to
patient mobility in the
healthcare sector
BigDataStack
Smart Insurance
Track & Know applied to
the automotive
insurance sector
Today’s Main Topic –
Big Data as a Self-Service Solution
I-BiDaaS overview
CaixaBank’s Pitches: Setting the requirements
I-BiDaaS architecture: Scientific & Technical view; how it
addresses the requirements set by CaixaBank.
Step by Step demonstration of I-BiDaaS solution and its
application to the banking sector.
Questions & Answers
29/05/20 6
Webinar Speakers
729/05/2020
Assistant Professor at the Department of
Mathematics and Informatics, Faculty of Sciences,
University of Novi Sad, Serbia
I-BiDaaS Scientific & Technical Manager.
Project Manager at
Security Innovation & Transformation,
CaixaBank, Barcelona
Dr. Dušan Jakovetić
University of Novi Sad, Serbia
Dr. Ramon Martin
de Pozuelo
CaixaBank
I-BiDaaS Application to the Financial Sector
Thursday, May 21, 2020 - 14:00-15:00 CEST
Big Data Pilot Demo Days
Dusan Jakovetic
Ass. Professor, University of Novi Sad, Faculty of Sciences, Serbia;
I-BiDaaS Scientific & Technical Manager
I-BiDaaS Overview
I-BiDaaS Application to the Financial Sector
Thursday, May 21, 2020 - 14:00-15:00 CEST
Identity card
http://www.ibidaas.eu/ @Ibidaas https://www.linkedin.com/in/i-bidaas/
1010
I-BiDaaS Consortium
1. FOUNDATION FOR RESEARCH AND TECHNOLOGY HELLAS (FORTH)
2. BARCELONA SUPERCOMPUTING CENTER - CENTRO NACIONAL DE
SUPERCOMPUTACION (BSC)
3. IBM ISRAEL - SCIENCE AND TECHNOLOGY LTD (IBM)
4. CENTRO RICERCHE FIAT SCPA (CRF)
5. SOFTWARE AG (SAG)
6. CAIXABANK, S.A (CAIXA)
7. THE UNIVERSITY OF MANCHESTER (UNIMAN)
8. ECOLE NATIONALE DES PONTS ET CHAUSSEES (ENPC)
9. ATOS SPAIN SA (ATOS)
10. AEGIS IT RESEARCH LTD (AEGIS)
11. INFORMATION TECHNOLOGY FOR MARKET LEADERSHIP (ITML)
12. University of Novi Sad Faculty of Sciences Serbia (UNSPMF)
13. TELEFONICA INVESTIGACION Y DESARROLLO SA (TID)
1111
Motivation
12
EuropeanDataEconomy
Essential resource for growth,
competitiveness, innovation, job creation
and societal progress in general
Organizations leverage
data pools to drive value
The rise of the demand for platforms in the
market empowering end users to analyze
The convergence of internet of things
(IoT), cloud, and big data transforms our
economy and society
Self-service solutions are transformative
for organizations
Building a European Data Economy (Jan
2017)
Continue to struggle to turn opportunity
from big data into realized gains
Companies call upon expert analysts and
consultants to assist them
A completely new paradigm towards big
data analytics
The right knowledge, and insights decision-
makers need to make the right decisions.
Towards a common European data space
(Apr 2018)
Towards a thriving data-driven economy
(Jul 2014)
Digital Single Market
12
Our Vision
A complete and safe environment for
methodological big data experimentation
Tool and services to increase the
quality of data analytics
A Big Data as a Self-Service solution that helps
breaking industrial data silos and boosts EU's
data-driven economy
Tools and services for fast ingestion
and consolidation of both realistic
and fabricated data
Tools and services for the management
of heterogeneous infrastructures
including elasticity
Increases impact in research community
and contributes to industrial innovation
capacity
1313
Project Statement
I-BiDaaS aims to empower users to easily utilize and interact
with big data technologies, by designing, building, and
demonstrating, a unified framework that:
significantly increases the speed of data analysis while coping
with the rate of data asset growth, and facilitates cross-
domain data-flow towards a thriving data-driven EU
economy.
I-BiDaaS will be tangibly validated by three real-world,
industry-lead experiments.
1414
CAIXA
Enhance control of customers to online banking
Advanced analysis of bank transfer payment in
financial terminal
Analysis of relationships through IP address
CRF
Production process of aluminium casting
Maintenance and monitoring of production
assets
TID
Accurate location prediction with high traffic and
visibility
Optimization of placement of telecommunication
equipment
Quality of Service in Call Centers
Telecommunications
Industry
Banking/Finance Industry
Manufacturing Industry
Application / Experimentation
1515
16
I-BiDaaS application domains
16
Ramon Martín de Pozuelo
Project Manager at Security Innovation & Transformation at
CaixaBank
Setting the Pilot Requirements
I-BiDaaS Application to the Financial Sector
Thursday, May 21, 2020 - 14:00-15:00 CEST
18
CaixaBank and the Use of Data
CaixaBank is the leading financial group in Spain,
both in banking and insurance and it is developing a
strategy of diversification with stakes in international
banks and also within leading service companies.
13.8M Customers
4.2K Branches
9.1K ATMs
4.8M Mobile Banking
5.8M On-line banking
32K Employees
In January 2014 CaixaBank created the Big Data Department
We manage 1.247 TB of information only in our Big data
Department of more than 100 internal people providing Data analytics services to all the
Organization.
Due to Regulation constraints all the infrastructure and the analysis is done internally
18
19
Why do we need I-BiDaaS?
Requirements
Lack of agility in our Datapool for extracting data externally.
Security procedures and constraints are necessary but hinder and slow down data sharing
processes.
We have tons of data but confidential.
We are the data managers but the real owners of the data are our customers.
Current data sharing situation in CaixaBank and I-BiDaaS approach
Requirement Control
Data privacy
Data of customers (e.g., social graph) and external suppliers
(e.g., SIEM) can not be accessed or shared with other partners
Regulation
Compliance
All activities related to CaixaBank within the project must comply
with the relevant regulations (e.g., ISO 27001, GDPR)
Fraud and
Security
Analytics
Use cases presented will be related to the improvement of
security and the prevention of fraud.
Objective: Exploit the I-BiDaaS platform to gain agility, efficiency and flexibility in our analytics for security.
19
§ Breaking external and cross-sectorial data silos while
complying Regulation
§ Sharing Data models with other FI
§ Sharing Data models with other sectors
§ Following ECB & Banco de España constraints, we already have
proved with I-BiDaas that this is possible
Why do we need I-BiDaaS?
20
§ Secure Self-Service Infrastructure
§ Being able to outsource Big Data Infrastructure preserving
privacy & security
§ To grow in a dedicated and specialized infrastructure
§ To count on dedicated and specialized specialists
Why do we need I-BiDaaS?
21
§ Competitiveness & Innovation
§ Fast, agile and specialized adaptation to new
technology.
§ Vs Current proprietary infrastructure
§ Efficiency
§ Reducing the costs and time of analyzing large datasets
Why do we need I-BiDaaS?
22
Dusan Jakovetic
Assistant Professor, University of Novi Sad, Serbia;
I-BiDaaS Scientific & Technical Manager
Big Data Architecture
I-BiDaaS Application to the Financial Sector
Thursday, May 21, 2020 - 14:00-15:00 CEST
• Expert mode
Analyze your DataUsers
• Import your data
• Self-service mode
Data
• Fabricate Data
• Stream & Batch Analytics
• Expert: Upload your code
• Self-service: Select an
algorithm from the pool
Results
• Visualize the results
• Share models
Do it
yourself
In a flexible
manner
Break data silos Safe environment Interact with Big Data
technologies
Increase speed of
data analysis
Cope with the rate of
data asset growth
Intra- and inter-
domain data-
flow
Benefits of using I-BiDaaS
• Co-develop mode
The I-BiDaaS Solution: Front-end
• Co-develop: custom end-
to-end application
• Tokenize data
24
• Expert mode
Analyze your DataUsers
• Import your data
• Self-service mode
Data
• Fabricate Data
• Stream & Batch Analytics
• Expert: Upload your code
• Self-service: Select an
algorithm from the pool
Results
• Visualize the results
• Share models
Do it
yourself
In a flexible
manner
Break data silos Safe environment Interact with Big Data
technologies
Increase speed of
data analysis
Cope with the rate of
data asset growth
Intra- and inter-
domain data-
flow
Benefits of using I-BiDaaS
• Co-develop mode
The I-BiDaaS Solution: Front-end
• Co-develop: custom end-
to-end application
• Tokenize data
25
• Expert mode
Analyze your DataUsers
• Import your data
• Self-service mode
Data
• Fabricate Data
• Stream & Batch Analytics
• Expert: Upload your code
• Self-service: Select an
algorithm from the pool
Results
• Visualize the results
• Share models
Do it
yourself
In a flexible
manner
Break data silos Safe environment Interact with Big Data
technologies
Increase speed of
data analysis
Cope with the rate of
data asset growth
Intra- and inter-
domain data-
flow
Benefits of using I-BiDaaS
• Co-develop mode
The I-BiDaaS Solution: Front-end
• Co-develop: custom end-
to-end application
• Tokenize data
26
Specify data
fabrication rules
Fabricate & save the fabricated
data in the I-BiDaaS external
platform [e.g., ATOS cloud]
Visualize data/data sample
[e.g., graphs, charts, etc.]
Select pre-defined analytics from
the pool / use code template to
write a new script (COMPSs)
Set algorithm parameters/choose
default values
Tune alg. paramet. inside the
visualis. without coding [e.g.,
sliding window]
Save the Big Data
pipeline/analytics on the
local premises [e.g.,
company’s cluster]
Specify real data sources
in the local warehouse for
processing
Execute the saved
pipeline/analytics on the real
data
Visualize execution &
results
Extract new fabrication rules &
save the rules;
save the pipeline & analytics
Fabricate/Tokenize
Experiment&test
Runatlocalprem.&realdata
Execute a trial experiment;
Select amount of resources
visualize execution time & results
I-BiDaaS – Prototypical Experimental Workflow
27
Specify data
fabrication rules
Fabricate & save the fabricated
data in the I-BiDaaS external
platform [e.g., ATOS cloud]
Visualize data/data sample
[e.g., graphs, charts, etc.]
Select pre-defined analytics from
the pool / use code template to
write a new script (COMPSs)
Set algorithm parameters/choose
default values
Tune alg. paramet. inside the
visualis. without coding [e.g.,
sliding window]
Save the Big Data
pipeline/analytics on the
local premises [e.g.,
company’s cluster]
Specify real data sources
in the local warehouse for
processing
Execute the saved
pipeline/analytics on the real
data
Visualize execution &
results
Extract new fabrication rules &
save the rules;
save the pipeline & analytics
Execute a trial experiment;
Select amount of resources
visualize execution time & results
Fabricate/Tokenize
Experiment&test
Runatlocalprem.&realdata
I-BiDaaS – Experimental Workflow
28
The I-BiDaaS Solution:
Architecture/back-end
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPsProgramming
Model(BSC)
Query Partitioning
Infrastructurelayer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource managementand orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Data ingestion
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPsRuntime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
TestData
Fabrication(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learnedpatterns correlations
29
WP2:
Data, user interface, visualization
Technologies:
• IBM TDF
• SAG UM
• AEGIS AVT
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPs Programming
Model (BSC)
Query Partitioning
Infrastructure layer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource management and orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPs Runtime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
DataFabrication
Platform(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learned patterns correlations
http://ibidaas.eu/tools
30
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPs Programming
Model (BSC)
Query Partitioning
Infrastructure layer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource management and orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPs Runtime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
DataFabrication
Platform(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learned patterns correlations
WP3:
Batch analytics
Technologies:
• BSC COMPSs
• BSC Hecuba
• BSC Qbeast
• Advanced ML (UNSPMF)
http://ibidaas.eu/tools
31
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPs Programming
Model (BSC)
Query Partitioning
Infrastructure layer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource management and orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPs Runtime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
DataFabrication
Platform(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learned patterns correlations
WP4:
Streaming analytics
Technologies:
• SAG Apama CEP
• FORTH GPU-accel. analytics
http://ibidaas.eu/tools
32
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPs Programming
Model (BSC)
Query Partitioning
Infrastructure layer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource management and orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPs Runtime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
DataFabrication
Platform(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learned patterns correlations
WP5:
Resource mgmt & integration
Technologies:
• ATOS Resource mgmt
• ITML integration services
http://ibidaas.eu/tools
33
Data fabrication capabilities
Solution flexibility
Easy to code programming paradigm
High code reusability
Key Features & Innovations
https://www.ibidaas.eu/deliverables
34
Data fabrication capabilities
Solution flexibility
Easy to code programming paradigm
High code reusability
CAIXA: From 3 months to 1-2
weeks for a new proof-of-
concept Big Data technology
Key Features & Innovations
https://www.ibidaas.eu/deliverables
35
Data fabrication capabilities
Solution flexibility
Easy to code programming paradigm
High code reusability
CAIXA: Bank transfers use case:
~3-4 less time to analysis
Key Features & Innovations
https://www.ibidaas.eu/deliverables
36
GPU-accelerated analytics; Synergy of CEP and GPU-
accelerated analytics for streaming data
Feedback from analytics to data fabrication
Feedback from analytics to problem modelling
Demonstrated on use cases across 3 different data providers
and 3 different industries
https://www.ibidaas.eu/deliverables
Key Features & Innovations (Cont’d)
37
Heterogeneous
Data Sources
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPs Programming
Model (BSC)
Query Partitioning
Infrastructure layer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource management and orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Data ingestion and
integration
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPs Runtime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
DataFabrication
Platform(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learned patterns correlations
https://www.ibidaas.eu/deliverables
I-BiDaaS Solution:
A CaixaBank Use Case Example
38
Heterogeneous
Data Sources
Medium to long term business decisions
Data
Fabrication
Platform
(IBM)
Refined
specifications
for data
fabrication
GPU-accelerated Analytics
(FORTH)
Apama Complex Event
Processing (SAG)
Streaming Analytics
Batch Processing
Advanced ML (UNSPMF)
COMPs Programming
Model (BSC)
Query Partitioning
Infrastructure layer: Private cloud; Commodity cluster; GPUs
Pre-defined
Queries
SQL-like
interface
Domain
Language
Programming
API
User Interface
Resource management and orchestration (ATOS)
Advanced
Visualis.
Advanced IT services for
Big Data processing tasks;
Open source pool of ML
algorithms
Data ingestion and
integration
Programming
Interface /
Sequential
Programming
(AEGIS+SAG)
(AEGIS)
COMPs Runtime
(BSC)
Distributed
large scale
layer
Application layer
UniversalMessaging(SAG)
DataFabrication
Platform(IBM)
Meta-
data;
Data
descri-
ption
Hecuba tools
(BSC)
Short term decisions
real time alerts
Model structure improvements
Learned patterns correlations
https://www.ibidaas.eu/deliverables
I-BiDaaS Solution:
A CaixaBank Use Case Example
Data fabrication:
75% time reduction for data
access by external
stakeholders
Real time analytics:
Detecting relations of
users in real time
Solution flexibility:
Found more useful
results than before the
project
39
Ramon Martín de Pozuelo
Project Manager at Security Innovation & Transformation at
CaixaBank
Financial Pilot: step by step
I-BiDaaS Application to the Financial Sector
Thursday, May 21, 2020 - 14:00-15:00 CEST
Current situation
§ Fraud Detection Analytics in CaixaBank
§ Currently, CaixaBank analytics are executed in-house.
§ Analytics lifecycle:
§ Security analysis data storage: DataPool (Oracle), Qradar (IBM)
§ Data exploration phase to build a model/query (expensive to run)
§ Execution of model/query on data in production mode
§ This process is executed periodically.
Data Pool
Data
exploration
Production
Analysis
Data model,
Query/program
41
CaixaBank Datapool Platform
Dataset recipe or
Tokenised Data
Analytics tools and
visualization
Data generation
Streams
Batch
I-BiDaaS Platform
Algorithms, expertise
I-BiDaaS solution
42
CaixaBank Data Roadmap
§ Synthetic data usage:
§ We explored the synthetic
data solution in the MVP.
§ New opportunities:
§ The possibility to work with
real data outside CaixaBank
in a secure way.
§ We moved from totally
synthetic approach to
tokenized real datasets.
§ We include a comparison
between synthetic and real
data to know better the
differences
43
Initial Data & use cases
Enhance control over
third party agencies
Facilitate the
analysis / detection
of connections
from external
suppliers.
Advanced analysis of
bank transfer payment
in financial terminal
Facilitate the
analysis / detection
of fraudulent
transfers through
Financial Terminal.
Analysis of
relationships through
IP address
Facilitate the
analysis / detection
of user
relationships with
the same
residential IP.
Building of a social
graph
Choose a graph-
oriented DB.
Test technologies
and tools for the
treatment of the
graph.
Validate the use of synthetic data for analysis, if the rules act in the same
situations as with the real data.
Establish testing
environment for new
Big Data tools without
sensitive data
constraints
44
Final Data & use cases
Enhance control of
customers to online banking
Facilitate the analysis /
detection of fraudulent
mobile to mobile bank
transfers in online
banking.
Advanced analysis of bank
transfer payment in financial
terminal
Facilitate the analysis /
detection of fraudulent
transfers through
Financial Terminal.
Analysis of relationships
through IP address
Facilitate the analysis /
detection of user
relationships with the
same residential IP.
Validate the use of synthetic
data for analysis, if the rules
act in the same situations as
with the real data.
Establish testing environment for new Big Data tools outside of
CaixaBank premises.
Open CaixaBank data to a wider community and explore novel data
analytics methodologies.
Syntheticand
realtokenizeddatasets
45
Use Case I-BiDaaS dataset Data
Enhance control of customers to
online banking
Online banking connections Real tokenized
Advanced analysis of bank transfer
payment in financial terminal
Bank transfers Real tokenized
Analysis of relationships through IP
address
IP address Synthetic / real
tokenized
Definition of the relationship
Synthetic Data Generation
Generation of non-
sensible data via rule
definition
Non-
sensitive
data Analytics
§ Business Goal:
The transaction between two people related
by an IP is not Fraud.
§ Use Case Goal:
Validation of Synthetic Data usage.
46
CAIXA 1st case MVP use case
47
CAIXA 1st case MVP use case
48
CAIXA 1st case MVP use case
49
CAIXA 1st case MVP use case
50
CAIXA 1st case MVP use case
51
CAIXA 1st case MVP use case
52
CAIXA 1st case MVP use case
Streamline CaixaBank processes to grant permits for data
access at the start of a project with an external provider
Streamline the process of establishing work environment
and scenario for PoCs without the need to use sensitive
data.
Synthetic data
53
How I-BiDaaS is helping us?
Big Data Analytics objective: Discover fraudulent scenarios from our data by analysing the presence or not of the client.
IDENTITY NUMBER
ADDITIONAL NOTES
TIME OF THE TRANSFER
Ensure the security of our
data: Decide what are we
going to share clear, tokenized
or encrypted.
The challenge relies on finding
the limit of what and how real
data can be shared to comply
to regulation and not lose
aditional and valuable
information for analytics.
Use case high-level objective: Breaking internal and external silos
Financial Operation Data, Security Management (SIEM), etc.
1.000kk of rows per month
54
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
Point of view:
• Final User
• Programmer
CONTEXTOBJECTIVES
• Identify and glue events, followed by enriching the transfer payment dataset
• Encrypt the data without lossing value
• Advanced analytics
55
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
56
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
57
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
Comparing solutions and processes to analyse
data outside CaixaBank premises
Data Analytics
comercial products
58
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
59
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
60
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
DataRobot comparison results
• Custom solutions:
• I-BiDaaS provide more flexibility in the definition of your own code, scoring metrics, etc.
• Unsupervised learning:
• DataRobot has very limited number of unsupervised learning models. I-BiDaaS can provide
much more detailed results on unsupervised learning use cases.
61
CAIXA 2nd case
Analyse bank transfers executed from employees financial terminal in the name of a customer.
Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of
the movement.)
Establish testing environment for new Big
Data analytics tools outside of CaixaBank
premises.
Open CaixaBank data to a wider community
and explore novel data analytics
methodologies.
High-level objective: Breaking internal and external silos
Financial Operation Data, Security Management (SIEM), etc.
Real Tokenised Data
62
How I-BiDaaS is helping us?
Benefits KPIs
To increase the efficiency and competitiveness in the
management of its vast and complex amounts of data.
75% time reduction data access from external
stakeholders using synthetic data (From 6 to
1.5 days).
To break data silos not only internally, but also fostering and
triggering internal procedures to open data to external
stakeholders.
Real data accessed by at least 6 different
external entities skipping long-time data
access procedures.
To evaluate Big Data analytics tools with real-life use cases of
CaixaBank in a much more agile way.
I-BiDaaS overall solution and tools
experimentation with 3 different industrial
use cases with real data.
63
CaixaBank benefits from I-BiDaaS
Questions?
Thank you!
Your feedback is valuable for us!
https://bit.ly/2Zl19IJ

More Related Content

What's hot

Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...Denodo
 
EDI: Federation of data services to foster the adoption of data-driven AI in ...
EDI: Federation of data services to foster the adoption of data-driven AI in ...EDI: Federation of data services to foster the adoption of data-driven AI in ...
EDI: Federation of data services to foster the adoption of data-driven AI in ...European Data Incubator (EDI)
 
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage DataCollaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage DataEdward Curry
 
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...Denodo
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)Denodo
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Denodo
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationDenodo
 
GDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharingGDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharingBig Data Value Association
 
Three Dimensions of Data as a Service
Three Dimensions of Data as a ServiceThree Dimensions of Data as a Service
Three Dimensions of Data as a ServiceDenodo
 
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...European Data Forum
 
Key Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplaceKey Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplaceBig Data Value Association
 
New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe
New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe
New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe inside-BigData.com
 
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...FIWARE
 
International Conference on Big Data and Block chain (BDAB 2020)
International Conference on Big Data and Block chain (BDAB 2020)International Conference on Big Data and Block chain (BDAB 2020)
International Conference on Big Data and Block chain (BDAB 2020)IJDKP
 
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...Denodo
 
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationEnabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationDenodo
 
Europe rules – making the fair data economy flourish
Europe rules – making the fair data economy flourishEurope rules – making the fair data economy flourish
Europe rules – making the fair data economy flourishSitra / Hyvinvointi
 
Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...
Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...
Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...IJNSA Journal
 

What's hot (20)

Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
 
EDI: Federation of data services to foster the adoption of data-driven AI in ...
EDI: Federation of data services to foster the adoption of data-driven AI in ...EDI: Federation of data services to foster the adoption of data-driven AI in ...
EDI: Federation of data services to foster the adoption of data-driven AI in ...
 
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage DataCollaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
 
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 
Data Vaults
Data VaultsData Vaults
Data Vaults
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
 
GDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharingGDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharing
 
Three Dimensions of Data as a Service
Three Dimensions of Data as a ServiceThree Dimensions of Data as a Service
Three Dimensions of Data as a Service
 
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
 
Key Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplaceKey Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplace
 
Grid07 8 Osborne
Grid07 8 OsborneGrid07 8 Osborne
Grid07 8 Osborne
 
New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe
New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe
New Horizons for a Data-Driven Economy – A Roadmap for Big Data in Europe
 
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
 
International Conference on Big Data and Block chain (BDAB 2020)
International Conference on Big Data and Block chain (BDAB 2020)International Conference on Big Data and Block chain (BDAB 2020)
International Conference on Big Data and Block chain (BDAB 2020)
 
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
 
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationEnabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
 
Europe rules – making the fair data economy flourish
Europe rules – making the fair data economy flourishEurope rules – making the fair data economy flourish
Europe rules – making the fair data economy flourish
 
Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...
Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...
Call for Papers - International Conference on Big Data and Blockchain (BDAB 2...
 

Similar to BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar

BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector WebinarBigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector WebinarBig Data Value Association
 
Big Data Value Association (BDVA) - Intro Slide Pack
Big Data Value Association (BDVA) - Intro Slide PackBig Data Value Association (BDVA) - Intro Slide Pack
Big Data Value Association (BDVA) - Intro Slide PackStuart Campbell
 
Idsa, smau, milano, 2019 10-24
Idsa, smau, milano, 2019 10-24Idsa, smau, milano, 2019 10-24
Idsa, smau, milano, 2019 10-24Nadia Fabrizio
 
BICS empowers predictive analytics and customer centricity with a Hadoop base...
BICS empowers predictive analytics and customer centricity with a Hadoop base...BICS empowers predictive analytics and customer centricity with a Hadoop base...
BICS empowers predictive analytics and customer centricity with a Hadoop base...DataWorks Summit
 
Big Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time Shipping
Big Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time ShippingBig Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time Shipping
Big Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time ShippingBig Data Value Association
 
Policy Cloud Data Driven - Policies against Radicalisation
Policy Cloud Data Driven - Policies against RadicalisationPolicy Cloud Data Driven - Policies against Radicalisation
Policy Cloud Data Driven - Policies against RadicalisationBig Data Value Association
 
Policy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against RadicalisationPolicy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against RadicalisationBig Data Value Association
 
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE WebinarBDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE WebinarBigData_Europe
 
Cloud computing for making indonesia 4.0
Cloud computing for making indonesia 4.0 Cloud computing for making indonesia 4.0
Cloud computing for making indonesia 4.0 PT Datacomm Diangraha
 
Rick Mutsaers Informatica
Rick Mutsaers InformaticaRick Mutsaers Informatica
Rick Mutsaers InformaticaBigDataExpo
 
Technology Transformation Services
Technology Transformation ServicesTechnology Transformation Services
Technology Transformation ServicesAtos
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Denodo
 
EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...
EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...
EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...European Data Incubator (EDI)
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Michael Lew
 
Session 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench ToolboxSession 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench ToolboxDataBench
 
Innovative and digital solutions for circularity and sustainability in textiles
Innovative and digital solutions for circularity and sustainability in textilesInnovative and digital solutions for circularity and sustainability in textiles
Innovative and digital solutions for circularity and sustainability in textilesCISUFLO
 

Similar to BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar (20)

BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector WebinarBigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
 
Big Data Value Association (BDVA) - Intro Slide Pack
Big Data Value Association (BDVA) - Intro Slide PackBig Data Value Association (BDVA) - Intro Slide Pack
Big Data Value Association (BDVA) - Intro Slide Pack
 
Idsa, smau, milano, 2019 10-24
Idsa, smau, milano, 2019 10-24Idsa, smau, milano, 2019 10-24
Idsa, smau, milano, 2019 10-24
 
BICS empowers predictive analytics and customer centricity with a Hadoop base...
BICS empowers predictive analytics and customer centricity with a Hadoop base...BICS empowers predictive analytics and customer centricity with a Hadoop base...
BICS empowers predictive analytics and customer centricity with a Hadoop base...
 
Big Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time Shipping
Big Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time ShippingBig Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time Shipping
Big Data Pilot Demo Days A Big DataStack Seafer's tale of Real-time Shipping
 
Policy Cloud Data Driven - Policies against Radicalisation
Policy Cloud Data Driven - Policies against RadicalisationPolicy Cloud Data Driven - Policies against Radicalisation
Policy Cloud Data Driven - Policies against Radicalisation
 
Policy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against RadicalisationPolicy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against Radicalisation
 
Big Data Pilot Days
Big Data Pilot DaysBig Data Pilot Days
Big Data Pilot Days
 
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE WebinarBDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
 
Study: #Big Data in #Austria
Study: #Big Data in #AustriaStudy: #Big Data in #Austria
Study: #Big Data in #Austria
 
Cloud computing for making indonesia 4.0
Cloud computing for making indonesia 4.0 Cloud computing for making indonesia 4.0
Cloud computing for making indonesia 4.0
 
Rick Mutsaers Informatica
Rick Mutsaers InformaticaRick Mutsaers Informatica
Rick Mutsaers Informatica
 
Technology Transformation Services
Technology Transformation ServicesTechnology Transformation Services
Technology Transformation Services
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...
EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...
EDI's view on Digital Innovation Hubs Working Group Meeting on Big Data and A...
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
 
Session 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench ToolboxSession 4 - A practical journey on how to use the DataBench Toolbox
Session 4 - A practical journey on how to use the DataBench Toolbox
 
Innovative and digital solutions for circularity and sustainability in textiles
Innovative and digital solutions for circularity and sustainability in textilesInnovative and digital solutions for circularity and sustainability in textiles
Innovative and digital solutions for circularity and sustainability in textiles
 
Big Data a Catalunya
Big Data a CatalunyaBig Data a Catalunya
Big Data a Catalunya
 
Big Data a Catalunya
Big Data a CatalunyaBig Data a Catalunya
Big Data a Catalunya
 

More from Big Data Value Association

Data Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharingData Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharingBig Data Value Association
 
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...Big Data Value Association
 
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and PrivacyThree pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and PrivacyBig Data Value Association
 
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...Big Data Value Association
 
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...Big Data Value Association
 
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna Big Data Value Association
 
BDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionalsBDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionalsBig Data Value Association
 
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...Big Data Value Association
 
BDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshopBDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshopBig Data Value Association
 
BDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshopBDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshopBig Data Value Association
 
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...Big Data Value Association
 
Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for BenchmarkingVirtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for BenchmarkingBig Data Value Association
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Big Data Value Association
 
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical OverviewPolicy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical OverviewBig Data Value Association
 
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...Big Data Value Association
 
BDVA i Spaces - What they are, how to become one, value and collaborations
BDVA i Spaces - What they are, how to become one, value and collaborationsBDVA i Spaces - What they are, how to become one, value and collaborations
BDVA i Spaces - What they are, how to become one, value and collaborationsBig Data Value Association
 

More from Big Data Value Association (20)

Data Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharingData Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharing
 
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
 
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and PrivacyThree pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
 
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
 
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
 
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
 
BDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionalsBDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionals
 
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
 
BDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshopBDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshop
 
BDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshopBDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshop
 
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
 
Virtual BenchLearning - Data Bench Framework
Virtual BenchLearning - Data Bench FrameworkVirtual BenchLearning - Data Bench Framework
Virtual BenchLearning - Data Bench Framework
 
Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for BenchmarkingVirtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
 
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical OverviewPolicy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
 
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
 
BDVA i Spaces - What they are, how to become one, value and collaborations
BDVA i Spaces - What they are, how to become one, value and collaborationsBDVA i Spaces - What they are, how to become one, value and collaborations
BDVA i Spaces - What they are, how to become one, value and collaborations
 
BDVA i Spaces - ITA Aragón
BDVA i Spaces - ITA AragónBDVA i Spaces - ITA Aragón
BDVA i Spaces - ITA Aragón
 
BDVA i Spaces - ahedd
BDVA i Spaces - aheddBDVA i Spaces - ahedd
BDVA i Spaces - ahedd
 
Policy Cloud Data Driven - Technical overview
Policy Cloud Data Driven - Technical overviewPolicy Cloud Data Driven - Technical overview
Policy Cloud Data Driven - Technical overview
 

Recently uploaded

wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 

Recently uploaded (20)

wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 

BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar

  • 1. Big Data Pilot Demo Days Despina Kopanaki (FORTH) dkopanaki@ics.forth.gr Marieke Willems (Trust-IT) m.willems@trust-itservices.com Andrea Schillaci (Trust-IT) a.schillaci@trust-itservices.com
  • 2. BDV PPP Summit 2020 went virtual 29/05/20 2
  • 3. Why Big Data Pilot Demo Days? The new data-driven industrial revolution highlights the need for big data technologies to unlock the potential in various application domains. BDV PPP projects I-BiDaaS, BigDataStack and Track & Know and deliver innovative technologies to address the emerging needs of data operations and applications. To fully exploit the sustainability of the developed technologies, the projects onboarded pilots that exhibit their applicability in a wide variety of sectors. In their third and final year, the projects are ready to demonstrate the developed and implemented technologies to interested end- users from industry as well as technology providers, for further adoption. 29/05/20 3
  • 4. BDV PPP Projects Join Forces 29/05/20 4 Industrial-Driven Big Data as a Self-Service Solution Holistic stack for big data applications and operations Big Data for Mobility Tracking Knowledge Extraction in Urban Areas BDV PPP projects joining forces to showcase application of innovative technologies in a variety of domains, fostering futher adoption, contributing to Europe’s digital future.
  • 5. Big Data Pilot Demo Days - A Series of Webinars 5 I-BiDaaS Application to the Financial Sector 1 BigDataStack Connected Consumer demo Track & Know applied to the fleet management sector 2 3 456 7 8 9 A BigDataStack Seafarer’s Tale on Real Time Shipping I-BiDaaS Application to the Telecommunication Sector I-BiDaaS Application to the manufacturing sector Track & Know applied to patient mobility in the healthcare sector BigDataStack Smart Insurance Track & Know applied to the automotive insurance sector
  • 6. Today’s Main Topic – Big Data as a Self-Service Solution I-BiDaaS overview CaixaBank’s Pitches: Setting the requirements I-BiDaaS architecture: Scientific & Technical view; how it addresses the requirements set by CaixaBank. Step by Step demonstration of I-BiDaaS solution and its application to the banking sector. Questions & Answers 29/05/20 6
  • 7. Webinar Speakers 729/05/2020 Assistant Professor at the Department of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Serbia I-BiDaaS Scientific & Technical Manager. Project Manager at Security Innovation & Transformation, CaixaBank, Barcelona Dr. Dušan Jakovetić University of Novi Sad, Serbia Dr. Ramon Martin de Pozuelo CaixaBank
  • 8. I-BiDaaS Application to the Financial Sector Thursday, May 21, 2020 - 14:00-15:00 CEST Big Data Pilot Demo Days
  • 9. Dusan Jakovetic Ass. Professor, University of Novi Sad, Faculty of Sciences, Serbia; I-BiDaaS Scientific & Technical Manager I-BiDaaS Overview I-BiDaaS Application to the Financial Sector Thursday, May 21, 2020 - 14:00-15:00 CEST
  • 10. Identity card http://www.ibidaas.eu/ @Ibidaas https://www.linkedin.com/in/i-bidaas/ 1010
  • 11. I-BiDaaS Consortium 1. FOUNDATION FOR RESEARCH AND TECHNOLOGY HELLAS (FORTH) 2. BARCELONA SUPERCOMPUTING CENTER - CENTRO NACIONAL DE SUPERCOMPUTACION (BSC) 3. IBM ISRAEL - SCIENCE AND TECHNOLOGY LTD (IBM) 4. CENTRO RICERCHE FIAT SCPA (CRF) 5. SOFTWARE AG (SAG) 6. CAIXABANK, S.A (CAIXA) 7. THE UNIVERSITY OF MANCHESTER (UNIMAN) 8. ECOLE NATIONALE DES PONTS ET CHAUSSEES (ENPC) 9. ATOS SPAIN SA (ATOS) 10. AEGIS IT RESEARCH LTD (AEGIS) 11. INFORMATION TECHNOLOGY FOR MARKET LEADERSHIP (ITML) 12. University of Novi Sad Faculty of Sciences Serbia (UNSPMF) 13. TELEFONICA INVESTIGACION Y DESARROLLO SA (TID) 1111
  • 12. Motivation 12 EuropeanDataEconomy Essential resource for growth, competitiveness, innovation, job creation and societal progress in general Organizations leverage data pools to drive value The rise of the demand for platforms in the market empowering end users to analyze The convergence of internet of things (IoT), cloud, and big data transforms our economy and society Self-service solutions are transformative for organizations Building a European Data Economy (Jan 2017) Continue to struggle to turn opportunity from big data into realized gains Companies call upon expert analysts and consultants to assist them A completely new paradigm towards big data analytics The right knowledge, and insights decision- makers need to make the right decisions. Towards a common European data space (Apr 2018) Towards a thriving data-driven economy (Jul 2014) Digital Single Market 12
  • 13. Our Vision A complete and safe environment for methodological big data experimentation Tool and services to increase the quality of data analytics A Big Data as a Self-Service solution that helps breaking industrial data silos and boosts EU's data-driven economy Tools and services for fast ingestion and consolidation of both realistic and fabricated data Tools and services for the management of heterogeneous infrastructures including elasticity Increases impact in research community and contributes to industrial innovation capacity 1313
  • 14. Project Statement I-BiDaaS aims to empower users to easily utilize and interact with big data technologies, by designing, building, and demonstrating, a unified framework that: significantly increases the speed of data analysis while coping with the rate of data asset growth, and facilitates cross- domain data-flow towards a thriving data-driven EU economy. I-BiDaaS will be tangibly validated by three real-world, industry-lead experiments. 1414
  • 15. CAIXA Enhance control of customers to online banking Advanced analysis of bank transfer payment in financial terminal Analysis of relationships through IP address CRF Production process of aluminium casting Maintenance and monitoring of production assets TID Accurate location prediction with high traffic and visibility Optimization of placement of telecommunication equipment Quality of Service in Call Centers Telecommunications Industry Banking/Finance Industry Manufacturing Industry Application / Experimentation 1515
  • 17. Ramon Martín de Pozuelo Project Manager at Security Innovation & Transformation at CaixaBank Setting the Pilot Requirements I-BiDaaS Application to the Financial Sector Thursday, May 21, 2020 - 14:00-15:00 CEST
  • 18. 18 CaixaBank and the Use of Data CaixaBank is the leading financial group in Spain, both in banking and insurance and it is developing a strategy of diversification with stakes in international banks and also within leading service companies. 13.8M Customers 4.2K Branches 9.1K ATMs 4.8M Mobile Banking 5.8M On-line banking 32K Employees In January 2014 CaixaBank created the Big Data Department We manage 1.247 TB of information only in our Big data Department of more than 100 internal people providing Data analytics services to all the Organization. Due to Regulation constraints all the infrastructure and the analysis is done internally 18
  • 19. 19 Why do we need I-BiDaaS? Requirements Lack of agility in our Datapool for extracting data externally. Security procedures and constraints are necessary but hinder and slow down data sharing processes. We have tons of data but confidential. We are the data managers but the real owners of the data are our customers. Current data sharing situation in CaixaBank and I-BiDaaS approach Requirement Control Data privacy Data of customers (e.g., social graph) and external suppliers (e.g., SIEM) can not be accessed or shared with other partners Regulation Compliance All activities related to CaixaBank within the project must comply with the relevant regulations (e.g., ISO 27001, GDPR) Fraud and Security Analytics Use cases presented will be related to the improvement of security and the prevention of fraud. Objective: Exploit the I-BiDaaS platform to gain agility, efficiency and flexibility in our analytics for security. 19
  • 20. § Breaking external and cross-sectorial data silos while complying Regulation § Sharing Data models with other FI § Sharing Data models with other sectors § Following ECB & Banco de España constraints, we already have proved with I-BiDaas that this is possible Why do we need I-BiDaaS? 20
  • 21. § Secure Self-Service Infrastructure § Being able to outsource Big Data Infrastructure preserving privacy & security § To grow in a dedicated and specialized infrastructure § To count on dedicated and specialized specialists Why do we need I-BiDaaS? 21
  • 22. § Competitiveness & Innovation § Fast, agile and specialized adaptation to new technology. § Vs Current proprietary infrastructure § Efficiency § Reducing the costs and time of analyzing large datasets Why do we need I-BiDaaS? 22
  • 23. Dusan Jakovetic Assistant Professor, University of Novi Sad, Serbia; I-BiDaaS Scientific & Technical Manager Big Data Architecture I-BiDaaS Application to the Financial Sector Thursday, May 21, 2020 - 14:00-15:00 CEST
  • 24. • Expert mode Analyze your DataUsers • Import your data • Self-service mode Data • Fabricate Data • Stream & Batch Analytics • Expert: Upload your code • Self-service: Select an algorithm from the pool Results • Visualize the results • Share models Do it yourself In a flexible manner Break data silos Safe environment Interact with Big Data technologies Increase speed of data analysis Cope with the rate of data asset growth Intra- and inter- domain data- flow Benefits of using I-BiDaaS • Co-develop mode The I-BiDaaS Solution: Front-end • Co-develop: custom end- to-end application • Tokenize data 24
  • 25. • Expert mode Analyze your DataUsers • Import your data • Self-service mode Data • Fabricate Data • Stream & Batch Analytics • Expert: Upload your code • Self-service: Select an algorithm from the pool Results • Visualize the results • Share models Do it yourself In a flexible manner Break data silos Safe environment Interact with Big Data technologies Increase speed of data analysis Cope with the rate of data asset growth Intra- and inter- domain data- flow Benefits of using I-BiDaaS • Co-develop mode The I-BiDaaS Solution: Front-end • Co-develop: custom end- to-end application • Tokenize data 25
  • 26. • Expert mode Analyze your DataUsers • Import your data • Self-service mode Data • Fabricate Data • Stream & Batch Analytics • Expert: Upload your code • Self-service: Select an algorithm from the pool Results • Visualize the results • Share models Do it yourself In a flexible manner Break data silos Safe environment Interact with Big Data technologies Increase speed of data analysis Cope with the rate of data asset growth Intra- and inter- domain data- flow Benefits of using I-BiDaaS • Co-develop mode The I-BiDaaS Solution: Front-end • Co-develop: custom end- to-end application • Tokenize data 26
  • 27. Specify data fabrication rules Fabricate & save the fabricated data in the I-BiDaaS external platform [e.g., ATOS cloud] Visualize data/data sample [e.g., graphs, charts, etc.] Select pre-defined analytics from the pool / use code template to write a new script (COMPSs) Set algorithm parameters/choose default values Tune alg. paramet. inside the visualis. without coding [e.g., sliding window] Save the Big Data pipeline/analytics on the local premises [e.g., company’s cluster] Specify real data sources in the local warehouse for processing Execute the saved pipeline/analytics on the real data Visualize execution & results Extract new fabrication rules & save the rules; save the pipeline & analytics Fabricate/Tokenize Experiment&test Runatlocalprem.&realdata Execute a trial experiment; Select amount of resources visualize execution time & results I-BiDaaS – Prototypical Experimental Workflow 27
  • 28. Specify data fabrication rules Fabricate & save the fabricated data in the I-BiDaaS external platform [e.g., ATOS cloud] Visualize data/data sample [e.g., graphs, charts, etc.] Select pre-defined analytics from the pool / use code template to write a new script (COMPSs) Set algorithm parameters/choose default values Tune alg. paramet. inside the visualis. without coding [e.g., sliding window] Save the Big Data pipeline/analytics on the local premises [e.g., company’s cluster] Specify real data sources in the local warehouse for processing Execute the saved pipeline/analytics on the real data Visualize execution & results Extract new fabrication rules & save the rules; save the pipeline & analytics Execute a trial experiment; Select amount of resources visualize execution time & results Fabricate/Tokenize Experiment&test Runatlocalprem.&realdata I-BiDaaS – Experimental Workflow 28
  • 29. The I-BiDaaS Solution: Architecture/back-end Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPsProgramming Model(BSC) Query Partitioning Infrastructurelayer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource managementand orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Data ingestion Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPsRuntime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) TestData Fabrication(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learnedpatterns correlations 29
  • 30. WP2: Data, user interface, visualization Technologies: • IBM TDF • SAG UM • AEGIS AVT Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPs Programming Model (BSC) Query Partitioning Infrastructure layer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource management and orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPs Runtime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) DataFabrication Platform(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learned patterns correlations http://ibidaas.eu/tools 30
  • 31. Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPs Programming Model (BSC) Query Partitioning Infrastructure layer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource management and orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPs Runtime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) DataFabrication Platform(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learned patterns correlations WP3: Batch analytics Technologies: • BSC COMPSs • BSC Hecuba • BSC Qbeast • Advanced ML (UNSPMF) http://ibidaas.eu/tools 31
  • 32. Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPs Programming Model (BSC) Query Partitioning Infrastructure layer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource management and orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPs Runtime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) DataFabrication Platform(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learned patterns correlations WP4: Streaming analytics Technologies: • SAG Apama CEP • FORTH GPU-accel. analytics http://ibidaas.eu/tools 32
  • 33. Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPs Programming Model (BSC) Query Partitioning Infrastructure layer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource management and orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPs Runtime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) DataFabrication Platform(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learned patterns correlations WP5: Resource mgmt & integration Technologies: • ATOS Resource mgmt • ITML integration services http://ibidaas.eu/tools 33
  • 34. Data fabrication capabilities Solution flexibility Easy to code programming paradigm High code reusability Key Features & Innovations https://www.ibidaas.eu/deliverables 34
  • 35. Data fabrication capabilities Solution flexibility Easy to code programming paradigm High code reusability CAIXA: From 3 months to 1-2 weeks for a new proof-of- concept Big Data technology Key Features & Innovations https://www.ibidaas.eu/deliverables 35
  • 36. Data fabrication capabilities Solution flexibility Easy to code programming paradigm High code reusability CAIXA: Bank transfers use case: ~3-4 less time to analysis Key Features & Innovations https://www.ibidaas.eu/deliverables 36
  • 37. GPU-accelerated analytics; Synergy of CEP and GPU- accelerated analytics for streaming data Feedback from analytics to data fabrication Feedback from analytics to problem modelling Demonstrated on use cases across 3 different data providers and 3 different industries https://www.ibidaas.eu/deliverables Key Features & Innovations (Cont’d) 37
  • 38. Heterogeneous Data Sources Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPs Programming Model (BSC) Query Partitioning Infrastructure layer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource management and orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Data ingestion and integration Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPs Runtime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) DataFabrication Platform(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learned patterns correlations https://www.ibidaas.eu/deliverables I-BiDaaS Solution: A CaixaBank Use Case Example 38
  • 39. Heterogeneous Data Sources Medium to long term business decisions Data Fabrication Platform (IBM) Refined specifications for data fabrication GPU-accelerated Analytics (FORTH) Apama Complex Event Processing (SAG) Streaming Analytics Batch Processing Advanced ML (UNSPMF) COMPs Programming Model (BSC) Query Partitioning Infrastructure layer: Private cloud; Commodity cluster; GPUs Pre-defined Queries SQL-like interface Domain Language Programming API User Interface Resource management and orchestration (ATOS) Advanced Visualis. Advanced IT services for Big Data processing tasks; Open source pool of ML algorithms Data ingestion and integration Programming Interface / Sequential Programming (AEGIS+SAG) (AEGIS) COMPs Runtime (BSC) Distributed large scale layer Application layer UniversalMessaging(SAG) DataFabrication Platform(IBM) Meta- data; Data descri- ption Hecuba tools (BSC) Short term decisions real time alerts Model structure improvements Learned patterns correlations https://www.ibidaas.eu/deliverables I-BiDaaS Solution: A CaixaBank Use Case Example Data fabrication: 75% time reduction for data access by external stakeholders Real time analytics: Detecting relations of users in real time Solution flexibility: Found more useful results than before the project 39
  • 40. Ramon Martín de Pozuelo Project Manager at Security Innovation & Transformation at CaixaBank Financial Pilot: step by step I-BiDaaS Application to the Financial Sector Thursday, May 21, 2020 - 14:00-15:00 CEST
  • 41. Current situation § Fraud Detection Analytics in CaixaBank § Currently, CaixaBank analytics are executed in-house. § Analytics lifecycle: § Security analysis data storage: DataPool (Oracle), Qradar (IBM) § Data exploration phase to build a model/query (expensive to run) § Execution of model/query on data in production mode § This process is executed periodically. Data Pool Data exploration Production Analysis Data model, Query/program 41
  • 42. CaixaBank Datapool Platform Dataset recipe or Tokenised Data Analytics tools and visualization Data generation Streams Batch I-BiDaaS Platform Algorithms, expertise I-BiDaaS solution 42
  • 43. CaixaBank Data Roadmap § Synthetic data usage: § We explored the synthetic data solution in the MVP. § New opportunities: § The possibility to work with real data outside CaixaBank in a secure way. § We moved from totally synthetic approach to tokenized real datasets. § We include a comparison between synthetic and real data to know better the differences 43
  • 44. Initial Data & use cases Enhance control over third party agencies Facilitate the analysis / detection of connections from external suppliers. Advanced analysis of bank transfer payment in financial terminal Facilitate the analysis / detection of fraudulent transfers through Financial Terminal. Analysis of relationships through IP address Facilitate the analysis / detection of user relationships with the same residential IP. Building of a social graph Choose a graph- oriented DB. Test technologies and tools for the treatment of the graph. Validate the use of synthetic data for analysis, if the rules act in the same situations as with the real data. Establish testing environment for new Big Data tools without sensitive data constraints 44
  • 45. Final Data & use cases Enhance control of customers to online banking Facilitate the analysis / detection of fraudulent mobile to mobile bank transfers in online banking. Advanced analysis of bank transfer payment in financial terminal Facilitate the analysis / detection of fraudulent transfers through Financial Terminal. Analysis of relationships through IP address Facilitate the analysis / detection of user relationships with the same residential IP. Validate the use of synthetic data for analysis, if the rules act in the same situations as with the real data. Establish testing environment for new Big Data tools outside of CaixaBank premises. Open CaixaBank data to a wider community and explore novel data analytics methodologies. Syntheticand realtokenizeddatasets 45
  • 46. Use Case I-BiDaaS dataset Data Enhance control of customers to online banking Online banking connections Real tokenized Advanced analysis of bank transfer payment in financial terminal Bank transfers Real tokenized Analysis of relationships through IP address IP address Synthetic / real tokenized Definition of the relationship Synthetic Data Generation Generation of non- sensible data via rule definition Non- sensitive data Analytics § Business Goal: The transaction between two people related by an IP is not Fraud. § Use Case Goal: Validation of Synthetic Data usage. 46 CAIXA 1st case MVP use case
  • 47. 47 CAIXA 1st case MVP use case
  • 48. 48 CAIXA 1st case MVP use case
  • 49. 49 CAIXA 1st case MVP use case
  • 50. 50 CAIXA 1st case MVP use case
  • 51. 51 CAIXA 1st case MVP use case
  • 52. 52 CAIXA 1st case MVP use case
  • 53. Streamline CaixaBank processes to grant permits for data access at the start of a project with an external provider Streamline the process of establishing work environment and scenario for PoCs without the need to use sensitive data. Synthetic data 53 How I-BiDaaS is helping us?
  • 54. Big Data Analytics objective: Discover fraudulent scenarios from our data by analysing the presence or not of the client. IDENTITY NUMBER ADDITIONAL NOTES TIME OF THE TRANSFER Ensure the security of our data: Decide what are we going to share clear, tokenized or encrypted. The challenge relies on finding the limit of what and how real data can be shared to comply to regulation and not lose aditional and valuable information for analytics. Use case high-level objective: Breaking internal and external silos Financial Operation Data, Security Management (SIEM), etc. 1.000kk of rows per month 54 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 55. Point of view: • Final User • Programmer CONTEXTOBJECTIVES • Identify and glue events, followed by enriching the transfer payment dataset • Encrypt the data without lossing value • Advanced analytics 55 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 56. 56 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 57. 57 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 58. Comparing solutions and processes to analyse data outside CaixaBank premises Data Analytics comercial products 58 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 59. 59 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 60. 60 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 61. DataRobot comparison results • Custom solutions: • I-BiDaaS provide more flexibility in the definition of your own code, scoring metrics, etc. • Unsupervised learning: • DataRobot has very limited number of unsupervised learning models. I-BiDaaS can provide much more detailed results on unsupervised learning use cases. 61 CAIXA 2nd case Analyse bank transfers executed from employees financial terminal in the name of a customer. Potential fraudulent transfer or bad practices (e.g. check that the client was present in the time of the movement.)
  • 62. Establish testing environment for new Big Data analytics tools outside of CaixaBank premises. Open CaixaBank data to a wider community and explore novel data analytics methodologies. High-level objective: Breaking internal and external silos Financial Operation Data, Security Management (SIEM), etc. Real Tokenised Data 62 How I-BiDaaS is helping us?
  • 63. Benefits KPIs To increase the efficiency and competitiveness in the management of its vast and complex amounts of data. 75% time reduction data access from external stakeholders using synthetic data (From 6 to 1.5 days). To break data silos not only internally, but also fostering and triggering internal procedures to open data to external stakeholders. Real data accessed by at least 6 different external entities skipping long-time data access procedures. To evaluate Big Data analytics tools with real-life use cases of CaixaBank in a much more agile way. I-BiDaaS overall solution and tools experimentation with 3 different industrial use cases with real data. 63 CaixaBank benefits from I-BiDaaS
  • 64. Questions? Thank you! Your feedback is valuable for us! https://bit.ly/2Zl19IJ