This document proposes using continuous embedding spaces and neural networks to analyze bank transaction data without manual labeling. It aims to better understand customer behaviors by discovering relationships in large, unlabeled transaction datasets. The approach represents transaction elements as vectors using Word2Vec, and trains a neural network to predict customer attributes like business segments from the transaction vectors. Initial experiments show the model can accurately predict business segments based on transaction information alone. Future work could use these techniques to identify patterns for applications like fraud detection and personalized product offers.
Continuous Embedding Spaces for Bank Transaction Data
1. CONTINUOUS EMBEDDING SPACES
for BANK TRANSACTION DATA
Ali Batuhan Dayıoğlugil 1
and Yusuf Sinan Akgül2
1
Yeditepe University, Cybersoft R&D Center, İstanbul, Turkey
2
Gebze Technical University, Kocaeli, Turkey
ISMIS 2017, WARSAW
2. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 2 of 18
OUTLINE
● Introduction & Motivations
● Modern NLP Techniques
● Purpose of the Work
● An NLP Approach for Bank Data
● Experiments
● Future Works
3. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 3 of 18
Introduction
● Growing data: todays treasure
● Health, e-commerce, banking, assurance, games...
● Biggest challenge is extracting valuable
information
● Behavior recognition
● Forecasting
● Detecting abnormal activities
4. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 4 of 18
Introduction
In case of Banks:
● Millions of transactions everyday
● Processing of data:
● rule based systems
● domain experts
● Missing complicated relations and patterns between same
and different customer transactions (limited capacity)
● As a result
● Less successful product offers
● Economical loss due to undetected frauds
● Dissatisfaction of customers
5. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 5 of 18
Motivations
● New era in banking: Smart systems
– Learning models
– Automatically discovering hidden patterns
– Faster than human decisions
– Less prone to human errors
● Essential tasks by Smart Systems:
– Fraud Detection
– New product offers
– Customer behavior analysis
6. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 6 of 18
Modern NLP Techniques
● Representation of words with their hidden features:
● Continuous embedding workspaces
– Fast in process, highly promising, domain independent, fully unsupervised...
● Application Fields:
– Language modeling, language translations
7. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 7 of 18
Nature of Bank Data
● Specifications:
– Structured and ordered (Row-Column)
– Large variety of attributes;
● Demographic, Transactional, Logs, Engineered Features
– Various customer behavior
– Fast, Cumulative, High Volume
– Lack of labeled data
● Role of Domain Experts
– Segmentation
– Campaign Planning
– Customer Propensity & New product design
– Outlier Identification
8. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 8 of 18
Nature of Bank Data
● Popular techniques for financial data extraction:
– Rule based approaches
● Difficult to keep up to date
● Hard to add/remove rules regularly
– Supervised machine learning methods
● Incapable of detecting deep relations in data
● Require labeled data
– Domain Experts
● Subjective decisions
● Prone to human errors
● Not enough resource to process all data
9. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 9 of 18
Purpose of the Work
“Predicting customer behavior using transaction and
demographic data without any manual labeling”
● Explaining deep relations in data
● Objective approach
● Up-to-date models
● Decisions with less domain expert dependent
10. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 10 of 18
NLP Design over Bank Data
● Similarities:
– Time series transactions ~ Sentences in a context
– Attributes of transactions ~ Words in a sentence
● Successful results of continuous embedding
spaces with libraries: Word2Vec, GloVe and
FastText
● Need of numerical inputs for machine learning
models
11. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 11 of 18
NLP Design over Bank Data
● Transaction Data:
– TRX= {Ti
}, i=1..ni
● Transaction Elements
– Ti = {ti,j}, j=1..m
● Mostly categorical attributes
● Numerical elements are clustered and irrelevant elements are ignored
● Transaction elements
are converted to vectors
and compared by cosine
similarity
12. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 12 of 18
Experiment: Space Transformation
● Data:
– 1.8 M transaction data (4 weeks) of a medium-sized Turkish
Bank
– Enriched with customer demographic data
– 8 categorical and 2 numerical attributes (clustered)
– Dictionary size is 137
● Vector Transformation
– Word2Vec library
● Skip-gram, 20-length vectors (for each element value)
● Comparison of vectors in 2D (PCA)
13. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 13 of 18
Experiment: Space Transformation
● Nearest neighbours of three transaction element values
(Age, business segment and profession respectively) with
respect to cosine similarities
14. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 14 of 18
Experiment: Space Transformation
● PCA of business segment element value embedding vectors
15. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 14 of 18
Experiment: Space Transformation
●
Embedding vectors of same element values with artificially divided
‘High-income’ value
16. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 15 of 18
Experiment: Classification
● Predicting business segment vectors Bi (20) using Transaction vectors i
(180)
(without segment vector)
● Aim: model produce vectors positions close to business segment element value
vectors
● ANN parameters:
– Cosine similarity (loss func.)
– Stochastic Gradient
Descent as optimizer
– tanh as activation func.
– Single hidden layer, 60 nodes
– Learning rate: 0,018
– Batch size: 100
17. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 16 of 18
Experiment: Classification
● 4 cross-fold validation with designed ANN model
● Comparison of predicted business segment vectors i
with real vectors Bi and selecting nearest (top) 5
● Proposed model accuracies for business segment
attribute:
18. 28/07/2017Dayioglugil A. B., Akgul Y. S. Page 17 of 18
Future Work
● Extracting deep and hidden patterns in customer
transactions may be used in:
– Creating relevant products
– Detecting fraud attempts by examining customer
behaviours
– Defining abnormal customer activities