SlideShare a Scribd company logo
1 of 36
Foreign Exchange Data Crawling and
Analysis for Knowledge Discovery
Leading to Informative Decision
Making
Data mining
Dr. Nasser Ghadiri
Mostafa Arjmand
The Presentation Include The Following :
• Introduction
• Data Coverage And Size
• Framework Structure And The Data Fetching Mechanism
• Experimental study
• Ranking And Classification
• Conclusions
2
Introduction
3
Introduction :
• Foreign exchange (Forex) market
• Development a framework
• The framework allows streaming and visualization of historical (previous) and current currency
prices in close to real time
• The framework benchmarks every monitored broker to decide whether he/she is trustworthy
4
Broker
• However, human decision making process is mostly subjective and emotional
• It is not necessary that every broker considers the right factors affecting exchange rate, or even gives the
right weight to each
5
CTS-Forex Performance Study
Forex monitoring system designed to :
• Automatically track and monitor brokers
• By fetching, visualizing and analyzing their
announced exchange rate
• Our current effort focuses on EUR/USD exchange
rate
• covers 24.1% of forex market
CTS FOREX PERFORMANCE STUDY MAIN FEATURES
6
Main Components Of the Proposed Framework :
Visualizing the Data Streams
CTS FOREX PERFORMANCE STUDY MAIN FEATURES
The ability to visualize previously captured data
as well as current data is essential for a expert
1) Single Broker Visualization
2) Multiple Brokers Visualization
3) Ranking and Classification Visualization
7
Main Components Of the Proposed Framework :
Analyzing Captured Data
CTS FOREX PERFORMANCE STUDY MAIN FEATURES
The ability to analyze and interpret the captured
data provides useful information for enhancing
the decision making process
Short
term
Medium
term
Long
term
8
DATA COVERAGE AND SIZE
9
Data Coverage And Size :
The Data Structure
Snapshot : The data for each broker
• Date of the snapshot
• Number of seconds recorded in the snapshot
Tick : Each piece of data is called a tick
• Date of the tick
• The ASK price of the currency
• The BID price of the currency
Recording Periods
Indeed data is the main source for knowledge discovery leading to informative decision making
10
4/5 are ignored
Data Coverage And Size :
Brokers Data Summary
• They monitored a total of 52 brokers
• All the collected data used is from the
MetaTrader 4 platform
BROKERS SUMMARY TABLE
11
Data Coverage And Size :
Daily Data Distribution (Monthly Stacked)
12
FRAMEWORK STRUCTURE AND
DATA FETCHING MECHANISM
13
Framework Structure And The Data Fetching Mechanism
System Block Diagram 14
Framework Features
• Main function of this framework is capturing and visualizing stream data
• The number of online brokers is increasing
• This rapid increase has to be handled by our framework by having the ability of easily add new
brokers
• Brokers use different platforms to provide traders with several features with minimum modifications
• The framework should provide runtime visualization as well as visualization of previously captured data
15
Framework Features
Framework should provide the ability to analyze data at three levels:
1) Short term analysis
2) Medium term analysis
3) Long term analysis
16
Available Resources
Meta Trader 4
• The software consists of two components:
• Client
• Server
• The server is used by brokers
• To provide a client component to their clients
• The client provides end users with the ability to :
• View live streaming prices
• Place orders
• Write their own scripts that automate the trading process
17
Client Applications Installation
• First mission is install the client applications (Meta Trader) to
start fetching the data
• The problem we faced was fitting all clients on the same
computer as they consume a lot of memory
• Distribute client applications
• save more memory for short term analysis
18
Local Data Centralization
Client-Server ModelDatabase
Insertion and Selection Model
The first model we came up with was a client/server model
The second model is a database insertion and selection model
Although the first showed better results
• Decided to stick to the second model
• It provides better integrity
19
Global Data Centralization
Global Data Centralization
We have all the fetched data in each computer centralized in its
fetcher application
• This allows us to run short term analysis and
• Some medium term analysis
• But it does not allow us to run long term analysis
• brokers ranking
• benchmarking
• Although centralizing data will have the drawback
20
Online/Local Visualization
One of the implemented features was a visualization system for online users
• Track their favorite brokers and compare them against others
• The first model was storing data into a MySQL database
• The second model was pre-compiling the data into JSON data files
We still had to find a way to pre-compile the data stored locally on our
computers and push it to the online server
Database model VS JSON model21
Manager Application
Online Visualization Structure
The manager application will be responsible for
• Running a lot of medium and long term analysis
• Ranking the brokers
• Benchmarking them
• Prioritizing the snapshots
22
EXPERIMENTAL STUDY
23
Experimental study
Windows 8 Pro with 32GB installed memory (RAM).
Intel Core i7-3770 CPU @ 3.40GHz and 3.40 GHz.
The first disk is a HDD second is a SSD.
DATA SET SUMMARY
Local centralization study
Visualization component study
24
Local Data Centralization Study
Local Data Centralization (Model 1 vs Model 2 HHD) Week 4
25
Visualization Study
WEEK 1 - VISUALIZATION THROUGHPUT RESULTS (MODEL 1 VS MODEL 2)
WEEK 2 - VISUALIZATION THROUGHPUT RESULTS (MODEL 1 VS MODEL 2)
Fetching data
from the
database
Fetching data
from pre-
compiled JSON
files
26
RANKING AND CLASSIFICATION
27
Ranking And Classification
Our main focus is to study performance of brokers
and find trustworthy ones
The two approaches are supplementary
28
Feed-price performance
Feed price performance key features Spread Consistency
29
Ranking And Classification
1) The Algorithm:
30
Ranking And Classification
• The ranking algorithm was just the first step for distinguishing good brokers
• We do not have any gold standard rules to specify ranges of good and bad brokers’ scores
• Therefore, we decided to use a clustering algorithm
• For each ranked snapshot, we cluster the brokers based on their final scores
• Thus we are clustering one-dimensional data.
31
Clustering
Hierarchical
Density Based
K-Means
Distribution of Brokers w.r.t. their Most Common Class
In our case, we want to find trustworthy brokers, gray, and bad ones
32
Ranking And Classification
2) Membership:
To determine whether a broker is good or not, we have to study their memberships in all classes during
their life cycle
33
CONCLUSIONS
34
CONCLUSIONS
• We Present a group of issues that exist in the current forex market
• Designed and implemented a full framework
• To monitor a list of brokers by fetching their data continuously
• Comparing them to each other and thus speed up and enhance domain experts’
decision making process
35
36
THANK YOU FOR LISTENING
•‫مطالب‬‫تکراری‬
•‫شده‬ ‫ارائه‬ ‫کار‬ ‫با‬ ‫مقاله‬ ‫تیتر‬ ‫نداشتن‬ ‫تطابق‬
•‫شده‬ ‫ارائه‬ ‫پراکنده‬ ‫صورت‬ ‫به‬ ‫آینده‬ ‫کارهای‬
•‫و‬ ‫شکلها‬ ‫آوردن‬ ‫در‬ ‫سازماندهی‬ ‫و‬ ‫نظم‬ ‫داشتن‬ ‫عدم‬‫جداول‬
•‫غیر‬ ‫برای‬ ‫چارچوب‬‫از‬EUR-USD
•‫بندی‬ ‫خوشه‬ ‫مورد‬ ‫در‬ ‫کم‬ ‫توضیحات‬
•‫مطالب‬ ‫پراکندگی‬
•‫داده‬ ‫پنجم‬ ‫یک‬‫ها‬
•‫شده‬ ‫ارائه‬ ‫چارچوب‬ ‫با‬ ‫کار‬ ‫نحوه‬ ‫چگونگی‬

More Related Content

Similar to Foreaign Exchange Data Crawling and Analysis for Knowledge Discovery Leading to Informative Decision Making

AIS PPt.pptx
AIS PPt.pptxAIS PPt.pptx
AIS PPt.pptxdereje33
 
An example of a successful proof of concept
An example of a successful proof of conceptAn example of a successful proof of concept
An example of a successful proof of conceptETLSolutions
 
Information Transformation: Planning for an Effective Implementation
Information Transformation: Planning for an Effective ImplementationInformation Transformation: Planning for an Effective Implementation
Information Transformation: Planning for an Effective ImplementationKyriba Corporation
 
Using PySpark to Scale Markov Decision Problems for Policy Exploration
Using PySpark to Scale Markov Decision Problems for Policy ExplorationUsing PySpark to Scale Markov Decision Problems for Policy Exploration
Using PySpark to Scale Markov Decision Problems for Policy ExplorationDatabricks
 
10-Step Methodology to Building a Single View with MongoDB
10-Step Methodology to Building a Single View with MongoDB10-Step Methodology to Building a Single View with MongoDB
10-Step Methodology to Building a Single View with MongoDBMat Keep
 
SYNOPSIS ON BANK MANAGEMENT SYSTEM
SYNOPSIS ON BANK MANAGEMENT SYSTEMSYNOPSIS ON BANK MANAGEMENT SYSTEM
SYNOPSIS ON BANK MANAGEMENT SYSTEMNitish Xavier Tirkey
 
Sharing CTMS Data Between Sponsors and CROs
Sharing CTMS Data Between Sponsors and CROsSharing CTMS Data Between Sponsors and CROs
Sharing CTMS Data Between Sponsors and CROsPerficient
 
Online-Voting-System.doc
Online-Voting-System.docOnline-Voting-System.doc
Online-Voting-System.docShangaviS2
 
Requirements document for big data use cases
Requirements document for big data use casesRequirements document for big data use cases
Requirements document for big data use casesAllied Consultants
 
MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...
MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...
MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...MongoDB
 
DSS Case on Planpower
DSS Case on PlanpowerDSS Case on Planpower
DSS Case on PlanpowerSubhasis Maji
 
Cdisc sdtm implementation_process _v1
Cdisc sdtm implementation_process _v1Cdisc sdtm implementation_process _v1
Cdisc sdtm implementation_process _v1ray4hz
 
Online e-voting
Online e-votingOnline e-voting
Online e-votingaeioou
 
3210cpmslideshare-1610302073773770419.pdf
3210cpmslideshare-1610302073773770419.pdf3210cpmslideshare-1610302073773770419.pdf
3210cpmslideshare-1610302073773770419.pdfPabitraMitra3
 
The Dodd-Frank Software Solution
The Dodd-Frank Software SolutionThe Dodd-Frank Software Solution
The Dodd-Frank Software SolutionWG Consulting
 
Critical Path Method (CPM) & Network Analysis
Critical Path Method (CPM) & Network AnalysisCritical Path Method (CPM) & Network Analysis
Critical Path Method (CPM) & Network AnalysisAsif Sultan Razu
 

Similar to Foreaign Exchange Data Crawling and Analysis for Knowledge Discovery Leading to Informative Decision Making (20)

AIS PPt.pptx
AIS PPt.pptxAIS PPt.pptx
AIS PPt.pptx
 
An example of a successful proof of concept
An example of a successful proof of conceptAn example of a successful proof of concept
An example of a successful proof of concept
 
Information Transformation: Planning for an Effective Implementation
Information Transformation: Planning for an Effective ImplementationInformation Transformation: Planning for an Effective Implementation
Information Transformation: Planning for an Effective Implementation
 
Using PySpark to Scale Markov Decision Problems for Policy Exploration
Using PySpark to Scale Markov Decision Problems for Policy ExplorationUsing PySpark to Scale Markov Decision Problems for Policy Exploration
Using PySpark to Scale Markov Decision Problems for Policy Exploration
 
10-Step Methodology to Building a Single View with MongoDB
10-Step Methodology to Building a Single View with MongoDB10-Step Methodology to Building a Single View with MongoDB
10-Step Methodology to Building a Single View with MongoDB
 
SYNOPSIS ON BANK MANAGEMENT SYSTEM
SYNOPSIS ON BANK MANAGEMENT SYSTEMSYNOPSIS ON BANK MANAGEMENT SYSTEM
SYNOPSIS ON BANK MANAGEMENT SYSTEM
 
The art of project estimation
The art of project estimationThe art of project estimation
The art of project estimation
 
Sharing CTMS Data Between Sponsors and CROs
Sharing CTMS Data Between Sponsors and CROsSharing CTMS Data Between Sponsors and CROs
Sharing CTMS Data Between Sponsors and CROs
 
Online-Voting-System.doc
Online-Voting-System.docOnline-Voting-System.doc
Online-Voting-System.doc
 
Presentation2
Presentation2Presentation2
Presentation2
 
Requirements document for big data use cases
Requirements document for big data use casesRequirements document for big data use cases
Requirements document for big data use cases
 
MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...
MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...
MongoBD London 2013: Real World MongoDB: Use Cases from Financial Services pr...
 
Benchmarking Faceted Browsing Capabilities of Triple Stores
Benchmarking Faceted Browsing Capabilities of Triple StoresBenchmarking Faceted Browsing Capabilities of Triple Stores
Benchmarking Faceted Browsing Capabilities of Triple Stores
 
DSS Case on Planpower
DSS Case on PlanpowerDSS Case on Planpower
DSS Case on Planpower
 
Cdisc sdtm implementation_process _v1
Cdisc sdtm implementation_process _v1Cdisc sdtm implementation_process _v1
Cdisc sdtm implementation_process _v1
 
Online e-voting
Online e-votingOnline e-voting
Online e-voting
 
3210cpmslideshare-1610302073773770419.pdf
3210cpmslideshare-1610302073773770419.pdf3210cpmslideshare-1610302073773770419.pdf
3210cpmslideshare-1610302073773770419.pdf
 
Group-9.pptx
Group-9.pptxGroup-9.pptx
Group-9.pptx
 
The Dodd-Frank Software Solution
The Dodd-Frank Software SolutionThe Dodd-Frank Software Solution
The Dodd-Frank Software Solution
 
Critical Path Method (CPM) & Network Analysis
Critical Path Method (CPM) & Network AnalysisCritical Path Method (CPM) & Network Analysis
Critical Path Method (CPM) & Network Analysis
 

More from Mostafa Arjmand

امنیت سایبر فیزیکی زیرساخت شبکه برق هوشمند
امنیت سایبر فیزیکی زیرساخت شبکه برق هوشمندامنیت سایبر فیزیکی زیرساخت شبکه برق هوشمند
امنیت سایبر فیزیکی زیرساخت شبکه برق هوشمندMostafa Arjmand
 
Monomi: Practical Analytical Query Processing over Encrypted Data
Monomi: Practical Analytical Query Processing over Encrypted DataMonomi: Practical Analytical Query Processing over Encrypted Data
Monomi: Practical Analytical Query Processing over Encrypted DataMostafa Arjmand
 
Security and trust in semantic web
Security  and trust in semantic webSecurity  and trust in semantic web
Security and trust in semantic webMostafa Arjmand
 
Smart city dynamic road lane management a smart city application
Smart city  dynamic road lane management a smart city applicationSmart city  dynamic road lane management a smart city application
Smart city dynamic road lane management a smart city applicationMostafa Arjmand
 
Internet of Things A Vision, Architectural Elements, and Future Directions
Internet of Things A Vision, Architectural Elements, and Future Directions Internet of Things A Vision, Architectural Elements, and Future Directions
Internet of Things A Vision, Architectural Elements, and Future Directions Mostafa Arjmand
 

More from Mostafa Arjmand (6)

امنیت سایبر فیزیکی زیرساخت شبکه برق هوشمند
امنیت سایبر فیزیکی زیرساخت شبکه برق هوشمندامنیت سایبر فیزیکی زیرساخت شبکه برق هوشمند
امنیت سایبر فیزیکی زیرساخت شبکه برق هوشمند
 
Monomi: Practical Analytical Query Processing over Encrypted Data
Monomi: Practical Analytical Query Processing over Encrypted DataMonomi: Practical Analytical Query Processing over Encrypted Data
Monomi: Practical Analytical Query Processing over Encrypted Data
 
Commitment scheme
Commitment schemeCommitment scheme
Commitment scheme
 
Security and trust in semantic web
Security  and trust in semantic webSecurity  and trust in semantic web
Security and trust in semantic web
 
Smart city dynamic road lane management a smart city application
Smart city  dynamic road lane management a smart city applicationSmart city  dynamic road lane management a smart city application
Smart city dynamic road lane management a smart city application
 
Internet of Things A Vision, Architectural Elements, and Future Directions
Internet of Things A Vision, Architectural Elements, and Future Directions Internet of Things A Vision, Architectural Elements, and Future Directions
Internet of Things A Vision, Architectural Elements, and Future Directions
 

Recently uploaded

Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Effects of rheological properties on mixing
Effects of rheological properties on mixingEffects of rheological properties on mixing
Effects of rheological properties on mixingviprabot1
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
EduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIEduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIkoyaldeepu123
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 

Recently uploaded (20)

Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Effects of rheological properties on mixing
Effects of rheological properties on mixingEffects of rheological properties on mixing
Effects of rheological properties on mixing
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
EduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIEduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AI
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 

Foreaign Exchange Data Crawling and Analysis for Knowledge Discovery Leading to Informative Decision Making

  • 1. Foreign Exchange Data Crawling and Analysis for Knowledge Discovery Leading to Informative Decision Making Data mining Dr. Nasser Ghadiri Mostafa Arjmand
  • 2. The Presentation Include The Following : • Introduction • Data Coverage And Size • Framework Structure And The Data Fetching Mechanism • Experimental study • Ranking And Classification • Conclusions 2
  • 4. Introduction : • Foreign exchange (Forex) market • Development a framework • The framework allows streaming and visualization of historical (previous) and current currency prices in close to real time • The framework benchmarks every monitored broker to decide whether he/she is trustworthy 4
  • 5. Broker • However, human decision making process is mostly subjective and emotional • It is not necessary that every broker considers the right factors affecting exchange rate, or even gives the right weight to each 5
  • 6. CTS-Forex Performance Study Forex monitoring system designed to : • Automatically track and monitor brokers • By fetching, visualizing and analyzing their announced exchange rate • Our current effort focuses on EUR/USD exchange rate • covers 24.1% of forex market CTS FOREX PERFORMANCE STUDY MAIN FEATURES 6
  • 7. Main Components Of the Proposed Framework : Visualizing the Data Streams CTS FOREX PERFORMANCE STUDY MAIN FEATURES The ability to visualize previously captured data as well as current data is essential for a expert 1) Single Broker Visualization 2) Multiple Brokers Visualization 3) Ranking and Classification Visualization 7
  • 8. Main Components Of the Proposed Framework : Analyzing Captured Data CTS FOREX PERFORMANCE STUDY MAIN FEATURES The ability to analyze and interpret the captured data provides useful information for enhancing the decision making process Short term Medium term Long term 8
  • 10. Data Coverage And Size : The Data Structure Snapshot : The data for each broker • Date of the snapshot • Number of seconds recorded in the snapshot Tick : Each piece of data is called a tick • Date of the tick • The ASK price of the currency • The BID price of the currency Recording Periods Indeed data is the main source for knowledge discovery leading to informative decision making 10 4/5 are ignored
  • 11. Data Coverage And Size : Brokers Data Summary • They monitored a total of 52 brokers • All the collected data used is from the MetaTrader 4 platform BROKERS SUMMARY TABLE 11
  • 12. Data Coverage And Size : Daily Data Distribution (Monthly Stacked) 12
  • 13. FRAMEWORK STRUCTURE AND DATA FETCHING MECHANISM 13
  • 14. Framework Structure And The Data Fetching Mechanism System Block Diagram 14
  • 15. Framework Features • Main function of this framework is capturing and visualizing stream data • The number of online brokers is increasing • This rapid increase has to be handled by our framework by having the ability of easily add new brokers • Brokers use different platforms to provide traders with several features with minimum modifications • The framework should provide runtime visualization as well as visualization of previously captured data 15
  • 16. Framework Features Framework should provide the ability to analyze data at three levels: 1) Short term analysis 2) Medium term analysis 3) Long term analysis 16
  • 17. Available Resources Meta Trader 4 • The software consists of two components: • Client • Server • The server is used by brokers • To provide a client component to their clients • The client provides end users with the ability to : • View live streaming prices • Place orders • Write their own scripts that automate the trading process 17
  • 18. Client Applications Installation • First mission is install the client applications (Meta Trader) to start fetching the data • The problem we faced was fitting all clients on the same computer as they consume a lot of memory • Distribute client applications • save more memory for short term analysis 18
  • 19. Local Data Centralization Client-Server ModelDatabase Insertion and Selection Model The first model we came up with was a client/server model The second model is a database insertion and selection model Although the first showed better results • Decided to stick to the second model • It provides better integrity 19
  • 20. Global Data Centralization Global Data Centralization We have all the fetched data in each computer centralized in its fetcher application • This allows us to run short term analysis and • Some medium term analysis • But it does not allow us to run long term analysis • brokers ranking • benchmarking • Although centralizing data will have the drawback 20
  • 21. Online/Local Visualization One of the implemented features was a visualization system for online users • Track their favorite brokers and compare them against others • The first model was storing data into a MySQL database • The second model was pre-compiling the data into JSON data files We still had to find a way to pre-compile the data stored locally on our computers and push it to the online server Database model VS JSON model21
  • 22. Manager Application Online Visualization Structure The manager application will be responsible for • Running a lot of medium and long term analysis • Ranking the brokers • Benchmarking them • Prioritizing the snapshots 22
  • 24. Experimental study Windows 8 Pro with 32GB installed memory (RAM). Intel Core i7-3770 CPU @ 3.40GHz and 3.40 GHz. The first disk is a HDD second is a SSD. DATA SET SUMMARY Local centralization study Visualization component study 24
  • 25. Local Data Centralization Study Local Data Centralization (Model 1 vs Model 2 HHD) Week 4 25
  • 26. Visualization Study WEEK 1 - VISUALIZATION THROUGHPUT RESULTS (MODEL 1 VS MODEL 2) WEEK 2 - VISUALIZATION THROUGHPUT RESULTS (MODEL 1 VS MODEL 2) Fetching data from the database Fetching data from pre- compiled JSON files 26
  • 28. Ranking And Classification Our main focus is to study performance of brokers and find trustworthy ones The two approaches are supplementary 28
  • 29. Feed-price performance Feed price performance key features Spread Consistency 29
  • 30. Ranking And Classification 1) The Algorithm: 30
  • 31. Ranking And Classification • The ranking algorithm was just the first step for distinguishing good brokers • We do not have any gold standard rules to specify ranges of good and bad brokers’ scores • Therefore, we decided to use a clustering algorithm • For each ranked snapshot, we cluster the brokers based on their final scores • Thus we are clustering one-dimensional data. 31
  • 32. Clustering Hierarchical Density Based K-Means Distribution of Brokers w.r.t. their Most Common Class In our case, we want to find trustworthy brokers, gray, and bad ones 32
  • 33. Ranking And Classification 2) Membership: To determine whether a broker is good or not, we have to study their memberships in all classes during their life cycle 33
  • 35. CONCLUSIONS • We Present a group of issues that exist in the current forex market • Designed and implemented a full framework • To monitor a list of brokers by fetching their data continuously • Comparing them to each other and thus speed up and enhance domain experts’ decision making process 35
  • 36. 36 THANK YOU FOR LISTENING •‫مطالب‬‫تکراری‬ •‫شده‬ ‫ارائه‬ ‫کار‬ ‫با‬ ‫مقاله‬ ‫تیتر‬ ‫نداشتن‬ ‫تطابق‬ •‫شده‬ ‫ارائه‬ ‫پراکنده‬ ‫صورت‬ ‫به‬ ‫آینده‬ ‫کارهای‬ •‫و‬ ‫شکلها‬ ‫آوردن‬ ‫در‬ ‫سازماندهی‬ ‫و‬ ‫نظم‬ ‫داشتن‬ ‫عدم‬‫جداول‬ •‫غیر‬ ‫برای‬ ‫چارچوب‬‫از‬EUR-USD •‫بندی‬ ‫خوشه‬ ‫مورد‬ ‫در‬ ‫کم‬ ‫توضیحات‬ •‫مطالب‬ ‫پراکندگی‬ •‫داده‬ ‫پنجم‬ ‫یک‬‫ها‬ •‫شده‬ ‫ارائه‬ ‫چارچوب‬ ‫با‬ ‫کار‬ ‫نحوه‬ ‫چگونگی‬

Editor's Notes

  1. As explained in the next section, our framework avoids recording information all the time; instead it only records 1/5 of the available data. The other 4/5 are ignored since our algorithm tends to study the feed-price performance and brokers are expected to be consistent all the time so it shouldn’t really matter what time portion is recorded for the analysis. The number of recording periods and the number of seconds per recording can be easily modified in our system from the administrator’s portal.
  2. In our situation, we used two computers, with 4GB RAM each, to install client applications. We installed 30 applications on the first computer and 25 applications on the second computer.
  3. Saturday and Sunday when the market is closed First used to study the visualization component. The second dataset includes week 3 and week 4 to study local data centralization. The local centralization study compares two models by testing every tick separately, and watch the throughput of each model. The visualization component study compares two models by visualizing chunks of ticks at the same time.
  4. The first model is a client/server model that directly sends updated ticks to the fetcher application through a socket with a predefined port number. The second model is a database model that inserts every updated tick into a database to be fetched later by the fetcher application.
  5. record only one minute every five minutes. Test were not run simultaneously but consecutively. However, Table X shows results of the experiments which were conducted in the second week scenarios, where each test was run simultaneously to check the effect it has on the throughput. It seems that it had an effect in model 1 but not in model 2.
  6. The ranking algorithm runs separately on each snapshot period. then we create weekly, monthly, and yearly reports by finding the average score of all snapshot periods combined together.
  7. 10% threshold provided the best distribution that helps in differentiating between trusted brokers (class 1 and class 2 with minimum values), gray brokers (class 3 with maximum value), and bad brokers (class 4)