SlideShare a Scribd company logo
Legal Informatics w/ AWS CloudSearch &
High-Performance Financial Market Apps
                         AWS Michigan Meetup

                            October 9th, 2012

                           http://www.solidlogic.com
                                                       1
Objectives


 ‣   Introduce Quantitative Trading


 ‣   Present a case study on AWS usage in Quantitative Trading System
     Evaluation.


 ‣   Discuss potential improvements upon our presented architecture.




                                                    http://www.solidlogic.com
                                                                                2
About
 Us




        http://www.solidlogic.com
                                    3
Solid Logic Technology develops innovative custom
technology solutions across a variety of industries
using leading software, infrastructure and
business practices.




                                  http://www.solidlogic.com
                                                              4
About us
Our expertise                                              Industry experience
Infrastructure and cloud computing                         ‣   Financial and legal services
‣   Scalable, programmatic infrastructure                  ‣   Logistics
    management
                                                           ‣   Automotive
‣   Strategic data center design
                                                           ‣   Defense and homeland security
‣   VMware architecture and management
                                                           ‣   Consumer sales and service
‣   Multi-cloud development and deployment
‣   Scalable web infrastructure with CDN
                                                           ‣   Academic and scientific research
‣   Security and compliance methods and
    implementation
Software development                                       Company Information
‣   Analytical solutions - simulation, optimization, big   ‣   Founded in 2011
    data, natural language processing, quant. finance
                                                           ‣   Entirely mobile company
‣   Enterprise content management, workflow
    solutions, system integration                          ‣   Develop both internal projects (IP) and
‣   Oracle Transportation Management                           client software solutions
‣   Database technology (Oracle, Vertica, Postgres,
    Cassandra, etc.)
‣   Web application and website development

                                                                           http://www.solidlogic.com
                                                                                                         5
Solid Logic Management Team
‣       Eric Detterman, CEO and Co-Founder
    •        Professional Experience
         -         Legal IT Business Analyst, Lean Startup, Cloud Computing, Processing Engineering and Consulting
         -         Researched and developed core investment strategies for Birmingham, MI RIA
               -       Currently in production and managing > $20M, AUM growth > 50% annually
         -         Proprietary trading (equities, futures, options), web and software development
    •        Education: B.S. Economics – Oakland University
‣       Michael Bommarito, CIO and Partner
    •        Relevant Experience
         -         “Big data” consultant, Oracle ERP architect, Linux cluster administrator.
         -         Software developer - NYC-based quantitative hedge fund
         -         Consultant - multiple quantitative hedge funds
    •        Education : M.S.E Financial Engineering, M.S. Political Science, B.S. Mathematics –
             University of Michigan
‣       Ronald Redmer, Board Member and Lead Technical Advisor
    •        Relevant Experience
         -         CIO, National Default Exchange (NDeX), a business unit of The Dolan Company (NYSE:DM)
         -         CEO defense supplier company, Airport systems software, CEO auto testing company, Affina –
                   software dev mgr, EDS - tech lead                           http://www.solidlogic.com
                                                                                                                     6
Case Study:
Proprietary Trading Simulation




                     http://www.solidlogic.com
                                                 7
Case Study: Proprietary Trading Simulation


 Quantitative Trading and Investment Systems:
 ‣ (Loose) Definition:
   •   Rules-based mathematical ‘model’ created by testing and validating a hypothesis
       about how a tradable market acts or optimizing parameters to create an equation
       to describe the market.
   •   The goal is to outperform the broad market (S&P 500) or some benchmark after
       costs.
 ‣ Example Strategy:
   •   Investment universe = ~50 Fidelity Mutual Funds
   •   Strategy #1: Invest in the top six ranked mutual funds based on proprietary
       momentum (p0 > p-1) based ranking algorithm. Analyze and rank fund universe
       every 45 days and re-allocate.
   •   Strategy #2: Invest in the top six ranked mutual funds based on proprietary mean
       reversion (p0 > p-1) based ranking algorithm. Analyze and rank fund universe
       every 45 days and re-allocate.                       http://www.solidlogic.com
                                                                                        8
Case Study: Proprietary Trading Simulation


        Hypothetical Example




                                 http://www.solidlogic.com
                                                             9
Case Study: Proprietary Trading Simulation


 Challenge:                                         Scope:
 ‣       Characterize the performance and           ‣   Assets                               62
         sensitivity of an equity trading
                                                    ‣   Tests/asset                          96
         system across input parameters and
         market conditions                          ‣   Total tests                   5,952
 ‣       Optimize parameters based on profit
         and risk measures                          Test Information
 ‣       Estimated runtime is unacceptable
                                                    ‣   Mean components/asset            395
         on local workstation (>1 month)
 ‣       Primary bottlenecks are in dense           ‣   Points/component               3,135
         linear algebra operations                  ‣   Points/test              1,238,325
     •      Spectral decomposition (ARPACK)
            Pairwise comparison of higher-order
                                                        Total elements 7,370,510,400
     •
            distribution moments (M-M arithmetic)   ‣


                                                                 http://www.solidlogic.com
                                                                                                  10
Case Study: Proprietary Trading Simulation


 Potential solutions:
 ‣ Run on existing hardware – wait for results
 ‣ Physical or virtualized servers with supporting job schedulers –
   requires hardware, software, and specialized labor
 ‣ Setup cloud infrastructure to process work – requires software
   and specialized labor




                                             http://www.solidlogic.com
                                                                         11
Trading Simulation: Architecture
This was our initial version – Not overly elegant, but works very well
with minimal effort to setup. Easy to improve upon.




                                                             Strategy Test
    Trading System                                              Results
   Source Code and                Custom                     (S3 Buckets)
      Config Data                 Created
       (Git Repo)                AMIs (x16)

                               Availability Zone

                               US East Region




                                                   http://www.solidlogic.com
                                                                               12
Trading Simulation: Test Process


     Trading
     System
   Source Code
    (Git Repo)




                                                   Strategy Test
                                                    Results (S3
                        Custom                       Buckets)
                        Created
                       AMIs (x16)

                     Availability Zone

                     US East Region
      Local
   Development
   Environment


                                         http://www.solidlogic.com
                                                                     13
Trading Simulation: Overview


 Technology Solution:                      Compute Instance (x16):
 ‣       Built an optimized simulation     ‣   88 Elastic Compute Units (ECU)
         environment as virtual image      ‣   2x Xeon E5-2670s-16 cores
         (AWS EC2 AMI)
                                           ‣   60.5GB RAM
 ‣       Provisioned and configured
                                           ‣   10GbE, dual NIC
         centralized storage (AWS S3)
                                           ‣   3+TB instance scratch
     •     Experiment configuration
     •     Simulation input                Total Compute Resources:
     •     Simulation output               ‣   1408 ECUs
     •     Post-processed results          ‣   512 concurrent threads (HT)
 ‣       Fully automated deployment of     ‣   968GB RAM
         simulation to instances through
         master source control system      (1 ECU~=5GFlops)
         (git)                                       http://www.solidlogic.com
                                                                                 14
Trading Simulation: AMI Creation Process


 ‣ Use standard Ubuntu Server 12.04.1 LTS for Cluster Instances
   AMI x64 (ami-eb7bcf82)
   •   cc2.8xLarge – 88ECUs, 16 cores, 60.5GB RAM

 ‣ Install git, s3cmd, PostgreSQL JDBC drivers
 ‣ Install and configure test environment and all dependencies
 ‣ Create new AMI based on the above




                                                 http://www.solidlogic.com
                                                                             15
Trading Simulation: Test Execution


 ‣ For each test instance…..
   •    ssh -X -i /home/ericd/.aws/first/name.pem ubuntu@IP
   •    cd /home/ubuntu/testcode/tradingsystemsales
   •    git pull
   •    cd /usr/local/testcode//bin
   •    sudo ./testcode -nodesktop

   •     parameterSweepSingleNode('Yes','Yes',
       'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdatamasterlist.mat',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'csv', '',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 1,
       'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdataETFsToTest.csv',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'mat',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/src', 'No', 'No')

   •     parameterSweepSingleNode('No','Yes',
       'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdatamasterlist.mat',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'csv', '',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 1,
       'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdataETFsToTest.csv',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'mat',
       '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/src', 'No', 'No')



                                                                                     http://www.solidlogic.com
                                                                                                                 16
Trading Simulation: Initial Test Results


 ‣ Result sets saved to S3 buckets using S3cmd
   •   Approximately 6000 result sets




                                        http://www.solidlogic.com
                                                                    17
Trading Simulation: Output




‣ Run Time:
 •   Cloud: 45 hours
 •   Single-seat: 1-2 months
 •   Order of magnitude improvement in time!
                                         http://www.solidlogic.com
                                                                     18
Trading Simulation: Economics

                                      Co-            On-         Spot
                      On-Site
                                    Location       Demand       Pricing           Cost Model Details
Server Hardware /                                                             ‣    Cost estimates using assumptions
                     $69,045        $69,045         $1,647         $193
Instance Usage
                                                                                   and calculations in Cost Comparison
Network Hardware       13,809         13,809                -             -
                                                                                   Worksheet
Hardware Maint.        24,856         24,856                -             -   ‣    Costs represent one year annualized
                                                                                   costs. Assumes a useful life of three
Operating System                -              -            -             -
                                                                                   years for purchased equipment
Power and Cooling       9,907                  -            -             -   ‣    1= Cost savings using On-Site as
Data Center                                                                        baseline
Construction / Co-      8,618         65,136                -             -
Location Expense
                                                                              ‣    2= On-Site and Co-Location assume
Admin. / Remote
                                                                                   100% usage
                     105,000             240                -             -
Hands Support                                                                 ‣    3= Based on actual 686 machine
Data Transfer               1              4            1                 1        hours used
Total                $231,237 $173,091              $1,647         $193



Cost Savings1             N/A        25.12%         99.29%       99.92%

$ / Compute Hr.2,3     $26.40         $19.76         $2.40        $0.28

                                                                                         http://www.solidlogic.com
                                                                                                                           19
Trading Simulation: Next Steps


 Potential Improvements:
 ‣ Develop improved cloud infrastructure management tools
   • Allocation of work across instances
   • Allow user defined completion time and programmatically
     scale compute resources to work towards goal
   • Spread work across unused internal and available external
     compute resources




                                           http://www.solidlogic.com
                                                                       20
Thank you


    Eric Detterman                            Michael Bommarito
    CEO, Co-Founder                                   CIO, Partner
  Eric.Detterman@solidlogic.com              Michael.Bommarito@solidlogic.com

    Direct: (248) 792 – 8001                        Direct: (646) 450 – 3387




                               (248) 792 – 8000
                               www.solidlogic.com

                          330 East Maple Rd. #231
                           Birmingham, MI 48009                                 21

More Related Content

Similar to Finance Trading in The Cloud - AWS Michigan Meetup

ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
Karan Sachdeva
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Anant Corporation
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data:  InterConnect 2016 Session on Getting Started with Big Data AnalyticsBig Data:  InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
 
A journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercializationA journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercialization
Institute of Contemporary Sciences
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
DATAVERSITY
 
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
Amazon Web Services Korea
 
Boot camp - Migration to AWS
Boot camp - Migration to AWSBoot camp - Migration to AWS
Boot camp - Migration to AWS
Amazon Web Services
 
All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price
All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price
All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price
Western Digital
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
DATAVERSITY
 
Get ready for_an_autonomous_data_driven_future_ext
Get ready for_an_autonomous_data_driven_future_extGet ready for_an_autonomous_data_driven_future_ext
Get ready for_an_autonomous_data_driven_future_ext
Oracle Developers
 
Enterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOS
Enterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOSEnterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOS
Enterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOS
Iceventure
 
Meetup Spark UDF performance
Meetup Spark UDF performanceMeetup Spark UDF performance
Meetup Spark UDF performance
Guilherme Braccialli
 
Estrategias para explotar las tendencias de SaaS y Cloud Computing
Estrategias para explotar las tendencias de SaaS y Cloud ComputingEstrategias para explotar las tendencias de SaaS y Cloud Computing
Estrategias para explotar las tendencias de SaaS y Cloud Computing
Software Guru
 
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Robb Boyd
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
DataWorks Summit
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
Indrajit Poddar
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino Data Lab
 
Journey to cloud engineering
Journey to cloud engineeringJourney to cloud engineering
Journey to cloud engineering
Md. Sadhan Sarker
 
Cloud-native Application Lifecycle Management
Cloud-native Application Lifecycle ManagementCloud-native Application Lifecycle Management
Cloud-native Application Lifecycle Management
Neil Gehani
 
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
Deepak Chandramouli
 

Similar to Finance Trading in The Cloud - AWS Michigan Meetup (20)

ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data:  InterConnect 2016 Session on Getting Started with Big Data AnalyticsBig Data:  InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
 
A journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercializationA journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercialization
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
 
Boot camp - Migration to AWS
Boot camp - Migration to AWSBoot camp - Migration to AWS
Boot camp - Migration to AWS
 
All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price
All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price
All-Flash Versus Hybrid VMware Virtual SAN™: Performance vs. Price
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
 
Get ready for_an_autonomous_data_driven_future_ext
Get ready for_an_autonomous_data_driven_future_extGet ready for_an_autonomous_data_driven_future_ext
Get ready for_an_autonomous_data_driven_future_ext
 
Enterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOS
Enterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOSEnterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOS
Enterprise Blockchains – A Pragmatic & Realistic Guide for CIOs/CDOS
 
Meetup Spark UDF performance
Meetup Spark UDF performanceMeetup Spark UDF performance
Meetup Spark UDF performance
 
Estrategias para explotar las tendencias de SaaS y Cloud Computing
Estrategias para explotar las tendencias de SaaS y Cloud ComputingEstrategias para explotar las tendencias de SaaS y Cloud Computing
Estrategias para explotar las tendencias de SaaS y Cloud Computing
 
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
 
Journey to cloud engineering
Journey to cloud engineeringJourney to cloud engineering
Journey to cloud engineering
 
Cloud-native Application Lifecycle Management
Cloud-native Application Lifecycle ManagementCloud-native Application Lifecycle Management
Cloud-native Application Lifecycle Management
 
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
 

Recently uploaded

GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Ukraine
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
DianaGray10
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
manji sharman06
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 

Recently uploaded (20)

GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 

Finance Trading in The Cloud - AWS Michigan Meetup

  • 1. Legal Informatics w/ AWS CloudSearch & High-Performance Financial Market Apps AWS Michigan Meetup October 9th, 2012 http://www.solidlogic.com 1
  • 2. Objectives ‣ Introduce Quantitative Trading ‣ Present a case study on AWS usage in Quantitative Trading System Evaluation. ‣ Discuss potential improvements upon our presented architecture. http://www.solidlogic.com 2
  • 3. About Us http://www.solidlogic.com 3
  • 4. Solid Logic Technology develops innovative custom technology solutions across a variety of industries using leading software, infrastructure and business practices. http://www.solidlogic.com 4
  • 5. About us Our expertise Industry experience Infrastructure and cloud computing ‣ Financial and legal services ‣ Scalable, programmatic infrastructure ‣ Logistics management ‣ Automotive ‣ Strategic data center design ‣ Defense and homeland security ‣ VMware architecture and management ‣ Consumer sales and service ‣ Multi-cloud development and deployment ‣ Scalable web infrastructure with CDN ‣ Academic and scientific research ‣ Security and compliance methods and implementation Software development Company Information ‣ Analytical solutions - simulation, optimization, big ‣ Founded in 2011 data, natural language processing, quant. finance ‣ Entirely mobile company ‣ Enterprise content management, workflow solutions, system integration ‣ Develop both internal projects (IP) and ‣ Oracle Transportation Management client software solutions ‣ Database technology (Oracle, Vertica, Postgres, Cassandra, etc.) ‣ Web application and website development http://www.solidlogic.com 5
  • 6. Solid Logic Management Team ‣ Eric Detterman, CEO and Co-Founder • Professional Experience - Legal IT Business Analyst, Lean Startup, Cloud Computing, Processing Engineering and Consulting - Researched and developed core investment strategies for Birmingham, MI RIA - Currently in production and managing > $20M, AUM growth > 50% annually - Proprietary trading (equities, futures, options), web and software development • Education: B.S. Economics – Oakland University ‣ Michael Bommarito, CIO and Partner • Relevant Experience - “Big data” consultant, Oracle ERP architect, Linux cluster administrator. - Software developer - NYC-based quantitative hedge fund - Consultant - multiple quantitative hedge funds • Education : M.S.E Financial Engineering, M.S. Political Science, B.S. Mathematics – University of Michigan ‣ Ronald Redmer, Board Member and Lead Technical Advisor • Relevant Experience - CIO, National Default Exchange (NDeX), a business unit of The Dolan Company (NYSE:DM) - CEO defense supplier company, Airport systems software, CEO auto testing company, Affina – software dev mgr, EDS - tech lead http://www.solidlogic.com 6
  • 7. Case Study: Proprietary Trading Simulation http://www.solidlogic.com 7
  • 8. Case Study: Proprietary Trading Simulation Quantitative Trading and Investment Systems: ‣ (Loose) Definition: • Rules-based mathematical ‘model’ created by testing and validating a hypothesis about how a tradable market acts or optimizing parameters to create an equation to describe the market. • The goal is to outperform the broad market (S&P 500) or some benchmark after costs. ‣ Example Strategy: • Investment universe = ~50 Fidelity Mutual Funds • Strategy #1: Invest in the top six ranked mutual funds based on proprietary momentum (p0 > p-1) based ranking algorithm. Analyze and rank fund universe every 45 days and re-allocate. • Strategy #2: Invest in the top six ranked mutual funds based on proprietary mean reversion (p0 > p-1) based ranking algorithm. Analyze and rank fund universe every 45 days and re-allocate. http://www.solidlogic.com 8
  • 9. Case Study: Proprietary Trading Simulation Hypothetical Example http://www.solidlogic.com 9
  • 10. Case Study: Proprietary Trading Simulation Challenge: Scope: ‣ Characterize the performance and ‣ Assets 62 sensitivity of an equity trading ‣ Tests/asset 96 system across input parameters and market conditions ‣ Total tests 5,952 ‣ Optimize parameters based on profit and risk measures Test Information ‣ Estimated runtime is unacceptable ‣ Mean components/asset 395 on local workstation (>1 month) ‣ Primary bottlenecks are in dense ‣ Points/component 3,135 linear algebra operations ‣ Points/test 1,238,325 • Spectral decomposition (ARPACK) Pairwise comparison of higher-order Total elements 7,370,510,400 • distribution moments (M-M arithmetic) ‣ http://www.solidlogic.com 10
  • 11. Case Study: Proprietary Trading Simulation Potential solutions: ‣ Run on existing hardware – wait for results ‣ Physical or virtualized servers with supporting job schedulers – requires hardware, software, and specialized labor ‣ Setup cloud infrastructure to process work – requires software and specialized labor http://www.solidlogic.com 11
  • 12. Trading Simulation: Architecture This was our initial version – Not overly elegant, but works very well with minimal effort to setup. Easy to improve upon. Strategy Test Trading System Results Source Code and Custom (S3 Buckets) Config Data Created (Git Repo) AMIs (x16) Availability Zone US East Region http://www.solidlogic.com 12
  • 13. Trading Simulation: Test Process Trading System Source Code (Git Repo) Strategy Test Results (S3 Custom Buckets) Created AMIs (x16) Availability Zone US East Region Local Development Environment http://www.solidlogic.com 13
  • 14. Trading Simulation: Overview Technology Solution: Compute Instance (x16): ‣ Built an optimized simulation ‣ 88 Elastic Compute Units (ECU) environment as virtual image ‣ 2x Xeon E5-2670s-16 cores (AWS EC2 AMI) ‣ 60.5GB RAM ‣ Provisioned and configured ‣ 10GbE, dual NIC centralized storage (AWS S3) ‣ 3+TB instance scratch • Experiment configuration • Simulation input Total Compute Resources: • Simulation output ‣ 1408 ECUs • Post-processed results ‣ 512 concurrent threads (HT) ‣ Fully automated deployment of ‣ 968GB RAM simulation to instances through master source control system (1 ECU~=5GFlops) (git) http://www.solidlogic.com 14
  • 15. Trading Simulation: AMI Creation Process ‣ Use standard Ubuntu Server 12.04.1 LTS for Cluster Instances AMI x64 (ami-eb7bcf82) • cc2.8xLarge – 88ECUs, 16 cores, 60.5GB RAM ‣ Install git, s3cmd, PostgreSQL JDBC drivers ‣ Install and configure test environment and all dependencies ‣ Create new AMI based on the above http://www.solidlogic.com 15
  • 16. Trading Simulation: Test Execution ‣ For each test instance….. • ssh -X -i /home/ericd/.aws/first/name.pem ubuntu@IP • cd /home/ubuntu/testcode/tradingsystemsales • git pull • cd /usr/local/testcode//bin • sudo ./testcode -nodesktop • parameterSweepSingleNode('Yes','Yes', 'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdatamasterlist.mat', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'csv', '', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 1, 'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdataETFsToTest.csv', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'mat', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/src', 'No', 'No') • parameterSweepSingleNode('No','Yes', 'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdatamasterlist.mat', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'csv', '', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 1, 'homeubuntutestcodetradingsystemsalesmodelsdailyAdaptiveStateSpaceSPYdataETFsToTest.csv', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/data/etfs', 'mat', '/home/ubuntu/testcode/tradingsystemsales/models/daily/AdaptiveStateSpaceSPY/src', 'No', 'No') http://www.solidlogic.com 16
  • 17. Trading Simulation: Initial Test Results ‣ Result sets saved to S3 buckets using S3cmd • Approximately 6000 result sets http://www.solidlogic.com 17
  • 18. Trading Simulation: Output ‣ Run Time: • Cloud: 45 hours • Single-seat: 1-2 months • Order of magnitude improvement in time! http://www.solidlogic.com 18
  • 19. Trading Simulation: Economics Co- On- Spot On-Site Location Demand Pricing Cost Model Details Server Hardware / ‣ Cost estimates using assumptions $69,045 $69,045 $1,647 $193 Instance Usage and calculations in Cost Comparison Network Hardware 13,809 13,809 - - Worksheet Hardware Maint. 24,856 24,856 - - ‣ Costs represent one year annualized costs. Assumes a useful life of three Operating System - - - - years for purchased equipment Power and Cooling 9,907 - - - ‣ 1= Cost savings using On-Site as Data Center baseline Construction / Co- 8,618 65,136 - - Location Expense ‣ 2= On-Site and Co-Location assume Admin. / Remote 100% usage 105,000 240 - - Hands Support ‣ 3= Based on actual 686 machine Data Transfer 1 4 1 1 hours used Total $231,237 $173,091 $1,647 $193 Cost Savings1 N/A 25.12% 99.29% 99.92% $ / Compute Hr.2,3 $26.40 $19.76 $2.40 $0.28 http://www.solidlogic.com 19
  • 20. Trading Simulation: Next Steps Potential Improvements: ‣ Develop improved cloud infrastructure management tools • Allocation of work across instances • Allow user defined completion time and programmatically scale compute resources to work towards goal • Spread work across unused internal and available external compute resources http://www.solidlogic.com 20
  • 21. Thank you Eric Detterman Michael Bommarito CEO, Co-Founder CIO, Partner Eric.Detterman@solidlogic.com Michael.Bommarito@solidlogic.com Direct: (248) 792 – 8001 Direct: (646) 450 – 3387 (248) 792 – 8000 www.solidlogic.com 330 East Maple Rd. #231 Birmingham, MI 48009 21