Grid Asia2008 Low Latency Data Grid

•Download as PPT, PDF•

0 likes•478 views

Investment banks rely extensively on grids to dramatically increase throughput for their calculations for analytics (especially risk). The traditional design pattern involves executing compute intensive workflows where jobs require movement of large data files to the compute nodes, calculation results creating files which then are again consumed by the next job in the flow. Increasingly, the pattern is shifting to running short lived tasks where the bottleneck is data i.e. the time spent to move data back and forth between compute nodes can be overwhelming - turning a compute bound job to be a IO bound one. For instance, real time pricing for financial derivative instruments could just take a few milliseconds, but, the time required for the data transfer could be hundreds of milliseconds. The talk focuses on one architectural pattern gaining popularity - move the compute to the data. The data is partitioned in grid memory across many nodes and the compute task is routed to the node with the right data set provisioned based on the data hints it provides during launch. We discuss the features of the main-memory based data grid solution that uses different data partitioning policies such as hashing or data relationship based to manage data across a large cluster of nodes. We also discuss techniques for rebalancing data and behavior across the Grid nodes to achieve the best throughput and lowest latency.

Technology

Low Latency Data Grids in Finance Jags Ramnarayan Chief Architect GemStone Systems [email_address]

Background on GemStone Systems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Use of Grid computing in finance ,[object Object],[object Object],[object Object]

State of affairs – Risk Analytics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

State of affairs – Pricing (derivatives) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Where is the problem? Compute farm Data warehouses Rational databases ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],File system CPU bound job turns into a IO bound Job Grid Scheduler

Data Fabric for Risk Analytics When data is stored, it is transparently replicated and/or partitioned; Redundant storage can be in memory and/or on disk— ensures continuous availability Keep reference data replicated on many; partition trade data Machine nodes can be added dynamically to expand storage capacity or to handle increased client load Pool memory (and disk) across cluster ; parallelize data access and computation to achieve very high aggregate throughput

Data Fabric for Risk Analytics TaskFlow - As results are generated push events to compute nodes to initiate subsequent computation Avoid bulk data transfer across tasks or Jobs Thousands of compute nodes can maintain local cache of most frequently used data; Optionally use local disk for overflow Move reference data to local cache Synchronous read through, write through or Asynchronous write-behind to other data sources and sinks

Move business logic to data f 1 , f 2 , … f n FIFO Queue Data fabric Resources Exec functions Sept Trades Submit (f1) -> AggregateHighValueTrades(<input data>, “ where trades.month=‘Sept ’) Function (f1) Function (f2) ,[object Object],[object Object],[object Object],[object Object],[object Object]

Key lessons ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

What's hot

Are New Orleans Data Centers Making Green Strategies a Priority? (SlideShare)SP Home Run Inc.

Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...yashbheda

Analysis of big data in pandemic case Muh Saleh

Big data toolsNovita Sari

ThilgaTHILAKAVATHIRAMRAJ

Big Data EcosystemIvo Vachkov

Big Data and OSS at IBMBoulder Java User's Group

Data Centers In USmsirmajritchie

NoSQL Type, Bigdata, and AnalyticsSandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW

Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Syst...Nagios

Trends in Database ManagementMarlon Jamera

Data Center Automation - Cisco ASAP Data CenterE.S.G. JR. Consulting, Inc.

Data warehouseingSajan Sahu

Three Things to Consider When Making Investments in Your Big Data InfrastructureFlyData Inc.

BigDataShankar R

Big data introductionChirag Ahuja

Thinking Outside the TableOntotext

R programming analysisdigitaladitya

Top 10 data science technologiesBrainware University

Big data frameworksCuelogic Technologies Pvt. Ltd.

What's hot (20)

Are New Orleans Data Centers Making Green Strategies a Priority? (SlideShare)

Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...

Analysis of big data in pandemic case

Big data tools

Thilga

Big Data Ecosystem

Big Data and OSS at IBM

Data Centers In US

NoSQL Type, Bigdata, and Analytics

Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Syst...

Trends in Database Management

Data Center Automation - Cisco ASAP Data Center

Data warehouseing

Three Things to Consider When Making Investments in Your Big Data Infrastructure

BigData

Big data introduction

Thinking Outside the Table

R programming analysis

Top 10 data science technologies

Big data frameworks

Similar to Grid Asia2008 Low Latency Data Grid

Hdf5Smith Kim

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...Niraj Tolia

Introduction to Data WarehousingJason S

BigdataShankar R

Waters Grid & HPC Coursejimliddle

DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDATAVERSITY

Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics

Big data analysis concepts and referencesInformation Security Awareness Group

Enterprise Data and Analytics Architecture Overview for Electric UtilityPrajesh Bhattacharya

BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...Big Data Week

Best practices and trends in people softHazelknight Media & Entertainment Pvt Ltd

Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Kevin Mao

Introduction Big DataFrank Kienle

ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY

Alluxio - Virtual Unified File System Alluxio, Inc.

London Cloud Computing Meetup: From GigaSpaces to the Cloud - a demonstration...Skills Matter

Vikram Andem Big Data Strategy @ IATA Technology Roadmap IT Strategy Group

Hadoop introductionSubhas Kumar Ghosh

Big Data .. Are you ready for the next wave?Mahmoud Sabri

TSE_Pres12.pptxssuseracaaae2

Similar to Grid Asia2008 Low Latency Data Grid (20)

Hdf5

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...

Introduction to Data Warehousing

Bigdata

Waters Grid & HPC Course

DM Radio Webinar: Adopting a Streaming-Enabled Architecture

Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...

Big data analysis concepts and references

Enterprise Data and Analytics Architecture Overview for Electric Utility

BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...

Best practices and trends in people soft

Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...

Introduction Big Data

ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...

Alluxio - Virtual Unified File System

London Cloud Computing Meetup: From GigaSpaces to the Cloud - a demonstration...

Vikram Andem Big Data Strategy @ IATA Technology Roadmap

Hadoop introduction

Big Data .. Are you ready for the next wave?

TSE_Pres12.pptx

Recently uploaded

How to write a Business Continuity PlanDatabarracks

Data governance with Unity Catalog PresentationKnoldus Inc.

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765

Decarbonising Buildings: Making a net-zero built environment a realityIES VE

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen

Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Top 10 Hubspot Development Companies in 2024TopCSSGallery

Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3

[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani

Scale your database traffic with Read & Write split using MySQL RouterMydbops

So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda

Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll

Recently uploaded (20)

How to write a Business Continuity Plan

Data governance with Unity Catalog Presentation

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Design pattern talk by Kaya Weers - 2024 (v2)

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration

Decarbonising Buildings: Making a net-zero built environment a reality

How AI, OpenAI, and ChatGPT impact business and software.

Testing tools and AI - ideas what to try with some tool examples

Generative Artificial Intelligence: How generative AI works.pdf

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Top 10 Hubspot Development Companies in 2024

Moving Beyond Passwords: FIDO Paris Seminar.pdf

[Webinar] SpiraTest - Setting New Standards in Quality Assurance

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

Potential of AI (Generative AI) in Business: Learnings and Insights

Scale your database traffic with Read & Write split using MySQL Router

So einfach geht modernes Roaming fuer Notes und Nomad.pdf

Emixa Mendix Meetup 11 April 2024 about Mendix Native development

Grid Asia2008 Low Latency Data Grid

1. Low Latency Data Grids in Finance Jags Ramnarayan Chief Architect GemStone Systems [email_address]

7. Data Fabric for Risk Analytics When data is stored, it is transparently replicated and/or partitioned; Redundant storage can be in memory and/or on disk— ensures continuous availability Keep reference data replicated on many; partition trade data Machine nodes can be added dynamically to expand storage capacity or to handle increased client load Pool memory (and disk) across cluster ; parallelize data access and computation to achieve very high aggregate throughput

8. Data Fabric for Risk Analytics TaskFlow - As results are generated push events to compute nodes to initiate subsequent computation Avoid bulk data transfer across tasks or Jobs Thousands of compute nodes can maintain local cache of most frequently used data; Optionally use local disk for overflow Move reference data to local cache Synchronous read through, write through or Asynchronous write-behind to other data sources and sinks

10.

Grid Asia2008 Low Latency Data Grid

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Grid Asia2008 Low Latency Data Grid

Similar to Grid Asia2008 Low Latency Data Grid (20)

Recently uploaded

Recently uploaded (20)

Grid Asia2008 Low Latency Data Grid