SlideShare a Scribd company logo
1 of 7
Download to read offline
Massive Scalability With InterSystems IRIS Data Platform
Technology Brief
Introduction
Faced with the enormous and ever-growing
amounts of data being generated in the world
today, software architects need to pay special
attention to the scalability of their solutions. They
must also design systems that can, when needed,
handle many thousands of concurrent users. It’s
not easy, but designing for massive scalability is
an absolute necessity.
Software architects have several options for
designing scalable systems. They can scale
vertically by using bigger machines with
dozens of cores. They can use data distribution
(replication) techniques to scale horizontally
for increasing numbers of users. And they can
scale data volume horizontally through the
use of a data partitioning strategy. In practice,
software architects will employ several of these
techniques, trading off hardware costs, code
complexity, and ease of deployment to suit their
particular needs.
This paper will discuss how InterSystems IRIS
Data Platform™ supports vertical scalability
and horizontal scalability of both user and
data volumes. It will outline several options for
distributing and partitioning data and/or user
volume, giving scenarios in which each option
would be particularly useful. Finally, this paper
will talk about how InterSystems IRIS helps
simplify the configuration and provisioning of
distributed systems.
Vertical Scaling
Perhaps the simplest way to scale is to do so
“vertically” — deploy on a bigger machine with
more CPUs and memory. InterSystems IRIS
supports parallel processing of SQL and includes
technology for optimizing the use of CPUs in
multi-core machines.
However, there are practical limits to what can be
achieved through vertical scaling alone. For one
thing, even the largest available machine may not
be able to handle the enormous data volumes
and workloads required by modern applications.
Also, “big iron” can be prohibitively expensive.
Many organizations find it more cost-effective to
buy, say, four 16-core servers than one 64-core
machine.
Capacity planning for single-server architectures
can be difficult, especially for solutions that are
likely to have widely varying workloads. Having
the ability to handle peak loads may result in
wasteful underutilization during off hours. On
the other hand, having too few cores may cause
performance to slow to a crawl during periods of
high usage. In addition, increasing the capacity
of a single server architecture implies buying an
entire new machine. Adding capacity “on the fly”
is impossible.
In short, although it is important for software to
leverage the full potential of the hardware on
which it is deployed, vertical scalability alone
is not enough to meet all but the most static
workloads.
Massive Scalability With InterSystems IRIS Data Platform 2
Technology Brief
Horizontal Scaling
For all of the reasons given above, most
organizations seeking massive scalability
will deploy on networked systems, scaling
workloads and/or data volumes “horizontally”
by distributing the work across multiple servers.
Typically, each server in the network will be an
affordable machine, but larger servers may be
used if needed to take advantage of vertical
scalability as well.
Software architects will recognize that no
two workloads are the same. Some modern
applications may be accessed by hundreds of
thousands of users concurrently, racking up very
high numbers of small transactions per second.
Others may only have a handful of users, but
query petabytes worth of data. Both are very
demanding workloads, but they require different
approaches to scalability. We will start by
considering each scenario separately.
Horizontal Scaling of User Volume
To support a huge number of concurrent users
(or transactions), InterSystems IRIS relies on
unique data caching technology called Enterprise
Cache Protocol (ECP).
Within a network of servers, one will be
configured as the data server where data is
persisted. The others will be configured as
application servers. Each application server runs
an instance of InterSystems IRIS and presents
data to the application as though it were a local
database. Data is not persisted on the application
servers. They are there to provide cache and CPU
processing power.
User sessions are distributed among the
application servers, typically through a load
balancer, and queries are satisfied from the
local application server cache, if possible.
Application servers will only retrieve data from
the data server if necessary. InterSystems IRIS
automatically synchronizes data between all
cluster participants.
With the compute work taken care of by the
application servers, the data server can be
dedicated mostly to persisting transaction
outcomes. Application servers can easily be
added to, or removed from, the cluster as
workloads vary. For example, in a retail use case,
you may want to add a few application servers
to deal with the exceptional load of Black Friday
and switch them off again after the holiday
season has finished.
Application servers are most useful for
applications where large numbers of transactions
must be performed, but each transaction only
affects a relatively small portion of the entire
data set. Deployments that use application
servers with ECP have been shown to support
many thousands of concurrent users in a variety
of industries.
Figure 1: Workload Distribution with Enterprise
Cache Protocol
cach
data
I
intelligent
InterSystems IRIS
workload distribution
cache cache
code code
cache
data
Massive Scalability With InterSystems IRIS Data Platform 3
 
Horizontal Scaling of Data Volume
When queries — usually analytic queries — must
access a large amount of data, the “working
dataset” that needs to be cached in order to
support the query workload efficiently may
exceed the memory capacity on a single
machine. InterSystems IRIS provides a capability
called sharding, which physically partitions large
database tables across multiple server instances.
Applications still access a single logical table on
an instance designated as the shard master. The
shard master decomposes incoming queries and
sends them to the shard servers, each of which
holds a distinct portion of the table data and
associated indices. The shard servers process
the shard-local queries in parallel, and send their
results back to the shard server for aggregation.
Data is partitioned among shard servers
according to a shard key, which can be
automatically managed by the system, or defined
by the software architect based on selected
table columns. Through the careful selection of
shard keys, tables that are often joined can be
co-sharded, so rows from those tables that would
typically be joined together are stored on the
same shard server, enabling the join to happen
entirely local to each shard server, and thus
maximizing parallelization and performance.
As data volumes grow, additional shards
can easily be added. Sharding is completely
transparent to the application and to users.
Not all tables need to be sharded. For example,
in analytics applications, facts tables (e.g. orders
in a retail scenario) are usually very large, and will
be sharded. The much smaller dimensions tables
(e.g. product, point of sale, etc.) will not be. Non-
sharded tables are persisted on the shard master.
If a query requires joins between sharded and
non-sharded tables, or if data from two different
shards must be joined, InterSystems IRIS uses
a highly efficient ECP-based mechanism to
correctly and efficiently satisfy the request. In
these cases, InterSystems IRIS will only share
between shards the rows that are needed, rather
than broadcasting entire tables over the network,
as many other technologies would. InterSystems
IRIS transparently improves the efficiency and
performance of big data query workloads
through sharding, without limiting the types of
queries that can be satisfied.
The InterSystems IRIS architecture enables
complex multi-table joins when querying
distributed, partitioned data sets without
requiring co-sharding, without replicating
data, and without requiring entire tables to be
broadcast across networks.
Figure 2: Sharding with Intelligent Inter-Shard
Communication
Technology Brief
InterSystems IRIS
intelligent inter-shard communication
with workload distribution
cache cache
shard master
cache cache
data data
rows
Shard Master
cache cache
data data
InterSystems IRIS
intelligent inter-shard communication
S
on
cache
code
rows
code code
Massive Scalability With InterSystems IRIS Data Platform 4
Scaling Both User and
Data Volumes
Many modern solutions must simultaneously
support both a high transaction rate (user
volume) and analytics on large volumes of data.
One example: a private wealth management
application that provides dashboards
summarizing clients’ portfolios and risk, in real
time based on current market data.
InterSystems IRIS enables such Hybrid
Transactional and Analytical Processing (HTAP)
applications by allowing application servers and
sharding to be used in combination. Application
servers can be added to the architecture pictured
in Figure 2 to distribute the workload on the
shard master. Workloads and data volumes
can be scaled independently of each other,
depending on the needs of the application.
 
When applications require the ultimate in
scalability (for example, if a predictive model
must score every record in a large table while
new records are being ingested and queried at
the same time) each individual data shard can act
as the data server in an ECP model. We refer to
the application servers that share the workloads
on data shards as “query shards.” This, combined
with the transparent mechanisms for ensuring
high availability of an InterSystems IRIS cluster,
provides solution architects with everything they
need to satisfy their solution’s unique scalability
and reliability requirements.
The comparative performance and efficiency
of the InterSystems IRIS approach to sharding
has been demonstrated and documented in a
benchmark test validated by a major technology
analyst firm. (Read their report here).1
In tests of
an actual mission critical financial services use
case, InterSystems IRIS was shown to be faster
than several highly specialized databases, while
requiring less hardware and accessing more data.
Technology Brief
Figure 3: Sharding and Workload Distribution
Massive Scalability With InterSystems IRIS Data Platform 5
Technology Brief
Flexible Deployment with
InterSystems Cloud Manager
InterSystems IRIS gives software developers
a great deal of flexibility when it comes to
designing a highly efficient, scalable solution.
But scalability may come at the cost of increased
complexity, as additional servers, taking on a
variety of roles, are added to the architecture.
To simplify the provisioning and deployment
of servers (whether physical or virtual) in a
distributed architecture, InterSystems IRIS uses
InterSystems Cloud Manager.
InterSystems Cloud Manager (which is included
with InterSystems IRIS) enables simple scripts
to be used for configuring InterSystems IRIS
containers as a data server, shard server, shard
master, application server, etc. Containers can
be deployed easily in public or private clouds.
They are also easily decommissioned, so scalable
architectures can be designed to grow or
contract with fluctuating needs.
Conclusion
Massive scalability is a requirement for modern
applications, particularly Hybrid Transactional
and Analytical Processing applications that
must handle very high workloads and data
volumes simultaneously. InterSystems IRIS Data
Platform gives software architects options for
cost-effectively scaling their applications. It
supports vertical scaling, application servers for
horizontally scaling user volume, and a highly
efficient approach to sharding for horizontally
scaling data volume that eliminates the need for
network broadcasts. All these technologies can
be used independently or in combination to tailor
a scalable architecture to an application’s specific
requirements.
For more information about InterSystems IRIS
Data Platform, visit InterSystems.com/IRIS.
Massive Scalability With InterSystems IRIS Data Platform 6
About InterSystems
InterSystems is the engine behind the world’s
most important applications. In healthcare,
finance, government, and other sectors where
lives and livelihoods are at stake, InterSystems
is the power behind what matters™. Founded in
1978, InterSystems is a privately held company
headquartered in Cambridge, Massachusetts
(USA), with offices worldwide, and its software
products are used daily by millions of people in
more than 80 countries. For more information,
visit InterSystems.com.
1
InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight, Kerry Dolan, Senior IT Validation Analyst, Enterprise Strategy Group, March 2018.
Massive Scalability With InterSystems IRIS Data Platform 8
InterSystems Corporation • One Memorial Drive • Cambridge, MA 02142 • InterSystems.com • Tel: +1.617.621.0600
© 2018 InterSystems Corporation. All rights reserved. InterSystems IRIS Data Platform is a trademark of InterSystems Corporation. 010518

More Related Content

What's hot

Exploring Microsoft Azure Infrastructures
Exploring Microsoft Azure InfrastructuresExploring Microsoft Azure Infrastructures
Exploring Microsoft Azure InfrastructuresCCG
 
Iaetsd mapreduce streaming over cassandra datasets
Iaetsd mapreduce streaming over cassandra datasetsIaetsd mapreduce streaming over cassandra datasets
Iaetsd mapreduce streaming over cassandra datasetsIaetsd Iaetsd
 
Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperScott Gray
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsDavid Portnoy
 
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionHow One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionDataWorks Summit
 
TheETLBottleneckinBigDataAnalytics(1)
TheETLBottleneckinBigDataAnalytics(1)TheETLBottleneckinBigDataAnalytics(1)
TheETLBottleneckinBigDataAnalytics(1)ruchabhandiwad
 
Business Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital StrategyBusiness Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital Strategyzitipoff
 
Hadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesHadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesDataWorks Summit
 
TCS_DATA_ANALYSIS_REPORT_ADITYA
TCS_DATA_ANALYSIS_REPORT_ADITYATCS_DATA_ANALYSIS_REPORT_ADITYA
TCS_DATA_ANALYSIS_REPORT_ADITYAAditya Srinivasan
 
Open source stak of big data techs open suse asia
Open source stak of big data techs   open suse asiaOpen source stak of big data techs   open suse asia
Open source stak of big data techs open suse asiaMuhammad Rifqi
 
No sql databases explained
No sql databases explainedNo sql databases explained
No sql databases explainedSalil Mehendale
 
Big Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with PentahoBig Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with PentahoMark Kromer
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lakepunedevscom
 

What's hot (20)

Exploring Microsoft Azure Infrastructures
Exploring Microsoft Azure InfrastructuresExploring Microsoft Azure Infrastructures
Exploring Microsoft Azure Infrastructures
 
Iaetsd mapreduce streaming over cassandra datasets
Iaetsd mapreduce streaming over cassandra datasetsIaetsd mapreduce streaming over cassandra datasets
Iaetsd mapreduce streaming over cassandra datasets
 
Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_Whitepaper
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
 
Microsoft AI Platform Whitepaper
Microsoft AI Platform WhitepaperMicrosoft AI Platform Whitepaper
Microsoft AI Platform Whitepaper
 
Hadoop 2.0 and yarn
Hadoop 2.0 and yarnHadoop 2.0 and yarn
Hadoop 2.0 and yarn
 
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionHow One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
 
hive lab
hive labhive lab
hive lab
 
TheETLBottleneckinBigDataAnalytics(1)
TheETLBottleneckinBigDataAnalytics(1)TheETLBottleneckinBigDataAnalytics(1)
TheETLBottleneckinBigDataAnalytics(1)
 
Business Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital StrategyBusiness Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital Strategy
 
Hadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesHadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data Architectures
 
TCS_DATA_ANALYSIS_REPORT_ADITYA
TCS_DATA_ANALYSIS_REPORT_ADITYATCS_DATA_ANALYSIS_REPORT_ADITYA
TCS_DATA_ANALYSIS_REPORT_ADITYA
 
Open source stak of big data techs open suse asia
Open source stak of big data techs   open suse asiaOpen source stak of big data techs   open suse asia
Open source stak of big data techs open suse asia
 
No sql databases explained
No sql databases explainedNo sql databases explained
No sql databases explained
 
Big data landscape
Big data landscapeBig data landscape
Big data landscape
 
Big Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with PentahoBig Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with Pentaho
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
 
Big data
Big dataBig data
Big data
 

Similar to Massive sacalabilitty with InterSystems IRIS Data Platform

Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy snehal parikh
 
CouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big DataCouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big DataDebajani Mohanty
 
Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Tina Zhang
 
Fundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopFundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopArchana Gopinath
 
ManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_ExtManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_ExtArup Ray
 
IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...
IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...
IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...Angela Williams
 
What is Scalability and How can affect on overall system performance of database
What is Scalability and How can affect on overall system performance of databaseWhat is Scalability and How can affect on overall system performance of database
What is Scalability and How can affect on overall system performance of databaseAlireza Kamrani
 
InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...
InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...
InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...InterSystems
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeSysfore Technologies
 
Introduction to Information Storage.pptx
Introduction to Information Storage.pptxIntroduction to Information Storage.pptx
Introduction to Information Storage.pptxNISHASOMSCS113
 
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111NavNeet KuMar
 
Data Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxData Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxArunPandiyan890855
 
CS8091_BDA_Unit_IV_Stream_Computing
CS8091_BDA_Unit_IV_Stream_ComputingCS8091_BDA_Unit_IV_Stream_Computing
CS8091_BDA_Unit_IV_Stream_ComputingPalani Kumar
 
My aws cp study (autosaved)
My aws cp study (autosaved)My aws cp study (autosaved)
My aws cp study (autosaved)Vishnu Sure
 
IaaS - Infrastructure as a Service
IaaS - Infrastructure as a ServiceIaaS - Infrastructure as a Service
IaaS - Infrastructure as a ServiceRajind Ruparathna
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computinghuda2018
 

Similar to Massive sacalabilitty with InterSystems IRIS Data Platform (20)

Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy
 
CouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big DataCouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big Data
 
Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_
 
Fundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopFundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and Hadoop
 
Scaling apps using azure cloud services
Scaling apps using azure cloud servicesScaling apps using azure cloud services
Scaling apps using azure cloud services
 
ManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_ExtManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_Ext
 
IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...
IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...
IEEE Paper - A Study Of Cloud Computing Environments For High Performance App...
 
What is Scalability and How can affect on overall system performance of database
What is Scalability and How can affect on overall system performance of databaseWhat is Scalability and How can affect on overall system performance of database
What is Scalability and How can affect on overall system performance of database
 
Rama1
Rama1Rama1
Rama1
 
InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...
InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...
InterSystems IRIS Data Platform: A Unified Platform for Powering Real-Time, D...
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
 
Introduction to Information Storage.pptx
Introduction to Information Storage.pptxIntroduction to Information Storage.pptx
Introduction to Information Storage.pptx
 
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
 
Data Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxData Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptx
 
CS8091_BDA_Unit_IV_Stream_Computing
CS8091_BDA_Unit_IV_Stream_ComputingCS8091_BDA_Unit_IV_Stream_Computing
CS8091_BDA_Unit_IV_Stream_Computing
 
My aws cp study (autosaved)
My aws cp study (autosaved)My aws cp study (autosaved)
My aws cp study (autosaved)
 
Sdn in big data
Sdn in big dataSdn in big data
Sdn in big data
 
IaaS - Infrastructure as a Service
IaaS - Infrastructure as a ServiceIaaS - Infrastructure as a Service
IaaS - Infrastructure as a Service
 
Case study 9
Case study 9Case study 9
Case study 9
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computing
 

More from Robert Bira

Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...
Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...
Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...Robert Bira
 
InterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the wayInterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the wayRobert Bira
 
InterSystems brochure
InterSystems brochureInterSystems brochure
InterSystems brochureRobert Bira
 
Infographie Retail
Infographie RetailInfographie Retail
Infographie RetailRobert Bira
 
IDC Executive Brief
IDC Executive Brief IDC Executive Brief
IDC Executive Brief Robert Bira
 
Utiliser InterSystems IRIS pour permettre la transformation digitale du Retail
Utiliser InterSystems IRIS pour permettre la transformation digitale du RetailUtiliser InterSystems IRIS pour permettre la transformation digitale du Retail
Utiliser InterSystems IRIS pour permettre la transformation digitale du RetailRobert Bira
 
Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT
Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT
Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT Robert Bira
 
The valule of Multi-model Databases
The valule of Multi-model DatabasesThe valule of Multi-model Databases
The valule of Multi-model DatabasesRobert Bira
 
Idc info brief-choosing_dbms_to_address_challenges_of_the_third-platform
Idc info brief-choosing_dbms_to_address_challenges_of_the_third-platformIdc info brief-choosing_dbms_to_address_challenges_of_the_third-platform
Idc info brief-choosing_dbms_to_address_challenges_of_the_third-platformRobert Bira
 
Building Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich ApplicationBuilding Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich ApplicationRobert Bira
 
Power behind what_matters-inter_systems
Power behind what_matters-inter_systemsPower behind what_matters-inter_systems
Power behind what_matters-inter_systemsRobert Bira
 
Qui dirige la transformation numérique
Qui dirige la transformation numériqueQui dirige la transformation numérique
Qui dirige la transformation numériqueRobert Bira
 
Gartner Magic Quadrant for Operational Database Management Systems
Gartner Magic Quadrant for Operational Database Management SystemsGartner Magic Quadrant for Operational Database Management Systems
Gartner Magic Quadrant for Operational Database Management SystemsRobert Bira
 
Data platform map
Data platform mapData platform map
Data platform mapRobert Bira
 
InterSystems advanced data technology
InterSystems advanced data technologyInterSystems advanced data technology
InterSystems advanced data technologyRobert Bira
 
InterSystems Caché a leader in Gartner MQ on Operational DBMS
InterSystems Caché a leader in Gartner MQ on Operational DBMSInterSystems Caché a leader in Gartner MQ on Operational DBMS
InterSystems Caché a leader in Gartner MQ on Operational DBMSRobert Bira
 

More from Robert Bira (16)

Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...
Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...
Une nouvelle plateforme de donnée pour vos applications transactionnelles et ...
 
InterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the wayInterSystems IRIS Data Platform : Machine learning on the way
InterSystems IRIS Data Platform : Machine learning on the way
 
InterSystems brochure
InterSystems brochureInterSystems brochure
InterSystems brochure
 
Infographie Retail
Infographie RetailInfographie Retail
Infographie Retail
 
IDC Executive Brief
IDC Executive Brief IDC Executive Brief
IDC Executive Brief
 
Utiliser InterSystems IRIS pour permettre la transformation digitale du Retail
Utiliser InterSystems IRIS pour permettre la transformation digitale du RetailUtiliser InterSystems IRIS pour permettre la transformation digitale du Retail
Utiliser InterSystems IRIS pour permettre la transformation digitale du Retail
 
Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT
Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT
Transformer le secteur du Retail grâce à une meilleure utilisation de l'IT
 
The valule of Multi-model Databases
The valule of Multi-model DatabasesThe valule of Multi-model Databases
The valule of Multi-model Databases
 
Idc info brief-choosing_dbms_to_address_challenges_of_the_third-platform
Idc info brief-choosing_dbms_to_address_challenges_of_the_third-platformIdc info brief-choosing_dbms_to_address_challenges_of_the_third-platform
Idc info brief-choosing_dbms_to_address_challenges_of_the_third-platform
 
Building Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich ApplicationBuilding Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich Application
 
Power behind what_matters-inter_systems
Power behind what_matters-inter_systemsPower behind what_matters-inter_systems
Power behind what_matters-inter_systems
 
Qui dirige la transformation numérique
Qui dirige la transformation numériqueQui dirige la transformation numérique
Qui dirige la transformation numérique
 
Gartner Magic Quadrant for Operational Database Management Systems
Gartner Magic Quadrant for Operational Database Management SystemsGartner Magic Quadrant for Operational Database Management Systems
Gartner Magic Quadrant for Operational Database Management Systems
 
Data platform map
Data platform mapData platform map
Data platform map
 
InterSystems advanced data technology
InterSystems advanced data technologyInterSystems advanced data technology
InterSystems advanced data technology
 
InterSystems Caché a leader in Gartner MQ on Operational DBMS
InterSystems Caché a leader in Gartner MQ on Operational DBMSInterSystems Caché a leader in Gartner MQ on Operational DBMS
InterSystems Caché a leader in Gartner MQ on Operational DBMS
 

Recently uploaded

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 

Recently uploaded (20)

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 

Massive sacalabilitty with InterSystems IRIS Data Platform

  • 1. Massive Scalability With InterSystems IRIS Data Platform Technology Brief Introduction Faced with the enormous and ever-growing amounts of data being generated in the world today, software architects need to pay special attention to the scalability of their solutions. They must also design systems that can, when needed, handle many thousands of concurrent users. It’s not easy, but designing for massive scalability is an absolute necessity. Software architects have several options for designing scalable systems. They can scale vertically by using bigger machines with dozens of cores. They can use data distribution (replication) techniques to scale horizontally for increasing numbers of users. And they can scale data volume horizontally through the use of a data partitioning strategy. In practice, software architects will employ several of these techniques, trading off hardware costs, code complexity, and ease of deployment to suit their particular needs. This paper will discuss how InterSystems IRIS Data Platform™ supports vertical scalability and horizontal scalability of both user and data volumes. It will outline several options for distributing and partitioning data and/or user volume, giving scenarios in which each option would be particularly useful. Finally, this paper will talk about how InterSystems IRIS helps simplify the configuration and provisioning of distributed systems. Vertical Scaling Perhaps the simplest way to scale is to do so “vertically” — deploy on a bigger machine with more CPUs and memory. InterSystems IRIS supports parallel processing of SQL and includes technology for optimizing the use of CPUs in multi-core machines. However, there are practical limits to what can be achieved through vertical scaling alone. For one thing, even the largest available machine may not be able to handle the enormous data volumes and workloads required by modern applications. Also, “big iron” can be prohibitively expensive. Many organizations find it more cost-effective to buy, say, four 16-core servers than one 64-core machine. Capacity planning for single-server architectures can be difficult, especially for solutions that are likely to have widely varying workloads. Having the ability to handle peak loads may result in wasteful underutilization during off hours. On the other hand, having too few cores may cause performance to slow to a crawl during periods of high usage. In addition, increasing the capacity of a single server architecture implies buying an entire new machine. Adding capacity “on the fly” is impossible. In short, although it is important for software to leverage the full potential of the hardware on which it is deployed, vertical scalability alone is not enough to meet all but the most static workloads.
  • 2. Massive Scalability With InterSystems IRIS Data Platform 2 Technology Brief Horizontal Scaling For all of the reasons given above, most organizations seeking massive scalability will deploy on networked systems, scaling workloads and/or data volumes “horizontally” by distributing the work across multiple servers. Typically, each server in the network will be an affordable machine, but larger servers may be used if needed to take advantage of vertical scalability as well. Software architects will recognize that no two workloads are the same. Some modern applications may be accessed by hundreds of thousands of users concurrently, racking up very high numbers of small transactions per second. Others may only have a handful of users, but query petabytes worth of data. Both are very demanding workloads, but they require different approaches to scalability. We will start by considering each scenario separately. Horizontal Scaling of User Volume To support a huge number of concurrent users (or transactions), InterSystems IRIS relies on unique data caching technology called Enterprise Cache Protocol (ECP). Within a network of servers, one will be configured as the data server where data is persisted. The others will be configured as application servers. Each application server runs an instance of InterSystems IRIS and presents data to the application as though it were a local database. Data is not persisted on the application servers. They are there to provide cache and CPU processing power. User sessions are distributed among the application servers, typically through a load balancer, and queries are satisfied from the local application server cache, if possible. Application servers will only retrieve data from the data server if necessary. InterSystems IRIS automatically synchronizes data between all cluster participants. With the compute work taken care of by the application servers, the data server can be dedicated mostly to persisting transaction outcomes. Application servers can easily be added to, or removed from, the cluster as workloads vary. For example, in a retail use case, you may want to add a few application servers to deal with the exceptional load of Black Friday and switch them off again after the holiday season has finished. Application servers are most useful for applications where large numbers of transactions must be performed, but each transaction only affects a relatively small portion of the entire data set. Deployments that use application servers with ECP have been shown to support many thousands of concurrent users in a variety of industries. Figure 1: Workload Distribution with Enterprise Cache Protocol cach data I intelligent InterSystems IRIS workload distribution cache cache code code cache data
  • 3. Massive Scalability With InterSystems IRIS Data Platform 3   Horizontal Scaling of Data Volume When queries — usually analytic queries — must access a large amount of data, the “working dataset” that needs to be cached in order to support the query workload efficiently may exceed the memory capacity on a single machine. InterSystems IRIS provides a capability called sharding, which physically partitions large database tables across multiple server instances. Applications still access a single logical table on an instance designated as the shard master. The shard master decomposes incoming queries and sends them to the shard servers, each of which holds a distinct portion of the table data and associated indices. The shard servers process the shard-local queries in parallel, and send their results back to the shard server for aggregation. Data is partitioned among shard servers according to a shard key, which can be automatically managed by the system, or defined by the software architect based on selected table columns. Through the careful selection of shard keys, tables that are often joined can be co-sharded, so rows from those tables that would typically be joined together are stored on the same shard server, enabling the join to happen entirely local to each shard server, and thus maximizing parallelization and performance. As data volumes grow, additional shards can easily be added. Sharding is completely transparent to the application and to users. Not all tables need to be sharded. For example, in analytics applications, facts tables (e.g. orders in a retail scenario) are usually very large, and will be sharded. The much smaller dimensions tables (e.g. product, point of sale, etc.) will not be. Non- sharded tables are persisted on the shard master. If a query requires joins between sharded and non-sharded tables, or if data from two different shards must be joined, InterSystems IRIS uses a highly efficient ECP-based mechanism to correctly and efficiently satisfy the request. In these cases, InterSystems IRIS will only share between shards the rows that are needed, rather than broadcasting entire tables over the network, as many other technologies would. InterSystems IRIS transparently improves the efficiency and performance of big data query workloads through sharding, without limiting the types of queries that can be satisfied. The InterSystems IRIS architecture enables complex multi-table joins when querying distributed, partitioned data sets without requiring co-sharding, without replicating data, and without requiring entire tables to be broadcast across networks. Figure 2: Sharding with Intelligent Inter-Shard Communication Technology Brief InterSystems IRIS intelligent inter-shard communication with workload distribution cache cache shard master cache cache data data rows Shard Master cache cache data data InterSystems IRIS intelligent inter-shard communication S on cache code rows code code
  • 4. Massive Scalability With InterSystems IRIS Data Platform 4 Scaling Both User and Data Volumes Many modern solutions must simultaneously support both a high transaction rate (user volume) and analytics on large volumes of data. One example: a private wealth management application that provides dashboards summarizing clients’ portfolios and risk, in real time based on current market data. InterSystems IRIS enables such Hybrid Transactional and Analytical Processing (HTAP) applications by allowing application servers and sharding to be used in combination. Application servers can be added to the architecture pictured in Figure 2 to distribute the workload on the shard master. Workloads and data volumes can be scaled independently of each other, depending on the needs of the application.   When applications require the ultimate in scalability (for example, if a predictive model must score every record in a large table while new records are being ingested and queried at the same time) each individual data shard can act as the data server in an ECP model. We refer to the application servers that share the workloads on data shards as “query shards.” This, combined with the transparent mechanisms for ensuring high availability of an InterSystems IRIS cluster, provides solution architects with everything they need to satisfy their solution’s unique scalability and reliability requirements. The comparative performance and efficiency of the InterSystems IRIS approach to sharding has been demonstrated and documented in a benchmark test validated by a major technology analyst firm. (Read their report here).1 In tests of an actual mission critical financial services use case, InterSystems IRIS was shown to be faster than several highly specialized databases, while requiring less hardware and accessing more data. Technology Brief Figure 3: Sharding and Workload Distribution
  • 5. Massive Scalability With InterSystems IRIS Data Platform 5 Technology Brief Flexible Deployment with InterSystems Cloud Manager InterSystems IRIS gives software developers a great deal of flexibility when it comes to designing a highly efficient, scalable solution. But scalability may come at the cost of increased complexity, as additional servers, taking on a variety of roles, are added to the architecture. To simplify the provisioning and deployment of servers (whether physical or virtual) in a distributed architecture, InterSystems IRIS uses InterSystems Cloud Manager. InterSystems Cloud Manager (which is included with InterSystems IRIS) enables simple scripts to be used for configuring InterSystems IRIS containers as a data server, shard server, shard master, application server, etc. Containers can be deployed easily in public or private clouds. They are also easily decommissioned, so scalable architectures can be designed to grow or contract with fluctuating needs. Conclusion Massive scalability is a requirement for modern applications, particularly Hybrid Transactional and Analytical Processing applications that must handle very high workloads and data volumes simultaneously. InterSystems IRIS Data Platform gives software architects options for cost-effectively scaling their applications. It supports vertical scaling, application servers for horizontally scaling user volume, and a highly efficient approach to sharding for horizontally scaling data volume that eliminates the need for network broadcasts. All these technologies can be used independently or in combination to tailor a scalable architecture to an application’s specific requirements. For more information about InterSystems IRIS Data Platform, visit InterSystems.com/IRIS.
  • 6. Massive Scalability With InterSystems IRIS Data Platform 6 About InterSystems InterSystems is the engine behind the world’s most important applications. In healthcare, finance, government, and other sectors where lives and livelihoods are at stake, InterSystems is the power behind what matters™. Founded in 1978, InterSystems is a privately held company headquartered in Cambridge, Massachusetts (USA), with offices worldwide, and its software products are used daily by millions of people in more than 80 countries. For more information, visit InterSystems.com. 1 InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight, Kerry Dolan, Senior IT Validation Analyst, Enterprise Strategy Group, March 2018.
  • 7. Massive Scalability With InterSystems IRIS Data Platform 8 InterSystems Corporation • One Memorial Drive • Cambridge, MA 02142 • InterSystems.com • Tel: +1.617.621.0600 © 2018 InterSystems Corporation. All rights reserved. InterSystems IRIS Data Platform is a trademark of InterSystems Corporation. 010518