SlideShare a Scribd company logo
1 of 21
Download to read offline
© 2015 ligaDATA, Inc. All Rights Reserved.
Powering Real-time
Decisioning for
Financial &
Healthcare using
Open Source 
August 2015
Community @ http://Kamanja.org
2
© 2015 ligaDATA, Inc. All Rights Reserved.
In ’14 the bank embarked
on transforming how they
leverage their data using
Open Source & Big Data
technologies.
3
© 2015 ligaDATA, Inc. All Rights Reserved.
To achieve this goal with

the bank we needed to:
1.  Create a framework 

to adopt Open Source
Software
2.  Need a catalyst to
attract and retain 

the talent
© 2015 ligaDATA, Inc. All Rights Reserved.
 4
Marissa Meyer of Yahoo won’t have to go in front of the senate to explain
why 100,000 records were lost – Barbara Desoer of CitiBank would.
What is different about Financial Services?
ü  Regulatory requirements requires 100% data protection
ü  Security & Data governance
ü  Auditability
ü  Lineage
ü  ZERO data loss
ü  Integration with legacy ecosystem
ü  Skillset
Open Source in Financial Services

Good enough for Internet companies isn't good enough!
© 2015 ligaDATA, Inc. All Rights Reserved.
 5
A modified “Crossing the Chasm” view for OSS
OSS – Adoption Chasm
Why Financial Services have not adopted OSS more aggressively?
Creators Contributors Users
Creators

Technology
Organizations, Rich
resources, Solving a
problem, Creating a
competitive advantage
Contributors

Technology
Organizations, taking
a risk while Solving a
problem 
Users
Lower Technology
Skillset, Low risk
tolerance, Solving a
problem
© 2015 ligaDATA, Inc. All Rights Reserved.
 6
Establish the BOSS framework for the consumption and contribution to open
source software (OSS) at scale in the Bank
.
Bank Open Source Software (BOSS)
Contribution to OSS by
enhancing existing open
source projects,
documentation, fixes,
enhancements
Initiation of a new OSS
project, championing and
facilitating OSS community
development and
consumption
Evaluation & Consumption of
OSS
Maturing Capability
Consumption
Contribution
Bank Current Focus
Step Change
Pioneering Target
BOSS optimises Consumption, enables Contribution and Creation
•  Input from stakeholders, internal and external influenced BOSS framework definition
•  OSS advisory board to steer and drive
•  Pre-approved licenses types per use case (consumption and contribution)
•  Invest in enabling technology, GitHub, Black Duck, Sonatype
•  No new governance steps, leverage and streamline existing controls instead of creating new ones
Creation
© 2015 ligaDATA, Inc. All Rights Reserved.
 7
BOSS framework is designed based on guidance and feedback received from key
representatives within the Bank and from leading open source contributors and fellow banks
.
Technology
Internal
External
BOSS – Collective Thought Process
Retail
Investment
Cards
Legal
Risk
Security
Sourcing
Business Units
Control
Functions
Data
Design
Infra
© 2015 ligaDATA, Inc. All Rights Reserved.
 8
Millennial developers …
•  Grew up using OSS
•  Unaware of Closed Source
software
•  Want to engage, share and
contribute
Real-time using Kamanja was selected
as a capability big enough, important
enough to build a Center of Excellence
around it.
Attracting and Retaining talent
© 2015 ligaDATA, Inc. All Rights Reserved.
 9
Individual
Events
Decisioning,
Detection
In-context 

and online
Cross section

of events
Analytics, 

MI
Offline, 

Longer cycle
Deriving Decisions

from Big Data
BATCH
 REAL-TIME
© 2015 ligaDATA, Inc. All Rights Reserved.
 10
customer-centric product design
require Real-time decisions
Triggers
Scoring
Notifications
Alerts
Transactional Updates
Deriving an
Opportunity or Threat 
E N D - T O - E N D C A P A B I L I T Y 
Tracking & Analyzing
(processing)
Streams of Information

(real-time) 
About Things That
Happen (events)



Actions
Real-time

Decisions
11
© 2015 ligaDATA, Inc. All Rights Reserved.
LigaDATA introduced Kamanja – 

an open source real-time decisioning project,
hardened for Financial Services & Healthcare requirements and
scalable to IoT level data volumes enabling low latency use
cases.
Customer 

churn/

retention
Risk
Analysis
Customer
Contact
Cyber
Crime
Fraud
Security &
Compliance
Audit &
Governance
U S E C A S E S 
Marketing
Telephony
Interception
Real-Time
Offer
12
© 2015 ligaDATA, Inc. All Rights Reserved.
Uses of 

Real-Time Decisioning
Complex Event Processing (CEP)
•  A few to possibly 100’s of concurrent data streams
•  Apply rule logic, select, aggregate
•  Decide action on elements in stream
Enterprise Applications, During …
•  customer call or chat: recommendations to improve service
•  card transaction: offer credit increase
•  web application: pre-approval
•  web transaction: recommend other product(s)
13
© 2015 ligaDATA, Inc. All Rights Reserved.
Case Study of a Modeling Department
Monitor $80B of consumer bank transactions / year to detect
fraud (between 1,400 banks)
PAIN POINT:
~2 months to deploy
(model group was different from deployment group)
INDUSTRY REVIEW to answer:
•  How common is it to use many algorithms or tools in a project?
•  What is an easier way to deploy models?
14
© 2015 ligaDATA, Inc. All Rights Reserved.
http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html
Independent use of tools
15
© 2015 ligaDATA, Inc. All Rights Reserved.
http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html
Tools used in combination
16
© 2015 ligaDATA, Inc. All Rights Reserved.
Scoring
Engine
(Kamanja)
PMML Diagram

Predictive Modeling Markup Language
Training & test data
(batch)
Data
Mining
Tool File, Save As
PMML
PMML
File
PMML
Producer
PMML
FileScoring data
(real time streaming)
Output data has
new score field
Training Project Phase
Production Scoring Project Phase
Full model
specification
PMML Consumer
17
© 2015 ligaDATA, Inc. All Rights Reserved.
Given industry fragmentation,
PMML is a solution
PMML Producers (18 companies)
•  R (Rattle, PMML)
•  RapidMiner
•  KNIME
PMML Consumers (12 co)
•  Zementis
•  SAS
•  IBM SPSS
•  KNIME
•  Microstrategy
•  Kamanja
•  JPMML
•  Spark (MLlib) (Open Source)
•  Weka
•  SAS Enterprise Miner
PREDICTIVE
Naïve Bayes
Neural Net
Regression
Rules
Scorecard
Sequence
SVM
Time Series
Trees
DESCRIPTIVE / OTH
Association Rules
Cluster, K-Nearest Nb
Text Models
model ensembles &
composition
(i.e. Gradient Boosting)
© 2015 ligaDATA, Inc. All Rights Reserved.
 18
Real Time Computing
OSS Technology Stack 
Integration with Kamanja
Kamanja
(PMML/Java/Scala Consumer)
High level languages / abstractions
Compute
Fabric
Cloud, EC2
Internal Cloud
Security
Kerberos
Real Time
Streaming
Kafka,
MQ
Spark*
ligaDATA
Data Store
HBase,
Cassandra,
InfluxDB
HDFS
(Create
adaptors to
integrate
others)
Resource
Management
Zookeeper,
Yarn*,
Mesos*
High Level Languages /
Abstractions
MLlib* (PMML Producers)
© 2015 ligaDATA, Inc. All Rights Reserved.
 19
Performance

Characteristics
© 2015 ligaDATA, Inc. All Rights Reserved.
 19
Performance
•  Throughput of million messages/second
•  Uses commodity hardware
Scalability
•  Linear scalability -- horizontally
•  Data partitioning support
•  Runtime multi-model optimizations to
support thousands of models
•  Consistent performance on hundreds of
models and thousands of rules
Built for IoT
data volumes
© 2015 ligaDATA, Inc. All Rights Reserved.
 20
•  Clinicians (knowledge experts) develop heuristic based rule set models
•  The initial model was COPD (Chronic Obstructive Pulmonary Disease) risk
assessment
•  Support of referenced Beneficiary, HL7, Inpatient Claim, and Outpatient Claim
•  Models are expressed with a domain specific language (DSL) they developed
•  DSL models are transformed to PMML for Kamanja
•  Models consume current + prior related messages over “look back period”
Save the “assertions” of a patient in the database (beyond standard PMML)
“State” can evolve over time
•  The “Medical Company” plans to integrate the DSL with their ontology data
modeling effort
•  Goal is to generate new models as their “medical world” ontology evolves
Medical Company use
of Kamanja
© 2015 ligaDATA, Inc. All Rights Reserved.
Try out
© 2015 ligaDATA, Inc. All Rights Reserved.
CONFIDENTIAL 
Community @ http://Kamanja.org

More Related Content

What's hot

Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0
DataWorks Summit
 
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr..."Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
Dataconomy Media
 

What's hot (20)

Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17
 
Getting Your Supply Chain Back on Track with AI
Getting Your Supply Chain Back on Track with AIGetting Your Supply Chain Back on Track with AI
Getting Your Supply Chain Back on Track with AI
 
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
 
Engineering with Open Source - Hyonjee Joo
Engineering with Open Source - Hyonjee JooEngineering with Open Source - Hyonjee Joo
Engineering with Open Source - Hyonjee Joo
 
Msst 2019 v4
Msst 2019 v4Msst 2019 v4
Msst 2019 v4
 
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkPolymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Digital Shift in Insurance: How is the Industry Responding with the Influx of...Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
 
Smooth Storage - A distributed storage system for managing structured time se...
Smooth Storage - A distributed storage system for managing structured time se...Smooth Storage - A distributed storage system for managing structured time se...
Smooth Storage - A distributed storage system for managing structured time se...
 
Mastering MapReduce: MapReduce for Big Data Management and Analysis
Mastering MapReduce: MapReduce for Big Data Management and AnalysisMastering MapReduce: MapReduce for Big Data Management and Analysis
Mastering MapReduce: MapReduce for Big Data Management and Analysis
 
Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
 
IoT-Enabled Predictive Maintenance
IoT-Enabled Predictive MaintenanceIoT-Enabled Predictive Maintenance
IoT-Enabled Predictive Maintenance
 
Ibm big data
Ibm big dataIbm big data
Ibm big data
 
Maximizing Business Value: Optimizing Technology Investment
Maximizing Business Value: Optimizing Technology InvestmentMaximizing Business Value: Optimizing Technology Investment
Maximizing Business Value: Optimizing Technology Investment
 
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr..."Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
 
Pivotal Digital Transformation Forum: Data Science
Pivotal Digital Transformation Forum: Data Science Pivotal Digital Transformation Forum: Data Science
Pivotal Digital Transformation Forum: Data Science
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
 
Digital Decisioning for the New Decade - 2020 and Beyond
Digital Decisioning for the New Decade - 2020 and BeyondDigital Decisioning for the New Decade - 2020 and Beyond
Digital Decisioning for the New Decade - 2020 and Beyond
 

Viewers also liked

Viewers also liked (15)

Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
 
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An OverviewModern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An Overview
 
Linked In Slides 2009 02 24 B
Linked In Slides 2009 02 24 BLinked In Slides 2009 02 24 B
Linked In Slides 2009 02 24 B
 
SFbayACM ACM Data Science Camp 2015 10 24
SFbayACM ACM Data Science Camp 2015 10 24SFbayACM ACM Data Science Camp 2015 10 24
SFbayACM ACM Data Science Camp 2015 10 24
 
The 360º Leader (Section 2 of 6)
The 360º Leader (Section 2 of 6)The 360º Leader (Section 2 of 6)
The 360º Leader (Section 2 of 6)
 
Big Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBig Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data sets
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient search
 
The 360º Leader (Section 1 of 6)
The 360º Leader (Section 1 of 6)The 360º Leader (Section 1 of 6)
The 360º Leader (Section 1 of 6)
 
Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsUsing Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical Applications
 
Three case studies deploying cluster analysis
Three case studies deploying cluster analysisThree case studies deploying cluster analysis
Three case studies deploying cluster analysis
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its Applications
 
Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
Cluster analysis for market segmentation
Cluster analysis for market segmentationCluster analysis for market segmentation
Cluster analysis for market segmentation
 

Similar to Powering Real­time Decision Engines in Finance and Healthcare using Open Source Software

THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
Srini Alavala
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Capgemini
 
Big data for product managers
Big data for product managersBig data for product managers
Big data for product managers
AIPMM Administration
 
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
JK Tech
 

Similar to Powering Real­time Decision Engines in Finance and Healthcare using Open Source Software (20)

Stop the Blame Game with Increased Visibility of your Mobile-to-Mainframe IT ...
Stop the Blame Game with Increased Visibility of your Mobile-to-Mainframe IT ...Stop the Blame Game with Increased Visibility of your Mobile-to-Mainframe IT ...
Stop the Blame Game with Increased Visibility of your Mobile-to-Mainframe IT ...
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?
 
Guiding Principles for the Low Code Revolution – Intuit QuickBase EMPOWER2015...
Guiding Principles for the Low Code Revolution – Intuit QuickBase EMPOWER2015...Guiding Principles for the Low Code Revolution – Intuit QuickBase EMPOWER2015...
Guiding Principles for the Low Code Revolution – Intuit QuickBase EMPOWER2015...
 
SOUG Day - autonomous what is next
SOUG Day - autonomous what is nextSOUG Day - autonomous what is next
SOUG Day - autonomous what is next
 
Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?
 
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRB
 
Incorporating cloud computing for enhanced communication v2
Incorporating cloud computing for enhanced communication v2Incorporating cloud computing for enhanced communication v2
Incorporating cloud computing for enhanced communication v2
 
The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Big data for product managers
Big data for product managersBig data for product managers
Big data for product managers
 
MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service Selection
 
MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service Selection
 
NoOps in a Serverless World
NoOps in a Serverless WorldNoOps in a Serverless World
NoOps in a Serverless World
 
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
 
Tdwi march 2015 presentation
Tdwi march 2015 presentationTdwi march 2015 presentation
Tdwi march 2015 presentation
 
Deliver New Customer Experiences Through AI-enabled Chatbots
 Deliver New Customer Experiences Through AI-enabled Chatbots Deliver New Customer Experiences Through AI-enabled Chatbots
Deliver New Customer Experiences Through AI-enabled Chatbots
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in Finance
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 

More from Greg Makowski

Understanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptx
Greg Makowski
 

More from Greg Makowski (6)

Understanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptx
 
Future of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxFuture of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptx
 
A Successful Hiring Process for Data Scientists
A Successful Hiring Process for Data ScientistsA Successful Hiring Process for Data Scientists
A Successful Hiring Process for Data Scientists
 
Kdd 2019: Standardizing Data Science to Help Hiring
Kdd 2019:  Standardizing Data Science to Help HiringKdd 2019:  Standardizing Data Science to Help Hiring
Kdd 2019: Standardizing Data Science to Help Hiring
 
Tales from an ip worker in consulting and software
Tales from an ip worker in consulting and softwareTales from an ip worker in consulting and software
Tales from an ip worker in consulting and software
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
 

Recently uploaded

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 

Recently uploaded (20)

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 

Powering Real­time Decision Engines in Finance and Healthcare using Open Source Software

  • 1. © 2015 ligaDATA, Inc. All Rights Reserved. Powering Real-time Decisioning for Financial & Healthcare using Open Source August 2015 Community @ http://Kamanja.org
  • 2. 2 © 2015 ligaDATA, Inc. All Rights Reserved. In ’14 the bank embarked on transforming how they leverage their data using Open Source & Big Data technologies.
  • 3. 3 © 2015 ligaDATA, Inc. All Rights Reserved. To achieve this goal with
 the bank we needed to: 1.  Create a framework 
 to adopt Open Source Software 2.  Need a catalyst to attract and retain 
 the talent
  • 4. © 2015 ligaDATA, Inc. All Rights Reserved. 4 Marissa Meyer of Yahoo won’t have to go in front of the senate to explain why 100,000 records were lost – Barbara Desoer of CitiBank would. What is different about Financial Services? ü  Regulatory requirements requires 100% data protection ü  Security & Data governance ü  Auditability ü  Lineage ü  ZERO data loss ü  Integration with legacy ecosystem ü  Skillset Open Source in Financial Services
 Good enough for Internet companies isn't good enough!
  • 5. © 2015 ligaDATA, Inc. All Rights Reserved. 5 A modified “Crossing the Chasm” view for OSS OSS – Adoption Chasm Why Financial Services have not adopted OSS more aggressively? Creators Contributors Users Creators
 Technology Organizations, Rich resources, Solving a problem, Creating a competitive advantage Contributors
 Technology Organizations, taking a risk while Solving a problem Users Lower Technology Skillset, Low risk tolerance, Solving a problem
  • 6. © 2015 ligaDATA, Inc. All Rights Reserved. 6 Establish the BOSS framework for the consumption and contribution to open source software (OSS) at scale in the Bank . Bank Open Source Software (BOSS) Contribution to OSS by enhancing existing open source projects, documentation, fixes, enhancements Initiation of a new OSS project, championing and facilitating OSS community development and consumption Evaluation & Consumption of OSS Maturing Capability Consumption Contribution Bank Current Focus Step Change Pioneering Target BOSS optimises Consumption, enables Contribution and Creation •  Input from stakeholders, internal and external influenced BOSS framework definition •  OSS advisory board to steer and drive •  Pre-approved licenses types per use case (consumption and contribution) •  Invest in enabling technology, GitHub, Black Duck, Sonatype •  No new governance steps, leverage and streamline existing controls instead of creating new ones Creation
  • 7. © 2015 ligaDATA, Inc. All Rights Reserved. 7 BOSS framework is designed based on guidance and feedback received from key representatives within the Bank and from leading open source contributors and fellow banks . Technology Internal External BOSS – Collective Thought Process Retail Investment Cards Legal Risk Security Sourcing Business Units Control Functions Data Design Infra
  • 8. © 2015 ligaDATA, Inc. All Rights Reserved. 8 Millennial developers … •  Grew up using OSS •  Unaware of Closed Source software •  Want to engage, share and contribute Real-time using Kamanja was selected as a capability big enough, important enough to build a Center of Excellence around it. Attracting and Retaining talent
  • 9. © 2015 ligaDATA, Inc. All Rights Reserved. 9 Individual Events Decisioning, Detection In-context 
 and online Cross section
 of events Analytics, 
 MI Offline, 
 Longer cycle Deriving Decisions
 from Big Data BATCH REAL-TIME
  • 10. © 2015 ligaDATA, Inc. All Rights Reserved. 10 customer-centric product design require Real-time decisions Triggers Scoring Notifications Alerts Transactional Updates Deriving an Opportunity or Threat E N D - T O - E N D C A P A B I L I T Y Tracking & Analyzing (processing) Streams of Information
 (real-time) About Things That Happen (events) Actions Real-time
 Decisions
  • 11. 11 © 2015 ligaDATA, Inc. All Rights Reserved. LigaDATA introduced Kamanja – 
 an open source real-time decisioning project, hardened for Financial Services & Healthcare requirements and scalable to IoT level data volumes enabling low latency use cases. Customer 
 churn/
 retention Risk Analysis Customer Contact Cyber Crime Fraud Security & Compliance Audit & Governance U S E C A S E S Marketing Telephony Interception Real-Time Offer
  • 12. 12 © 2015 ligaDATA, Inc. All Rights Reserved. Uses of 
 Real-Time Decisioning Complex Event Processing (CEP) •  A few to possibly 100’s of concurrent data streams •  Apply rule logic, select, aggregate •  Decide action on elements in stream Enterprise Applications, During … •  customer call or chat: recommendations to improve service •  card transaction: offer credit increase •  web application: pre-approval •  web transaction: recommend other product(s)
  • 13. 13 © 2015 ligaDATA, Inc. All Rights Reserved. Case Study of a Modeling Department Monitor $80B of consumer bank transactions / year to detect fraud (between 1,400 banks) PAIN POINT: ~2 months to deploy (model group was different from deployment group) INDUSTRY REVIEW to answer: •  How common is it to use many algorithms or tools in a project? •  What is an easier way to deploy models?
  • 14. 14 © 2015 ligaDATA, Inc. All Rights Reserved. http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html Independent use of tools
  • 15. 15 © 2015 ligaDATA, Inc. All Rights Reserved. http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html Tools used in combination
  • 16. 16 © 2015 ligaDATA, Inc. All Rights Reserved. Scoring Engine (Kamanja) PMML Diagram
 Predictive Modeling Markup Language Training & test data (batch) Data Mining Tool File, Save As PMML PMML File PMML Producer PMML FileScoring data (real time streaming) Output data has new score field Training Project Phase Production Scoring Project Phase Full model specification PMML Consumer
  • 17. 17 © 2015 ligaDATA, Inc. All Rights Reserved. Given industry fragmentation, PMML is a solution PMML Producers (18 companies) •  R (Rattle, PMML) •  RapidMiner •  KNIME PMML Consumers (12 co) •  Zementis •  SAS •  IBM SPSS •  KNIME •  Microstrategy •  Kamanja •  JPMML •  Spark (MLlib) (Open Source) •  Weka •  SAS Enterprise Miner PREDICTIVE Naïve Bayes Neural Net Regression Rules Scorecard Sequence SVM Time Series Trees DESCRIPTIVE / OTH Association Rules Cluster, K-Nearest Nb Text Models model ensembles & composition (i.e. Gradient Boosting)
  • 18. © 2015 ligaDATA, Inc. All Rights Reserved. 18 Real Time Computing OSS Technology Stack Integration with Kamanja Kamanja (PMML/Java/Scala Consumer) High level languages / abstractions Compute Fabric Cloud, EC2 Internal Cloud Security Kerberos Real Time Streaming Kafka, MQ Spark* ligaDATA Data Store HBase, Cassandra, InfluxDB HDFS (Create adaptors to integrate others) Resource Management Zookeeper, Yarn*, Mesos* High Level Languages / Abstractions MLlib* (PMML Producers)
  • 19. © 2015 ligaDATA, Inc. All Rights Reserved. 19 Performance
 Characteristics © 2015 ligaDATA, Inc. All Rights Reserved. 19 Performance •  Throughput of million messages/second •  Uses commodity hardware Scalability •  Linear scalability -- horizontally •  Data partitioning support •  Runtime multi-model optimizations to support thousands of models •  Consistent performance on hundreds of models and thousands of rules Built for IoT data volumes
  • 20. © 2015 ligaDATA, Inc. All Rights Reserved. 20 •  Clinicians (knowledge experts) develop heuristic based rule set models •  The initial model was COPD (Chronic Obstructive Pulmonary Disease) risk assessment •  Support of referenced Beneficiary, HL7, Inpatient Claim, and Outpatient Claim •  Models are expressed with a domain specific language (DSL) they developed •  DSL models are transformed to PMML for Kamanja •  Models consume current + prior related messages over “look back period” Save the “assertions” of a patient in the database (beyond standard PMML) “State” can evolve over time •  The “Medical Company” plans to integrate the DSL with their ontology data modeling effort •  Goal is to generate new models as their “medical world” ontology evolves Medical Company use of Kamanja
  • 21. © 2015 ligaDATA, Inc. All Rights Reserved. Try out © 2015 ligaDATA, Inc. All Rights Reserved. CONFIDENTIAL Community @ http://Kamanja.org