SlideShare a Scribd company logo
Szabolcs Rozsnyai
July 2012




Business Process Insight
An Approach and Platform for the Discovery and Analysis of End-to-End Business
Processes




Szabolcs Rozsnyai, Geetika T. Lakshmanan, Vinod Muthusamy, Rania Khalaf and
Matthew J. Duftler
                                                                              © 2009 IBM Corporation
IBM Presentation Template Full Version


Agenda



 Introduction and Motivation
 BPI Life-Cycle
 Architecture
 Research Challenges
 Conclusion and Future Work




Source: If applicable, describe source origin

2                                               © 2009 IBM Corporation
Introduction and Motivation 1/2
       Understanding, managing and improving business processes in complex environments proves
        to be a significant challenge and has a severe impact on the organizations process maturity




    Organizational Challenges                                      Technical Challenges
    •Business processes                                            • Processes are not coordinated by one entity
       • can stretch across complex organizational silos and in    • Systems are loosely coupled, heterogeneous and distributed
         many cases even extends to customers                      • Business Process artifacts range from simple record entries to
       • are not necessarily complete or accurate                    complex events at various granularity levels
       • are heavily human-driven, require a lot of knowledge      • Business Process activities might be represented through
         and have a large number of exceptions                       multiple events
       • Are simplified to preserve a high degree of freedom              • Sometimes workflow engines emit events to mark the
       • Are often in the heads of individuals, groups or buried            start and end of an activity
         in application logic




3                                                                                                                   © 2009 IBM Corporation
Introduction and Motivation 2/2
 We propose a system to enable Process Intelligence from two perspectives
   – Analytics on historical data
        • to understand what, how, who and why aspects of end-to-end business process based on real-time and
          historical data
        • identify root causes of problems,
        • understand process deficiencies and
        • provides means to improve process performance

    – Analytics on real-time data
        • to increase the effectiveness of business operations, and managing operational risk
        • to identify and predict situations in order to react on them



             BPI platform is a software as a service (SaaS) enabled, collaborative system that realizes the end-to-end
                                                            BPI life-cycle.


                                      Process Intelligence

                                           BI        BAM          CEP         BPM


                                                Process Mining

       The platform allows users to manage a variety of data at different levels of granularity including raw captured events,
                             correlated instance traces, mined process models, and prediction alerts.


4                                                                                                                     © 2009 IBM Corporation
BPI Life-Cycle




5                © 2009 IBM Corporation
Architecture Overview




6                       © 2009 IBM Corporation
Architecture – Data Management



                                 • Volume and the complexity makes tracking and processing a
                                   difficult and resource intensive task

                                 • As data grows at a very high rate, tracking arbitrary artifacts for
                                   provenance purposes within large organizations is very costly
                                 •
                                   Storing, organizing, retrieving and analyzing the artifacts
                                   necessitate allocating large amount of computing resources
                                 •
                                   RDBMS requires trade-offs need to be made between the
                                   amount of captured data and the granularity levels
                                        • Aggregation vs. leaving out data
                                           both impact the potential for analytics




7                                                                                  © 2009 IBM Corporation
Architecture – Data Management




                                 • Cloud-based elastic storage (Hadoop/HBase)
                                        • Distributed column-oriented key-value storage
                                 • NoSQL but BPI API supports
                                        • a limited set of queries
                                        • Joins with constraint that has high selectivity
                                        • Secondary indexing
                                 • Allows to compose annotated graphs of relationships




8                                                                               © 2009 IBM Corporation
Architecture – Data Integration

                                  • Schema-less structure easily allows
                                      • to “dump” everything into data storage
                                      • following a LET (Load Extract Transform) paradigm in
                                        contrast to classical ETL approaches
                                      • RAW data is preserved
                                      • Attributes of interest are extracted based on deployed
                                        and configured transformers

                                      • Integration options:
                                         • Using ESB (especially for real-time processing)
                                         • Loading files that are following a defined XML
                                           schema




9                                                                                 © 2009 IBM Corporation
Architecture – Correlation Module
                                     • Correlation Discovery
                                         • Determines correlation rules that express how certain
                                           events are related to each other by combining a unique
                                           combination of statistics on event attributes
                                         • Applies graph reduction algorithms to reduce the
                                           number of correlation rules

                                    OrderToShipment :
                                    OrderReceived.OrderId = ShipmentCreated.OrderId,
                                    ShipmentCreated.ShipmentId = TransportStarted.ShipmentId,
                                    TransportStarted.TransportId = TransportEnded.TransportId




                                                How can I reduce the complexity for rules?

10                                                                                    © 2009 IBM Corporation
Architecture – Correlation Engine
                                    • Higher level aggregations can be created that include
                                      several lower level aggregation nodes using representation
                                      of correlations.
                                    • Statistics can be calculated over correlated events and
                                      updated every time new events enter a correlation
                                    • User can place queries for aggregates and drill-down based
                                      on his interests




                                                                                © 2009 IBM Corporation
Architecture – Process Aware Analytics
                                   • Pluggable analytics module for
                                         • Process mining
                                         • Process comparison
                                         • Predictive analytics

                                   • Process Mining
                                         • Algorithms can be plugged in (Alpha, Heuristics,
                                            Biased, …)
                                         • Results are transformed to a BPMN representation
                                         • Queries can be applied to mine subsets of traces to
                                            observe variations in the behavior




                                                                               © 2009 IBM Corporation
Architecture – Process Aware Analytics

                                   • Process Comparison
                                         • Tree-Based comparison returns a detailed diff-list of
                                            the process model
                                         • Visual Overlay returns a visual representation of
                                            how process models differ from each other




                                                                                © 2009 IBM Corporation
Architecture – Process Aware Analytics
                                   • Predictive Analytics
                                         • algorithms in BPI currently include decision trees and
                                            an instance-specific probabilistic process model




                                                                                © 2009 IBM Corporation
Research Challenges
BPI addresses several key challenges defined by the process mining manifesto *)

     C1 Finding, Merging, and Cleaning Event Data                   C4 - Dealing with Concept Drift
     When extracting event data suitable for process mining         The process may be changing while being analyzed.
     several challenges need to be addressed:                       Understanding such phenomena is of prime importance
                                                                    for the management of processes.
     • data may be distributed over a variety of sources,
     • event data may be incomplete,
     • an event log may contain outliers and
     • events at different level of granularity.


     C2 Dealing with Complex Event Logs Having Diverse
                                                                    C7 - Cross-Organizational Mining
     Characteristics
     Event logs may be extremely large making them difficult to     Some organizations work together to handle process
     handle whereas other event logs are so small that not enough   instances (e.g., supply chain partners) or organizations
     data is available to make reliable conclusions.                are executing essentially the same process while sharing
                                                                    experiences, knowledge, or a common infrastructure. The
                                                                    analysis of event logs originating from multiple
                                                                    organizations provides several challenges.


     C8 - Providing Operational Support
     Process mining is not restricted to off-line analysis and
     can also be used for online operational support. Three
     operational support activities can be identified: detect,
     predict, and recommend..



15                                                                                                                 © 2009 IBM Corporation
Future Work



 Scale vs. Query Expressiveness
   – Data management scales out on cost of query expressiveness
       • Experiments with relational-cloud hybrid models
 Parallelizing algorithms to scale-out




16                                                                © 2009 IBM Corporation
Thank You




17               © 2009 IBM Corporation

More Related Content

What's hot

Make Your Business More Flexible with Scalable Business Process Management So...
Make Your Business More Flexible with Scalable Business Process Management So...Make Your Business More Flexible with Scalable Business Process Management So...
Make Your Business More Flexible with Scalable Business Process Management So...
Perficient, Inc.
 
IDC Says, Don't Move To The Cloud
IDC Says, Don't Move To The CloudIDC Says, Don't Move To The Cloud
IDC Says, Don't Move To The Cloud
Novell
 
2012 ukdc shared services value prop growth day newbury
2012 ukdc shared services value prop growth day newbury2012 ukdc shared services value prop growth day newbury
2012 ukdc shared services value prop growth day newbury
bara2cls
 
PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...
PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...
PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...
Adlib - The PDF Experts
 
Exadata meeting business challenges! - Doug Cackett
Exadata meeting business challenges! - Doug CackettExadata meeting business challenges! - Doug Cackett
Exadata meeting business challenges! - Doug Cackett
ORACLE USER GROUP ESTONIA
 
SugarCON partner presentation by IBM
SugarCON partner presentation by IBMSugarCON partner presentation by IBM
SugarCON partner presentation by IBM
Bevdewitt
 
Id105 fortify your ibm lotus notes and ibm lotus domino infrastructure agai...
Id105   fortify your ibm lotus notes and ibm lotus domino infrastructure agai...Id105   fortify your ibm lotus notes and ibm lotus domino infrastructure agai...
Id105 fortify your ibm lotus notes and ibm lotus domino infrastructure agai...
waukema
 
E Business
E BusinessE Business
E Business
Rizwan Qamar
 
Voith boosts productivity, cuts costs with IBM Power Systems and DB2
Voith boosts productivity, cuts costs with IBM Power Systems and DB2 Voith boosts productivity, cuts costs with IBM Power Systems and DB2
Voith boosts productivity, cuts costs with IBM Power Systems and DB2
IBM India Smarter Computing
 
Wall Street Technology
Wall Street TechnologyWall Street Technology
Wall Street Technology
Bharat Gera
 
PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...
PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...
PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...
IBM Danmark
 
Enterprise Architecture
Enterprise ArchitectureEnterprise Architecture
Enterprise Architecture
Raman Kannan
 
Ibm log differentiators for strategic network planning 2011 v6
Ibm log differentiators for strategic network planning 2011 v6Ibm log differentiators for strategic network planning 2011 v6
Ibm log differentiators for strategic network planning 2011 v6
Artem Vinogradov
 
Dc architecture for_cloud
Dc architecture for_cloudDc architecture for_cloud
Dc architecture for_cloud
Alain Geenrits
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data Grid
ScaleOut Software
 
1667 making z rules work session
1667 making z rules work session1667 making z rules work session
1667 making z rules work session
nick_garrod
 
Cloud Computing -- Organizational Shift
Cloud Computing -- Organizational ShiftCloud Computing -- Organizational Shift
Cloud Computing -- Organizational Shift
Raman Kannan
 
Network Operations Managed Services (NOMS)
Network Operations Managed Services (NOMS)Network Operations Managed Services (NOMS)
Network Operations Managed Services (NOMS)
TMNG Global
 
Oracle Data Warehouse
Oracle Data WarehouseOracle Data Warehouse
Oracle Data Warehouse
DataminingTools Inc
 
Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...
Mark Tapley
 

What's hot (20)

Make Your Business More Flexible with Scalable Business Process Management So...
Make Your Business More Flexible with Scalable Business Process Management So...Make Your Business More Flexible with Scalable Business Process Management So...
Make Your Business More Flexible with Scalable Business Process Management So...
 
IDC Says, Don't Move To The Cloud
IDC Says, Don't Move To The CloudIDC Says, Don't Move To The Cloud
IDC Says, Don't Move To The Cloud
 
2012 ukdc shared services value prop growth day newbury
2012 ukdc shared services value prop growth day newbury2012 ukdc shared services value prop growth day newbury
2012 ukdc shared services value prop growth day newbury
 
PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...
PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...
PRESENTATION: Tips and Tricks for Government Agencies to Push the Limits of P...
 
Exadata meeting business challenges! - Doug Cackett
Exadata meeting business challenges! - Doug CackettExadata meeting business challenges! - Doug Cackett
Exadata meeting business challenges! - Doug Cackett
 
SugarCON partner presentation by IBM
SugarCON partner presentation by IBMSugarCON partner presentation by IBM
SugarCON partner presentation by IBM
 
Id105 fortify your ibm lotus notes and ibm lotus domino infrastructure agai...
Id105   fortify your ibm lotus notes and ibm lotus domino infrastructure agai...Id105   fortify your ibm lotus notes and ibm lotus domino infrastructure agai...
Id105 fortify your ibm lotus notes and ibm lotus domino infrastructure agai...
 
E Business
E BusinessE Business
E Business
 
Voith boosts productivity, cuts costs with IBM Power Systems and DB2
Voith boosts productivity, cuts costs with IBM Power Systems and DB2 Voith boosts productivity, cuts costs with IBM Power Systems and DB2
Voith boosts productivity, cuts costs with IBM Power Systems and DB2
 
Wall Street Technology
Wall Street TechnologyWall Street Technology
Wall Street Technology
 
PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...
PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...
PCTY 2012, Overvågning af forretningssystemer i et virtuelt miljø v. Hans Ped...
 
Enterprise Architecture
Enterprise ArchitectureEnterprise Architecture
Enterprise Architecture
 
Ibm log differentiators for strategic network planning 2011 v6
Ibm log differentiators for strategic network planning 2011 v6Ibm log differentiators for strategic network planning 2011 v6
Ibm log differentiators for strategic network planning 2011 v6
 
Dc architecture for_cloud
Dc architecture for_cloudDc architecture for_cloud
Dc architecture for_cloud
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data Grid
 
1667 making z rules work session
1667 making z rules work session1667 making z rules work session
1667 making z rules work session
 
Cloud Computing -- Organizational Shift
Cloud Computing -- Organizational ShiftCloud Computing -- Organizational Shift
Cloud Computing -- Organizational Shift
 
Network Operations Managed Services (NOMS)
Network Operations Managed Services (NOMS)Network Operations Managed Services (NOMS)
Network Operations Managed Services (NOMS)
 
Oracle Data Warehouse
Oracle Data WarehouseOracle Data Warehouse
Oracle Data Warehouse
 
Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...
 

Viewers also liked

Large-Scale Distributed Storage System for Business Provenance - Cloud 2011
Large-Scale Distributed Storage System for Business Provenance - Cloud 2011Large-Scale Distributed Storage System for Business Provenance - Cloud 2011
Large-Scale Distributed Storage System for Business Provenance - Cloud 2011
Szabolcs Rozsnyai
 
Automated Correlation Discovery for Semi-Structured Business Processes
Automated Correlation Discovery for Semi-Structured Business ProcessesAutomated Correlation Discovery for Semi-Structured Business Processes
Automated Correlation Discovery for Semi-Structured Business Processes
Szabolcs Rozsnyai
 
Business Process Management and Virtual Worlds
Business Process Management and Virtual WorldsBusiness Process Management and Virtual Worlds
Business Process Management and Virtual Worlds
Ian Hughes / epredator
 
Business process modelling with sbi an example
Business process modelling with sbi an exampleBusiness process modelling with sbi an example
Business process modelling with sbi an example
Satyam Anand
 
Business Process Modeling Case Study
Business Process Modeling Case StudyBusiness Process Modeling Case Study
Business Process Modeling Case Study
Akash Gajjar
 
Supply chain excellence
Supply chain excellenceSupply chain excellence
Supply chain excellence
Keivan Zokaei
 
Business Process Management in Higher Education Institutions - an award winni...
Business Process Management in Higher Education Institutions - an award winni...Business Process Management in Higher Education Institutions - an award winni...
Business Process Management in Higher Education Institutions - an award winni...
Tomislav Rozman
 
The Forrester Wave BPM Suites 2013
The Forrester Wave BPM Suites 2013The Forrester Wave BPM Suites 2013
The Forrester Wave BPM Suites 2013
Luciano Gomes
 
Beyond Process Mining: Discovering Business Rules From Event Logs
Beyond Process Mining: Discovering Business Rules From Event LogsBeyond Process Mining: Discovering Business Rules From Event Logs
Beyond Process Mining: Discovering Business Rules From Event Logs
Marlon Dumas
 
Introduction to the BPM Lifecycle
Introduction to the BPM LifecycleIntroduction to the BPM Lifecycle
Introduction to the BPM Lifecycle
Michael zur Muehlen
 
Service Management with Odoo/OpenERP - Opendays 2014
Service Management with Odoo/OpenERP - Opendays 2014Service Management with Odoo/OpenERP - Opendays 2014
Service Management with Odoo/OpenERP - Opendays 2014
Daniel Reis
 
H&M Strategic Recommendations in Depth
H&M Strategic Recommendations in DepthH&M Strategic Recommendations in Depth
H&M Strategic Recommendations in Depth
Vasiliki Evangelou
 

Viewers also liked (12)

Large-Scale Distributed Storage System for Business Provenance - Cloud 2011
Large-Scale Distributed Storage System for Business Provenance - Cloud 2011Large-Scale Distributed Storage System for Business Provenance - Cloud 2011
Large-Scale Distributed Storage System for Business Provenance - Cloud 2011
 
Automated Correlation Discovery for Semi-Structured Business Processes
Automated Correlation Discovery for Semi-Structured Business ProcessesAutomated Correlation Discovery for Semi-Structured Business Processes
Automated Correlation Discovery for Semi-Structured Business Processes
 
Business Process Management and Virtual Worlds
Business Process Management and Virtual WorldsBusiness Process Management and Virtual Worlds
Business Process Management and Virtual Worlds
 
Business process modelling with sbi an example
Business process modelling with sbi an exampleBusiness process modelling with sbi an example
Business process modelling with sbi an example
 
Business Process Modeling Case Study
Business Process Modeling Case StudyBusiness Process Modeling Case Study
Business Process Modeling Case Study
 
Supply chain excellence
Supply chain excellenceSupply chain excellence
Supply chain excellence
 
Business Process Management in Higher Education Institutions - an award winni...
Business Process Management in Higher Education Institutions - an award winni...Business Process Management in Higher Education Institutions - an award winni...
Business Process Management in Higher Education Institutions - an award winni...
 
The Forrester Wave BPM Suites 2013
The Forrester Wave BPM Suites 2013The Forrester Wave BPM Suites 2013
The Forrester Wave BPM Suites 2013
 
Beyond Process Mining: Discovering Business Rules From Event Logs
Beyond Process Mining: Discovering Business Rules From Event LogsBeyond Process Mining: Discovering Business Rules From Event Logs
Beyond Process Mining: Discovering Business Rules From Event Logs
 
Introduction to the BPM Lifecycle
Introduction to the BPM LifecycleIntroduction to the BPM Lifecycle
Introduction to the BPM Lifecycle
 
Service Management with Odoo/OpenERP - Opendays 2014
Service Management with Odoo/OpenERP - Opendays 2014Service Management with Odoo/OpenERP - Opendays 2014
Service Management with Odoo/OpenERP - Opendays 2014
 
H&M Strategic Recommendations in Depth
H&M Strategic Recommendations in DepthH&M Strategic Recommendations in Depth
H&M Strategic Recommendations in Depth
 

Similar to Business Process Insight - SRII 2012

How to get cloud architecture and design right the first time 2012
How to get cloud architecture and design right the first time 2012How to get cloud architecture and design right the first time 2012
How to get cloud architecture and design right the first time 2012
David Linthicum
 
Network Sage™ Into To C Level V1.4
Network Sage™ Into To C Level V1.4Network Sage™ Into To C Level V1.4
Network Sage™ Into To C Level V1.4
ikirmer
 
Cloud computing sucess
Cloud computing sucess Cloud computing sucess
Cloud computing sucess
Anwar Bakhashwain
 
Ugif 04 2011 france ug04042011-jroy_part1
Ugif 04 2011   france ug04042011-jroy_part1Ugif 04 2011   france ug04042011-jroy_part1
Ugif 04 2011 france ug04042011-jroy_part1
UGIF
 
The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...
The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...
The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...
Brian Wilson
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Igor De Souza
 
Kognitio overview april 2013
Kognitio overview april 2013Kognitio overview april 2013
Kognitio overview april 2013
Kognitio
 
Using a private cloud to automate and govern enterprise development
Using a private cloud to automate and govern enterprise developmentUsing a private cloud to automate and govern enterprise development
Using a private cloud to automate and govern enterprise development
WSO2
 
Journey to the Programmable Data Center
Journey to the Programmable Data CenterJourney to the Programmable Data Center
Journey to the Programmable Data Center
Toby Weiss
 
Analyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast DataAnalyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast Data
EMC
 
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
Amazon Web Services
 
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflowsCloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Yong Feng
 
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
IBM Danmark
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloud
tdwiindia
 
An overview of modern scalable web development
An overview of modern scalable web developmentAn overview of modern scalable web development
An overview of modern scalable web development
Tung Nguyen
 
Fi nf068c73aef66f694f31a049aff3f4
Fi nf068c73aef66f694f31a049aff3f4Fi nf068c73aef66f694f31a049aff3f4
Fi nf068c73aef66f694f31a049aff3f4
Shawn D'souza
 
Kognitio feb 2013
Kognitio feb 2013Kognitio feb 2013
Kognitio feb 2013
Kognitio
 
Postgres Plus Cloud Database
Postgres Plus Cloud DatabasePostgres Plus Cloud Database
Postgres Plus Cloud Database
Gary Carter
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and Comparison
DATAVERSITY
 
Destroying Data Silos
Destroying Data SilosDestroying Data Silos
Destroying Data Silos
DataWorks Summit
 

Similar to Business Process Insight - SRII 2012 (20)

How to get cloud architecture and design right the first time 2012
How to get cloud architecture and design right the first time 2012How to get cloud architecture and design right the first time 2012
How to get cloud architecture and design right the first time 2012
 
Network Sage™ Into To C Level V1.4
Network Sage™ Into To C Level V1.4Network Sage™ Into To C Level V1.4
Network Sage™ Into To C Level V1.4
 
Cloud computing sucess
Cloud computing sucess Cloud computing sucess
Cloud computing sucess
 
Ugif 04 2011 france ug04042011-jroy_part1
Ugif 04 2011   france ug04042011-jroy_part1Ugif 04 2011   france ug04042011-jroy_part1
Ugif 04 2011 france ug04042011-jroy_part1
 
The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...
The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...
The Power of Two: Using IBM Standards Processing Engine for EDI Commerce or H...
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
 
Kognitio overview april 2013
Kognitio overview april 2013Kognitio overview april 2013
Kognitio overview april 2013
 
Using a private cloud to automate and govern enterprise development
Using a private cloud to automate and govern enterprise developmentUsing a private cloud to automate and govern enterprise development
Using a private cloud to automate and govern enterprise development
 
Journey to the Programmable Data Center
Journey to the Programmable Data CenterJourney to the Programmable Data Center
Journey to the Programmable Data Center
 
Analyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast DataAnalyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast Data
 
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
 
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflowsCloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
 
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloud
 
An overview of modern scalable web development
An overview of modern scalable web developmentAn overview of modern scalable web development
An overview of modern scalable web development
 
Fi nf068c73aef66f694f31a049aff3f4
Fi nf068c73aef66f694f31a049aff3f4Fi nf068c73aef66f694f31a049aff3f4
Fi nf068c73aef66f694f31a049aff3f4
 
Kognitio feb 2013
Kognitio feb 2013Kognitio feb 2013
Kognitio feb 2013
 
Postgres Plus Cloud Database
Postgres Plus Cloud DatabasePostgres Plus Cloud Database
Postgres Plus Cloud Database
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and Comparison
 
Destroying Data Silos
Destroying Data SilosDestroying Data Silos
Destroying Data Silos
 

Recently uploaded

Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
DianaGray10
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
Fwdays
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
Mydbops
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 

Recently uploaded (20)

Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 

Business Process Insight - SRII 2012

  • 1. Szabolcs Rozsnyai July 2012 Business Process Insight An Approach and Platform for the Discovery and Analysis of End-to-End Business Processes Szabolcs Rozsnyai, Geetika T. Lakshmanan, Vinod Muthusamy, Rania Khalaf and Matthew J. Duftler © 2009 IBM Corporation
  • 2. IBM Presentation Template Full Version Agenda  Introduction and Motivation  BPI Life-Cycle  Architecture  Research Challenges  Conclusion and Future Work Source: If applicable, describe source origin 2 © 2009 IBM Corporation
  • 3. Introduction and Motivation 1/2 Understanding, managing and improving business processes in complex environments proves to be a significant challenge and has a severe impact on the organizations process maturity Organizational Challenges Technical Challenges •Business processes • Processes are not coordinated by one entity • can stretch across complex organizational silos and in • Systems are loosely coupled, heterogeneous and distributed many cases even extends to customers • Business Process artifacts range from simple record entries to • are not necessarily complete or accurate complex events at various granularity levels • are heavily human-driven, require a lot of knowledge • Business Process activities might be represented through and have a large number of exceptions multiple events • Are simplified to preserve a high degree of freedom • Sometimes workflow engines emit events to mark the • Are often in the heads of individuals, groups or buried start and end of an activity in application logic 3 © 2009 IBM Corporation
  • 4. Introduction and Motivation 2/2  We propose a system to enable Process Intelligence from two perspectives – Analytics on historical data • to understand what, how, who and why aspects of end-to-end business process based on real-time and historical data • identify root causes of problems, • understand process deficiencies and • provides means to improve process performance – Analytics on real-time data • to increase the effectiveness of business operations, and managing operational risk • to identify and predict situations in order to react on them BPI platform is a software as a service (SaaS) enabled, collaborative system that realizes the end-to-end BPI life-cycle. Process Intelligence BI BAM CEP BPM Process Mining The platform allows users to manage a variety of data at different levels of granularity including raw captured events, correlated instance traces, mined process models, and prediction alerts. 4 © 2009 IBM Corporation
  • 5. BPI Life-Cycle 5 © 2009 IBM Corporation
  • 6. Architecture Overview 6 © 2009 IBM Corporation
  • 7. Architecture – Data Management • Volume and the complexity makes tracking and processing a difficult and resource intensive task • As data grows at a very high rate, tracking arbitrary artifacts for provenance purposes within large organizations is very costly • Storing, organizing, retrieving and analyzing the artifacts necessitate allocating large amount of computing resources • RDBMS requires trade-offs need to be made between the amount of captured data and the granularity levels • Aggregation vs. leaving out data  both impact the potential for analytics 7 © 2009 IBM Corporation
  • 8. Architecture – Data Management • Cloud-based elastic storage (Hadoop/HBase) • Distributed column-oriented key-value storage • NoSQL but BPI API supports • a limited set of queries • Joins with constraint that has high selectivity • Secondary indexing • Allows to compose annotated graphs of relationships 8 © 2009 IBM Corporation
  • 9. Architecture – Data Integration • Schema-less structure easily allows • to “dump” everything into data storage • following a LET (Load Extract Transform) paradigm in contrast to classical ETL approaches • RAW data is preserved • Attributes of interest are extracted based on deployed and configured transformers • Integration options: • Using ESB (especially for real-time processing) • Loading files that are following a defined XML schema 9 © 2009 IBM Corporation
  • 10. Architecture – Correlation Module • Correlation Discovery • Determines correlation rules that express how certain events are related to each other by combining a unique combination of statistics on event attributes • Applies graph reduction algorithms to reduce the number of correlation rules OrderToShipment : OrderReceived.OrderId = ShipmentCreated.OrderId, ShipmentCreated.ShipmentId = TransportStarted.ShipmentId, TransportStarted.TransportId = TransportEnded.TransportId How can I reduce the complexity for rules? 10 © 2009 IBM Corporation
  • 11. Architecture – Correlation Engine • Higher level aggregations can be created that include several lower level aggregation nodes using representation of correlations. • Statistics can be calculated over correlated events and updated every time new events enter a correlation • User can place queries for aggregates and drill-down based on his interests © 2009 IBM Corporation
  • 12. Architecture – Process Aware Analytics • Pluggable analytics module for • Process mining • Process comparison • Predictive analytics • Process Mining • Algorithms can be plugged in (Alpha, Heuristics, Biased, …) • Results are transformed to a BPMN representation • Queries can be applied to mine subsets of traces to observe variations in the behavior © 2009 IBM Corporation
  • 13. Architecture – Process Aware Analytics • Process Comparison • Tree-Based comparison returns a detailed diff-list of the process model • Visual Overlay returns a visual representation of how process models differ from each other © 2009 IBM Corporation
  • 14. Architecture – Process Aware Analytics • Predictive Analytics • algorithms in BPI currently include decision trees and an instance-specific probabilistic process model © 2009 IBM Corporation
  • 15. Research Challenges BPI addresses several key challenges defined by the process mining manifesto *) C1 Finding, Merging, and Cleaning Event Data C4 - Dealing with Concept Drift When extracting event data suitable for process mining The process may be changing while being analyzed. several challenges need to be addressed: Understanding such phenomena is of prime importance for the management of processes. • data may be distributed over a variety of sources, • event data may be incomplete, • an event log may contain outliers and • events at different level of granularity. C2 Dealing with Complex Event Logs Having Diverse C7 - Cross-Organizational Mining Characteristics Event logs may be extremely large making them difficult to Some organizations work together to handle process handle whereas other event logs are so small that not enough instances (e.g., supply chain partners) or organizations data is available to make reliable conclusions. are executing essentially the same process while sharing experiences, knowledge, or a common infrastructure. The analysis of event logs originating from multiple organizations provides several challenges. C8 - Providing Operational Support Process mining is not restricted to off-line analysis and can also be used for online operational support. Three operational support activities can be identified: detect, predict, and recommend.. 15 © 2009 IBM Corporation
  • 16. Future Work  Scale vs. Query Expressiveness – Data management scales out on cost of query expressiveness • Experiments with relational-cloud hybrid models  Parallelizing algorithms to scale-out 16 © 2009 IBM Corporation
  • 17. Thank You 17 © 2009 IBM Corporation