SlideShare a Scribd company logo
1 of 22
IBM Research - India, New Delhi, India‡
IBM TJ Watson Research Center, New York, USA†
   Free form diagramming tools (e.g., Visio, Powerpoint) are preferred in
    creation for initial process models
     Ease of use, Intuitiveness
     Ubiquity
     Doesn’t hinder your creativity

   Process modeling software (e.g., WBM, ARIS) create models with formal
    underpinnings
       Allow formal analysis, model checking
       Process Reuse
       Process Improvement
       Traceability with realized executable process

   Sound, automatic approach to convert process diagrams to formal process
    models is essential
     A bridge between the worlds of diagramming and formal modeling


          September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Challenges
     Ambiguities in diagrams
     Limitation of existing capabilities

   Approach
     Structure Inference
     Semantic Interpretation

   Empirical Study

   Related Work & Future directions
        September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Human can interpret different visual cues in
drawings to correctly resolve the structure and
semantics of the models, but machines cannot
do the same!


  September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Connectors not glued to shapes at their endpoints


                                          Missing
                                          Edge




                                          Missing
                                          Edge



                                          Missing
                                          Edge
Text annotations not explicitly part of any shape for node/edge
Same shape conveys multiple semantics




                           Same semantic conveyed in multiple shapes




September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Popular BPM tools such as Websphere Business
    Modeler, ARIS, Lombardi, Telelogic System Architect,
    have Visio import capabilities
   Create imprecise flow structure when faced with
    structural ambiguities
   Employ a simple mapping (fixed or pluggable) from a
    set of diagram shapes to a target set of process
    semantics to interpret semantics
     Such an approach cannot deal with under-specification
     Building an exhaustive mapping is painful in presence of
      over-specification

        September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Process                                           Diagram Parsing
Diagram                                             Attributes such as             Use format specific
                     Parse information
                                                 coordinates, dimensions,          SDKs or parse XML
                   about diagram shapes
                                                     text, geometry                     formats


Shapes &
Attributes
                                               Structure Inference
                   Precisely determine the          Deal with structural       Extract features for each
                    underlying flow graph              ambiguities                  node and edge
 Flow
 Graph

                                            Semantic Interpretation
                     Assign process semantics to every
Process            node and edge in the flow graph using
                                                                Supervised and unsupervised schemes
 Model                                                                 to train such a classifier
                             a trained classifier

      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Process                                           Diagram Parsing
Diagram                                             Attributes such as             Use format specific
                     Parse information
                                                 coordinates, dimensions,          SDKs or parse XML
                   about diagram shapes
                                                     text, geometry                     formats


Shapes &
Attributes
                                               Structure Inference
                   Precisely determine the          Deal with structural       Extract features for each
                    underlying flow graph              ambiguities                  node and edge
 Flow
 Graph

                                            Semantic Interpretation
                     Assign process semantics to every
Process            node and edge in the flow graph using
                                                                Supervised and unsupervised schemes
 Model                                                                 to train such a classifier
                             a trained classifier

      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
A                                         B A                                       B




                          C               D                        C                    D




September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
    Uses notion of connection
                                                                  points created at node – line
    SRC    SRC            SRC        SRC
                                     TGT
                                                                  and line – line intersections
    SRC    NEU            NEU
    C1     C2             C5         C8
A                                             B                  Assign direction to connection
          C3 TGT        UNK C6
                                                                  points
                         SRC
          SRC C4         TGT   C7                                Starting at connection points
                                                                  attached to nodes, propagate
            C               D                                     their directions along paths in
                                                                  which the directions are
                                                                  consistent and identifies the
                                                                  reached nodes
                                                                 Create edges if connection
                                                                  point at reached node has a
                                                                  different direction


          September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Process                                           Diagram Parsing
Diagram                                             Attributes such as             Use format specific
                     Parse information
                                                 coordinates, dimensions,          SDKs or parse XML
                   about diagram shapes
                                                     text, geometry                     formats


Shapes &
Attributes
                                               Structure Inference
                   Precisely determine the          Deal with structural       Extract features for each
                    underlying flow graph              ambiguities                  node and edge
 Flow
 Graph

                                            Semantic Interpretation
                     Assign process semantics to every
Process            node and edge in the flow graph using
                                                                Supervised and unsupervised schemes
 Model                                                                 to train such a classifier
                             a trained classifier

      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Train a classifier to mimic human reasoning
    to decide process semantics
   Features used for classification:
     Relational: Indegree, Outdegree, Count of nodes
      contained within
     Geometric: Shape name, Count of horizontal,
      vertical, diagonal lines
     Textual: Count of cue words for every target entity


    September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Structure                             {Nodes, Edges}
        Flow                                                                    Annotated by
      Diagrams                           Inference
                                                                                  Features




              Classifier                     {Nodes, Edges}
                                          Annotated by Features
                                           + Process Semantic




Classifier establishes correspondence                                      An expert labels all nodes &
between the features and labels for                                        edges in the input set of
process semantics                                                          diagrams by their semantics

           September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Flow                         Structure                      {Nodes, Edges}                     Clusterer
Diagrams                       Inference                       Annotated by
                                                                 Features




                                                   Cluster A = Semantic X
       {Nodes, Edges}                                                                                          Cluster A
    Annotated by Features
     + Process Semantic                            Cluster B = Semantic Y                                      Cluster B



                                                                                                              Clusters
                                                                                                                have
                                                                    An expert looks at                        common
                                                                    exemplars from each                      semantics
           Classifier                                               cluster to label process
                                                                    semantic of the cluster


           September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Data Set: 185 Visio process diagrams created in real
    business-transformation projects
   Objective: Compare accuracy of our tool iDISCOVER
    and a popular modeling tool (called PMT for
    proprietary reasons)
   Method: Compare tool outputs with models created
    manually by human experts to measure precision &
    recall
   Precision =         |Actual ∩ Retrieved| , Recall = |Actual ∩ Retrieved|
                            |Retrieved|                      |Actual|

        September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Node                 96.93                   95.91                    70.44                   86.29
Edge                 93.26                   90.86                    63.43                   59.87




Dangling                   47 (100%)                     3 (14%)                        56%
Connector
Unlinked Labels            46 (39%)                      2 (3.7%)                       38%


Count of dangling connectors has a greater correlation with the edge recall of
(ρ = −0.48) than with the edge recall of           (ρ = −0.08).
            September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Our            (Overall Δ ≈30%) and           (Overall Δ ≈20%) for all process semantic classes
 •Accuracy is low only for scarce entities like Intermediate Events and Data Objects (together
are greater than that of
 less than 3% of the data set)
                 is almost as good as
 •Better results possible with a more equitable distribution of entities work almost
        Size of the training data need not be huge. Classification could
        as well with only a third of the dataset size


             September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Large body of work in the area of
    understanding line drawings and hand
    sketches (e.g., Futrelle, Gross, Barbu)
     Focus on identifying shape geometry
     Semantic interpretation follows directly from a
      fixed mapping between shape geometry and target
      semantics
     Visual Language theory prescribes geometry
      detection with grammar rules.
      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   More efficient modeling of textual cues
     Text is the only reliable feature in highly ambiguous
      scenarios

   Tracking spatial patterns of shapes and labels
    that emerge due to local styles
   Identification of higher-level relations (block
    structures) between model entities (e.g., sub-
    process, loop, and fork-merge)
   Extend to other diagram types
       September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Informal process diagrams contain structural and
    semantic ambiguities – need to be dealt with in order to
    discover precise formal models
   Existing capabilities are limited because:
       Do not resolve structural ambiguities
       Interpreting semantic based on shape name does not suffice

   Standard pattern-classification techniques can be
    successfully employed in interpreting process semantics if
    the feature space is carefully modeled to mimic human
    reasoning
     Unsupervised clustering can almost match supervised
        techniques in performance

          September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA

More Related Content

Similar to From Informal Process Diagrams To Formal Process Models

Software Design_Se lect16 btech
Software Design_Se lect16 btechSoftware Design_Se lect16 btech
Software Design_Se lect16 btechIIITA
 
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)Denis Gagné
 
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...Sotiris Koussouris
 
Event-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesEvent-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesIstvan Rath
 
scenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug Walkerscenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug WalkerOpenPlans
 
Open source print quality software
Open source print quality softwareOpen source print quality software
Open source print quality softwareChristophe Mercier
 
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)Steve Lange
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Marco Brambilla
 
Axel uhl sap@md-day2011
Axel uhl sap@md-day2011Axel uhl sap@md-day2011
Axel uhl sap@md-day2011MDDAY11
 
Advanced Excel
Advanced Excel Advanced Excel
Advanced Excel eduCBA
 
Star Ccm604 Features Final
Star Ccm604 Features FinalStar Ccm604 Features Final
Star Ccm604 Features FinalEd Erb
 
An interactive exploration environment for complex process design
An interactive exploration environment for complex process designAn interactive exploration environment for complex process design
An interactive exploration environment for complex process designEric Fraga
 
BA conf presentation 2010
BA conf presentation 2010BA conf presentation 2010
BA conf presentation 2010Julen Mohanty
 
01.egovFrame Training Book II
01.egovFrame Training Book II01.egovFrame Training Book II
01.egovFrame Training Book IIChuong Nguyen
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architecturedrewz lin
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architectureTot Bob
 
Software Architecture: views and viewpoints
Software Architecture: views and viewpointsSoftware Architecture: views and viewpoints
Software Architecture: views and viewpointsHenry Muccini
 
On the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented SoftwareOn the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented SoftwareRaffi Khatchadourian
 
Framework Engineering_Final
Framework Engineering_FinalFramework Engineering_Final
Framework Engineering_FinalYoungSu Son
 

Similar to From Informal Process Diagrams To Formal Process Models (20)

Software Design_Se lect16 btech
Software Design_Se lect16 btechSoftware Design_Se lect16 btech
Software Design_Se lect16 btech
 
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
 
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
 
Event-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesEvent-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling Languages
 
scenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug Walkerscenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug Walker
 
Open source print quality software
Open source print quality softwareOpen source print quality software
Open source print quality software
 
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...
 
Axel uhl sap@md-day2011
Axel uhl sap@md-day2011Axel uhl sap@md-day2011
Axel uhl sap@md-day2011
 
Advanced Excel
Advanced Excel Advanced Excel
Advanced Excel
 
Star Ccm604 Features Final
Star Ccm604 Features FinalStar Ccm604 Features Final
Star Ccm604 Features Final
 
An interactive exploration environment for complex process design
An interactive exploration environment for complex process designAn interactive exploration environment for complex process design
An interactive exploration environment for complex process design
 
BA conf presentation 2010
BA conf presentation 2010BA conf presentation 2010
BA conf presentation 2010
 
01.egovFrame Training Book II
01.egovFrame Training Book II01.egovFrame Training Book II
01.egovFrame Training Book II
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architecture
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architecture
 
Software Architecture: views and viewpoints
Software Architecture: views and viewpointsSoftware Architecture: views and viewpoints
Software Architecture: views and viewpoints
 
On the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented SoftwareOn the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
 
Software design
Software designSoftware design
Software design
 
Framework Engineering_Final
Framework Engineering_FinalFramework Engineering_Final
Framework Engineering_Final
 

More from Debdoot Mukherjee

Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersDebdoot Mukherjee
 
Determining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDetermining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDebdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Debdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Debdoot Mukherjee
 
Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Debdoot Mukherjee
 

More from Debdoot Mukherjee (6)

Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
 
Determining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDetermining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL Compositions
 
Scc talk
Scc talkScc talk
Scc talk
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
 
Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?
 

Recently uploaded

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

From Informal Process Diagrams To Formal Process Models

  • 1. IBM Research - India, New Delhi, India‡ IBM TJ Watson Research Center, New York, USA†
  • 2. Free form diagramming tools (e.g., Visio, Powerpoint) are preferred in creation for initial process models  Ease of use, Intuitiveness  Ubiquity  Doesn’t hinder your creativity  Process modeling software (e.g., WBM, ARIS) create models with formal underpinnings  Allow formal analysis, model checking  Process Reuse  Process Improvement  Traceability with realized executable process  Sound, automatic approach to convert process diagrams to formal process models is essential  A bridge between the worlds of diagramming and formal modeling September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 3. Challenges  Ambiguities in diagrams  Limitation of existing capabilities  Approach  Structure Inference  Semantic Interpretation  Empirical Study  Related Work & Future directions September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 4. Human can interpret different visual cues in drawings to correctly resolve the structure and semantics of the models, but machines cannot do the same! September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 5. Connectors not glued to shapes at their endpoints Missing Edge Missing Edge Missing Edge
  • 6. Text annotations not explicitly part of any shape for node/edge
  • 7. Same shape conveys multiple semantics Same semantic conveyed in multiple shapes September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 8. Popular BPM tools such as Websphere Business Modeler, ARIS, Lombardi, Telelogic System Architect, have Visio import capabilities  Create imprecise flow structure when faced with structural ambiguities  Employ a simple mapping (fixed or pluggable) from a set of diagram shapes to a target set of process semantics to interpret semantics  Such an approach cannot deal with under-specification  Building an exhaustive mapping is painful in presence of over-specification September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 9. Process Diagram Parsing Diagram Attributes such as Use format specific Parse information coordinates, dimensions, SDKs or parse XML about diagram shapes text, geometry formats Shapes & Attributes Structure Inference Precisely determine the Deal with structural Extract features for each underlying flow graph ambiguities node and edge Flow Graph Semantic Interpretation Assign process semantics to every Process node and edge in the flow graph using Supervised and unsupervised schemes Model to train such a classifier a trained classifier September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 10. Process Diagram Parsing Diagram Attributes such as Use format specific Parse information coordinates, dimensions, SDKs or parse XML about diagram shapes text, geometry formats Shapes & Attributes Structure Inference Precisely determine the Deal with structural Extract features for each underlying flow graph ambiguities node and edge Flow Graph Semantic Interpretation Assign process semantics to every Process node and edge in the flow graph using Supervised and unsupervised schemes Model to train such a classifier a trained classifier September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 11. A B A B C D C D September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 12. Uses notion of connection points created at node – line SRC SRC SRC SRC TGT and line – line intersections SRC NEU NEU C1 C2 C5 C8 A B  Assign direction to connection C3 TGT UNK C6 points SRC SRC C4 TGT C7  Starting at connection points attached to nodes, propagate C D their directions along paths in which the directions are consistent and identifies the reached nodes  Create edges if connection point at reached node has a different direction September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 13. Process Diagram Parsing Diagram Attributes such as Use format specific Parse information coordinates, dimensions, SDKs or parse XML about diagram shapes text, geometry formats Shapes & Attributes Structure Inference Precisely determine the Deal with structural Extract features for each underlying flow graph ambiguities node and edge Flow Graph Semantic Interpretation Assign process semantics to every Process node and edge in the flow graph using Supervised and unsupervised schemes Model to train such a classifier a trained classifier September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 14. Train a classifier to mimic human reasoning to decide process semantics  Features used for classification:  Relational: Indegree, Outdegree, Count of nodes contained within  Geometric: Shape name, Count of horizontal, vertical, diagonal lines  Textual: Count of cue words for every target entity September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 15. Structure {Nodes, Edges} Flow Annotated by Diagrams Inference Features Classifier {Nodes, Edges} Annotated by Features + Process Semantic Classifier establishes correspondence An expert labels all nodes & between the features and labels for edges in the input set of process semantics diagrams by their semantics September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 16. Flow Structure {Nodes, Edges} Clusterer Diagrams Inference Annotated by Features Cluster A = Semantic X {Nodes, Edges} Cluster A Annotated by Features + Process Semantic Cluster B = Semantic Y Cluster B Clusters have An expert looks at common exemplars from each semantics Classifier cluster to label process semantic of the cluster September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 17. Data Set: 185 Visio process diagrams created in real business-transformation projects  Objective: Compare accuracy of our tool iDISCOVER and a popular modeling tool (called PMT for proprietary reasons)  Method: Compare tool outputs with models created manually by human experts to measure precision & recall  Precision = |Actual ∩ Retrieved| , Recall = |Actual ∩ Retrieved| |Retrieved| |Actual| September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 18. Node 96.93 95.91 70.44 86.29 Edge 93.26 90.86 63.43 59.87 Dangling 47 (100%) 3 (14%) 56% Connector Unlinked Labels 46 (39%) 2 (3.7%) 38% Count of dangling connectors has a greater correlation with the edge recall of (ρ = −0.48) than with the edge recall of (ρ = −0.08). September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 19. Our (Overall Δ ≈30%) and (Overall Δ ≈20%) for all process semantic classes •Accuracy is low only for scarce entities like Intermediate Events and Data Objects (together are greater than that of less than 3% of the data set) is almost as good as •Better results possible with a more equitable distribution of entities work almost Size of the training data need not be huge. Classification could as well with only a third of the dataset size September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 20. Large body of work in the area of understanding line drawings and hand sketches (e.g., Futrelle, Gross, Barbu)  Focus on identifying shape geometry  Semantic interpretation follows directly from a fixed mapping between shape geometry and target semantics  Visual Language theory prescribes geometry detection with grammar rules. September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 21. More efficient modeling of textual cues  Text is the only reliable feature in highly ambiguous scenarios  Tracking spatial patterns of shapes and labels that emerge due to local styles  Identification of higher-level relations (block structures) between model entities (e.g., sub- process, loop, and fork-merge)  Extend to other diagram types September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 22. Informal process diagrams contain structural and semantic ambiguities – need to be dealt with in order to discover precise formal models  Existing capabilities are limited because:  Do not resolve structural ambiguities  Interpreting semantic based on shape name does not suffice  Standard pattern-classification techniques can be successfully employed in interpreting process semantics if the feature space is carefully modeled to mimic human reasoning  Unsupervised clustering can almost match supervised techniques in performance September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA