SlideShare a Scribd company logo
1 of 20
Download to read offline
FRAMEWORK FOR INTELLIGENT VIRTUAL ORGANIZATIONS (FIVO)




       Natural Language based
 Processing of Multilingual Contracts
for Virtual Organizations constitution
          Mikołaj Pastuszko, Bartosz Kryza, Renata Słota, Jacek Kitowski
       Institute of Computer Science, University of Science and Technology AGH
                                                               Kraków, POLAND
Agenda

Background of the problem

Goals and requirements of NLPN system

Architecture of NLPN system

Main processing flow in NLPN system

Technologies and tools used in NLPN system

Example of contract text analysis in NLPN system

Future development proposals for NLPN system
Problem introduction

Assumption
  Organizations
               own resources that are expected to be shared within Virtual
   Organization
  Conditions of cooperation are written down in form of the contract
   document


Problem
  Contracts
           are written in natural language (e.g. Polish)
  Automatization of the Virtual Organization management (FiVO) requires a
   formal and semantic form of the contract (ontology in OWL format)


Solution
  NLP-based  Negotiations (NLPN) System:
   Translating natural language based contracts to ontologies in OWL
   format
Concept of NLPN system
Goals and requirements

Support for multiple languages
  English and Polish as a starting point
  Easily extendable with support for another languages



Output ontology in OWL format           (FiVO requirement)
  Ontology   sturucture easily adjustable


Minimalization of human (supervisor) assistance


Flexible mapping between text phrases and ontology entities
  Human-readable     and easily editable Contract Dictionary


Modularity
  Easy   orchestration for various applications
Data flow in NLPN system
Modular architecture of NLPN system
Contract text analysis


1. Tokenization

2. Sentence Splitting

3. Morphological Analysis and POS Tagging

4. Named Entities Recognition
  ●
    Gazetteer

5. Contract Statemets Recognition
  ●
    Transducer + grammars
Technologies and tools

NLP tools
    GATE – General Architecture for Text Engineering
        Tokenizer
                                 ANNIE – A Nearly-New
        Gazetteer
                                 Information Extraction System
        OntoGazetteer
        JAPE Transducer
          JAPE grammars – Java Annotations Pattern Engine

    LanguageTool
      Sentence Splitter
        Part-of-Speech Tagger
        Disambiguator (tagger part)
        Supports 20 languages including Polish (Morfologik library)
Technologies and tools

Ontologies
    Jena Semantic Web Framework library
        Supports read and write in RDF/XML, N3 and N-Triples formats
        Provides API for OWL and RDF


Configuration files
    YAML format
    SnakeYAML library
Example: Contract text analysis

QoS Statements
Costa Rica Airlines should provide number of seats of Mercedes-Benz H6 equal to 54 and expected average velocity
greater than 60 km/h.
Security Statements
Tour Manager and Client should be able to book seats on Costa Rica Service.
Penalty Clauses
In case of violation of Acela D45 trainset sharing conditions a notification should be sent to John Smith.




Stwierdzenia QoS
Costa Rica Airlines będzie świadczyć ilość miejsc siedzących dla Mercedes-Benz H6 wynoszącą dokładnie 54 i
przewidywaną prędkość średnią ponad 60 km/h.
Stwierdzenia bezpieczeństwa
Tour Manager i Klient powinni być uprawnieni do rezerwowania miejsc poprzez Usługę Costa Rica.
Klauzule kar umownych
W przypadku niedotrzymania warunków świadczenia Acela D45 trainset powinno zostać wysłane powiadomienie do
Johna Smitha.
Tokenization
Sentence Splitting
Morphological Analysis and POS Tagging
Named Entities Recognition
Contract Statements Recognition
Contract Statements Recognition
Summary

NLPN system:
  Translates   natural language based contracts to formal and
   semantic form of ontologies
  Supports English and Polish

    Easily extendable with another languages
  Is modular

    Ease of use in various applications
  Is highly configurable

    Contract Dictionary (including its structure)
    Contract Ontology structure
    Contract Statements forms
    Configuration files for all components
  Has broad perspectives for future development →
Future development

Distributed Negotiations Environment
     Negotiations
      Console
 More
  statement
  forms
 Statistic
  approach
  algorithms
 Noise
  correction
  (typo etc.)
The End



    Thank you


mikolaj.pastuszko@gmail.com

More Related Content

Similar to CGW 2010 - NLPN

How AI can help you build better customer relationships?
How AI can help you build better customer relationships?How AI can help you build better customer relationships?
How AI can help you build better customer relationships?Knoldus Inc.
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212kcmani15
 
Google Cloud Platform Munich
Google Cloud Platform MunichGoogle Cloud Platform Munich
Google Cloud Platform MunichVMware Tanzu
 
Don Bailey eLearning sample
Don Bailey eLearning sampleDon Bailey eLearning sample
Don Bailey eLearning sampledonbailey
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212guestc93331f
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212gauravtest1
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212aaa111333
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212guest993a3d
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212guest993a3d
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212guest993a3d
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212guest993a3d
 
02 c a306-phillips_langtags
02 c a306-phillips_langtags02 c a306-phillips_langtags
02 c a306-phillips_langtagssuvo1111
 
Google Cloud Platform - Cloud-Native Roadshow Stuttgart
Google Cloud Platform - Cloud-Native Roadshow StuttgartGoogle Cloud Platform - Cloud-Native Roadshow Stuttgart
Google Cloud Platform - Cloud-Native Roadshow StuttgartVMware Tanzu
 
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Facultad de Informática UCM
 
CSCorganization of programming languages
CSCorganization of programming languagesCSCorganization of programming languages
CSCorganization of programming languagesOluwafolakeOjo
 

Similar to CGW 2010 - NLPN (20)

How AI can help you build better customer relationships?
How AI can help you build better customer relationships?How AI can help you build better customer relationships?
How AI can help you build better customer relationships?
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212
 
Google Cloud Platform Munich
Google Cloud Platform MunichGoogle Cloud Platform Munich
Google Cloud Platform Munich
 
Don Bailey eLearning sample
Don Bailey eLearning sampleDon Bailey eLearning sample
Don Bailey eLearning sample
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212
 
CLTL Software and Web Services
CLTL Software and Web Services CLTL Software and Web Services
CLTL Software and Web Services
 
DeepPavlov 2019
DeepPavlov 2019DeepPavlov 2019
DeepPavlov 2019
 
02 c a306-phillips_langtags
02 c a306-phillips_langtags02 c a306-phillips_langtags
02 c a306-phillips_langtags
 
Google Cloud Platform - Cloud-Native Roadshow Stuttgart
Google Cloud Platform - Cloud-Native Roadshow StuttgartGoogle Cloud Platform - Cloud-Native Roadshow Stuttgart
Google Cloud Platform - Cloud-Native Roadshow Stuttgart
 
voice browser
voice browservoice browser
voice browser
 
ACE Logo
ACE LogoACE Logo
ACE Logo
 
Tamil OCR using Tesseract OCR Engine
Tamil OCR using Tesseract OCR EngineTamil OCR using Tesseract OCR Engine
Tamil OCR using Tesseract OCR Engine
 
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
 
CSCorganization of programming languages
CSCorganization of programming languagesCSCorganization of programming languages
CSCorganization of programming languages
 

Recently uploaded

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

CGW 2010 - NLPN

  • 1. FRAMEWORK FOR INTELLIGENT VIRTUAL ORGANIZATIONS (FIVO) Natural Language based Processing of Multilingual Contracts for Virtual Organizations constitution Mikołaj Pastuszko, Bartosz Kryza, Renata Słota, Jacek Kitowski Institute of Computer Science, University of Science and Technology AGH Kraków, POLAND
  • 2. Agenda Background of the problem Goals and requirements of NLPN system Architecture of NLPN system Main processing flow in NLPN system Technologies and tools used in NLPN system Example of contract text analysis in NLPN system Future development proposals for NLPN system
  • 3. Problem introduction Assumption  Organizations own resources that are expected to be shared within Virtual Organization  Conditions of cooperation are written down in form of the contract document Problem  Contracts are written in natural language (e.g. Polish)  Automatization of the Virtual Organization management (FiVO) requires a formal and semantic form of the contract (ontology in OWL format) Solution  NLP-based Negotiations (NLPN) System: Translating natural language based contracts to ontologies in OWL format
  • 5. Goals and requirements Support for multiple languages  English and Polish as a starting point  Easily extendable with support for another languages Output ontology in OWL format (FiVO requirement)  Ontology sturucture easily adjustable Minimalization of human (supervisor) assistance Flexible mapping between text phrases and ontology entities  Human-readable and easily editable Contract Dictionary Modularity  Easy orchestration for various applications
  • 6. Data flow in NLPN system
  • 8. Contract text analysis 1. Tokenization 2. Sentence Splitting 3. Morphological Analysis and POS Tagging 4. Named Entities Recognition ● Gazetteer 5. Contract Statemets Recognition ● Transducer + grammars
  • 9. Technologies and tools NLP tools  GATE – General Architecture for Text Engineering  Tokenizer ANNIE – A Nearly-New  Gazetteer Information Extraction System  OntoGazetteer  JAPE Transducer  JAPE grammars – Java Annotations Pattern Engine  LanguageTool  Sentence Splitter  Part-of-Speech Tagger  Disambiguator (tagger part)  Supports 20 languages including Polish (Morfologik library)
  • 10. Technologies and tools Ontologies  Jena Semantic Web Framework library  Supports read and write in RDF/XML, N3 and N-Triples formats  Provides API for OWL and RDF Configuration files  YAML format  SnakeYAML library
  • 11. Example: Contract text analysis QoS Statements Costa Rica Airlines should provide number of seats of Mercedes-Benz H6 equal to 54 and expected average velocity greater than 60 km/h. Security Statements Tour Manager and Client should be able to book seats on Costa Rica Service. Penalty Clauses In case of violation of Acela D45 trainset sharing conditions a notification should be sent to John Smith. Stwierdzenia QoS Costa Rica Airlines będzie świadczyć ilość miejsc siedzących dla Mercedes-Benz H6 wynoszącą dokładnie 54 i przewidywaną prędkość średnią ponad 60 km/h. Stwierdzenia bezpieczeństwa Tour Manager i Klient powinni być uprawnieni do rezerwowania miejsc poprzez Usługę Costa Rica. Klauzule kar umownych W przypadku niedotrzymania warunków świadczenia Acela D45 trainset powinno zostać wysłane powiadomienie do Johna Smitha.
  • 18. Summary NLPN system:  Translates natural language based contracts to formal and semantic form of ontologies  Supports English and Polish  Easily extendable with another languages  Is modular  Ease of use in various applications  Is highly configurable  Contract Dictionary (including its structure)  Contract Ontology structure  Contract Statements forms  Configuration files for all components  Has broad perspectives for future development →
  • 19. Future development Distributed Negotiations Environment  Negotiations Console  More statement forms  Statistic approach algorithms  Noise correction (typo etc.)
  • 20. The End Thank you mikolaj.pastuszko@gmail.com