SlideShare a Scribd company logo
1 of 20
Download to read offline
Personal World
      Absorber
   Global Search + Indexing conception


Vladimir Kryukov, Product Marketing Director
Vladimir Vodolazkiy, Ph.D., Soft ware Architect


        (C) ROSA - September, 2012
                     1
PWA Mission:
Information extraction from the information feeds
- textual, video, photo resources and so on.
Absorb only information, relevant to user’s interests
Store all extracted data in the personal user-friendly
information warehouse
Search based on the object’s conception
Comfortable object‘ representation/detalisation
Integration with 3rd party data processing
soft ware


                        2                       (C) ROSA 2012
Personal infosphere
Continuous data bank, common for all user’s
gadgets, available at home, work, and car on
the road
Facts and objects are grouped in accordance
with personal preferences
All relation bet ween objects are subject of time,
space, personal alignment and inspiration of
the moment


                      3                     (C) ROSA 2012
Main tasks to be solved for
     PWA prototype

Information’ aggregation
Indexing
Search engine
Representation engine
Personal cloud storage

                 4         (C) ROSA 2012
Objects in PWA:
Texts - facts, fiction, notes, memoires
Music - inspiration, relaxation
Speech notes - memoires, notes, evidences
Images - facts, fiction, notes, memoires
Video - all above mentioned
Data - calendars, list, other info

                   5                      (C) ROSA 2012
PWA data agregator subsystem

        Robots (search agents)
    Actualisation   Search results




   Data Warehouse


                            Analyzer
                                                 Request

        Interpreter
                                     Responce
                      6                         (C) ROSA 2012
Cluster model in PWA
                                                                     Latitude, Longtitude,
                                                         GPS-        Timestamp,Direction
                       06/15 - Whiskas Chicken 3 pcs   necklace             Speed



                                    Cat food
    Objects can be                  purchase
                                                                                                Event list
    united into clusters
                                                                                       Cat’s story
                                                                                        Cat’s story
                                                            My cat                       Cat’s story
    Clusters can be                Vaccination
    united into                     calendar
    metaclusters
    and so on                                              Photo           Photo             Photo
                                           Movie

                               My cat on u-tube                                              Cat on the
                                                                                               shelf

(C) ROSA 2012                                  7
Object’ attributes:
                                         - Date of creation
                                         - Time interval (intervals) covered

                                           Time attributes


       Tags with
                                                                                 Space attributes
      description                                                                - Where it was created
                                                                                 - Space position (positions) covered
- list of the tags with the textual
                                                                                 - Movement alignment
description of object
                                                                                     - speed,
                                                                                     - direction


                    Technical
                   attributes                                      Personal attitude
                                                                     - Fact of the objective reality?
         - file size           - codec to play,                       - Positive, negative relation
         - underlaying file/   - charset coding
                                                                     - relax, inspiration, information,
         database access      - etc.
         path                                             8          memoire                        (C)   ROSA 2012
Object’ attributes:
Should be applicable to any kind of media
Should provide fast and reliable identification
Should be presented in human-readable form
Should take into account the uncertainty of the choice
Should be used in conjunction with personal user’s profile



Frame of the text-based tags with fuzzy weights
Conceptual thesaurus, based on WordNet
Adaptive user’ interests profile
                              9                          (C) ROSA 2012
Tag set extraction
                                       Document Keywords/Nouns




    Common attribute extraction
    Semantic meaning extraction
    Formal/technical attributes                     Intermediate Terms


Resulted document description -
vector in the Base Term’s space


                                                Base Terms




                                  10                   (C) ROSA 2012
Multiclassification
(C) ROSA 2012



                                                                                                             Anna’s PWA
         Each object can belong to
         several clusters at once                                Where           Kindergarden #5
                                                                 When             26 Dec 2011                New Year Evening
         Tag’s frame content depends on                       Duration               20:00
         the owner of the object                                 Who               Mary, Helen

                                                                           Ann - kindergartener
 Example:: Two girls read poems at the festival in kindergarten...

                                                            Audio File MP3
       Mary
       Helen

 Steve’s PWA                                                                                               John’s PWA
     Where       Kindergarden #5
                                         Mary                                                    Where       Kindergarden #5
     When          26 Dec 2011
                                                                         Helen                   When         26 Dec 2011
                  00:00 - 12:30
     Timing
                  15:40 - 20:00                                                                  Timing       12:30 - 15:40
     Who              Mary                                                                       Who              Helen
                                   Steve - father of Mary   11                                            John - father of Helen
Hierarchy approach
PWA includes different kinds of objects
PWA should provide estimation of similarity bet ween
different object
Similar objects can be combined in clusters
Any object can belong to several clusters
 Clusters can be united the same way as underlying
objects
Hierarchy of the objects/clusters should be self-
organised

                        12                          (C) ROSA 2012
Logical architecture fo indexing subsystem
 Representation                       Appearance, set of             Urgent message
parameters tuning                   content, time schedule          generation criteria
                                                                                                         End users
               Informational channels (feeds)

  Warehouse for information                              Data structures for task-                   Knowledge engineer
   extraction algorithms                                  oriented processing


    Common information extraction
            algorithms
                                                     Intermidiate processing layer, which converts
                                                      primary databases into user’s adopted views
             Thesaurus                                                                               Software engineer


                                                                                     «Manual» data
     Information gathering agents                     Primary database                   input



                                                             Sensors,
                                                               OCR,         Interface to              Software and OS
   System soft ware, servers NNTP,                            Video,
       WWW, SMTP, SQL etc.                                   Keyboard      external world
                                                               13                                      (C) ROSA 2012
Empirical-based learning system
Initial learning             Input text data
  test corpse
                                   End-User System



                   Knowlege-base             Natural Language
Knowledge-base        updater                   Processor
  generation

                           Actual
                        knowledge base


     Initial
  knowledge base                    Processed data


                   14                                (C) ROSA 2012
Information extraction approach
    4Apr Dallas - Early last evening,             Early/adv last/adj evening/
    a tornado swept through an area               noun/time, a/det tornado/
                                                                                          Early last evening         adverb:time
       northwest of Dallas, causing              noun/weather swept/verb ...
                                                                                              a tornado           noun group/subject
           extensive damage...
                                                                                               swept                 verb group
                                                                                          through an area        prep phrase:location
                                                                                         northwest of Dallas       adverb:location
                           Syntax framing                            Sentence                  causing               verb group
                            and markup                                analysis            extensive damage        noun group/object




                                           Early last evening, a tornado
                                        swept through an area northwest of
 Event:     Tornado                                    Dallas...
  Date:   04/03/1997                        Whitnes confirmed, that the
  Time:      19:15                                  twister...                              Extraction


                  Template                                                           Sub phrase                Data extracted
                                                    Object merging
                  creation                                                          tornado swept               event:tornado
                                                                                 swept through an area         location: «area»
                                                                                   area northwest of      location: «northwest of
                                                                                        Dallas                    Dallas»
                                                                                   causing extensive
                                                                                                               effect: «damage»
                                                                                       damage

(C) ROSA 2012                                            15
Search in the tag space
 I am interested in Apple soft ware, but
 does Apple is interested in mine?
 Asymmetric quasimetric to reflect
 personal preferences and relationships
 to objects
 Cosinus measure and Tanimoto-based
 equation to evaluate similarity of t wo
 and more objects

                   16                 (C) ROSA 2012
Representation
               Various tasks requires different representation
                Both visual and audio output is required


                                   Search
                                   results

 Time Frame                                              Infosphere
Document/file access in time                            Cluster-based view of objects
    retrospective order
                                  Herald                    and relationships


                                Audio output for the
                              search results summary
                                         17                            (C) ROSA 2012
Herald - text-to-speech
         synthesis system
Pro:
  Does not distract user from the main activity
  Effectively utilizes user’s «input channells»
Contra:
  Audio brings up to 10 times less information
  than visual
  Reuires special agregation procedures to be
  developed

                     18                     (C) ROSA 2012
Herald - based on Flite library

The same quiality as
Festival due to static      Festival                                 FestVox
voices                                        Open source C++                 Open source Voice
                                                Framework
                                                                                preparation
No Scheme, no interpreter                      Scheme based
                                             internal scripting                  subsystem


Small, fast portable run-
time synthesizer
Fast synthesis start
                                                    Flite
Thread safe
                                                          Light-Weight Pure C version
Ideal for embedded            GStreamer
                                                            with precompiled voices
systems                                                       for runtime library



                                          GStreamer - not yet



                                  19                                         (C) ROSA 2012
Summary
PWA technology development is in progress
today
ROSA is ready to collaborate with OEMs to
enable next generation devices with
intellectual data organisation and search
Developers are welcome to join ROSA team in
this project

Contacts: vladimir.kryukov@rosalab.ru,
    vladimir.vodolazkiy@rosalab.ru
                   20                    (C) ROSA 2012

More Related Content

Viewers also liked

下水道職災分析及管理990405
下水道職災分析及管理990405下水道職災分析及管理990405
下水道職災分析及管理990405zoe lee
 
10 Growth hacking tools and leveraging qualitative data to drive quantitative...
10 Growth hacking tools and leveraging qualitative data to drive quantitative...10 Growth hacking tools and leveraging qualitative data to drive quantitative...
10 Growth hacking tools and leveraging qualitative data to drive quantitative...Sam Ho
 
Tips for a successful school year guide for parents
Tips for a successful school year guide for parentsTips for a successful school year guide for parents
Tips for a successful school year guide for parentsCheryl Ramos-Roldan
 
詳細表資料架構與計價
詳細表資料架構與計價詳細表資料架構與計價
詳細表資料架構與計價zoe lee
 
16 coisas sobre o legislativo de Brasília
16 coisas sobre o legislativo de Brasília16 coisas sobre o legislativo de Brasília
16 coisas sobre o legislativo de BrasíliaDaniel Bastos
 
Authenticities from record hop to raves
Authenticities from record hop to ravesAuthenticities from record hop to raves
Authenticities from record hop to ravesCamila Schneider
 
金字塔原理Part3
金字塔原理Part3金字塔原理Part3
金字塔原理Part3zoe lee
 
Vejen til danmarks bedste arbejdsplads
Vejen til danmarks bedste arbejdspladsVejen til danmarks bedste arbejdsplads
Vejen til danmarks bedste arbejdspladsKristian Gren
 
G&G company profile & sample works
G&G company profile & sample worksG&G company profile & sample works
G&G company profile & sample workscataligiii
 
Work Sampling System in Early Childhood Education
Work Sampling System in Early Childhood EducationWork Sampling System in Early Childhood Education
Work Sampling System in Early Childhood EducationCheryl Ramos-Roldan
 

Viewers also liked (17)

下水道職災分析及管理990405
下水道職災分析及管理990405下水道職災分析及管理990405
下水道職災分析及管理990405
 
10 Growth hacking tools and leveraging qualitative data to drive quantitative...
10 Growth hacking tools and leveraging qualitative data to drive quantitative...10 Growth hacking tools and leveraging qualitative data to drive quantitative...
10 Growth hacking tools and leveraging qualitative data to drive quantitative...
 
Forma e funk
Forma e funkForma e funk
Forma e funk
 
Volcans
VolcansVolcans
Volcans
 
Tips for a successful school year guide for parents
Tips for a successful school year guide for parentsTips for a successful school year guide for parents
Tips for a successful school year guide for parents
 
詳細表資料架構與計價
詳細表資料架構與計價詳細表資料架構與計價
詳細表資料架構與計價
 
16 coisas sobre o legislativo de Brasília
16 coisas sobre o legislativo de Brasília16 coisas sobre o legislativo de Brasília
16 coisas sobre o legislativo de Brasília
 
Parent's Night 2014 2015
Parent's Night 2014 2015Parent's Night 2014 2015
Parent's Night 2014 2015
 
Authenticities from record hop to raves
Authenticities from record hop to ravesAuthenticities from record hop to raves
Authenticities from record hop to raves
 
金字塔原理Part3
金字塔原理Part3金字塔原理Part3
金字塔原理Part3
 
Vejen til danmarks bedste arbejdsplads
Vejen til danmarks bedste arbejdspladsVejen til danmarks bedste arbejdsplads
Vejen til danmarks bedste arbejdsplads
 
Bipolar
BipolarBipolar
Bipolar
 
G&G company profile & sample works
G&G company profile & sample worksG&G company profile & sample works
G&G company profile & sample works
 
Work Sampling System in Early Childhood Education
Work Sampling System in Early Childhood EducationWork Sampling System in Early Childhood Education
Work Sampling System in Early Childhood Education
 
Myofascial pain syndrome
Myofascial pain syndromeMyofascial pain syndrome
Myofascial pain syndrome
 
Physical Medicine and Rehabilitation
Physical Medicine and RehabilitationPhysical Medicine and Rehabilitation
Physical Medicine and Rehabilitation
 
Segmentation Best Practices
Segmentation Best PracticesSegmentation Best Practices
Segmentation Best Practices
 

Similar to Personal World Absorber - a tool to filter information garbage and boost user comfort.

[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & Drupal[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & DrupalDrupal Taiwan
 
Mobile social search
Mobile social searchMobile social search
Mobile social searchRamesh Jain
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinalDeborah McGuinness
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicagoDeborah McGuinness
 
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...InfiniteGraph
 
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011SEO CAMP
 
Linked Data - the Future for Open Repositories. Kultivate Workshop
Linked Data - the Future for Open Repositories. Kultivate WorkshopLinked Data - the Future for Open Repositories. Kultivate Workshop
Linked Data - the Future for Open Repositories. Kultivate WorkshopAdrian Stevenson
 
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
Applying Semantic Extensions And New Services To Drupal   Sem Tech June 2010Applying Semantic Extensions And New Services To Drupal   Sem Tech June 2010
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010AI4BD GmbH
 

Similar to Personal World Absorber - a tool to filter information garbage and boost user comfort. (9)

[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & Drupal[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & Drupal
 
Mobile social search
Mobile social searchMobile social search
Mobile social search
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago
 
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
 
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
 
Linked Data - the Future for Open Repositories. Kultivate Workshop
Linked Data - the Future for Open Repositories. Kultivate WorkshopLinked Data - the Future for Open Repositories. Kultivate Workshop
Linked Data - the Future for Open Repositories. Kultivate Workshop
 
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
Applying Semantic Extensions And New Services To Drupal   Sem Tech June 2010Applying Semantic Extensions And New Services To Drupal   Sem Tech June 2010
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
 
Jmora.di.oeg.3x1e
Jmora.di.oeg.3x1eJmora.di.oeg.3x1e
Jmora.di.oeg.3x1e
 

Recently uploaded

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 

Recently uploaded (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 

Personal World Absorber - a tool to filter information garbage and boost user comfort.

  • 1. Personal World Absorber Global Search + Indexing conception Vladimir Kryukov, Product Marketing Director Vladimir Vodolazkiy, Ph.D., Soft ware Architect (C) ROSA - September, 2012 1
  • 2. PWA Mission: Information extraction from the information feeds - textual, video, photo resources and so on. Absorb only information, relevant to user’s interests Store all extracted data in the personal user-friendly information warehouse Search based on the object’s conception Comfortable object‘ representation/detalisation Integration with 3rd party data processing soft ware 2 (C) ROSA 2012
  • 3. Personal infosphere Continuous data bank, common for all user’s gadgets, available at home, work, and car on the road Facts and objects are grouped in accordance with personal preferences All relation bet ween objects are subject of time, space, personal alignment and inspiration of the moment 3 (C) ROSA 2012
  • 4. Main tasks to be solved for PWA prototype Information’ aggregation Indexing Search engine Representation engine Personal cloud storage 4 (C) ROSA 2012
  • 5. Objects in PWA: Texts - facts, fiction, notes, memoires Music - inspiration, relaxation Speech notes - memoires, notes, evidences Images - facts, fiction, notes, memoires Video - all above mentioned Data - calendars, list, other info 5 (C) ROSA 2012
  • 6. PWA data agregator subsystem Robots (search agents) Actualisation Search results Data Warehouse Analyzer Request Interpreter Responce 6 (C) ROSA 2012
  • 7. Cluster model in PWA Latitude, Longtitude, GPS- Timestamp,Direction 06/15 - Whiskas Chicken 3 pcs necklace Speed Cat food Objects can be purchase Event list united into clusters Cat’s story Cat’s story My cat Cat’s story Clusters can be Vaccination united into calendar metaclusters and so on Photo Photo Photo Movie My cat on u-tube Cat on the shelf (C) ROSA 2012 7
  • 8. Object’ attributes: - Date of creation - Time interval (intervals) covered Time attributes Tags with Space attributes description - Where it was created - Space position (positions) covered - list of the tags with the textual - Movement alignment description of object - speed, - direction Technical attributes Personal attitude - Fact of the objective reality? - file size - codec to play, - Positive, negative relation - underlaying file/ - charset coding - relax, inspiration, information, database access - etc. path 8 memoire (C) ROSA 2012
  • 9. Object’ attributes: Should be applicable to any kind of media Should provide fast and reliable identification Should be presented in human-readable form Should take into account the uncertainty of the choice Should be used in conjunction with personal user’s profile Frame of the text-based tags with fuzzy weights Conceptual thesaurus, based on WordNet Adaptive user’ interests profile 9 (C) ROSA 2012
  • 10. Tag set extraction Document Keywords/Nouns Common attribute extraction Semantic meaning extraction Formal/technical attributes Intermediate Terms Resulted document description - vector in the Base Term’s space Base Terms 10 (C) ROSA 2012
  • 11. Multiclassification (C) ROSA 2012 Anna’s PWA Each object can belong to several clusters at once Where Kindergarden #5 When 26 Dec 2011 New Year Evening Tag’s frame content depends on Duration 20:00 the owner of the object Who Mary, Helen Ann - kindergartener Example:: Two girls read poems at the festival in kindergarten... Audio File MP3 Mary Helen Steve’s PWA John’s PWA Where Kindergarden #5 Mary Where Kindergarden #5 When 26 Dec 2011 Helen When 26 Dec 2011 00:00 - 12:30 Timing 15:40 - 20:00 Timing 12:30 - 15:40 Who Mary Who Helen Steve - father of Mary 11 John - father of Helen
  • 12. Hierarchy approach PWA includes different kinds of objects PWA should provide estimation of similarity bet ween different object Similar objects can be combined in clusters Any object can belong to several clusters Clusters can be united the same way as underlying objects Hierarchy of the objects/clusters should be self- organised 12 (C) ROSA 2012
  • 13. Logical architecture fo indexing subsystem Representation Appearance, set of Urgent message parameters tuning content, time schedule generation criteria End users Informational channels (feeds) Warehouse for information Data structures for task- Knowledge engineer extraction algorithms oriented processing Common information extraction algorithms Intermidiate processing layer, which converts primary databases into user’s adopted views Thesaurus Software engineer «Manual» data Information gathering agents Primary database input Sensors, OCR, Interface to Software and OS System soft ware, servers NNTP, Video, WWW, SMTP, SQL etc. Keyboard external world 13 (C) ROSA 2012
  • 14. Empirical-based learning system Initial learning Input text data test corpse End-User System Knowlege-base Natural Language Knowledge-base updater Processor generation Actual knowledge base Initial knowledge base Processed data 14 (C) ROSA 2012
  • 15. Information extraction approach 4Apr Dallas - Early last evening, Early/adv last/adj evening/ a tornado swept through an area noun/time, a/det tornado/ Early last evening adverb:time northwest of Dallas, causing noun/weather swept/verb ... a tornado noun group/subject extensive damage... swept verb group through an area prep phrase:location northwest of Dallas adverb:location Syntax framing Sentence causing verb group and markup analysis extensive damage noun group/object Early last evening, a tornado swept through an area northwest of Event: Tornado Dallas... Date: 04/03/1997 Whitnes confirmed, that the Time: 19:15 twister... Extraction Template Sub phrase Data extracted Object merging creation tornado swept event:tornado swept through an area location: «area» area northwest of location: «northwest of Dallas Dallas» causing extensive effect: «damage» damage (C) ROSA 2012 15
  • 16. Search in the tag space I am interested in Apple soft ware, but does Apple is interested in mine? Asymmetric quasimetric to reflect personal preferences and relationships to objects Cosinus measure and Tanimoto-based equation to evaluate similarity of t wo and more objects 16 (C) ROSA 2012
  • 17. Representation Various tasks requires different representation Both visual and audio output is required Search results Time Frame Infosphere Document/file access in time Cluster-based view of objects retrospective order Herald and relationships Audio output for the search results summary 17 (C) ROSA 2012
  • 18. Herald - text-to-speech synthesis system Pro: Does not distract user from the main activity Effectively utilizes user’s «input channells» Contra: Audio brings up to 10 times less information than visual Reuires special agregation procedures to be developed 18 (C) ROSA 2012
  • 19. Herald - based on Flite library The same quiality as Festival due to static Festival FestVox voices Open source C++ Open source Voice Framework preparation No Scheme, no interpreter Scheme based internal scripting subsystem Small, fast portable run- time synthesizer Fast synthesis start Flite Thread safe Light-Weight Pure C version Ideal for embedded GStreamer with precompiled voices systems for runtime library GStreamer - not yet 19 (C) ROSA 2012
  • 20. Summary PWA technology development is in progress today ROSA is ready to collaborate with OEMs to enable next generation devices with intellectual data organisation and search Developers are welcome to join ROSA team in this project Contacts: vladimir.kryukov@rosalab.ru, vladimir.vodolazkiy@rosalab.ru 20 (C) ROSA 2012