SlideShare a Scribd company logo
1 of 30
Semantically Driven Social Data
Aggregation Interfaces for Research 2.0
               Laurens De Vocht
                 Selver Softic
                 Martin Ebner
              Herbert Mühlburger

         http://www.semanticprofiling.net

                September 7, 2011
Agenda

 ‣Problem Statement
 ‣Social Semantic Web
 ‣Solution
 ‣Evaluation
 ‣Conclusion
                2
Problem Statement: Definitions
 Profiling
 “Inferring unobser vable information about users from
 observable information about them, that is their actions or their
 utterances.” (Zukerman and Albrecht, 2001)


 Semantic Analysis
 “A technique using semantic-based tools and ontologies in
 order to gain a deeper understanding of the information being
 stored and manipulated in an existing system” (McComb, 2004)



                                 3
Problem Statement: Research Question
Web users generate a massive
unstructured information flow




                                            ?


                         Who has scientific information
                                      relevant for me?
                          4
Problem Statement: Use Case
Connecting researchers based on shared scientific events
(conferences)
                         Scientific Profiling


                                                        Scientific
                  User Model              Event Model   Conferences
                                                        Resource


   Researchers




                               Profiler/
                               Analyzer


   Researcher
     (User)


                                  5
Social Semantic Web
                     Social Web                  Semantic Web
         Community of         (micro)blogging,
        researchers with          sharing,
           conference             tagging,
           experience           discussion
                                                   semi-structured
                                                     information
  Larger population of                                  system
   people interested in
                              (faceted) search
  scientific conferences
                                   engine




                             recommendation          clustered and
                                 engine              analyzed data




 Human process                                   Machine process
                                                       (Gruber, 2007)
                                  6
Social semantic Web
 ‣Hashtags as Identifiers
  ‣not always strong or consistent enough
  ‣properties of good hashtags formalized
  ‣helpful in assessment of valuable identifiers
                                    (Laniado and Mika, 2007)


 ‣Expert Search/Profiling with Linked Data
  ‣aggregate and analyze certain types of data
  ‣need to surpass limits of closed data sets
  ‣LOD delivers multi-purpose data
                                     (Stankovic et al., 2010)

                            7
Scope & Value of the Study

‣Bridging research areas
Human Computer-Interaction & Semantic Analysis
‣Integration
Social network data and linked open data
‣Framework driven methodology
based upon current state-of-the-art semantic tools
‣Evaluation: improved connectivity
proof-of-concept Research 2.0 application


                           8
Solution


 ‣Overview
 ‣Framework
 ‣Web Service
 ‣Client Application

                  9
Solution: Overview
Annotate Data from Social Networks


                      Community approved
                     ontologies: FOAF, SIOC


 Linked Open Data                     Applications




                              Scientific Profiling Framework



                       Connect People and Resources
                        that share Scientific Affinities
                        10
Solution: Overview
          Social                      Linked Open
                                                                    Output Format
         Networks                      Data Cloud



   Framework   Aggregate                      Interlink                    Publish


     Archived/Cached                                                   Scientific
                                      Linked Data                    Information
           Data            Annotate                       Analyse




                                         11
Solution: Overview
          Social                       Linked Open
                                                                       Output Format
         Networks                       Data Cloud



   Framework   Aggregate                       Interlink                      Publish


     Archived/Cached                                                      Scientific
                                       Linked Data                      Information
           Data            Annotate                        Analyse




                                         DBPedia                            JSON
           Twitter                       Colinda                          RDF (XML)
                                        GeoNames


               Aggregate                        Interlink                       Publish


                                          Semantic                         Scientific
          Grabeeter                                                      Profiling API
                           Annotate   Profiling Network       Analyse


                                          11
Solution: Grabeeter
= Twitter aggregation & archiving tool
(developed at TUGraz)




http://grabeeter.tugraz.at
                             12
Solution: Grabeeter
= Twitter aggregation & archiving tool
(developed at TUGraz)




http://grabeeter.tugraz.at
                             12
Solution: Framework Architecture
                                            Applications



                                                           Programming Interface


                                                    Analysis



                                                           High Level Queries


                 Extraction                       Interlinking


   SQL Queries        Triplification                        SPARQL Queries



            Grabeeter                 RDF Store




                                      13
Solution: Web Service

‣get User Profile
‣find People or Events given a User Profile
‣register a new User Profile
‣get Event Details

                     14
Solution: Web Service




                15
Solution: Web Service




                16
Solution: Web Service




                17
Solution: Web Service




                18
Solution: Web Service




                18
Evaluation


 ‣Approach
 ‣Usability
 ‣Usefulness
 ‣Discussion

               19
Evaluation: Approach


‣Test usability & usefulness
‣Web application: “Researcher Affinity Browser”
‣Using explicit evaluation questionnaire


                      20
Evaluation: Usability




                  21
Evaluation: Usefulness

 ‣Relevance
  Test users rate their search results
 ‣Satisfaction questionnaire
  Targeted questions about usefulness
  Allow comments on user interface



                      22
Evaluation: Usefulness
Relevant user percentage
                                   Number of users
                 0% (None)

             1-20% (A few)

 21-40% (Less than one half)

   41-60% (About one half)

61-80% (More than one half)

        81-99% (Almost all)

                 100% (All)

                               0       1             2   3   4



                                       23
Evaluation: Usefulness                                       Usefulness Questionnaire Results
                                   Concept Affinity

           Clear view of affinities between people

             Map & Plot combination understood

                     Deactivating filer fast enough

                       Activating filer fast enough

                           Never usability glitches

          Convention between views understood

Information display not overwhelming (confusing)

                     Relevant detailed person info

Shown details correspond with ‘real life’ activities

                   Enough relevant (new) persons

            Daily updating of information obvious

   Twitter data made more useful for researchers
                                                       1            2          3           4       5

                                                           24
Evaluation: Discussion
‣ Affinities exposed in an engaging way
‣ Positive match according to users
  Triggered by how many common entities?
  After investigation of suggested users?
‣ Reliability of person details hard to verify
‣ UI satisfaction user dependent
  ‣ What does the user expect from “Affinity Browser”?
  ‣ Test different scenarios to identify usage types?
                             25
Conclusion

‣ Framework supports social semantic-based applications
‣ Realized with current state-of-the-art technologies
‣ Interlinking with Linked Open Data Cloud enriches social network
  data
‣ Researcher Affinity Browser
  ‣ Exposes affinities between users
  ‣ User feedback affirms positively new view on social data
  ‣ Hash tags identified as conferences provide consistent links

                                 26
Future work

‣ Rank tags
 by importance, not just frequency of use

‣ Visualization
 improve viewing of links between users and entities

‣ Multiple Resources
 better reliability and more verification of data


                              27

More Related Content

Viewers also liked

Meet David - ETL / Informatica Consultant
Meet David - ETL / Informatica ConsultantMeet David - ETL / Informatica Consultant
Meet David - ETL / Informatica ConsultantDavid Hubbard
 
Etl with talend (data integeration)
Etl with talend (data integeration)Etl with talend (data integeration)
Etl with talend (data integeration)pomishra
 
Researcher Profiling based on Semantic Analysis in Social Networks
Researcher Profiling based on Semantic Analysis in Social NetworksResearcher Profiling based on Semantic Analysis in Social Networks
Researcher Profiling based on Semantic Analysis in Social NetworksLaurens De Vocht
 
A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...
A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...
A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...Laurens De Vocht
 
Talend Community Use Group Bristol: Preparing your business for mastering dat...
Talend Community Use Group Bristol: Preparing your business for mastering dat...Talend Community Use Group Bristol: Preparing your business for mastering dat...
Talend Community Use Group Bristol: Preparing your business for mastering dat...KETL Limited
 
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataEffect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataLaurens De Vocht
 
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Laurens De Vocht
 
OSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsOSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsLaurens De Vocht
 
Talend winter 2017 overview webinar
Talend winter 2017 overview webinarTalend winter 2017 overview webinar
Talend winter 2017 overview webinarJean-Michel Franco
 
Présentation de Talend Winter 2017
Présentation de Talend Winter 2017 Présentation de Talend Winter 2017
Présentation de Talend Winter 2017 Jean-Michel Franco
 
Simplifying Big Data ETL with Talend
Simplifying Big Data ETL with TalendSimplifying Big Data ETL with Talend
Simplifying Big Data ETL with TalendEdureka!
 
Talend Big Data Capabilities Overview
Talend Big Data Capabilities OverviewTalend Big Data Capabilities Overview
Talend Big Data Capabilities OverviewRajan Kanitkar
 
Open Source ETL using Talend Open Studio
Open Source ETL using Talend Open StudioOpen Source ETL using Talend Open Studio
Open Source ETL using Talend Open Studiosantosluis87
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningKai Wähner
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data IntegrationRoberto Marchetto
 
ETL using Big Data Talend
ETL using Big Data Talend  ETL using Big Data Talend
ETL using Big Data Talend Edureka!
 

Viewers also liked (16)

Meet David - ETL / Informatica Consultant
Meet David - ETL / Informatica ConsultantMeet David - ETL / Informatica Consultant
Meet David - ETL / Informatica Consultant
 
Etl with talend (data integeration)
Etl with talend (data integeration)Etl with talend (data integeration)
Etl with talend (data integeration)
 
Researcher Profiling based on Semantic Analysis in Social Networks
Researcher Profiling based on Semantic Analysis in Social NetworksResearcher Profiling based on Semantic Analysis in Social Networks
Researcher Profiling based on Semantic Analysis in Social Networks
 
A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...
A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...
A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...
 
Talend Community Use Group Bristol: Preparing your business for mastering dat...
Talend Community Use Group Bristol: Preparing your business for mastering dat...Talend Community Use Group Bristol: Preparing your business for mastering dat...
Talend Community Use Group Bristol: Preparing your business for mastering dat...
 
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataEffect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
 
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
 
OSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsOSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked Organizations
 
Talend winter 2017 overview webinar
Talend winter 2017 overview webinarTalend winter 2017 overview webinar
Talend winter 2017 overview webinar
 
Présentation de Talend Winter 2017
Présentation de Talend Winter 2017 Présentation de Talend Winter 2017
Présentation de Talend Winter 2017
 
Simplifying Big Data ETL with Talend
Simplifying Big Data ETL with TalendSimplifying Big Data ETL with Talend
Simplifying Big Data ETL with Talend
 
Talend Big Data Capabilities Overview
Talend Big Data Capabilities OverviewTalend Big Data Capabilities Overview
Talend Big Data Capabilities Overview
 
Open Source ETL using Talend Open Studio
Open Source ETL using Talend Open StudioOpen Source ETL using Talend Open Studio
Open Source ETL using Talend Open Studio
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
 
ETL using Big Data Talend
ETL using Big Data Talend  ETL using Big Data Talend
ETL using Big Data Talend
 

Recently uploaded

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 

Semantically Driven Social Data Aggregation Interfaces for Research 2.0

  • 1. Semantically Driven Social Data Aggregation Interfaces for Research 2.0 Laurens De Vocht Selver Softic Martin Ebner Herbert Mühlburger http://www.semanticprofiling.net September 7, 2011
  • 2. Agenda ‣Problem Statement ‣Social Semantic Web ‣Solution ‣Evaluation ‣Conclusion 2
  • 3. Problem Statement: Definitions Profiling “Inferring unobser vable information about users from observable information about them, that is their actions or their utterances.” (Zukerman and Albrecht, 2001) Semantic Analysis “A technique using semantic-based tools and ontologies in order to gain a deeper understanding of the information being stored and manipulated in an existing system” (McComb, 2004) 3
  • 4. Problem Statement: Research Question Web users generate a massive unstructured information flow ? Who has scientific information relevant for me? 4
  • 5. Problem Statement: Use Case Connecting researchers based on shared scientific events (conferences) Scientific Profiling Scientific User Model Event Model Conferences Resource Researchers Profiler/ Analyzer Researcher (User) 5
  • 6. Social Semantic Web Social Web Semantic Web Community of (micro)blogging, researchers with sharing, conference tagging, experience discussion semi-structured information Larger population of system people interested in (faceted) search scientific conferences engine recommendation clustered and engine analyzed data Human process Machine process (Gruber, 2007) 6
  • 7. Social semantic Web ‣Hashtags as Identifiers ‣not always strong or consistent enough ‣properties of good hashtags formalized ‣helpful in assessment of valuable identifiers (Laniado and Mika, 2007) ‣Expert Search/Profiling with Linked Data ‣aggregate and analyze certain types of data ‣need to surpass limits of closed data sets ‣LOD delivers multi-purpose data (Stankovic et al., 2010) 7
  • 8. Scope & Value of the Study ‣Bridging research areas Human Computer-Interaction & Semantic Analysis ‣Integration Social network data and linked open data ‣Framework driven methodology based upon current state-of-the-art semantic tools ‣Evaluation: improved connectivity proof-of-concept Research 2.0 application 8
  • 9. Solution ‣Overview ‣Framework ‣Web Service ‣Client Application 9
  • 10. Solution: Overview Annotate Data from Social Networks Community approved ontologies: FOAF, SIOC Linked Open Data Applications Scientific Profiling Framework Connect People and Resources that share Scientific Affinities 10
  • 11. Solution: Overview Social Linked Open Output Format Networks Data Cloud Framework Aggregate Interlink Publish Archived/Cached Scientific Linked Data Information Data Annotate Analyse 11
  • 12. Solution: Overview Social Linked Open Output Format Networks Data Cloud Framework Aggregate Interlink Publish Archived/Cached Scientific Linked Data Information Data Annotate Analyse DBPedia JSON Twitter Colinda RDF (XML) GeoNames Aggregate Interlink Publish Semantic Scientific Grabeeter Profiling API Annotate Profiling Network Analyse 11
  • 13. Solution: Grabeeter = Twitter aggregation & archiving tool (developed at TUGraz) http://grabeeter.tugraz.at 12
  • 14. Solution: Grabeeter = Twitter aggregation & archiving tool (developed at TUGraz) http://grabeeter.tugraz.at 12
  • 15. Solution: Framework Architecture Applications Programming Interface Analysis High Level Queries Extraction Interlinking SQL Queries Triplification SPARQL Queries Grabeeter RDF Store 13
  • 16. Solution: Web Service ‣get User Profile ‣find People or Events given a User Profile ‣register a new User Profile ‣get Event Details 14
  • 22. Evaluation ‣Approach ‣Usability ‣Usefulness ‣Discussion 19
  • 23. Evaluation: Approach ‣Test usability & usefulness ‣Web application: “Researcher Affinity Browser” ‣Using explicit evaluation questionnaire 20
  • 25. Evaluation: Usefulness ‣Relevance Test users rate their search results ‣Satisfaction questionnaire Targeted questions about usefulness Allow comments on user interface 22
  • 26. Evaluation: Usefulness Relevant user percentage Number of users 0% (None) 1-20% (A few) 21-40% (Less than one half) 41-60% (About one half) 61-80% (More than one half) 81-99% (Almost all) 100% (All) 0 1 2 3 4 23
  • 27. Evaluation: Usefulness Usefulness Questionnaire Results Concept Affinity Clear view of affinities between people Map & Plot combination understood Deactivating filer fast enough Activating filer fast enough Never usability glitches Convention between views understood Information display not overwhelming (confusing) Relevant detailed person info Shown details correspond with ‘real life’ activities Enough relevant (new) persons Daily updating of information obvious Twitter data made more useful for researchers 1 2 3 4 5 24
  • 28. Evaluation: Discussion ‣ Affinities exposed in an engaging way ‣ Positive match according to users Triggered by how many common entities? After investigation of suggested users? ‣ Reliability of person details hard to verify ‣ UI satisfaction user dependent ‣ What does the user expect from “Affinity Browser”? ‣ Test different scenarios to identify usage types? 25
  • 29. Conclusion ‣ Framework supports social semantic-based applications ‣ Realized with current state-of-the-art technologies ‣ Interlinking with Linked Open Data Cloud enriches social network data ‣ Researcher Affinity Browser ‣ Exposes affinities between users ‣ User feedback affirms positively new view on social data ‣ Hash tags identified as conferences provide consistent links 26
  • 30. Future work ‣ Rank tags by importance, not just frequency of use ‣ Visualization improve viewing of links between users and entities ‣ Multiple Resources better reliability and more verification of data 27