SiocLog: Providing IRC Discussion Logs as Linked Data  Tuukka Hastrup 1 , Uldis Bojars 2  and  John G. Breslin 2, 3 1  Uni...
Motivation <ul><li>IRC conversations are quite disconnected from the Web and even from other IRC channels and networks </l...
Motivation (2) <ul><li>SIOC provides a framework for linking social media contributions to other content and Linked Data r...
Background <ul><li>We will begin by introducing the various areas relevant to this system: </li></ul><ul><ul><li>IRC </li>...
Internet Relay Chat (IRC) <ul><li>Instant messaging / internet chat is a major form of social interaction online </li></ul...
Linked Data <ul><li>Building a “Web of Data” to enhance the current Web </li></ul><ul><li>Exposing, sharing and connecting...
Semantically-Interlinked Online Communities (SIOC) <ul><li>An effort from DERI, NUI Galway to discover how we can create /...
 
Some of the SIOC core ontology classes and properties
Some examples of where SIOC is already use (about 50 implementations / applications)
Web ID <ul><li>A Web ID is a web address that identifies a person as a Linked Data item </li></ul><ul><li>A Web ID should ...
Design
Mapping IRC identifiers to URIs on the Web <ul><li>irc://freenode </li></ul><ul><li>(IRC Network) </li></ul><ul><li>irc://...
Some of the internal and external links
Browsing the Linked Data
Creating a link between a user account on IRC and a personal profile <ul><li>Claiming a Web ID creates a link [black] betw...
Web IDs in SiocLog <ul><li>A Web ID can be claimed using mttlbot </li></ul><ul><li>Can claim using standard IRC services <...
Implementation <ul><li>2000 lines of Python source code </li></ul><ul><li>1000 lines of Zope/TAL HTML templates </li></ul>...
Implementation (2) <ul><li>IRC interface: </li></ul><ul><ul><li>Discussion logger / persona monitor on Twisted </li></ul><...
Finding the names of friends of an IRC persona with SPARQL <ul><li>semwebquery –sparql &quot;SELECT ?name WHERE { </li></u...
Validation <ul><li>291 chat personas on five channels </li></ul><ul><li>22,418 chat messages </li></ul><ul><li>51 chat per...
Future work <ul><li>Extend to instant messaging and private messaging </li></ul><ul><li>Study of IRC communities where use...
Acknowledgements <ul><li>We would like to thank Science Foundation Ireland for their support under grant  SFI/08/CE/I1380 ...
Summary <ul><li>IRC conversations are quite disconnected from the Web and even from other IRC channels and networks </li><...
Upcoming SlideShare
Loading in...5
×

SiocLog: Providing IRC Discussion Logs as Linked Data

21,710

Published on

Social Data on the Web Workshop at the International Semantic Web Conference / Washington, DC / 26th October 2009

Published in: Technology, Education
1 Comment
2 Likes
Statistics
Notes
No Downloads
Views
Total Views
21,710
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
15
Comments
1
Likes
2
Embeds 0
No embeds

No notes for slide

SiocLog: Providing IRC Discussion Logs as Linked Data

  1. 1. SiocLog: Providing IRC Discussion Logs as Linked Data Tuukka Hastrup 1 , Uldis Bojars 2 and John G. Breslin 2, 3 1 University of Jyväskylä, Finland 2 DERI, NUI Galway, Ireland 3 School of Engineering and Informatics, NUI Galway, Ireland
  2. 2. Motivation <ul><li>IRC conversations are quite disconnected from the Web and even from other IRC channels and networks </li></ul><ul><li>Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere </li></ul><ul><li>This may be useful to people who do not use IRC, by those on other networks, or simply by people who leave and rejoin a channel </li></ul>
  3. 3. Motivation (2) <ul><li>SIOC provides a framework for linking social media contributions to other content and Linked Data resources, and IRC can become part of that framework </li></ul><ul><li>We also need mechanisms to link the IRC contributions to the people who made them, hence the use of Web ID </li></ul>
  4. 4. Background <ul><li>We will begin by introducing the various areas relevant to this system: </li></ul><ul><ul><li>IRC </li></ul></ul><ul><ul><li>Linked Data </li></ul></ul><ul><ul><li>SIOC </li></ul></ul><ul><ul><li>Web ID </li></ul></ul>
  5. 5. Internet Relay Chat (IRC) <ul><li>Instant messaging / internet chat is a major form of social interaction online </li></ul><ul><li>It is often disconnected from the Web: </li></ul><ul><ul><li>Due to the different protocols involved </li></ul></ul><ul><ul><li>Due to its real-time nature / lack of persistent storage </li></ul></ul><ul><li>IRC was one of the earliest chat systems </li></ul><ul><li>It has an important role amongst open-source communities, web communities, and even geeks! </li></ul><ul><ul><li>Hundreds of thousands of users online at any time </li></ul></ul>
  6. 6. Linked Data <ul><li>Building a “Web of Data” to enhance the current Web </li></ul><ul><li>Exposing, sharing and connecting data about things via dereferenceable URIs </li></ul><ul><li>Linking datasets together that were not previously connected, for example: </li></ul><ul><ul><li>Music and people </li></ul></ul><ul><ul><li>Real-world things and places </li></ul></ul><ul><li>The Linking Open Data (LOD) effort aims to link various open datasets together (DBpedia, GeoNames, etc.) </li></ul>
  7. 7. Semantically-Interlinked Online Communities (SIOC) <ul><li>An effort from DERI, NUI Galway to discover how we can create / establish ontologies on the Semantic Web </li></ul><ul><li>Goal of the SIOC ontology is to address interoperability issues on the (Social) Web </li></ul><ul><li>http://sioc-project.org/ </li></ul><ul><li>SIOC has been adopted in a framework of 50 applications or modules deployed on over 400 sites </li></ul><ul><li>Various domains: Web 2.0, enterprise information integration, HCLS, e-government </li></ul>
  8. 9. Some of the SIOC core ontology classes and properties
  9. 10. Some examples of where SIOC is already use (about 50 implementations / applications)
  10. 11. Web ID <ul><li>A Web ID is a web address that identifies a person as a Linked Data item </li></ul><ul><li>A Web ID should also lead to a document with more information about that person (e.g. FOAF, other RDF) </li></ul><ul><li>For more information, see the definition in this paper: </li></ul><ul><ul><li>Ching-Man Au Yeung, Ilaria Liccardi, Kanghao Lu, Oshani Seneviratne, Tim Berners-Lee, “ Decentralization: The Future of Online Social Networking ”, W3C Workshop on Future of Social Networking </li></ul></ul>
  11. 12. Design
  12. 13. Mapping IRC identifiers to URIs on the Web <ul><li>irc://freenode </li></ul><ul><li>(IRC Network) </li></ul><ul><li>irc://freenode/%23 channel </li></ul><ul><li>(Channel) </li></ul><ul><li>No identifier </li></ul><ul><li>(Message) </li></ul><ul><li>irc://freenode/ persona ,isuser </li></ul><ul><li>(Chat Persona) </li></ul><ul><li>http://irc.sioc-project.org/#freenode </li></ul><ul><li>http://irc.sioc-project.org/ channel #channel </li></ul><ul><li>http://irc.sioc-project.org/ channel /0000-00-00 #00:00:00.00 </li></ul><ul><li>http://irc.sioc-project.org/users/ persona #user </li></ul>
  13. 14. Some of the internal and external links
  14. 15. Browsing the Linked Data
  15. 16. Creating a link between a user account on IRC and a personal profile <ul><li>Claiming a Web ID creates a link [black] between a user account (a sioc:User that created a sioc:Post in a sioct:ChatChannel) and a person (foaf:Person) </li></ul><ul><li>The person can manually verify this: </li></ul><ul><ul><li>By pointing back to the sioc:User from their foaf:Person definition [grey] </li></ul></ul>
  16. 17. Web IDs in SiocLog <ul><li>A Web ID can be claimed using mttlbot </li></ul><ul><li>Can claim using standard IRC services </li></ul><ul><li>/msg nickserv </li></ul><ul><li>set property webid SomeWebID </li></ul>
  17. 18. Implementation <ul><li>2000 lines of Python source code </li></ul><ul><li>1000 lines of Zope/TAL HTML templates </li></ul><ul><li>Twisted, SimpleTAL and Redland libraries </li></ul><ul><li>Four major components: </li></ul><ul><ul><li>IRC interface, data analysis, data integration, Web </li></ul></ul>
  18. 19. Implementation (2) <ul><li>IRC interface: </li></ul><ul><ul><li>Discussion logger / persona monitor on Twisted </li></ul></ul><ul><li>Data analysis: </li></ul><ul><ul><li>Process logs, a filters pipeline, sinks for stats / output </li></ul></ul><ul><li>Data integration: </li></ul><ul><ul><li>Queries for external Linked Data (personal profiles) </li></ul></ul><ul><li>Web interface: </li></ul><ul><ul><li>Requests via CGI, publishes as HTML and RDF </li></ul></ul>
  19. 20. Finding the names of friends of an IRC persona with SPARQL <ul><li>semwebquery –sparql &quot;SELECT ?name WHERE { </li></ul><ul><li>?person foaf:holdsAccount </li></ul><ul><li><http://irc.sioc-project.org/users/melvster#user> . </li></ul><ul><li>?person foaf:knows ?friend . </li></ul><ul><li>?friend foaf:name ?name . }&quot; </li></ul>
  20. 21. Validation <ul><li>291 chat personas on five channels </li></ul><ul><li>22,418 chat messages </li></ul><ul><li>51 chat personas have associated Web IDs claimed using mttlbot (2/3) or nickserv (1/3) </li></ul><ul><ul><li>44 of those have a valid associated RDF document </li></ul></ul><ul><li>Scalable (projected 4 million triples in 10 years) </li></ul><ul><li>SiocLog data being consumed by the “Towards linked sensor data for Hackystat” project </li></ul><ul><li>SiocLog interfaces to FOAF Me for new profile creation </li></ul>
  21. 22. Future work <ul><li>Extend to instant messaging and private messaging </li></ul><ul><li>Study of IRC communities where users and content are distributed across channels and networks </li></ul>
  22. 23. Acknowledgements <ul><li>We would like to thank Science Foundation Ireland for their support under grant SFI/08/CE/I1380 (Líon 2) </li></ul><ul><li>Thanks also to Benja Fallenstein and Dan Brickley for their insights </li></ul>
  23. 24. Summary <ul><li>IRC conversations are quite disconnected from the Web and even from other IRC channels and networks </li></ul><ul><li>Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere </li></ul><ul><li>SIOC provides a framework for interlinking social media to other content and Linked Data, and IRC has been integrated as a part of that framework </li></ul><ul><li>We also used mechanisms to link IRC contributions to the people who made them via Web ID and FOAF </li></ul>
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×