Your SlideShare is downloading. ×
0
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
SiocLog: Providing IRC Discussion Logs as Linked Data
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

SiocLog: Providing IRC Discussion Logs as Linked Data

21,654

Published on

Social Data on the Web Workshop at the International Semantic Web Conference / Washington, DC / 26th October 2009

Social Data on the Web Workshop at the International Semantic Web Conference / Washington, DC / 26th October 2009

Published in: Technology, Education
1 Comment
2 Likes
Statistics
Notes
No Downloads
Views
Total Views
21,654
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
15
Comments
1
Likes
2
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. SiocLog: Providing IRC Discussion Logs as Linked Data Tuukka Hastrup 1 , Uldis Bojars 2 and John G. Breslin 2, 3 1 University of Jyväskylä, Finland 2 DERI, NUI Galway, Ireland 3 School of Engineering and Informatics, NUI Galway, Ireland
  • 2. Motivation <ul><li>IRC conversations are quite disconnected from the Web and even from other IRC channels and networks </li></ul><ul><li>Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere </li></ul><ul><li>This may be useful to people who do not use IRC, by those on other networks, or simply by people who leave and rejoin a channel </li></ul>
  • 3. Motivation (2) <ul><li>SIOC provides a framework for linking social media contributions to other content and Linked Data resources, and IRC can become part of that framework </li></ul><ul><li>We also need mechanisms to link the IRC contributions to the people who made them, hence the use of Web ID </li></ul>
  • 4. Background <ul><li>We will begin by introducing the various areas relevant to this system: </li></ul><ul><ul><li>IRC </li></ul></ul><ul><ul><li>Linked Data </li></ul></ul><ul><ul><li>SIOC </li></ul></ul><ul><ul><li>Web ID </li></ul></ul>
  • 5. Internet Relay Chat (IRC) <ul><li>Instant messaging / internet chat is a major form of social interaction online </li></ul><ul><li>It is often disconnected from the Web: </li></ul><ul><ul><li>Due to the different protocols involved </li></ul></ul><ul><ul><li>Due to its real-time nature / lack of persistent storage </li></ul></ul><ul><li>IRC was one of the earliest chat systems </li></ul><ul><li>It has an important role amongst open-source communities, web communities, and even geeks! </li></ul><ul><ul><li>Hundreds of thousands of users online at any time </li></ul></ul>
  • 6. Linked Data <ul><li>Building a “Web of Data” to enhance the current Web </li></ul><ul><li>Exposing, sharing and connecting data about things via dereferenceable URIs </li></ul><ul><li>Linking datasets together that were not previously connected, for example: </li></ul><ul><ul><li>Music and people </li></ul></ul><ul><ul><li>Real-world things and places </li></ul></ul><ul><li>The Linking Open Data (LOD) effort aims to link various open datasets together (DBpedia, GeoNames, etc.) </li></ul>
  • 7. Semantically-Interlinked Online Communities (SIOC) <ul><li>An effort from DERI, NUI Galway to discover how we can create / establish ontologies on the Semantic Web </li></ul><ul><li>Goal of the SIOC ontology is to address interoperability issues on the (Social) Web </li></ul><ul><li>http://sioc-project.org/ </li></ul><ul><li>SIOC has been adopted in a framework of 50 applications or modules deployed on over 400 sites </li></ul><ul><li>Various domains: Web 2.0, enterprise information integration, HCLS, e-government </li></ul>
  • 8.  
  • 9. Some of the SIOC core ontology classes and properties
  • 10. Some examples of where SIOC is already use (about 50 implementations / applications)
  • 11. Web ID <ul><li>A Web ID is a web address that identifies a person as a Linked Data item </li></ul><ul><li>A Web ID should also lead to a document with more information about that person (e.g. FOAF, other RDF) </li></ul><ul><li>For more information, see the definition in this paper: </li></ul><ul><ul><li>Ching-Man Au Yeung, Ilaria Liccardi, Kanghao Lu, Oshani Seneviratne, Tim Berners-Lee, “ Decentralization: The Future of Online Social Networking ”, W3C Workshop on Future of Social Networking </li></ul></ul>
  • 12. Design
  • 13. Mapping IRC identifiers to URIs on the Web <ul><li>irc://freenode </li></ul><ul><li>(IRC Network) </li></ul><ul><li>irc://freenode/%23 channel </li></ul><ul><li>(Channel) </li></ul><ul><li>No identifier </li></ul><ul><li>(Message) </li></ul><ul><li>irc://freenode/ persona ,isuser </li></ul><ul><li>(Chat Persona) </li></ul><ul><li>http://irc.sioc-project.org/#freenode </li></ul><ul><li>http://irc.sioc-project.org/ channel #channel </li></ul><ul><li>http://irc.sioc-project.org/ channel /0000-00-00 #00:00:00.00 </li></ul><ul><li>http://irc.sioc-project.org/users/ persona #user </li></ul>
  • 14. Some of the internal and external links
  • 15. Browsing the Linked Data
  • 16. Creating a link between a user account on IRC and a personal profile <ul><li>Claiming a Web ID creates a link [black] between a user account (a sioc:User that created a sioc:Post in a sioct:ChatChannel) and a person (foaf:Person) </li></ul><ul><li>The person can manually verify this: </li></ul><ul><ul><li>By pointing back to the sioc:User from their foaf:Person definition [grey] </li></ul></ul>
  • 17. Web IDs in SiocLog <ul><li>A Web ID can be claimed using mttlbot </li></ul><ul><li>Can claim using standard IRC services </li></ul><ul><li>/msg nickserv </li></ul><ul><li>set property webid SomeWebID </li></ul>
  • 18. Implementation <ul><li>2000 lines of Python source code </li></ul><ul><li>1000 lines of Zope/TAL HTML templates </li></ul><ul><li>Twisted, SimpleTAL and Redland libraries </li></ul><ul><li>Four major components: </li></ul><ul><ul><li>IRC interface, data analysis, data integration, Web </li></ul></ul>
  • 19. Implementation (2) <ul><li>IRC interface: </li></ul><ul><ul><li>Discussion logger / persona monitor on Twisted </li></ul></ul><ul><li>Data analysis: </li></ul><ul><ul><li>Process logs, a filters pipeline, sinks for stats / output </li></ul></ul><ul><li>Data integration: </li></ul><ul><ul><li>Queries for external Linked Data (personal profiles) </li></ul></ul><ul><li>Web interface: </li></ul><ul><ul><li>Requests via CGI, publishes as HTML and RDF </li></ul></ul>
  • 20. Finding the names of friends of an IRC persona with SPARQL <ul><li>semwebquery –sparql &quot;SELECT ?name WHERE { </li></ul><ul><li>?person foaf:holdsAccount </li></ul><ul><li><http://irc.sioc-project.org/users/melvster#user> . </li></ul><ul><li>?person foaf:knows ?friend . </li></ul><ul><li>?friend foaf:name ?name . }&quot; </li></ul>
  • 21. Validation <ul><li>291 chat personas on five channels </li></ul><ul><li>22,418 chat messages </li></ul><ul><li>51 chat personas have associated Web IDs claimed using mttlbot (2/3) or nickserv (1/3) </li></ul><ul><ul><li>44 of those have a valid associated RDF document </li></ul></ul><ul><li>Scalable (projected 4 million triples in 10 years) </li></ul><ul><li>SiocLog data being consumed by the “Towards linked sensor data for Hackystat” project </li></ul><ul><li>SiocLog interfaces to FOAF Me for new profile creation </li></ul>
  • 22. Future work <ul><li>Extend to instant messaging and private messaging </li></ul><ul><li>Study of IRC communities where users and content are distributed across channels and networks </li></ul>
  • 23. Acknowledgements <ul><li>We would like to thank Science Foundation Ireland for their support under grant SFI/08/CE/I1380 (Líon 2) </li></ul><ul><li>Thanks also to Benja Fallenstein and Dan Brickley for their insights </li></ul>
  • 24. Summary <ul><li>IRC conversations are quite disconnected from the Web and even from other IRC channels and networks </li></ul><ul><li>Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere </li></ul><ul><li>SIOC provides a framework for interlinking social media to other content and Linked Data, and IRC has been integrated as a part of that framework </li></ul><ul><li>We also used mechanisms to link IRC contributions to the people who made them via Web ID and FOAF </li></ul>

×