SlideShare a Scribd company logo
1 of 23
@twitter Mining #MicroblogsUsing #SemanticTechnologies Selver Softic, Martin Ebner, Herbert Mühlburger , Thomas Altmann, Behnam Taraghi
Web 2.0 -  well knownstory Web 2.0 technologiesbroughtuserscloserto Web … Wikis, Blogs, Forums … Podcasts, RSS, XML … … thenusersstarted togeneratecontent  … Source: http:mediabistro.com
From Web toSocial Web Result = a vastofinformation Text, Pictures, Audio, Videos …. Communication, networking, exchangeofdata Web becamemore personal Cultural, geographicalandsocialbordersdisappeared Source: http://www.ignitesocialmedia.com
Social Media Boom!
Socialsitesaredatasilos source: www.pidgintech.com
But still disconnected ? source: www.pidgintech.com
Data is still captured in Walled Garden!
Statements Social Web relies on usersandcommunicationamongthem Whilecommunicatingusersproduceorconsumecontent Socialsitesaredatasilosrich on varietyofinformation Thisinformationcouldbeinterestingfor: monitoring of trends, advertising, statistics, reputation, news broadcasting , tagging … Thisdataiscaptured in Walledgarden !!!
Questions Howtousethisdatatogainmoreusefulinsights Whataretheadvantagesof online (offline) search on such dataandhowtoreachit in an uniform way Is itpossibletostructurize, connectandexposethedata in order tobeusedbyhumansandmachinesmoreefficiently Whatwould an architecturelooklikeforthisissue
Social Web Trends Microblogging SocialBookmarking Social Networking Social Marketing Sharing Photos, Videos … Source: http://socialwebresearch.com
Microblogs Microblogs Usedforcommunication,publishingandinformationexchange Simple forprocessing Information  generatedbymany different users Socialuserrelations Tripartitecommunicationstructure Varietyofinformations Noboundariesbyculture,locationortechnology (mobile users) Twitter Most Popular Large amountoddata But limited According: http://an.kaist.ac.kr/traces/WWW2010.html 41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets
SemanticaspectsandTwitter Twitter User realtions Tweetsasshortinformationartefacts Communication withtripartitepattern Time relatedinformation Vocabularies SIOC, FOAF, Dublin Core
Linked Data andTwitter Twittercontainsinfos on: People, Organisations, Locations, Trends … LOD Cloudcontains Billionsoftriplesabout: Geolocations , dataaboutscience, government, commonknowledge, persons, news … Vocabularies MOAT, CommmonTag
Architecture model
Acquisition - Grabeeter
Grabeeter Search in your Tweets Filter your Tweets by date Search in your Tweets offline using the Grabeeter Client Filter your tweets offline using the Grabeeter Client Grabeeter provides an API
Triplification Module  Author Date Content Reciever <tweet url="http://grabeeter.tugraz.at/tweet/199272" text="Sitting in Prater #vienna, launch party. Nice" screen_name="selvers" created="2010-08-19" twitterUrl="http://twitter.com/selvers/status/21606926237"/> RDF  Store Triplifier
Triplification Module @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix sioc: <http://rdfs.org/sioc/ns#> . @prefix sioct: <http://rdfs.org/sioc/types#> . @prefix dcterms: <http://purl.org/dc/terms/#> . <http://twitter.com/selvers/status/21606926237>  rdf:typesioct:MicroblogPost ; sioc:content "Sitting in Prater #vienna, launch party. Nice" ; sioc:has_creator  <http://twitter.com/selvers/>  ; foaf:maker <http://grabeteer.tugraz.at/foaf/selvers/> ; dcterms:created  “2010-08-19” ; rdfs:sameAs  <http://grabeeter.tugraz.at/tweet/199272> . <http://twitter.com/selvers/>  rdf:typefoaf:Person ; foaf:name  "SelverSoftic" ; foaf:depiction <http://a0.twimg.com/profile_images/905118560/f9e4b6eba.13070201_3_normal.jpg> ; foaf:knows <http://twitter.com/hmuehlburger/> ; foaf:knows <http://twitter.com/mhausenblas/> ; foaf:knows <http://twitter.com/mebner/> .  …
Interlinking Module Hashtags (People, Organisation, Locations) MOAT, CommonTag Later NLP processedcontent, SILK Framework SELECT ?post ?content ?maker ?name WHERE { ?post rdf:typesioct:MicroblogPost; foaf:maker ?maker;       ?makerfoaf:name ?name; sioc:content ?content. FILTER(regex(?content,#vienna)) }  Classifier tag: tagName "vienna" ; moat: tagMeaning <http://dbpedia .org/resource/Vienna> tag: taggedResource <http://twitter.com/selvers/status/2160692623>
Analysis
Conclusions & Outlook Currentstateofthearttechnologiessufficetorealisetheproposedarchitectureparadigm Interlinkingwith LOD Cloud (Tweet-O-Sphere) Involving NLP Methods Sentiment classification (Re)TaggingofTweets Providing SPARQL Endpoint + Lookup Serviceasresearchinterface SocialSemantic Web Apps
Questions?

More Related Content

What's hot

Webinar: Personal Online Privacy - Sucuri Security
Webinar: Personal Online Privacy - Sucuri SecurityWebinar: Personal Online Privacy - Sucuri Security
Webinar: Personal Online Privacy - Sucuri SecuritySucuri
 
Sucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: How to clean hacked WordPress sitesSucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: How to clean hacked WordPress sitesSucuri
 
Sucuri Webinar: Impacts of a website compromise
Sucuri Webinar: Impacts of a website compromiseSucuri Webinar: Impacts of a website compromise
Sucuri Webinar: Impacts of a website compromiseSucuri
 
Sucuri Webinar: How Websites Get Hacked
Sucuri Webinar: How Websites Get HackedSucuri Webinar: How Websites Get Hacked
Sucuri Webinar: How Websites Get HackedSucuri
 
obtain additional security
obtain additional security 
obtain additional security
obtain additional security offbeatnominee633
 
Sucuri Webinar: What is SEO Spam and How to Fight It
Sucuri Webinar: What is SEO Spam and How to Fight ItSucuri Webinar: What is SEO Spam and How to Fight It
Sucuri Webinar: What is SEO Spam and How to Fight ItSucuri
 
Webinar: eCommerce Compliance - PCI meets GDPR
Webinar: eCommerce Compliance - PCI meets GDPRWebinar: eCommerce Compliance - PCI meets GDPR
Webinar: eCommerce Compliance - PCI meets GDPRSucuri
 
Why Do Hackers Hack?
Why Do Hackers Hack?Why Do Hackers Hack?
Why Do Hackers Hack?Sucuri
 
Logs: Understanding Them to Better Manage Your WordPress Site
Logs: Understanding Them to Better Manage Your WordPress SiteLogs: Understanding Them to Better Manage Your WordPress Site
Logs: Understanding Them to Better Manage Your WordPress SiteSucuri
 
Steps to Keep Your Site Clean
Steps to Keep Your Site CleanSteps to Keep Your Site Clean
Steps to Keep Your Site CleanSucuri
 
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics ReportsSucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics ReportsSucuri
 
Getting the word out: How to implement your online branding strategy
Getting the word out: How to implement your online branding strategyGetting the word out: How to implement your online branding strategy
Getting the word out: How to implement your online branding strategyMatt Sullivan
 
Website Security
Website SecurityWebsite Security
Website SecurityMae Durac
 
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
Sucuri Webinar: How Caching Options Can Impact Your Website SpeedSucuri Webinar: How Caching Options Can Impact Your Website Speed
Sucuri Webinar: How Caching Options Can Impact Your Website SpeedSucuri
 

What's hot (15)

Webinar: Personal Online Privacy - Sucuri Security
Webinar: Personal Online Privacy - Sucuri SecurityWebinar: Personal Online Privacy - Sucuri Security
Webinar: Personal Online Privacy - Sucuri Security
 
Sucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: How to clean hacked WordPress sitesSucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: How to clean hacked WordPress sites
 
Sucuri Webinar: Impacts of a website compromise
Sucuri Webinar: Impacts of a website compromiseSucuri Webinar: Impacts of a website compromise
Sucuri Webinar: Impacts of a website compromise
 
Sucuri Webinar: How Websites Get Hacked
Sucuri Webinar: How Websites Get HackedSucuri Webinar: How Websites Get Hacked
Sucuri Webinar: How Websites Get Hacked
 
obtain additional security
obtain additional security 
obtain additional security
obtain additional security
 
Sucuri Webinar: What is SEO Spam and How to Fight It
Sucuri Webinar: What is SEO Spam and How to Fight ItSucuri Webinar: What is SEO Spam and How to Fight It
Sucuri Webinar: What is SEO Spam and How to Fight It
 
Webinar: eCommerce Compliance - PCI meets GDPR
Webinar: eCommerce Compliance - PCI meets GDPRWebinar: eCommerce Compliance - PCI meets GDPR
Webinar: eCommerce Compliance - PCI meets GDPR
 
Why Do Hackers Hack?
Why Do Hackers Hack?Why Do Hackers Hack?
Why Do Hackers Hack?
 
Logs: Understanding Them to Better Manage Your WordPress Site
Logs: Understanding Them to Better Manage Your WordPress SiteLogs: Understanding Them to Better Manage Your WordPress Site
Logs: Understanding Them to Better Manage Your WordPress Site
 
Steps to Keep Your Site Clean
Steps to Keep Your Site CleanSteps to Keep Your Site Clean
Steps to Keep Your Site Clean
 
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics ReportsSucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
 
Getting the word out: How to implement your online branding strategy
Getting the word out: How to implement your online branding strategyGetting the word out: How to implement your online branding strategy
Getting the word out: How to implement your online branding strategy
 
Website Security
Website SecurityWebsite Security
Website Security
 
Cyber security lifting the veil of hacking webinar
Cyber security   lifting the veil of hacking webinarCyber security   lifting the veil of hacking webinar
Cyber security lifting the veil of hacking webinar
 
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
Sucuri Webinar: How Caching Options Can Impact Your Website SpeedSucuri Webinar: How Caching Options Can Impact Your Website Speed
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
 

Similar to Swap2010 twitter minining using semantic web technologies and linked data

Bills Pr 2.0 Presentation
Bills Pr 2.0 PresentationBills Pr 2.0 Presentation
Bills Pr 2.0 PresentationInBlackandWhite
 
Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2PCM creative
 
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Social Semantic Web on Facebook Open Graph protocol and Twitter AnnotationsSocial Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Social Semantic Web on Facebook Open Graph protocol and Twitter AnnotationsMyungjin Lee
 
BotCommons: Metadata for Bots - Devoxx 2017
BotCommons: Metadata for Bots - Devoxx 2017BotCommons: Metadata for Bots - Devoxx 2017
BotCommons: Metadata for Bots - Devoxx 2017Cisco DevNet
 
Privacy on the internet presentation_kf_final
Privacy on the internet presentation_kf_finalPrivacy on the internet presentation_kf_final
Privacy on the internet presentation_kf_finalKaren Fraser
 
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.Hendrik Speck
 
Information Management Trends 2009
Information Management Trends 2009Information Management Trends 2009
Information Management Trends 2009Christopher Eagle
 
Espiando redes de microblogging Navaja Negra 2017
Espiando redes de microblogging Navaja Negra 2017Espiando redes de microblogging Navaja Negra 2017
Espiando redes de microblogging Navaja Negra 2017Miguel Hernández Boza
 
Web3.0 or The semantic web
Web3.0 or The semantic webWeb3.0 or The semantic web
Web3.0 or The semantic webDarren Wood
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Michael Nelson
 
Social Developers London - Twitter Cards Update
Social Developers London - Twitter Cards UpdateSocial Developers London - Twitter Cards Update
Social Developers London - Twitter Cards UpdateAngus Fox
 
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...RootedCON
 
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET Journal
 
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET Journal
 
News Media Metadata - The Current Landscape
News Media Metadata - The Current LandscapeNews Media Metadata - The Current Landscape
News Media Metadata - The Current LandscapeRichard Wallis
 

Similar to Swap2010 twitter minining using semantic web technologies and linked data (20)

Bills Pr 2.0 Presentation
Bills Pr 2.0 PresentationBills Pr 2.0 Presentation
Bills Pr 2.0 Presentation
 
Semantic Microblogging
Semantic MicrobloggingSemantic Microblogging
Semantic Microblogging
 
Geeks History of the Internet - how we arrived at Web 2.0
Geeks History of the Internet - how we arrived at Web 2.0Geeks History of the Internet - how we arrived at Web 2.0
Geeks History of the Internet - how we arrived at Web 2.0
 
Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2
 
Web 3 0
Web 3 0 Web 3 0
Web 3 0
 
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Social Semantic Web on Facebook Open Graph protocol and Twitter AnnotationsSocial Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
 
BotCommons: Metadata for Bots - Devoxx 2017
BotCommons: Metadata for Bots - Devoxx 2017BotCommons: Metadata for Bots - Devoxx 2017
BotCommons: Metadata for Bots - Devoxx 2017
 
Privacy on the internet presentation_kf_final
Privacy on the internet presentation_kf_finalPrivacy on the internet presentation_kf_final
Privacy on the internet presentation_kf_final
 
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
 
Information Management Trends 2009
Information Management Trends 2009Information Management Trends 2009
Information Management Trends 2009
 
Espiando redes de microblogging Navaja Negra 2017
Espiando redes de microblogging Navaja Negra 2017Espiando redes de microblogging Navaja Negra 2017
Espiando redes de microblogging Navaja Negra 2017
 
Microformats
MicroformatsMicroformats
Microformats
 
Webware Webinar
Webware WebinarWebware Webinar
Webware Webinar
 
Web3.0 or The semantic web
Web3.0 or The semantic webWeb3.0 or The semantic web
Web3.0 or The semantic web
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social Developers London - Twitter Cards Update
Social Developers London - Twitter Cards UpdateSocial Developers London - Twitter Cards Update
Social Developers London - Twitter Cards Update
 
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
 
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
 
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
 
News Media Metadata - The Current Landscape
News Media Metadata - The Current LandscapeNews Media Metadata - The Current Landscape
News Media Metadata - The Current Landscape
 

Recently uploaded

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Swap2010 twitter minining using semantic web technologies and linked data

  • 1. @twitter Mining #MicroblogsUsing #SemanticTechnologies Selver Softic, Martin Ebner, Herbert Mühlburger , Thomas Altmann, Behnam Taraghi
  • 2. Web 2.0 - well knownstory Web 2.0 technologiesbroughtuserscloserto Web … Wikis, Blogs, Forums … Podcasts, RSS, XML … … thenusersstarted togeneratecontent … Source: http:mediabistro.com
  • 3. From Web toSocial Web Result = a vastofinformation Text, Pictures, Audio, Videos …. Communication, networking, exchangeofdata Web becamemore personal Cultural, geographicalandsocialbordersdisappeared Source: http://www.ignitesocialmedia.com
  • 5.
  • 7. But still disconnected ? source: www.pidgintech.com
  • 8. Data is still captured in Walled Garden!
  • 9. Statements Social Web relies on usersandcommunicationamongthem Whilecommunicatingusersproduceorconsumecontent Socialsitesaredatasilosrich on varietyofinformation Thisinformationcouldbeinterestingfor: monitoring of trends, advertising, statistics, reputation, news broadcasting , tagging … Thisdataiscaptured in Walledgarden !!!
  • 10. Questions Howtousethisdatatogainmoreusefulinsights Whataretheadvantagesof online (offline) search on such dataandhowtoreachit in an uniform way Is itpossibletostructurize, connectandexposethedata in order tobeusedbyhumansandmachinesmoreefficiently Whatwould an architecturelooklikeforthisissue
  • 11. Social Web Trends Microblogging SocialBookmarking Social Networking Social Marketing Sharing Photos, Videos … Source: http://socialwebresearch.com
  • 12. Microblogs Microblogs Usedforcommunication,publishingandinformationexchange Simple forprocessing Information generatedbymany different users Socialuserrelations Tripartitecommunicationstructure Varietyofinformations Noboundariesbyculture,locationortechnology (mobile users) Twitter Most Popular Large amountoddata But limited According: http://an.kaist.ac.kr/traces/WWW2010.html 41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets
  • 13. SemanticaspectsandTwitter Twitter User realtions Tweetsasshortinformationartefacts Communication withtripartitepattern Time relatedinformation Vocabularies SIOC, FOAF, Dublin Core
  • 14. Linked Data andTwitter Twittercontainsinfos on: People, Organisations, Locations, Trends … LOD Cloudcontains Billionsoftriplesabout: Geolocations , dataaboutscience, government, commonknowledge, persons, news … Vocabularies MOAT, CommmonTag
  • 17. Grabeeter Search in your Tweets Filter your Tweets by date Search in your Tweets offline using the Grabeeter Client Filter your tweets offline using the Grabeeter Client Grabeeter provides an API
  • 18. Triplification Module Author Date Content Reciever <tweet url="http://grabeeter.tugraz.at/tweet/199272" text="Sitting in Prater #vienna, launch party. Nice" screen_name="selvers" created="2010-08-19" twitterUrl="http://twitter.com/selvers/status/21606926237"/> RDF Store Triplifier
  • 19. Triplification Module @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix sioc: <http://rdfs.org/sioc/ns#> . @prefix sioct: <http://rdfs.org/sioc/types#> . @prefix dcterms: <http://purl.org/dc/terms/#> . <http://twitter.com/selvers/status/21606926237> rdf:typesioct:MicroblogPost ; sioc:content "Sitting in Prater #vienna, launch party. Nice" ; sioc:has_creator <http://twitter.com/selvers/> ; foaf:maker <http://grabeteer.tugraz.at/foaf/selvers/> ; dcterms:created “2010-08-19” ; rdfs:sameAs <http://grabeeter.tugraz.at/tweet/199272> . <http://twitter.com/selvers/> rdf:typefoaf:Person ; foaf:name "SelverSoftic" ; foaf:depiction <http://a0.twimg.com/profile_images/905118560/f9e4b6eba.13070201_3_normal.jpg> ; foaf:knows <http://twitter.com/hmuehlburger/> ; foaf:knows <http://twitter.com/mhausenblas/> ; foaf:knows <http://twitter.com/mebner/> . …
  • 20. Interlinking Module Hashtags (People, Organisation, Locations) MOAT, CommonTag Later NLP processedcontent, SILK Framework SELECT ?post ?content ?maker ?name WHERE { ?post rdf:typesioct:MicroblogPost; foaf:maker ?maker; ?makerfoaf:name ?name; sioc:content ?content. FILTER(regex(?content,#vienna)) } Classifier tag: tagName "vienna" ; moat: tagMeaning <http://dbpedia .org/resource/Vienna> tag: taggedResource <http://twitter.com/selvers/status/2160692623>
  • 22. Conclusions & Outlook Currentstateofthearttechnologiessufficetorealisetheproposedarchitectureparadigm Interlinkingwith LOD Cloud (Tweet-O-Sphere) Involving NLP Methods Sentiment classification (Re)TaggingofTweets Providing SPARQL Endpoint + Lookup Serviceasresearchinterface SocialSemantic Web Apps