SlideShare a Scribd company logo
SocialSensor: Sensing User Generated Input for
Improved Media Discovery and Experience
Topic discovery in tweeter streams




Dr. Yiannis Kompatsiaris, Project Coordinator
International Workshop on Search Computing, Brussels, 25 September 2012
Overview

•   Motivation
•   Objectives
•   Architecture
•   Use Cases and Requirements
    – News
    – Infotainment
• Research Activities and Results
    – Topic Discovery in Tweeter Streams
• Conclusions


                                 #2
What is SocialSensor?

• 3-year FP7 European Integrated Project
   – http://www.socialsensor.eu

• Members: CERTH, ATC (Greece), Deutsche Welle,
  University Koblenz, Research Center for Artificial
  Intelligence (Germany), The City University London,
  Alcatel – Lucent Bell Labs, JCP Consult (France),
  University of Klagenfurt (Austria), IBM Israel, Yahoo
  Iberia

• 1 year into the project (Development of user
  requirements, use case scenarios, architecture study and
  implementation and first R&D prototypes)


                                  #3
Motivation: Social Networks as Sensors
•    Social Networks is a data source with an
     extremely dynamic nature that reflects
     events and the evolution of community
     focus (user’s interests)
•    Transform individually rare but
     collectively frequent media to meaningful
     topics, events, points of
     interest, emotional states and social
     connections
•    Mine the data and their relations and
     exploit them in the right context
•    Scalable mining and indexing approaches
     taking into account the content and social
     context of social networks
Relevant Applications
Xin Jin, Andrew Gallagher, Liangliang Cao, Jiebo Luo, and
Jiawei Han. The wisdom of social multimedia:
using flickr for prediction and
forecast, International conference on Multimedia (MM
'10). ACM.




                                                                      Federal Emergency Management Agency plans to
                                                                      engage the public more in disaster response by
                                                                      sharing data and leveraging reports from mobile
                                                                      phones and social media


                                                            “…if you're more than 100 km away from the epicenter
                                                                [of an earthquake] you can read about the quake on
                                                                twitter before it hits you…”



                                                                       5
Objective
     SocialSensor quickly surfaces trusted and relevant material
                 from social media – with context.

                                Social media mining
                               Aggregation & indexing


                               content            time


                       usage             DySCO           location


                        social context           behaviour




Massive social media            Personalised access
and unstructured web            Ad-hoc P2P networks                 News - Infotainment
The SocialSensor Vision

  SocialSensor quickly surfaces trusted and relevant
      material from social media – with context.

•“quickly”: in real time
•“surfaces”: automatically discovers, clusters and searches
•“trusted”: automatic support in verification process
•“relevant”: to the users, personalized
•“material”: any material (text, image, audio, video =
multimedia), aggregated with other sources (e.g. web)
•“social media”: across all relevant social media platforms
•“with context”: location, time, sentiment, influence

                                  #7
Conceptual Architecture and Main components
• Real time dynamic topic              Public        In-project
  and event clustering                  Data         Data
• Trend, popularity                  DATA COLLECTION / CRAWLING
  and sentiment analysis
• Calculate trust/influence                     STORAGE

  scores around people                MINING              INDEXING
• Personalized search,
                                    SEARCH & RECOMMENDATION
  access & presentation
  based on social network                       SEMANTIC MIDDLEWARE
  interactions
                                    USER MODELLING & PRESENTATION
• Semantic enrichment
  and discovery of services

                              #10
“Social media is transforming the way we do journalism”
(New York Times)




         “Social media is the key place for emerging stories –
         internationally, nationally, locally” (BBC)



“It has changed the way we do
news”(MSN)



                                                   Source: picture alliance / dpa

                                 #12
#13
“It’s really hard to find the nuggets of useful stuff
           in an ocean of content” (BBC)




“Things that aren’t relevant crowd out the content
you are looking for” (MSN)



                            “The filters aren’t configurable
                            enough” (CNN)



                                                           Source: Getty
  Images
                                      #14
Verification was simpler in the past...




Source: Frank Grätz


                            #15
An example: BBC Verification Procedure: Arab
Spring Coverage
• Referencing locations against maps and existing images
  from, in particular, geo-located ones.
• Working with our colleagues in BBC Arabic and BBC
  Monitoring to ascertain that accents and language are
  correct for the location.
• Searching for the original source of the
  upload/sequences as an indicator of date.
• Examining weather reports and shadows to confirm that
  the conditions shown fit with the claimed date and time.
• Maintaining lists of previously verified material to act as
  reference for colleagues covering the stories.
• Checking scenery, weaponry, vehicles and licence plates
  against those known for the given country.

                               #16
Infotainment
• Events with large numbers
  of visitors
• Thessaloniki International
  Film Festival
   – 80,000 viewers / 100,000
     visitors in 10 days
   – 150 films, 350 screenings
• Fete de la Musique Berlin
   – 100,000 visitors every year
• Discovery and presentation
  of relevant aggregated
  social media (e.g. film
  ratings from tweets)



                                   #18
User Requirements
• Social media aggregation
• Sentiment analysis for
  screenings and events
• Real-time check-in
  heatmaps
• AR 3D interfaces with
  RT information layers
                                   30
• Social media-based film          25

  recommendations                  20
                                   15                                             Positive
                                   10                                             Negative
• Smart location-based              5                                             Interest

  recommendations                   0
                                   -5   Fri Mar Fri Mar Fri Mar Fri Mar Fri Mar
                                           06      07      08      09      10




                             #19
ThessFest
• Thessaloniki                 ThessFest
  International Film
  Festival
• Support
  twitter/comment
  usage within the app
• Ratings and
  comments per film
• Feedback
  aggregation
   – Votes
   – Tweets
• Real-time feedback
  to the organisation
  and visitors


                         #20
ThessFest

• Gather “realistic” user requirements
• Early showcase and evaluation of SocialSensor
  technologies in real-world event scale
• Engage users and create an informed user basis
• TDF14: 9-18 March 2012
   – 400+ users
   – 6500+ user sessions
   – positive response to social media
• Next version
   – Updated features
   – Android version

                                #21
Topic Discovery in Tweeter Streams
Giorgos Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris,
Carlos Martin, David Corney, Ryan Skraba, Luca Aiello, Alex Jaimes, Yosi Mass




                                            #22
Problem Formulation

• Detect trending topics in a stream of Tweets
• Topics are represented as sets of keywords that
  capture the essence of an emerging news story
   – Ex: “ramires”, “goal”, “1-0”, “chelsea”, “score”
      in the case of a soccer match
• Due to evolving nature of social interest, topics are
  extracted per timeslot (e.g. per minute, hour, etc.)
   – in contrast to previous work on topic detection, we are not
     only interested in a posteriori detecting topics in a static
     corpus, but in detecting them at the time they are
     discussed

                                 #23
Background

Two basic classes of methods to discover topics:
• Feature-pivot (Feat-p):
    1. Select important (trending) terms in a stream
    2. Cluster them with related terms into topics
•   Document-pivot (Doc-p):
    1. Cluster documents (tweets) into groups based on
       similarity
    2. Extract most characteristic terms for each group




                                 #24
Proposed Methods
• Doc-p based on Locality Sensitive Hashing (LSH)
   – Efficient stream clustering  cluster filtering  term selection
• Graph-based feat-p approach
   – Term selection  term graph creation based on co-occurrence 
     graph clustering
• Feat-p based on frequent pattern detection
   – Parallel FP-Growth algorithm (Hadoop implementation)  frequent
     pattern ranking
• “Soft” frequent itemset mining
   – Greedy search for incrementally detecting maximal term sets of co-
     occurrence
• DF-IDFt approach
   – Select n-grams based on burstiness (increase in frequency relative to
     recent past)  hierarchical clustering of n-grams

                                      #25
Leveraging Influence
• Exploiting social context in the analysis procedure
• Assumption: Tweets coming from influential users
  are expected to be more relevant / less noisy
• Extract a first set of topics with previous TD methods
• Identify a set of influencers for each of those topics
   – Create content citation graph [nodes: users, edges:
     retweets], term profile for each edge based on RT text
   – Term- and topic-based subgraphs
   – Influence computation based on random walks
     (distributed computation on top of Hadoop)
• Filter original stream of Tweets taking into account
  only Tweets coming from influencers

                                #26
Evaluation

• Two datasets
   – Super Tuesday (ST): event in presidential nomination race for US
     Republican party [3.5M tweets, average 131 tweets/min]
   – FA Cup (FA): Chelsea vs. Liverpool [444K tweets, av. 148 tweets/min]
• Manual ground truth construction of set of interesting topics
   – Source: mainstream media reports
   – Representation: Set of topics (keywords + short headline) per timeslot
     [1-hour timeslot for ST, 1-minute for FA]
• Metrics
   – Topic recall (number of found target topics) [TRec]
   – Keyword-level recall [KRec] & precision [KPrec]
• Baseline
   – Latent Dirichlet Allocation (LDA)

                                         #27
Evaluation




             #28
Results

                         FA Cup                   Super Tuesday

                 TRec      KRec   KPrec   TRec        KRec        KPrec

      LDA        92.3%    14.8%   71.4%   27.3%      22.4%        40.4%

     Doc-p       53.8%    12.4%   46.4%   27.3%      11.6%        33.8%


 Graph-based     84.6%     9.6%   62.5%   18.2%       7.6%        33.1%


  Freq pattern   53.8%    30.3%   35.7%   18.2%      23.3%        17.6%


   Soft freq     84.6%    16.3%   58.9%   40.9%      13.0%        39.0%
   itemset

    DF-IDFt      92.3%    18.0%   57.1%   45.5%      16.9%        41.9%




                                    #29
Evaluation
FA cup:
•Time: 17:16, Ramires scores for Chelsea.
    – goal, 1-0, ramires, #cfc
•Time: 18:56, Andy Caroll takes a header but Cech saves Chelsea.
    – #facupfinal, saved, carroll, claiming, header, cech, @chelseafc, #cfcwembley
Super Tuesday:
•Time: 19:00: ABC/NBC/CNN project Newt Gingrich as the winner of the
Georgia Primary
    – georgia, newt, wins, primary, republican, breaking, gingrich
•Time: 19:00, Newt Gingrich says “Thank you Georgia! It is gratifying to win
my home state so decisively to launch our March Momentum”
    – @newtgingrich, win, georgia, launch, #marchmo, gratifying, march, momentum




                                                #30
Mobile browsing of large image
collections on a smart phone




                  #31
Mobile photo browsing
   & tagging on the go
                                HIERARCHICAL
CITY PROFILE MINING                            OWN IMAGE TAGGING
                              PHOTO BROWSING
Areas: cluster geotagging
information (BIRCH)
Image clustering:
community detection
(SCAN) on image similarity
graphs
Cluster processing:                                  CONTEXTUAL TAG
- classify landmarks-events                         RECOMMENDATION
- extract titles and tags


              ClustTour
                                       #32
City profile creation




Community detection on
 image similarity graphs




                           #33
Hierarchical exploration through mobile
Top-level spatial clustering
(areas)                        Adaptive marker clustering


                                                        Scrollable photo clusters




                                          #34
Conclusions
• Great interest in both use cases
   • In news social media have transformed both news generation and
     consumption
• Social media data mining can provide interesting results in
  many applications
• Not all data always available (e.g. User queries, fb)
   – Infrastructure, Policy issues
• Technical challenges
   – Fusion (multi-modality, context), real-time, noise, big data,
     aggregation (web, Linked Open Data)
• Applications challenges
   • User engagement, privacy, copyright, commercialization
Thank you!

More Related Content

What's hot

Sj terp emerging tech radar
Sj terp emerging tech radarSj terp emerging tech radar
Sj terp emerging tech radar
SaraJayneTerp
 
Driving Innovation with Knowledge Sharing and Open Data
Driving Innovation with Knowledge Sharing and Open DataDriving Innovation with Knowledge Sharing and Open Data
Driving Innovation with Knowledge Sharing and Open Data
Jeanne Holm
 
Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...
Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...
Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...
Amit Sheth
 
Making & Shaping the News - technical artifacts and the interplay of journali...
Making & Shaping the News - technical artifacts and the interplay of journali...Making & Shaping the News - technical artifacts and the interplay of journali...
Making & Shaping the News - technical artifacts and the interplay of journali...
Nele Heise
 
Socialmediapublicsavety
SocialmediapublicsavetySocialmediapublicsavety
SocialmediapublicsavetyFrank Smilda
 
Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016
suresh sood
 
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web SystemsUsing Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Matthew Rowe
 

What's hot (8)

Sj terp emerging tech radar
Sj terp emerging tech radarSj terp emerging tech radar
Sj terp emerging tech radar
 
Driving Innovation with Knowledge Sharing and Open Data
Driving Innovation with Knowledge Sharing and Open DataDriving Innovation with Knowledge Sharing and Open Data
Driving Innovation with Knowledge Sharing and Open Data
 
VRCs & Technology
VRCs & TechnologyVRCs & Technology
VRCs & Technology
 
Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...
Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...
Computing for Human Experience: Sensors, Perception, Semantics, Social Comput...
 
Making & Shaping the News - technical artifacts and the interplay of journali...
Making & Shaping the News - technical artifacts and the interplay of journali...Making & Shaping the News - technical artifacts and the interplay of journali...
Making & Shaping the News - technical artifacts and the interplay of journali...
 
Socialmediapublicsavety
SocialmediapublicsavetySocialmediapublicsavety
Socialmediapublicsavety
 
Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016
 
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web SystemsUsing Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
 

Viewers also liked

Measuring User Influence in Twitter
Measuring User Influence in TwitterMeasuring User Influence in Twitter
Measuring User Influence in Twitter
augustodefranco .
 
Bye, Bye Research. Hello Data Mining! Tapping into Social Media
Bye, Bye Research. Hello Data Mining! Tapping into Social MediaBye, Bye Research. Hello Data Mining! Tapping into Social Media
Bye, Bye Research. Hello Data Mining! Tapping into Social Media
Alterian
 
Processing Large Complex Data
Processing Large Complex DataProcessing Large Complex Data
Processing Large Complex Data
Yiannis Kompatsiaris
 
Vision about Social Networks Content Exploitation (EC Concertation meeting)
Vision about Social Networks Content Exploitation (EC Concertation meeting)Vision about Social Networks Content Exploitation (EC Concertation meeting)
Vision about Social Networks Content Exploitation (EC Concertation meeting)
Yiannis Kompatsiaris
 
The DemaWare Service-Oriented AAL Platform for People with Dementia
The DemaWare Service-Oriented AAL Platform for People with DementiaThe DemaWare Service-Oriented AAL Platform for People with Dementia
The DemaWare Service-Oriented AAL Platform for People with Dementia
Yiannis Kompatsiaris
 
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEYDATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
ijdkp
 
Improve My City: App for Citizens Reporting Issues in Municipalities – Regions
Improve My City: App for Citizens Reporting Issues in Municipalities – RegionsImprove My City: App for Citizens Reporting Issues in Municipalities – Regions
Improve My City: App for Citizens Reporting Issues in Municipalities – Regions
Yiannis Kompatsiaris
 
Text mining of Social Network Data for Business Intelligence - iLabs camp
Text mining of Social Network Data for Business Intelligence - iLabs campText mining of Social Network Data for Business Intelligence - iLabs camp
Text mining of Social Network Data for Business Intelligence - iLabs camp
Ankit Sharma
 
Understanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing AgeUnderstanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing Age
Sherman Mohr Jr.
 
Social Media Verification Challenges, Approaches and Applications
Social Media Verification  Challenges, Approaches and ApplicationsSocial Media Verification  Challenges, Approaches and Applications
Social Media Verification Challenges, Approaches and Applications
Yiannis Kompatsiaris
 
Social Media Analytics: The Value Proposition
Social Media Analytics: The Value PropositionSocial Media Analytics: The Value Proposition
Social Media Analytics: The Value Proposition
Content Savvy
 
Data Mining in Business
Data Mining in BusinessData Mining in Business
Data Mining in Business
hamidics786
 
Social media mining and multimedia analysis research and applications
Social media mining and multimedia analysis research and applicationsSocial media mining and multimedia analysis research and applications
Social media mining and multimedia analysis research and applications
Yiannis Kompatsiaris
 
Social Media Analytics for Graph-Based Event Detection
Social Media Analytics for Graph-Based Event DetectionSocial Media Analytics for Graph-Based Event Detection
Social Media Analytics for Graph-Based Event Detection
Yiannis Kompatsiaris
 
Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Production model lifecycle management 2016 09
Production model lifecycle management 2016 09
Greg Makowski
 
Campaign Analytics & Optimization
Campaign Analytics & OptimizationCampaign Analytics & Optimization
Campaign Analytics & Optimization
George Krasadakis
 
Social Media Mining: An Introduction
Social Media Mining: An IntroductionSocial Media Mining: An Introduction
Social Media Mining: An Introduction
Ali Abbasi
 
Mining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing InsightsMining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing Insights
juliannacole
 
SocialSensor Project: Sensing User Generated Input for Improved Media Discove...
SocialSensor Project: Sensing User Generated Input for Improved Media Discove...SocialSensor Project: Sensing User Generated Input for Improved Media Discove...
SocialSensor Project: Sensing User Generated Input for Improved Media Discove...
Yiannis Kompatsiaris
 

Viewers also liked (20)

Measuring User Influence in Twitter
Measuring User Influence in TwitterMeasuring User Influence in Twitter
Measuring User Influence in Twitter
 
Bye, Bye Research. Hello Data Mining! Tapping into Social Media
Bye, Bye Research. Hello Data Mining! Tapping into Social MediaBye, Bye Research. Hello Data Mining! Tapping into Social Media
Bye, Bye Research. Hello Data Mining! Tapping into Social Media
 
Processing Large Complex Data
Processing Large Complex DataProcessing Large Complex Data
Processing Large Complex Data
 
Vision about Social Networks Content Exploitation (EC Concertation meeting)
Vision about Social Networks Content Exploitation (EC Concertation meeting)Vision about Social Networks Content Exploitation (EC Concertation meeting)
Vision about Social Networks Content Exploitation (EC Concertation meeting)
 
The DemaWare Service-Oriented AAL Platform for People with Dementia
The DemaWare Service-Oriented AAL Platform for People with DementiaThe DemaWare Service-Oriented AAL Platform for People with Dementia
The DemaWare Service-Oriented AAL Platform for People with Dementia
 
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEYDATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
 
Improve My City: App for Citizens Reporting Issues in Municipalities – Regions
Improve My City: App for Citizens Reporting Issues in Municipalities – RegionsImprove My City: App for Citizens Reporting Issues in Municipalities – Regions
Improve My City: App for Citizens Reporting Issues in Municipalities – Regions
 
Text mining of Social Network Data for Business Intelligence - iLabs camp
Text mining of Social Network Data for Business Intelligence - iLabs campText mining of Social Network Data for Business Intelligence - iLabs camp
Text mining of Social Network Data for Business Intelligence - iLabs camp
 
Understanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing AgeUnderstanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing Age
 
Social Media Verification Challenges, Approaches and Applications
Social Media Verification  Challenges, Approaches and ApplicationsSocial Media Verification  Challenges, Approaches and Applications
Social Media Verification Challenges, Approaches and Applications
 
Social Media Analytics: The Value Proposition
Social Media Analytics: The Value PropositionSocial Media Analytics: The Value Proposition
Social Media Analytics: The Value Proposition
 
Data Mining in Business
Data Mining in BusinessData Mining in Business
Data Mining in Business
 
Social media mining and multimedia analysis research and applications
Social media mining and multimedia analysis research and applicationsSocial media mining and multimedia analysis research and applications
Social media mining and multimedia analysis research and applications
 
Social Media Analytics for Graph-Based Event Detection
Social Media Analytics for Graph-Based Event DetectionSocial Media Analytics for Graph-Based Event Detection
Social Media Analytics for Graph-Based Event Detection
 
Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Production model lifecycle management 2016 09
Production model lifecycle management 2016 09
 
Business Friendly Data Mining
Business Friendly Data MiningBusiness Friendly Data Mining
Business Friendly Data Mining
 
Campaign Analytics & Optimization
Campaign Analytics & OptimizationCampaign Analytics & Optimization
Campaign Analytics & Optimization
 
Social Media Mining: An Introduction
Social Media Mining: An IntroductionSocial Media Mining: An Introduction
Social Media Mining: An Introduction
 
Mining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing InsightsMining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing Insights
 
SocialSensor Project: Sensing User Generated Input for Improved Media Discove...
SocialSensor Project: Sensing User Generated Input for Improved Media Discove...SocialSensor Project: Sensing User Generated Input for Improved Media Discove...
SocialSensor Project: Sensing User Generated Input for Improved Media Discove...
 

Similar to Socialsensor project overview and topic discovery in tweeter streams

Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
Amit Sheth
 
From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?
Yiannis Kompatsiaris
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the Datasphere
J T "Tom" Johnson
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhyDavide Feltoni Gurini
 
Lida change-reference-abels
Lida change-reference-abelsLida change-reference-abels
Lida change-reference-abelsfpehar
 
Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course
Ashutosh Jadhav
 
Visual Information Analysis for Crisis and Natural Disasters Management and R...
Visual Information Analysis for Crisis and Natural Disasters Management and R...Visual Information Analysis for Crisis and Natural Disasters Management and R...
Visual Information Analysis for Crisis and Natural Disasters Management and R...
Yiannis Kompatsiaris
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Michael Mathioudakis
 
Lecture4 Social Web
Lecture4 Social Web Lecture4 Social Web
Lecture4 Social Web
Marieke van Erp
 
Knowledge Management and Open Data for Innovation
Knowledge Management and Open Data for InnovationKnowledge Management and Open Data for Innovation
Knowledge Management and Open Data for Innovation
Jeanne Holm
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
Marieke Guy
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...
Marco Brambilla
 
Methods and Tools for Facilitating Social Participation
Methods and Tools for Facilitating Social ParticipationMethods and Tools for Facilitating Social Participation
Methods and Tools for Facilitating Social Participation
Uniq UI: Usability, UX, and UI design, consulting and training
 
histoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitieshistoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital Humanities
CUbRIK Project
 
Social Media Crawling and Mining Seminar (Motivation Part)
Social Media Crawling and Mining Seminar (Motivation Part)Social Media Crawling and Mining Seminar (Motivation Part)
Social Media Crawling and Mining Seminar (Motivation Part)Yiannis Kompatsiaris
 
Exploring social theory through enterprise social media (muller, ibm research)
Exploring social theory through enterprise social media (muller, ibm research)Exploring social theory through enterprise social media (muller, ibm research)
Exploring social theory through enterprise social media (muller, ibm research)
Michael Muller
 
Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...
Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...
Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...
Diego López-de-Ipiña González-de-Artaza
 
ODC BarCamp 2013 - Introduction to Data Journalism
ODC BarCamp 2013 - Introduction to Data JournalismODC BarCamp 2013 - Introduction to Data Journalism
ODC BarCamp 2013 - Introduction to Data Journalism
Open Development Cambodia
 
Text Mining : Experience
Text Mining : ExperienceText Mining : Experience
Text Mining : Experience
Boonlert Aroonpiboon
 
COSMOS
COSMOSCOSMOS
COSMOSNSMNSS
 

Similar to Socialsensor project overview and topic discovery in tweeter streams (20)

Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
 
From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the Datasphere
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and Why
 
Lida change-reference-abels
Lida change-reference-abelsLida change-reference-abels
Lida change-reference-abels
 
Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course
 
Visual Information Analysis for Crisis and Natural Disasters Management and R...
Visual Information Analysis for Crisis and Natural Disasters Management and R...Visual Information Analysis for Crisis and Natural Disasters Management and R...
Visual Information Analysis for Crisis and Natural Disasters Management and R...
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
 
Lecture4 Social Web
Lecture4 Social Web Lecture4 Social Web
Lecture4 Social Web
 
Knowledge Management and Open Data for Innovation
Knowledge Management and Open Data for InnovationKnowledge Management and Open Data for Innovation
Knowledge Management and Open Data for Innovation
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...
 
Methods and Tools for Facilitating Social Participation
Methods and Tools for Facilitating Social ParticipationMethods and Tools for Facilitating Social Participation
Methods and Tools for Facilitating Social Participation
 
histoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitieshistoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital Humanities
 
Social Media Crawling and Mining Seminar (Motivation Part)
Social Media Crawling and Mining Seminar (Motivation Part)Social Media Crawling and Mining Seminar (Motivation Part)
Social Media Crawling and Mining Seminar (Motivation Part)
 
Exploring social theory through enterprise social media (muller, ibm research)
Exploring social theory through enterprise social media (muller, ibm research)Exploring social theory through enterprise social media (muller, ibm research)
Exploring social theory through enterprise social media (muller, ibm research)
 
Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...
Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...
Enabling Smarter Cities through Internet of Things, Web of Data & Citizen Par...
 
ODC BarCamp 2013 - Introduction to Data Journalism
ODC BarCamp 2013 - Introduction to Data JournalismODC BarCamp 2013 - Introduction to Data Journalism
ODC BarCamp 2013 - Introduction to Data Journalism
 
Text Mining : Experience
Text Mining : ExperienceText Mining : Experience
Text Mining : Experience
 
COSMOS
COSMOSCOSMOS
COSMOS
 

More from Yiannis Kompatsiaris

AI4Media - European Leadership in Human-Centred Trustworthy AI session
AI4Media - European Leadership in Human-Centred Trustworthy AI sessionAI4Media - European Leadership in Human-Centred Trustworthy AI session
AI4Media - European Leadership in Human-Centred Trustworthy AI session
Yiannis Kompatsiaris
 
Social media mining for sensing and responding to real-world trends and events
Social media mining for sensing and responding to real-world trends and eventsSocial media mining for sensing and responding to real-world trends and events
Social media mining for sensing and responding to real-world trends and events
Yiannis Kompatsiaris
 
Sensor Based Ambient Assisted Living
Sensor Based Ambient Assisted LivingSensor Based Ambient Assisted Living
Sensor Based Ambient Assisted Living
Yiannis Kompatsiaris
 
Dem@care Project Short Overview
Dem@care Project Short OverviewDem@care Project Short Overview
Dem@care Project Short Overview
Yiannis Kompatsiaris
 
"Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ...
"Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ..."Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ...
"Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ...
Yiannis Kompatsiaris
 
Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...
Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...
Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...
Yiannis Kompatsiaris
 
Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ...
 Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ... Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ...
Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ...
Yiannis Kompatsiaris
 
Introduction for the Summer School on Social Media Modeling and Search 2012
Introduction for the Summer School on Social Media Modeling and Search 2012Introduction for the Summer School on Social Media Modeling and Search 2012
Introduction for the Summer School on Social Media Modeling and Search 2012Yiannis Kompatsiaris
 

More from Yiannis Kompatsiaris (8)

AI4Media - European Leadership in Human-Centred Trustworthy AI session
AI4Media - European Leadership in Human-Centred Trustworthy AI sessionAI4Media - European Leadership in Human-Centred Trustworthy AI session
AI4Media - European Leadership in Human-Centred Trustworthy AI session
 
Social media mining for sensing and responding to real-world trends and events
Social media mining for sensing and responding to real-world trends and eventsSocial media mining for sensing and responding to real-world trends and events
Social media mining for sensing and responding to real-world trends and events
 
Sensor Based Ambient Assisted Living
Sensor Based Ambient Assisted LivingSensor Based Ambient Assisted Living
Sensor Based Ambient Assisted Living
 
Dem@care Project Short Overview
Dem@care Project Short OverviewDem@care Project Short Overview
Dem@care Project Short Overview
 
"Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ...
"Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ..."Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ...
"Μια πόλη από το μέλλον": Πως ο πολίτης μπορεί να γίνει συμμέτοχος μέσω της χ...
 
Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...
Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...
Τεχνικές Αναγνώρισης Προτύπων και Μηχανικής Μάθησης για Εφαρμογές Ανάλυσης Πο...
 
Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ...
 Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ... Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ...
Άνοια στο σπίτι: Τεχνολογίες για παρακολούθηση από απόσταση και ανεξάρτητη δ...
 
Introduction for the Summer School on Social Media Modeling and Search 2012
Introduction for the Summer School on Social Media Modeling and Search 2012Introduction for the Summer School on Social Media Modeling and Search 2012
Introduction for the Summer School on Social Media Modeling and Search 2012
 

Recently uploaded

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 

Recently uploaded (20)

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 

Socialsensor project overview and topic discovery in tweeter streams

  • 1. SocialSensor: Sensing User Generated Input for Improved Media Discovery and Experience Topic discovery in tweeter streams Dr. Yiannis Kompatsiaris, Project Coordinator International Workshop on Search Computing, Brussels, 25 September 2012
  • 2. Overview • Motivation • Objectives • Architecture • Use Cases and Requirements – News – Infotainment • Research Activities and Results – Topic Discovery in Tweeter Streams • Conclusions #2
  • 3. What is SocialSensor? • 3-year FP7 European Integrated Project – http://www.socialsensor.eu • Members: CERTH, ATC (Greece), Deutsche Welle, University Koblenz, Research Center for Artificial Intelligence (Germany), The City University London, Alcatel – Lucent Bell Labs, JCP Consult (France), University of Klagenfurt (Austria), IBM Israel, Yahoo Iberia • 1 year into the project (Development of user requirements, use case scenarios, architecture study and implementation and first R&D prototypes) #3
  • 4. Motivation: Social Networks as Sensors • Social Networks is a data source with an extremely dynamic nature that reflects events and the evolution of community focus (user’s interests) • Transform individually rare but collectively frequent media to meaningful topics, events, points of interest, emotional states and social connections • Mine the data and their relations and exploit them in the right context • Scalable mining and indexing approaches taking into account the content and social context of social networks
  • 5. Relevant Applications Xin Jin, Andrew Gallagher, Liangliang Cao, Jiebo Luo, and Jiawei Han. The wisdom of social multimedia: using flickr for prediction and forecast, International conference on Multimedia (MM '10). ACM. Federal Emergency Management Agency plans to engage the public more in disaster response by sharing data and leveraging reports from mobile phones and social media “…if you're more than 100 km away from the epicenter [of an earthquake] you can read about the quake on twitter before it hits you…” 5
  • 6. Objective SocialSensor quickly surfaces trusted and relevant material from social media – with context. Social media mining Aggregation & indexing content time usage DySCO location social context behaviour Massive social media Personalised access and unstructured web Ad-hoc P2P networks News - Infotainment
  • 7. The SocialSensor Vision SocialSensor quickly surfaces trusted and relevant material from social media – with context. •“quickly”: in real time •“surfaces”: automatically discovers, clusters and searches •“trusted”: automatic support in verification process •“relevant”: to the users, personalized •“material”: any material (text, image, audio, video = multimedia), aggregated with other sources (e.g. web) •“social media”: across all relevant social media platforms •“with context”: location, time, sentiment, influence #7
  • 8. Conceptual Architecture and Main components • Real time dynamic topic Public In-project and event clustering Data Data • Trend, popularity DATA COLLECTION / CRAWLING and sentiment analysis • Calculate trust/influence STORAGE scores around people MINING INDEXING • Personalized search, SEARCH & RECOMMENDATION access & presentation based on social network SEMANTIC MIDDLEWARE interactions USER MODELLING & PRESENTATION • Semantic enrichment and discovery of services #10
  • 9. “Social media is transforming the way we do journalism” (New York Times) “Social media is the key place for emerging stories – internationally, nationally, locally” (BBC) “It has changed the way we do news”(MSN) Source: picture alliance / dpa #12
  • 10. #13
  • 11. “It’s really hard to find the nuggets of useful stuff in an ocean of content” (BBC) “Things that aren’t relevant crowd out the content you are looking for” (MSN) “The filters aren’t configurable enough” (CNN) Source: Getty Images #14
  • 12. Verification was simpler in the past... Source: Frank Grätz #15
  • 13. An example: BBC Verification Procedure: Arab Spring Coverage • Referencing locations against maps and existing images from, in particular, geo-located ones. • Working with our colleagues in BBC Arabic and BBC Monitoring to ascertain that accents and language are correct for the location. • Searching for the original source of the upload/sequences as an indicator of date. • Examining weather reports and shadows to confirm that the conditions shown fit with the claimed date and time. • Maintaining lists of previously verified material to act as reference for colleagues covering the stories. • Checking scenery, weaponry, vehicles and licence plates against those known for the given country. #16
  • 14. Infotainment • Events with large numbers of visitors • Thessaloniki International Film Festival – 80,000 viewers / 100,000 visitors in 10 days – 150 films, 350 screenings • Fete de la Musique Berlin – 100,000 visitors every year • Discovery and presentation of relevant aggregated social media (e.g. film ratings from tweets) #18
  • 15. User Requirements • Social media aggregation • Sentiment analysis for screenings and events • Real-time check-in heatmaps • AR 3D interfaces with RT information layers 30 • Social media-based film 25 recommendations 20 15 Positive 10 Negative • Smart location-based 5 Interest recommendations 0 -5 Fri Mar Fri Mar Fri Mar Fri Mar Fri Mar 06 07 08 09 10 #19
  • 16. ThessFest • Thessaloniki ThessFest International Film Festival • Support twitter/comment usage within the app • Ratings and comments per film • Feedback aggregation – Votes – Tweets • Real-time feedback to the organisation and visitors #20
  • 17. ThessFest • Gather “realistic” user requirements • Early showcase and evaluation of SocialSensor technologies in real-world event scale • Engage users and create an informed user basis • TDF14: 9-18 March 2012 – 400+ users – 6500+ user sessions – positive response to social media • Next version – Updated features – Android version #21
  • 18. Topic Discovery in Tweeter Streams Giorgos Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris, Carlos Martin, David Corney, Ryan Skraba, Luca Aiello, Alex Jaimes, Yosi Mass #22
  • 19. Problem Formulation • Detect trending topics in a stream of Tweets • Topics are represented as sets of keywords that capture the essence of an emerging news story – Ex: “ramires”, “goal”, “1-0”, “chelsea”, “score” in the case of a soccer match • Due to evolving nature of social interest, topics are extracted per timeslot (e.g. per minute, hour, etc.) – in contrast to previous work on topic detection, we are not only interested in a posteriori detecting topics in a static corpus, but in detecting them at the time they are discussed #23
  • 20. Background Two basic classes of methods to discover topics: • Feature-pivot (Feat-p): 1. Select important (trending) terms in a stream 2. Cluster them with related terms into topics • Document-pivot (Doc-p): 1. Cluster documents (tweets) into groups based on similarity 2. Extract most characteristic terms for each group #24
  • 21. Proposed Methods • Doc-p based on Locality Sensitive Hashing (LSH) – Efficient stream clustering  cluster filtering  term selection • Graph-based feat-p approach – Term selection  term graph creation based on co-occurrence  graph clustering • Feat-p based on frequent pattern detection – Parallel FP-Growth algorithm (Hadoop implementation)  frequent pattern ranking • “Soft” frequent itemset mining – Greedy search for incrementally detecting maximal term sets of co- occurrence • DF-IDFt approach – Select n-grams based on burstiness (increase in frequency relative to recent past)  hierarchical clustering of n-grams #25
  • 22. Leveraging Influence • Exploiting social context in the analysis procedure • Assumption: Tweets coming from influential users are expected to be more relevant / less noisy • Extract a first set of topics with previous TD methods • Identify a set of influencers for each of those topics – Create content citation graph [nodes: users, edges: retweets], term profile for each edge based on RT text – Term- and topic-based subgraphs – Influence computation based on random walks (distributed computation on top of Hadoop) • Filter original stream of Tweets taking into account only Tweets coming from influencers #26
  • 23. Evaluation • Two datasets – Super Tuesday (ST): event in presidential nomination race for US Republican party [3.5M tweets, average 131 tweets/min] – FA Cup (FA): Chelsea vs. Liverpool [444K tweets, av. 148 tweets/min] • Manual ground truth construction of set of interesting topics – Source: mainstream media reports – Representation: Set of topics (keywords + short headline) per timeslot [1-hour timeslot for ST, 1-minute for FA] • Metrics – Topic recall (number of found target topics) [TRec] – Keyword-level recall [KRec] & precision [KPrec] • Baseline – Latent Dirichlet Allocation (LDA) #27
  • 24. Evaluation #28
  • 25. Results FA Cup Super Tuesday TRec KRec KPrec TRec KRec KPrec LDA 92.3% 14.8% 71.4% 27.3% 22.4% 40.4% Doc-p 53.8% 12.4% 46.4% 27.3% 11.6% 33.8% Graph-based 84.6% 9.6% 62.5% 18.2% 7.6% 33.1% Freq pattern 53.8% 30.3% 35.7% 18.2% 23.3% 17.6% Soft freq 84.6% 16.3% 58.9% 40.9% 13.0% 39.0% itemset DF-IDFt 92.3% 18.0% 57.1% 45.5% 16.9% 41.9% #29
  • 26. Evaluation FA cup: •Time: 17:16, Ramires scores for Chelsea. – goal, 1-0, ramires, #cfc •Time: 18:56, Andy Caroll takes a header but Cech saves Chelsea. – #facupfinal, saved, carroll, claiming, header, cech, @chelseafc, #cfcwembley Super Tuesday: •Time: 19:00: ABC/NBC/CNN project Newt Gingrich as the winner of the Georgia Primary – georgia, newt, wins, primary, republican, breaking, gingrich •Time: 19:00, Newt Gingrich says “Thank you Georgia! It is gratifying to win my home state so decisively to launch our March Momentum” – @newtgingrich, win, georgia, launch, #marchmo, gratifying, march, momentum #30
  • 27. Mobile browsing of large image collections on a smart phone #31
  • 28. Mobile photo browsing & tagging on the go HIERARCHICAL CITY PROFILE MINING OWN IMAGE TAGGING PHOTO BROWSING Areas: cluster geotagging information (BIRCH) Image clustering: community detection (SCAN) on image similarity graphs Cluster processing: CONTEXTUAL TAG - classify landmarks-events RECOMMENDATION - extract titles and tags ClustTour #32
  • 29. City profile creation Community detection on image similarity graphs #33
  • 30. Hierarchical exploration through mobile Top-level spatial clustering (areas) Adaptive marker clustering Scrollable photo clusters #34
  • 31. Conclusions • Great interest in both use cases • In news social media have transformed both news generation and consumption • Social media data mining can provide interesting results in many applications • Not all data always available (e.g. User queries, fb) – Infrastructure, Policy issues • Technical challenges – Fusion (multi-modality, context), real-time, noise, big data, aggregation (web, Linked Open Data) • Applications challenges • User engagement, privacy, copyright, commercialization