Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Decomposing discussion forums using user roles


Published on

  • who will win this game? get free picks and predictions. ▲▲▲
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

Decomposing discussion forums using user roles

  1. 1. Decomposing discussion forums using user roles Jeffrey Chan & Conor Hayes<br />Friday seminar<br />8/20/2010<br />Presenter: AstaZelenkauskaite<br />
  2. 2. Feature-based profiling<br />Users roles are identified by the features (indicators) to profile user behavior<br />Visualization techniques<br />Downside: used only for small-scale studies<br />Proposed solution: soc net analysis<br />Ego-network analysis and the out-degree distribution<br />
  3. 3. Data<br /> – the largest discussion board in Ireland<br />596 forums<br />75400 users <br />244850 threads <br />4.3 mln posts<br />
  4. 4. Forums<br />
  5. 5. Optimal number of clusters<br />
  6. 6. Analysis<br />Weighted directed graph <br />Ego-net graph – reply graph (multi-edge graph)<br />20 forums from 01/07/2006 – 31/12/2006.<br />
  7. 7. Features<br />Initially 50 features, redundant eliminated<br />Structural features (as communication btw users)<br />Unweighted directed graphs<br />From interaction with their neighbors<br />Reciprocity features<br />Persistence features<br />Popularity features<br />Initialization features<br />
  8. 8. Structural features (operationalization)<br />From interaction with their neighbors<br />Reciprocity features<br />% of bi-directional neighbors (represents the % of the neighbors of a user where there is both in and out edges – they have replied to each other).<br />Persistence features<br />The length of the conversations a user typically engages in (mean and sd of the posts per thread).<br />Popularity features<br />Ratio of a users’ in-neighbors (% of in-degree) # of replies<br />% of the posts where there is at least one reply to the user.<br />Initialization features<br />Initiated % of msgsby a user.<br />
  9. 9. User role discovery approach<br />Data cleaning<br />Filtering out low-degree, low posting users<br />User grouping<br />Via number of neighbors<br />
  10. 10. User roles<br />Joining conversationalists <br />the ones who do not initiate but post replies<br />Taciturns<br />Low reciprocity (rarely get involved into two-way communication)<br />Elitists<br />Low % of neighbors w/ two-way communication<br />Supporters<br />Middle range of the statistics of all features<br />Popular participant<br />Do not initiate many threads but get involved with a large percentage of users of a forum<br />Grunts<br />Similar to taciturns, relatively high levels of reciprocity.<br />Ignored<br />Extremely low % posts being replied to (not very popular)<br />
  11. 11. clusters<br />
  12. 12.
  13. 13. Results: Forum composition<br />Some forums are distinctively different from the others (eg. personal issues)<br />Difference in grouping by conversationalists vstaciturns<br />Some topics determine certain composition <br />
  14. 14. Discussion<br />Is it impossible to assess the ‘success of functioning’ from the composition of the group?<br />