Decomposing discussion forums using user roles	Jeffrey Chan & Conor Hayes<br />Friday seminar<br />8/20/2010<br />Presente...
Feature-based profiling<br />Users roles are identified by the features (indicators) to profile user behavior<br />Visuali...
Data<br />Boards.ie – the largest discussion board in Ireland<br />596 forums<br />75400 users <br />244850 threads <br />...
Forums<br />
Optimal number of clusters<br />
Analysis<br />Weighted directed graph <br />Ego-net graph – reply graph (multi-edge graph)<br />20 forums from 01/07/2006 ...
Features<br />Initially 50 features, redundant eliminated<br />Structural features (as communication btw users)<br />Unwei...
Structural features (operationalization)<br />From interaction with their neighbors<br />Reciprocity features<br />% of bi...
User role discovery approach<br />Data cleaning<br />Filtering out low-degree, low posting users<br />User grouping<br />V...
User roles<br />Joining conversationalists <br />the ones who do not initiate but post replies<br />Taciturns<br />Low rec...
clusters<br />
Results: Forum composition<br />Some forums are distinctively different from the others (eg. personal issues)<br />Differe...
Discussion<br />Is it impossible to assess the ‘success of functioning’ from the composition of the group?<br />
Upcoming SlideShare
Loading in …5
×

Decomposing discussion forums using user roles

10,554 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
10,554
On SlideShare
0
From Embeds
0
Number of Embeds
10
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Decomposing discussion forums using user roles

  1. 1. Decomposing discussion forums using user roles Jeffrey Chan & Conor Hayes<br />Friday seminar<br />8/20/2010<br />Presenter: AstaZelenkauskaite<br />
  2. 2. Feature-based profiling<br />Users roles are identified by the features (indicators) to profile user behavior<br />Visualization techniques<br />Downside: used only for small-scale studies<br />Proposed solution: soc net analysis<br />Ego-network analysis and the out-degree distribution<br />
  3. 3. Data<br />Boards.ie – the largest discussion board in Ireland<br />596 forums<br />75400 users <br />244850 threads <br />4.3 mln posts<br />
  4. 4. Forums<br />
  5. 5. Optimal number of clusters<br />
  6. 6. Analysis<br />Weighted directed graph <br />Ego-net graph – reply graph (multi-edge graph)<br />20 forums from 01/07/2006 – 31/12/2006.<br />
  7. 7. Features<br />Initially 50 features, redundant eliminated<br />Structural features (as communication btw users)<br />Unweighted directed graphs<br />From interaction with their neighbors<br />Reciprocity features<br />Persistence features<br />Popularity features<br />Initialization features<br />
  8. 8. Structural features (operationalization)<br />From interaction with their neighbors<br />Reciprocity features<br />% of bi-directional neighbors (represents the % of the neighbors of a user where there is both in and out edges – they have replied to each other).<br />Persistence features<br />The length of the conversations a user typically engages in (mean and sd of the posts per thread).<br />Popularity features<br />Ratio of a users’ in-neighbors (% of in-degree) # of replies<br />% of the posts where there is at least one reply to the user.<br />Initialization features<br />Initiated % of msgsby a user.<br />
  9. 9. User role discovery approach<br />Data cleaning<br />Filtering out low-degree, low posting users<br />User grouping<br />Via number of neighbors<br />
  10. 10. User roles<br />Joining conversationalists <br />the ones who do not initiate but post replies<br />Taciturns<br />Low reciprocity (rarely get involved into two-way communication)<br />Elitists<br />Low % of neighbors w/ two-way communication<br />Supporters<br />Middle range of the statistics of all features<br />Popular participant<br />Do not initiate many threads but get involved with a large percentage of users of a forum<br />Grunts<br />Similar to taciturns, relatively high levels of reciprocity.<br />Ignored<br />Extremely low % posts being replied to (not very popular)<br />
  11. 11. clusters<br />
  12. 12.
  13. 13. Results: Forum composition<br />Some forums are distinctively different from the others (eg. personal issues)<br />Difference in grouping by conversationalists vstaciturns<br />Some topics determine certain composition <br />
  14. 14. Discussion<br />Is it impossible to assess the ‘success of functioning’ from the composition of the group?<br />

×