Analysis Report of Greek Blogosphere by DataMine.it

1,016 views
947 views

Published on

Sync.gr run an extensive survey on greek blogosphere. Here you may find an extended data mining analysis on the results, provided by http://DataMine.it

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,016
On SlideShare
0
From Embeds
0
Number of Embeds
16
Actions
Shares
0
Downloads
30
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Analysis Report of Greek Blogosphere by DataMine.it

  1. 1. Analysis Report Prepared for: Nikos Drandakis, www.sync.gr Prepared by: Eleutheria Kanavou, Data Engineer, datamine.it January 17th, 2009 Report number: 000-0007 datamine.it 14 Meletiou Vasileiou Str 11 745 Athens, Greece T +30 6937 122 065 go@datamine.it http://datamine.it
  2. 2. Executive Summary Objective The hereby report summarizes the results of the extended data mining analysis performed for Nikos Drandakis, www.sync.gr. The initial data provided regards a survey on bloggers in Greece and their characteristics, which served as the input for a bunch of advanced methodologies and algorithms run to reveal underlying structure and patterns that reside as latent across the data. The paragraphs to follow include, among others, a careful selection of the most significant out of these results, in terms of relevance, consistency and accuracy. The results are presented in a comprehensible and easily digestible format, ready to support decision making processes. Goals The analysis performed served a single goal: To extensively study the given data set in order to search for and find out the most important of the rules and patterns hidden within the data. The study, eventually, contributes the shaping of these patterns into usable knowledge, while putting focus on the given variables of specific interest. Means The tools and approaches used for extracting the underlying patterns out of the available data set lie in the conjunction of Artificial Intelligence / Machine Learning and Statistics, an area commonly called Data Mining. The datamine.it team leverages on extended research experience on the topic to utilize state-of-the-art tools and techniques and provide you with the most insightful of the results, while yet in an absolutely familiar way. Outcomes Among the vast number of results occurred and the most significant out of them to be appeared throughout the report, a sneak peek of the insights gained is provided here: • The most popular issues covered in the Greek blogosphere are strongly associated with the general characteristics of a blogger. • Bloggers customs are determined by their age, their education and their professional occupation. • The use of social web services, such as ‘twitter’, ‘delicious’, ‘facebook’ etc, is related to blogging. The totality of contents of this report by DataMine.it is distributed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License. Analysis Report 000-0007 2
  3. 3. Table of Contents The context 4 Data, in general 4 Data Mining, in general 4 Data Mine.it, in specific 4 The content 5 Analysis of the data set 5 The analysis 8 Introduction 8 Best rules discovered 8 General outcomes 16 Appendix I: Questionnaire 17 Performed questions 17 Appendix II: Data set attributes 21 Description of data set attributes 21 Appendix III: Rules discovered 28 List of significant rules discovered 28 Contact Information 36 Analysis Report 000-0007 3
  4. 4. The context Data, in general Data stands as the least biased input to decision making, the purest source of insights and knowledge. Today, data is generated, stored and used at an unprecedented rate and volume. Typical tools available to interpret data generated by commonly used tools and techniques such as statistical reports and surveys cannot respond efficiently to the hurdles today's volume of data and required in-depth analysis pose. Datamine.it presents a solution to this problem. Data Mining, in general Where classical approaches prove to be ineffective of the scale, speed and simplicity needed, artificial intelligence comes to join statistics and provide the much needed solution. That solution is Data Mining. You can visualize data mining as a process of searching for treasure buried in the sand or digging up rock to mine for gold - thus 'mining' -, but the tools we use do it in a truly systematic and efficient way. In our case, the rock stands for data and the gold are the insights and knowledge hidden within the data set. That said, a miner with a mattock in his hand is a very rough way to conceptualize the complexity and state-of-the-art of the processes executed. A diverse and extended set of exploration and filtering algorithms, next to a variety of learning and meta-learning techniques, were utilized, optimized and evaluated, while the problem is a computationally intensive one and demands a highly customized approach. Data Mine.it, in specific The paragraphs to follow aim at providing insight on the patterns that emerge from the extended -in both width and depth- data mining analysis of the given data set. A bunch of sophisticated machine learning algorithms were run and fine-tuned by one or more datamine.it engineers to end up on extracting outcomes and patterns that make perfect sense for your dataset and really provide you with insights you never imagined before, or never thought them as being well proven; we like to call it quot;a tale of discovery, from your data to the report on handquot;. What’s more, rest assured we've worked really hard to separate the wheat from the chaff, all the peculiar terminology included. And if you were used to concern a pie chart or a histogram as the most insightful thing you could expect from a data analysis, get ready to be astonished on the pages to follow. Analysis Report 000-0007 4
  5. 5. The content Analysis of the data set The initial dataset consisted of 31 attributes (you may visualize it as the number of ‘questions performed’, see Appendix I) and 919 instances (the number of ‘samples collected’). For the sake of our analysis and clarity of the results, we dismantled these 31 questions into 108 attributes, the analytical description of which is provided in Appendix II, while Table 1 that follows gives a very sneak peek of them. Description Quantity attributes 108 nominal 108 numeric 0 target 48 instances 919 missing 0 uniques 0 Table 1: Data set at a glance Let's take a deeper view. Table 2 provides the titles of all attributes, which consist the data set. These are referred here to provide you with a broader view of the data in focus that are potentially utilized in the results of the following pages. Again, you may find a more detailed description of the submitted attributes in Appendix II. # Name # Name # Name 1 q101_platform_blogspot 37 q805_about_music 73 q1703_greekprob_humanrights 2 q102_platform_worpress 38 q806_about_news 74 q1704_greekprob_education 3 q103_platform_livespaces 39 q807_about_art_culture 75 q1705_greekprob_insurance 4 q104_platform_pathfinder 40 q808_about_media 76 q1706_greekprob_externalpolicy 5 q105_platform_other 41 q809_about_sport 77 q1707_greekprob_corruption 6 q301_com_forum 42 q810_about_gaming 78 q1708_greekprob_culture 7 q302_com_chatroom 43 q811_about_gossip 79 q1709_greekprob_technology 8 q303_com_socnet 44 q812_about_science 80 q17010_greekprob_healthsys 9 q304_com_other 45 q813_about_religion 81 q18_advert 10 q401_char_blog 46 q814_about_health 82 q19_advert_benef 11 q402_char_blog_other 47 q815_about_other 83 q2001_affect_career_known 12 q501_media_internet 48 q9_anonymity 84 q2002_affect_career_cv 13 q502_media_tv 49 q1001_anonymity_you 85 q2003_affect_career_prestige 14 q503_media_radio 50 q1002_anonymity_you_other 86 q2004_affect_career_changedep 15 q504_media_newspapers 51 q1101_why_blog_opinion 87 q2005_affect_career_leftjob Analysis Report 000-0007 5
  6. 6. # Name # Name # Name 16 q505_media_magazines 52 q1102_why_blog_share_thoughts 88 q2006_affect_career_lostjob 17 q601_socweb_facebook 53 q1103_why_blog_contacts 89 q2007_affect_career_noneofthem 18 q602_socweb_youtube 54 q1104_why_blog_inform_ff 90 q2008_affect_career_other 19 q603_socweb_twitter 55 q1105_why_blog_known 91 q2101_invitation_event 20 q604_socweb_frienfeed 56 q1106_why_blog_money_career 92 q2102_invitation_discussion 21 q605_socweb_delicious 57 q1107_why_blog_cv 93 q2103_invitation_campaign 22 q606_socweb_Greader 58 q1108_why_blog_customers 94 q2104_invitation_press 23 q607_socweb_lastfm 59 q1201_success_personal_satisf 95 q2105_invitation_tv/radio 24 q608_socweb_flickr 60 q1202_success_comments 96 q2106_invitation_speaker 25 q609_socweb_linkedln 61 q1203_success_visitors 97 q22_time 26 q610_socweb_buzz 62 q1204_success_links 98 q23_unique_visitors 27 q611_socweb_netvibes 63 q1205_success_users 99 q2401_occupation 28 q612_socweb_myspace 64 q1206_success_mediareport 100 q2402_occupation_other 29 q613_socweb_mogulus 65 q1207_success_money 101 q25_occupation_about 30 q614_socweb_ustream 66 q1208_success_leads 102 q26_education 31 q615_socweb_other 67 q13_satisfied 103 q27_age 32 q7_num_blogs 68 q14_blog_journal 104 q28_annual 33 q801_about_personal 69 q15_copyright 105 q2901_home 34 q802_about_political 70 q16_more_time 106 q2902_home_other 35 q803_about_social 71 q1701_greekprob_economy 107 q30_home_greece 36 q804_about_tech 72 q1702_greekprob_enviroment 108 q31_sex Table 2: Titles of attributes in use As the target for the analysis served 48 attributes from those listed in Table 2. In other words, the analysis performed attempt to extract relationships and insights of all other attributes in regard to each one of the 48 attributes at a time. The distributions of all the attributes are given in the Appendix II. Due to the sample’s complexity and size, various advanced filtering techniques were repeatedly utilized to firstly rank these attributes according to their correlation and informational value in regards to the analysis target, and then put focus on the ones that matter the most. All attributes are ranked according to their informational value to describe the target attribute. An example of such a ranking is provided with attribute quot;agequot; serving as the target. Table 4 presents the 10 most valuable out of these, as occurred by such a process, while Table 5 contributes the ones of least informational value. Analysis Report 000-0007 6
  7. 7. # Name 1 q2401_occupation 2 q28_annual 3 q26_education 4 q25_occupation_about 5 q503_media_radio 6 q505_media_magazines 7 q16_more_time 8 q502_media_tv 9 q601_socweb_facebook 10 q1708_greekprob_culture Table 4: Attributes of most informational value # Name 1 q2104_invitation_press 2 q2007_affect_career_noneofthem 3 q31_sex 4 q2105_invitation_tv/radio 5 q2102_invitation_discussion 6 q610_socweb_buzz 7 q2103_invitation_campaign 8 q2101_invitation_event 9 q103_platform_livespaces 10 q606_socweb_Greader Table 5: Attributes of low informational value Given the rough description of the submitted data set and the analysis framework deployed before, the next paragraph stands as the core of this report, moving to the actual results of the knowledge discovery process. Analysis Report 000-0007 7
  8. 8. The analysis Introduction As referred above, the analysis performed utilized an extended variety of advanced data mining techniques and machine learning algorithms, next to the outcomes of the data set’s analysis, to finally extract the best and brightest of its latent pat- terns. Significant effort was also put into transforming these patterns and analysis results into some direct, tangible and easi- ly comprehensible outcomes. Best rules discovered The pages to follow describe in words and figures the most significant out of the rules discovered, in other words the most distinguishable of the patterns emerged out of the extensive mining processes performed. Each pattern is also described by the number of cases that validates it across the data set, as well as its success rate. Apart from the rules presented here, Appendix III provides an extended list of (less or more) significant rules discovered, essentially contributing to the formation and understanding of the latent knowledge in the given data set. Rule 1: If q806_about_news=yes and q811_about_gossip=yes then q808_about_media=yes (39 occurred instances supported this rule, while 7 did not) (85% success) Rule 1 indicates that if the blog deals with news and gossip, then it also includes media topics at a rate of 85%. Analysis Report 000-0007 8
  9. 9. Rule 2: If q603_socweb_twitter=yes and q605_socweb_delicious=yes then q804_about_tech=yes (69.0/13.0) (84% success) Rule 2 suggests, with a certainty of 84%, that if the blogger often uses ‘twitter’ and ‘delicious’ in the social web, then he sees about technology subjects in his blog. Rule 3: If q803_about_social=yes and q806_about_news=yes then q802_about_political=yes (247.0/49.0) (83% success) This rule provides the insight that if the respondent deals with social issues and news in his blog, then he is expected to also write about political topics, with an accuracy of 85%. Rule 4: If q27_age=<18 and q502_media_tv=2-3/week then q615_socweb_other=hi5 (4.0/1.0) (80% success) Rule 4 suggests that 80% of the teenagers bloggers, who get informed from television 2-3 times per week, often use the social web service ‘Hi5’. Analysis Report 000-0007 9
  10. 10. Rule 5: If q603_socweb_twitter=yes and q605_socweb_delicious=yes then q608_socweb_flickr=yes (73.0/23.0) (76% success) Rule 5 indicates that if the blogger uses both ‘twitter’ and ‘delicious’, then he also uses ‘flickr’ at a rate 76% between the respondents. Rule 6: If q603_socweb_twitter=yes then q606_socweb_Greader=yes (148.0/47.0) (76% success) Hence, if the blogger uses only the social web service ‘twitter’ then he often uses the service ‘google reader’ with a certainty of 76%. Rule 7: If q612_socweb_myspace=yes and q1001_anonymity_you=nickname then q805_about_music=yes (49.0/16.0) (75% success) Rule 7 indicates that the respondents who often use the web service ‘my space’ and participate in the blog using nicknames deal with music in their blogs. Analysis Report 000-0007 10
  11. 11. Rule 8: If q601_socweb_facebook=no and q602_socweb_youtube=no and q615_socweb_other=no then q606_socweb_Google reader=yes (46.0/15.0) (75% success) Rule 8 suggests that ‘google reader’ is also preferred by the 75% of the bloggers who don’t often use ‘facebook’ and ‘you- tube’ or another social web service. Rule 9: If q603_socweb_twitter=yes and q609_socweb_linkedln=yes then q604_socweb_frienfeed=yes (48.0/16.0) (75% success) Another important association between social web services occurs from rule 8. That is, 75% of the bloggers who often use ‘twitter’ and ‘linkedln’ also use ‘friendfeed’. Rule 10: If q27_age=36-45 and q28_annual=no_answer then q601_socweb_facebook=no (46.0/15.0) (75% success) People 36-45 years old with unknown annual income do not often use the social web service ‘facebook’. Analysis Report 000-0007 11
  12. 12. Rule 11: If q803_about_social=yes and q31_sex=male then q802_about_political=yes (192.0/69.0) (74% success) Rule 11 suggests that male respondents, who handle social issues in their blogs handle, also write about political subjects. Rule 12: If q1101_why_blog_opinion=definitely and q26_education=tech_inst then q27_age=36-45 (26.0/9.0) (74% success) This rule indicates that graduates of technical educational institutes who use the blogs in order to express their opinion are typically 36-45 years old. Rule 13: If q803_about_social=yes and q401_char_blog=informative then q806_about_news=yes (90.0/34.0) (73% success) Rule 13 suggests that 73% of the bloggers, who deal with social issues and characterize their blogs as informative, also provide news coverage. Analysis Report 000-0007 12
  13. 13. Rule 14: If q804_about_tech=yes and q608_socweb_flickr=yes then q1001_anonymity_you=realname (65.0/27.0) (71% success) If a blog concerns technology and its owner is also a flickr user, then he is expected to sign with his real name. Rule 15: If q303_com_socnet=yes and q302_com_chatroom=no then q301_com_forum=no (172.0/69.0) (71% success) 71% of the bloggers who participated in social nets (like ‘facebook’, ‘my space’, etc) before blogging, but not in chat rooms, they neither participated in online forums. Rule 16: If q610_socweb_buzz=yes then q606_socweb_Google reader=yes (47.0/20.0) (70%success) Rule 16 suggests, with a certainty of 70%, that bloggers who often use the web service 'buzz', are also expected to utilize the ‘google reader’ web service. Analysis Report 000-0007 13
  14. 14. Rule 17: If q1205_success_users=enough and q804_about_tech=yes then q606_socweb_Google reader=yes (52.0/22.0) (70% success) Among the responders who consider the number of readers as an important success metric and also blog about technolo- gy, 70% of them also use ‘google reader’. Rule 18: If q16_more_time=always and q502_media_tv=everyday then q27_age=36-45 (84.0/37.0) (69% success) Rule 18 suggests that bloggers who spend extra time on the issues that they blog about -apart from blogging- and get in- formed by television on a daily basis, are expected to be 36-45 years old. Analysis Report 000-0007 14
  15. 15. Rule 19: If q805_about_music=yes and q803_about_social=yes then q807_about_art_culture=yes (103.0/49.0) (68% success) Finally, 68% of the respondents, who write about music and social issues, are expected to also blog on subjects related to culture and the arts. Again, the rules demonstrated here consist a small part of the best rules found, all of which are available in Appendix II. Analysis Report 000-0007 15
  16. 16. General outcomes The extended analysis performed and the numbers of results presented in the previous pages, as long as in the Appendix III, clearly shaped out a number of outcomes, the most significant of which are also deployed hereby: • If the blog deals with news and gossip, typically it also refers to media. • If the blogger often uses ‘twitter’ and ‘delicious’ in the social web then he sees about technology subjects in his blog. • 85% of the respondents who deal with social issues and news in their blogs are also concerned about political top- ics. • The social web service ‘google reader’ is preferred by the 75% of the bloggers who don’t often use ‘facebook’ and ‘youtube’. • Moreover, bloggers who often use the web service ‘buzz’ prefer the ‘google reader’. • 75% of the bloggers who often use ‘twitter’ and ‘linkedln’ use also ‘friendfeed’. • Male respondents who handle social issues in their blogs handle also and political matters. • 68% of the blogs which are about music and social issues deal also with art and culture subjects. While the results found are presented at full extent in the Appendixes below (including the list of the full name and description of the questions posed and answers submitted, the attributes analytical description and plots, most valuable -information wise- attributes and a really big list of rules extracted), it is by now clear that the on hand analysis has contributed deep in- sights, yet simple descriptions, on the patterns and knowledge that were lying unveiled through the submitted data set. This tale of discovery, from your data to the report on hand, seemed to reach its end, at least on the part of maximizing the value of your data input. We do believe you’ll come to validate this, while we continuously remain at your request for shaping the next episode of your data tales. Analysis Report 000-0007 16
  17. 17. Appendix I: Questionnaire Performed questions The list of the performed questions and the corresponding possible answers are listed in the following table. # Question Answers 1 q1: Which blogging platform do you use? q11: Blogger (Blogspot.com) q12: Wordpress q 13: Live Spaces q14: Pathfinder (pblogs.gr) q15: other (define) 2 q2: Date q2: Date 3 q3: Before you started blogging, were you active in another online q31: online Forum q32: chat rooms/IRC channels community? q33: social networks (i.e Facebook, Myspace) q34: other (define) 4 q4: Would you consider your blog as (pick one) personnal, informative, professional, corporate, other 5 q5: Which media do you use to get informed? q51: Internet q52: TV q53:radio q54: newspapers q55: magazines 6 q6: Which of the following services of the social web do you often use? q61: Facebook q62: YouTube q63: Twitter q64: Friendfeed q65: delicious q66: Google Reader q67: Last.fm q68: Flickr q69: LinkedIn q610: Buzz (Reality Tape) q611: Netvibes q612: MySpace q613: Mogulus q614: Ustream.tv q615: other (define) 7 q7: In how many blogs do you frequently contribute? one, two, three ,four, five, more than Analysis Report 000-0007 17
  18. 18. # Question Answers five 8 q8: On which of the following topics do you blog about? q81: personal q82: politics q83: social q84: technology q85: music q86: news q87: arts & culture q88: Media q89: sports q810: Gaming q811: gossip q812: science q813:religion q814: health q815: other(define) 9 q9: Should anonymity / aliasing of bloggers be protected? yes, no 10 q10: How do you handle anonymity in your blog? use real name, use nickname, anonymous, use nickname/anonymous but thinking to use real name, other 11 q11: Why do you blog? q111: to voice my opinion q112: to share my knowledge and experiences q113: to get in contact with people with similar interests q114: to inform friends and relatives on my activities q115: to become known in traditional media q116: to earn some money or start a career in writing q117: to improve my resume q118: to gain new customers in my work 12 q12: How do you track your blogs' success? q121: for personal satisfaction q122: by number of comments q123: by number of visitors q124: by number of links from other blogs q125: by number of RSS subscribers q126: by media references q127: money I earn q128:customers/leads won 13 q13: How satisfied are you with the content of blogs in Greece? none, little, enough, very, no opinion 14 q14: Do you consider blogs as a form of journalism? yes, no, sometimes, don't know 15 q15: When you are using intellectual content of others (news, photos, Yes, I always check the license to use it, videos etc), do you consider intellectual rights? I use no matter of the license, but I refer to the source, No, I'm not paying attention and I use it if I want to, Analysis Report 000-0007 18
  19. 19. # Question Answers I never use content produced by others. 16 q16: Do you spend extra time on the news or opinions you deploy? never, sometimes, often, very often, always, don’t answer 17 q17: Assess the important topics to be solved in Greece nowadays q171: economy-evolvement q172: environment q173: personal/social rights q174: education q175: insurance/pension q176: external dangers q177: corruption q178: culture development q179: technological development q1710: health system 18 q18: Do you have advertisements in your blog? yes, no 19 q19: (if q18=yes) What is your monthly revenue? not important, <50€, <100€, <250€, <500€, <750€, <1000€, <2500€, >2500€, don’t answer 20 q20: How was your professional life affected by blogging? q201:get known in my sector because of my blog q202: use my blog in my cv or attract employers q203:gain credits in my company because of blogging q204: changed sector or company because of blogging q205: left my job for blogging q206: lost my job or fall in disgrace because of the stuff i write in my blog q207: nothing of the above q208:other (define) 21 q21: Were you invited in any of the following because of your blogging q211: group meeting q212: round table discussion activity? q213: campaign supporter q214: article/interview in press q215: participation in radio broadcasting/tv shows q216: participation as a speaker or panel member in exhibitions/conferences 22 q22: How many hours per week do you invest in blogging? <1h,1-3h,3-5h,5-10h,10-20h,>20h 23 q23: How many unique visitors do you have per month in your blog? <1000, <5000, <10000, <20000, <30000, <50000, <75000, <100000, <250000, <500000, >500000, don’t answer 24 q24: Your profession private servant, civil servant, free- lancer, businessman, unemployed, pupil/student, retired, other 25 q25: Area of profession media, technology, banking, insurances, education, art/culture, Analysis Report 000-0007 19
  20. 20. # Question Answers services, tourism, health, constructions, real estate, industry, transports, nutrition, entertainment, agriculture, public sector, DEKO, retail trade, nongovernmental organization, other 26 q26: Education high school, middle school, technical institute, university, master, PhD 27 q 27: age <18, 18-25, 26-35, 36-45, 46-55, 56- 65, >65 28 q28: Annual income <10000€, <20000€, <40000€, <60000€, >60000€, don’t answer 29 q29: Place of permanent residence Greece, EU, USA, other 30 q30: (if in greece) town of permanent residence Attica, Thessaloniki, other 31 q 31: sex male, female Analysis Report 000-0007 20
  21. 21. Appendix II: Data set attributes Description of data set attributes The list of attributes of the given data set is provided here. # Name Type Values Missing Distinct Unique 1 q101_platform_blogspot nominal {0,1}~{no,yes} 7 2 0 2 q102_platform_worpress nominal {0,1}~{no,yes} 7 2 0 3 q103_platform_livespaces nominal {0,1}~{no,yes} 7 2 0 4 q104_platform_pathfinder nominal {0,1}~{no,yes} 7 2 0 {0,drupal,joomla,musicheaven, 5 q105_platform_other nominal 7 7 0 selfhosted,Google,yahoo360} 6 q301_com_forum nominal {0,1}~{no,yes} 105 2 0 7 q302_com_chatroom nominal {0,1}~{no,yes} 105 2 0 8 q303_com_socnet nominal {0,1}~{no,yes} 105 2 0 {0,no,BBS,mailinglists,youtube, 9 q304_com_other nominal twitter,MSN,newsgroups,USENET, 105 11 0 Pathfinder,hi5} {1,2,3,4,-1} 10 q401_char_blog nominal ~{personnal,informative,professional, 0 5 0 corporative,other} 11 q402_char_blog_other nominal {0,political,entertaining} 0 3 0 {0,1,2,3,4,5,6,7}~{no,everyday, 4-6 times/week,2-3 times/week, 12 q501_media_internet nominal 0 8 0 1 time/week,1 time/2weeks, 1 time/month,less often} {0,1,2,3,4,5,6,7}~{no,everyday, 4-6 times/week,2-3 times/week, 13 q502_media_tv nominal 0 8 0 1 time/week,1 time/2weeks, 1 time/month,less often} {0,1,2,3,4,5,6,7}~{no,everyday, 4-6 times/week,2-3 times/week, 14 q503_media_radio nominal 0 8 0 1 time/week,1 time/2weeks, 1 time/month,less often} Analysis Report 000-0007 21
  22. 22. # Name Type Values Missing Distinct Unique {0,1,2,3,4,5,6,7}~{no,everyday, 4-6 times/week,2-3 times/week, 15 q504_media_newspapers nominal 0 8 0 1 time/week,1 time/2weeks, 1 time/month,less often} {0,1,2,3,4,5,6,7}~{no,everyday, 4-6 times/week,2-3 times/week, 16 q505_media_magazines nominal 0 8 0 1 time/week,1 time/2weeks, 1 time/month,less often} 17 q601_socweb_facebook nominal {0,1}~{no,yes} 20 2 0 18 q602_socweb_youtube nominal {0,1}~{no,yes} 20 2 0 19 q603_socweb_twitter nominal {0,1}~{no,yes} 20 2 0 20 q604_socweb_frienfeed nominal {0,1}~{no,yes} 20 2 0 21 q605_socweb_delicious nominal {0,1}~{no,yes} 20 2 0 22 q606_socweb_Google reader nominal {0,1}~{no,yes} 20 2 0 23 q607_socweb_lastfm nominal {0,1}~{no,yes} 20 2 0 24 q608_socweb_flickr nominal {0,1}~{no,yes} 20 2 0 25 q609_socweb_linkedln nominal {0,1}~{no,yes} 20 2 0 26 q610_socweb_buzz nominal {0,1}~{no,yes} 20 2 0 27 q611_socweb_netvibes nominal {0,1}~{no,yes} 20 2 0 28 q612_socweb_myspace nominal {0,1}~{no,yes} 20 2 0 29 q613_socweb_mogulus nominal {0,1}~{no,yes} 20 2 0 30 q614_socweb_ustream nominal {0,1}~{no,yes} 20 2 0 {0,sync,hi5,plurk,digg,stixoi,picasa, 31 q615_socweb_other nominal 20 8 1 none} {1,2,3,4,5,6}~{one,two,three,four, 32 q7_num_blogs nominal 0 6 0 five,more than five} 33 q801_about_personal nominal {0,1}~{no,yes} 5 2 0 34 q802_about_political nominal {0,1}~{no,yes} 5 2 0 35 q803_about_social nominal {0,1}~{no,yes} 5 2 0 36 q804_about_tech nominal {0,1}~{no,yes} 5 2 0 37 q805_about_music nominal {0,1}~{no,yes} 5 2 0 38 q806_about_news nominal {0,1}~{no,yes} 5 2 0 39 q807_about_art_culture nominal {0,1}~{no,yes} 5 2 0 40 q808_about_media nominal {0,1}~{no,yes} 5 2 0 41 q809_about_sport nominal {0,1}~{no,yes} 5 2 0 42 q810_about_gaming nominal {0,1}~{no,yes} 5 2 0 43 q811_about_gossip nominal {0,1}~{no,yes} 5 2 0 44 q812_about_science nominal {0,1}~{no,yes} 5 2 0 45 q813_about_religion nominal {0,1}~{no,yes} 5 2 0 46 q814_about_health nominal {0,1}~{no,yes} 5 2 0 Analysis Report 000-0007 22
  23. 23. # Name Type Values Missing Distinct Unique 47 q815_about_other nominal {0,environment,education,travel} 5 4 0 48 q9_anonymity nominal {1,2}~{yes,no} 9 2 0 {1,2,3,4,-1} ~{realname,nickname,anonymous, 49 q1001_anonymity_you nominal 1 5 0 nickname/anonymous_but_thinking_ use_realname,other} 50 q1002_anonymity_you_other nominal {0,both,front_name,pseudo_known} 1 4 0 51 q1101_why_blog_opinion nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 52 q1102_why_blog_share_thought nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 53 q1103_why_blog_contacts nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 54 q1104_why_blog_inform_ff nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 55 q1105_why_blog_known nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 56 q1106_why_blog_money_career nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 57 q1107_why_blog_cv nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 58 q1108_why_blog_customers nominal {1,2,3,4}~{no,little,enough,definetely} 0 4 0 59 q1201_success_personal_satisf nominal {1,2,3,4}~{no,little,enough,definetely} 15 4 0 60 q1202_success_comments nominal {1,2,3,4}~{no,little,enough,definetely} 30 4 0 61 q1203_success_visitors nominal {1,2,3,4}~{no,little,enough,definetely} 25 4 0 62 q1204_success_links nominal {1,2,3,4}~{no,little,enough,definetely} 31 4 0 63 q1205_success_users nominal {1,2,3,4}~{no,little,enough,definetely} 54 4 0 64 q1206_success_mediareport nominal {1,2,3,4}~{no,little,enough,definetely} 51 4 0 65 q1207_success_money nominal {1,2,3,4}~{no,little,enough,definetely} 55 4 0 66 q1208_success_leads nominal {1,2,3,4}~{no,little,enough,definetely} 53 4 0 {1,2,3,4,5}~{none,little,enough,very, 67 q13_satisfied nominal 0 5 0 no_opinion} {1,2,3,4}~ 68 q14_blog_journal nominal 0 4 0 {yes,no,sometimes, don't_know} 69 q15_copyright nominal {1,2,3,4} 2 4 0 {1,2,3,4,5,6}~{never,sometimes, 70 q16_more_time nominal 0 6 0 often,very_often,always,no_answer} {1,2,3,4,5}~{none,little,very, 71 q1701_greekprob_economy nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 72 q1702_greekprob_enviroment nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 73 q1703_greekprob_humanrights nominal 0 5 0 absolutely,no_opinion} 74 q1704_greekprob_education nominal 0 5 0 {1,2,3,4,5}~{none,little,very, Analysis Report 000-0007 23
  24. 24. # Name Type Values Missing Distinct Unique absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 75 q1705_greekprob_insurance nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 76 q1706_greekprob_externalpolicy nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 77 q1707_greekprob_corruption nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 78 q1708_greekprob_culture nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 79 q1709_greekprob_technology nominal 0 5 0 absolutely,no_opinion} {1,2,3,4,5}~{none,little,very, 80 q17010_greekprob_healthsys nominal 0 5 0 absolutely,no_opinion} 81 q18_advert nominal {1,2}~{yes,no} 0 2 0 {0,1,2,3,4,5,6,7,8,9,-1} ~{not_important,<50€,<100€,<250€, 82 q19_advert_benef nominal 705 11 2 <500€,<750€,<1000€,<2500€, >2500€,no_answer} 83 q2001_affect_career_known nominal {1,0}~{yes,no} 1 2 0 84 q2002_affect_career_cv nominal {1,0}~{yes,no} 1 2 0 85 q2003_affect_career_prestige nominal {1,0}~{yes,no} 1 2 0 86 q2004_affect_career_changedep nominal {1,0}~{yes,no} 1 2 0 87 q2005_affect_career_leftjob nominal {1,0}~{yes,no} 1 2 0 88 q2006_affect_career_lostjob nominal {1,0}~{yes,no} 1 2 0 89 q2007_affect_career_noneofthem nominal {1,0}~{yes,no} 1 2 0 {0,nothing,contacts,new_job, 90 q2008_affect_career_other nominal 1 5 0 share_opinions} 91 q2101_invitation_event nominal {1,0}~{yes,no} 460 2 0 92 q2102_invitation_discussion nominal {1,0}~{yes,no} 460 2 0 93 q2103_invitation_campaign nominal {1,0}~{yes,no} 460 2 0 94 q2104_invitation_press nominal {1,0}~{yes,no} 460 2 0 95 q2105_invitation_tv/radio nominal {1,0}~{yes,no} 460 2 0 96 q2106_invitation_speaker nominal {1,0}~{yes,no} 460 2 0 {1,2,3,4,5,6}~{<1h,1-3h,3-5h, 97 q22_time nominal 0 6 0 5-10h,10-20h,>20h} Analysis Report 000-0007 24
  25. 25. # Name Type Values Missing Distinct Unique {1,2,3,4,5,6,7,8,9,10,11,12}~{<1000, <5000,<10000,<20000,<30000,<50 98 q23_unique_visitors nominal 0 12 1 000,<75000,<100000,<250000, <500000,>500000,no_answer} {1,2,3,4,5,6,7,-1} ~{private_servant,civil_servant,free- 99 q2401_occupation nominal 6 8 0 lancer,businessman,unemployed, pupil/student,retired,other} 100 q2402_occupation_other nominal {0,journalist,educator,employee} 6 4 0 {1,2,3,4,5,6,7,8,9,10,11,12,13,14, 15,16,17,18,19,20,-1} ~{media,technology,banking, insurances,education,art/culture, 47 101 q25_occupation_about nominal services,tourism,health,constructions 21 0 ,real_estate,industry,transports, nutrition,entertainment,agriculture, public_sector,DEKO,retail_trade, non_governmental_org,other} {1,2,3,4,5,6}~{high_school, 102 q26_education nominal middle_school, 0 6 0 tech_inst,bachelor,master,phd} {1,2,3,4,5,6,7,-1}~{<18,18-25, 103 q27_age nominal 0 8 0 26-35,36-45,46-55,56-65,>65} {1,2,3,4,5,6}~{<10000€,<20000€, 104 q28_annual nominal <40000€,<60000€,>60000€, 0 6 0 no_answer} 105 q2901_home nominal {1,2,3,-1}~{greece,EU,USA,other} 3 4 0 106 q2902_home_other nominal {0,australia,canada,serbia} 3 4 1 107 q30_home_greece nominal {attiki,thessaloniki,other} 232 3 0 108 q31_sex nominal {1,2}~{male,female} 1 2 0 Table x: Analytical description of data set attributes Analysis Report 000-0007 25
  26. 26. Analysis Report 000-0007 26
  27. 27. Figure x: Visualization of the data set’s distribution, according to variable ‘sex’. Analysis Report 000-0007 27
  28. 28. Appendix III: Rules discovered List of significant rules discovered Apart from the most significant rules that were referred to in the analysis section and out of the huge bulk of rules that were found during the study of the given data set, a number of other rules are definitely worth or mentioning. These are referred to in the Table XX that follows. # Rule 1 If q101_platform_blogspot = 0 and q1206_success_mediareport = 2 and q2401_occupation = 1 then q22_time=3 (16.0/5.0) 2 If q102_platform_worpress = 1 then q101_platform_blogspot=0 (245.0/19.0) If q102_platform_worpress = 1 and q1203_success_visitors = 2 and q808_about_media = 0 then q606_socweb_Google reader=1 3 (19.0/4.0) 4 If q102_platform_worpress = 1 and q28_annual = 1 and q1702_greekprob_enviroment = 3 then q7_num_blogs=2 (12.0/0.0) 5 If q102_platform_worpress = 1 and q1205_success_users = 3 and q22_time = 6 then q603_socweb_twitter=1 (5.0/0.0) 6 If q103_platform_livespaces = 1 then q101_platform_blogspot=0 (8.0/2.0) 7 If q104_platform_pathfinder = 1 then q101_platform_blogspot=0 (24.0/3.0) 8 If q105_platform_other = selfhosted then q101_platform_blogspot=0 (7.0/0.0) 9 If q105_platform_other = drupal then q101_platform_blogspot=0 (3.0/0.0) 10 If q105_platform_other = musicheaven then q101_platform_blogspot=0 (2.0/0.0) 11 If q302_com_chatroom = 1 and q303_com_socnet = 0 and q802_about_political = 1 then q301_com_forum=0 (63.0/23.0) 12 If q303_com_socnet = 1 and q302_com_chatroom = 0 then q301_com_forum=0 (172.0/69.0) 13 If q303_com_socnet = 1 and q301_com_forum = 1 then q302_com_chatroom=1 (151.0/69.0) 14 If q303_com_socnet = 1 and q607_socweb_lastfm = 1 and q606_socweb_Google reader = 0 then q612_socweb_myspace=1 15 If q304_com_other = 0 and q301_com_forum = 0 and q303_com_socnet = 0 then q302_com_chatroom=1 (88.0/25.0) 16 If q304_com_other = no then q301_com_forum=0 (83.0/0.0) 17 If q304_com_other = no and q502_media_tv = 6 then q501_media_internet=7 (2.0/0.0) 18 If q304_com_other = no and q2106_invitation_speaker = 1 then q615_socweb_other=sync (3.0/1.0) 19 If q401_char_blog = 2 then q801_about_personal=0 (250.0/70.0) 20 If q401_char_blog = 3 then q801_about_personal=0 (43.0/9.0) 21 If q401_char_blog = -1 and q1706_greekprob_externalpolicy = 3 then q801_about_personal=0 (10.0/1.0) 22 If q401_char_blog = -1 and q1205_success_users = 2 and q402_char_blog_other = 0 then q1001_anonymity_you=1 (14.0/3.0) 23 If q402_char_blog_other = political then q401_char_blog=-1 (29.0/3.0) 24 If q402_char_blog_other = entertaining then q401_char_blog=-1 (17.0/2.0) 25 If q501_media_internet = 2 and q27_age = 2 and q1203_success_visitors = 3 then q505_media_magazines=5 (8.0/0.0) If q502_media_tv = 2 and q1206_success_mediareport = 3 and q2102_invitation_discussion = 1 then q808_about_media=1 26 (7.0/1.0) 27 If q502_media_tv = 0 and q2102_invitation_discussion = 0 then q504_media_newspapers=0 (5.0/2.0) 28 If q502_media_tv = 2 and q31_sex = 2 and q1709_greekprob_technology = 4 then q504_media_newspapers=5 (14.0/5.0) 29 If q502_media_tv = 1 and q505_media_magazines = 1 then q504_media_newspapers=1 (19.0/2.0) Analysis Report 000-0007 28
  29. 29. # Rule 30 If q502_media_tv = 0 and q1709_greekprob_technology = 3 then q505_media_magazines=0 (9.0/1.0) 31 If q503_media_radio = 0 and q9_anonymity = 2 then q502_media_tv=0 (5.0/2.0) 32 If q503_media_radio = 5 and q25_occupation_about = 6 then q502_media_tv=5 (6.0/2.0) 33 If q503_media_radio = 5 and q25_occupation_about = 10 then q502_media_tv=4 (3.0/0.0) 34 If q503_media_radio = 0 and q2401_occupation = 2 then q505_media_magazines=0 (9.0/1.0) 35 If q504_media_newspapers = 1 and q2003_affect_career_prestige = 1 then q505_media_magazines=1 (12.0/5.0) 36 If q504_media_newspapers = 0 then q505_media_magazines=0 (42.0/8.0) If q504_media_newspapers = 6 and q1706_greekprob_externalpolicy = 1 and q1103_why_blog_contacts = 4 then 37 q502_media_tv=6 (3.0/0.0) 38 If q504_media_newspapers = 0 and q301_com_forum = 0 then q503_media_radio=0 (4.0/0.0) 39 If q504_media_newspapers = 7 and q28_annual = 6 and q15_copyright = 2 then q503_media_radio=7 (13.0/3.0) If q504_media_newspapers = 7 and q1102_why_blog_share_thoughts = 3 and q806_about_news = 0 then q503_media_radio=7 40 (25.0/9.0) 41 If q504_media_newspapers = 7 and q30_home_greece = other and q804_about_tech = 1 then q503_media_radio=7 (8.0/2.0) If q504_media_newspapers = 3 and q1708_greekprob_culture = 2 and q2102_invitation_discussion = 0 then 42 q505_media_magazines=3 (9.0/2.0) 43 If q504_media_newspapers = 1 and q1104_why_blog_inform_ff = 3 then q505_media_magazines=3 (6.0/1.0) 44 If q504_media_newspapers = 2 and q503_media_radio = 3 then q505_media_magazines=4 (17.0/7.0) 45 If q504_media_newspapers = 6 and q801_about_personal = 0 then q505_media_magazines=6 (25.0/12.0) 46 If q504_media_newspapers = 1 and q2101_invitation_event = 0 then q802_about_political=1 (16.0/2.0) 47 If q505_media_magazines = 0 then q504_media_newspapers=0 (60.0/26.0) 48 If q505_media_magazines = 0 then q503_media_radio=0 (60.0/23.0) 49 If q505_media_magazines = 0 then q502_media_tv=0 (60.0/27.0) 50 If q505_media_magazines = 0 and q602_socweb_youtube = 0 and q503_media_radio = 1 then q501_media_internet=0 (4.0/1.0) 51 If q505_media_magazines = 0 and q303_com_socnet = 1 and q7_num_blogs = 2 then q501_media_internet=5 (4.0/1.0) 52 If q505_media_magazines = 2 and q27_age = 2 then q501_media_internet=7 (7.0/3.0) 53 If q505_media_magazines = 5 and q1702_greekprob_enviroment = 2 then q503_media_radio=7 (3.0/0.0) 54 If q505_media_magazines = 5 and q7_num_blogs = 5 then q504_media_newspapers=5 (6.0/1.0) 55 If q505_media_magazines = 7 and q503_media_radio = 7 then q504_media_newspapers=7 (59.0/21.0) If q505_media_magazines = 7 and q802_about_political = 0 and q1102_why_blog_share_thoughts = 4 then 56 q504_media_newspapers=7 (49.0/21.0) 57 If q505_media_magazines = 4 and q16_more_time = 4 then q7_num_blogs=2 (27.0/8.0) 58 If q505_media_magazines = 3 and q102_platform_worpress = 1 then q504_media_newspapers=3 (20.0/7.0) 59 If q505_media_magazines = 7 and q1201_success_personal_satisf = 1 and q13_satisfied = 3 then q26_education=2 (7.0/0.0) 60 If q505_media_magazines = 0 and q802_about_political = 1 and q1102_why_blog_share_thoughts = 3 then q27_age=5 (8.0/1.0) 61 If q505_media_magazines = 0 and q25_occupation_about = -1 and q501_media_internet = 1 then q27_age=5 (4.0/0.0) If q505_media_magazines = 4 and q1205_success_users = 4 and q1208_success_leads = 1 then q30_home_greece=thessaloniki 62 (12.0/3.0) If q601_socweb_facebook = 1 and q612_socweb_myspace = 1 and q1703_greekprob_humanrights = 3 then q303_com_socnet=1 63 (34.0/6.0) Analysis Report 000-0007 29
  30. 30. # Rule If q601_socweb_facebook = 0 and q602_socweb_youtube = 0 and q615_socweb_other = 0 then q606_socweb_Google reader=1 64 (46.0/15.0) 65 If q601_socweb_facebook = 1 and q302_com_chatroom = 1 and q301_com_forum = 1 then q303_com_socnet=1 (95.0/34.0) If q601_socweb_facebook = 1 and q301_com_forum = 0 and q304_com_other = 0 and q302_com_chatroom = 0 then 66 q303_com_socnet=1 (92.0/13.0) 67 If q601_socweb_facebook = 0 and q7_num_blogs = 3 and q1705_greekprob_insurance = 3 then q22_time=5 (10.0/3.0) 68 If q601_socweb_facebook = 0 and q807_about_art_culture = 1 and q1702_greekprob_enviroment = 3 then q22_time=4 (25.0/11.0) 69 If q601_socweb_facebook = 0 and q25_occupation_about = 5 and q1204_success_links = 2 then q27_age=5 (10.0/2.0) If q603_socweb_twitter = 1 and q2105_invitation_tv/radio = 1 and q805_about_music = 1 then q613_socweb_mogulus=1 70 (10.0/4.0) If q603_socweb_twitter = 1 and q1104_why_blog_inform_ff = 4 and q1203_success_visitors = 3 then q503_media_radio=5 71 (6.0/1.0) 72 If q603_socweb_twitter = 1 and q609_socweb_linkedln = 1 then q604_socweb_frienfeed=1 (48.0/16.0) If q603_socweb_twitter = 1 and q605_socweb_delicious = 1 and q2104_invitation_press = 1 then q604_socweb_frienfeed=1 73 (6.0/2.0) If q603_socweb_twitter = 1 and q604_socweb_frienfeed = 1 and q801_about_personal = 1 then q605_socweb_delicious=1 74 (31.0/5.0) 75 If q603_socweb_twitter = 1 and q23_unique_visitors = 1 and q804_about_tech = 1 then q605_socweb_delicious=1 (25.0/8.0) 76 If q603_socweb_twitter = 1 then q606_socweb_Google reader=1 (148.0/47.0) 77 If q603_socweb_twitter = 1 and q605_socweb_delicious = 1 then q608_socweb_flickr=1 (73.0/23.0) 78 If q603_socweb_twitter = 1 and q503_media_radio = 1 then q608_socweb_flickr=1 (15.0/5.0) 79 If q603_socweb_twitter = 1 and q604_socweb_frienfeed = 1 and q808_about_media = 1 then q609_socweb_linkedln=1 (8.0/2.0) 80 If q603_socweb_twitter = 1 and q605_socweb_delicious = 1 then q804_about_tech=1 (69.0/13.0) 81 If q604_socweb_frienfeed = 1 then q603_socweb_twitter=1 (70.0/12.0) 82 If q604_socweb_frienfeed = 1 and q1107_why_blog_cv = 4 then q614_socweb_ustream=1 (6.0/2.0) If q604_socweb_frienfeed = 1 and q1101_why_blog_opinion = 2 and q2103_invitation_campaign = 1 then 83 q615_socweb_other=digg (4.0/1.0) 84 If q605_socweb_delicious = 1 and q804_about_tech = 1 then q603_socweb_twitter=1 (38.0/18.0) If q605_socweb_delicious = 1 and q2101_invitation_event = 1 and q1201_success_personal_satisf = 3 then 85 q604_socweb_frienfeed=1 (9.0/3.0) If q605_socweb_delicious = 1 and q2003_affect_career_prestige = 1 and q1101_why_blog_opinion = 1 then 86 q611_socweb_netvibes=1 (4.0/0.0) If q605_socweb_delicious = 1 and q503_media_radio = 1 and q1206_success_mediareport = 2 then q611_socweb_netvibes=1 87 (5.0/1.0) If q606_socweb_Google reader = 1 and q605_socweb_delicious = 1 and q1101_why_blog_opinion = 3 then 88 q609_socweb_linkedln=1 (16.0/5.0) 89 If q607_socweb_lastfm = 1 and q1104_why_blog_inform_ff = 2 and q606_socweb_Google reader = 0 then q22_time=1 (15.0/4.0) Analysis Report 000-0007 30
  31. 31. # Rule 90 If q608_socweb_flickr = 1 and q26_education = 6 then q7_num_blogs=3 (14.0/6.0) 91 If q609_socweb_linkedln = 1 and q7_num_blogs = 1 then q804_about_tech=1 (13.0/2.0) 92 If q610_socweb_buzz = 1 then q606_socweb_Google reader=1 (47.0/20.0) 93 If q610_socweb_buzz = 1 and q1105_why_blog_known = 3 then q611_socweb_netvibes=1 (5.0/2.0) If q611_socweb_netvibes = 1 and q807_about_art_culture = 1 and q810_about_gaming = 0 then q605_socweb_delicious=1 94 (8.0/1.0) 95 If q612_socweb_myspace = 1 and q14_blog_journal = 3 and q1101_why_blog_opinion = 4 then q303_com_socnet=1 (15.0/4.0) 96 If q612_socweb_myspace = 1 and q802_about_political = 0 and q805_about_music = 1 then q607_socweb_lastfm=1 (49.0/22.0) 97 If q612_socweb_myspace = 1 and q1001_anonymity_you = 2 then q805_about_music=1 (49.0/16.0) 98 If q613_socweb_mogulus = 1 and q607_socweb_lastfm = 1 then q614_socweb_ustream=1 (4.0/1.0) 99 If q615_socweb_other = none then q602_socweb_youtube=0 (13.0/0.0) 100 If q7_num_blogs = 4 and q812_about_science = 1 then q505_media_magazines=6 (8.0/1.0) 101 If q7_num_blogs = 6 and q401_char_blog = -1 then q22_time=6 (13.0/6.0) 102 If q801_about_personal = 0 and q806_about_news = 1 and q605_socweb_delicious = 0 then q401_char_blog=2 (103.0/28.0) 103 If q801_about_personal = 0 and q303_com_socnet = 1 then q401_char_blog=2 (81.0/32.0) 104 If q801_about_personal = 0 and q27_age = 2 then q401_char_blog=2 (19.0/4.0) 105 If q801_about_personal = 0 and q806_about_news = 1 then q401_char_blog=2 (132.0/53.0) 106 If q801_about_personal = 0 and q303_com_socnet = 1 then q401_char_blog=2 (77.0/31.0) 107 If q801_about_personal = 1 and q808_about_media = 1 then q805_about_music=1 (52.0/20.0) 108 If q802_about_political = 0 and q1207_success_money = 2 and q608_socweb_flickr = 0 then q303_com_socnet=1 (11.0/2.0) 109 If q802_about_political = 0 and q801_about_personal = 0 then q803_about_social=0 (227.0/51.0) 110 If q802_about_political = 0 and q805_about_music = 0 then q803_about_social=0 (135.0/56.0) 111 If q802_about_political = 1 and q808_about_media = 1 and q401_char_blog = 2 then q806_about_news=1 (39.0/1.0) 112 If q802_about_political = 1 and q807_about_art_culture = 1 and q808_about_media = 1 then q806_about_news=1 (50.0/8.0) 113 If q802_about_political = 1 and q505_media_magazines = 6 and q23_unique_visitors = 1 then q806_about_news=1 (26.0/8.0) 114 If q803_about_social = 1 and q806_about_news = 1 then q802_about_political=1 (247.0/49.0) 115 If q803_about_social = 1 and q31_sex = 1 then q802_about_political=1 (192.0/69.0) 116 If q803_about_social = 0 and q504_media_newspapers = 3 and q28_annual = 1 then q505_media_magazines=3 (13.0/5.0) If q803_about_social = 1 and q27_age = 5 and q9_anonymity = 1 and q806_about_news = 0 then q601_socweb_facebook=0 117 (35.0/8.0) 118 If q803_about_social = 1 and q401_char_blog = 2 then q806_about_news=1 (90.0/34.0) 119 If q803_about_social = 1 and q503_media_radio = 6 and q2103_invitation_campaign = 0 then q813_about_religion=1 (10.0/3.0) 120 If q804_about_tech = 1 and q811_about_gossip = 1 and q1206_success_mediareport = 1 then q810_about_gaming=1 (14.0/6.0) If q804_about_tech = 1 and q1101_why_blog_opinion = 2 and q1707_greekprob_corruption = 2 then q810_about_gaming=1 121 (7.0/2.0) 122 If q804_about_tech = 0 and q28_annual = 5 then q504_media_newspapers=1 (13.0/6.0) 123 If q804_about_tech = 1 and q608_socweb_flickr = 1 then q1001_anonymity_you=1 (65.0/27.0) 124 If q805_about_music = 1 and q610_socweb_buzz = 1 and q503_media_radio = 1 then q612_socweb_myspace=1 (13.0/4.0) 125 If q805_about_music = 0 and q401_char_blog = -1 and q501_media_internet = 1 then q801_about_personal=0 (60.0/4.0) 126 If q805_about_music = 1 and q2101_invitation_event = 1 and q607_socweb_lastfm = 1 then q22_time=4 (12.0/3.0) 127 If q805_about_music = 1 and q1708_greekprob_culture = 4 then q807_about_art_culture=1 (187.0/41.0) 128 If q805_about_music = 1 and q803_about_social = 1 then q807_about_art_culture=1 (103.0/49.0) Analysis Report 000-0007 31
  32. 32. # Rule 129 If q805_about_music = 1 and q806_about_news = 1 and q503_media_radio = 2 then q809_about_sport=1 (23.0/8.0) 130 If q805_about_music = 0 and q1205_success_users = 1 and q16_more_time = 3 then q30_home_greece=other (16.0/4.0) If q805_about_music = 0 and q1206_success_mediareport = 2 and q806_about_news = 1 then q30_home_greece=other 131 (29.0/12.0) 132 If q806_about_news = 1 and q811_about_gossip = 1 and q1001_anonymity_you = 2 then q809_about_sport=1 (17.0/2.0) 133 If q806_about_news = 1 and q811_about_gossip = 1 then q808_about_media=1 (39.0/7.0) 134 If q806_about_news = 1 and q804_about_tech = 1 and q2007_affect_career_noneofthem = 0 then q808_about_media=1 (27.0/9.0) 135 If q806_about_news = 1 and q25_occupation_about = 1 and q812_about_science = 1 then q808_about_media=1 (6.0/0.0) 136 If q806_about_news = 1 and q804_about_tech = 1 then q812_about_science=1 (86.0/39.0) 137 If q806_about_news = 1 and q505_media_magazines = 0 then q813_about_religion=1 (12.0/5.0) If q807_about_art_culture = 1 and q1706_greekprob_externalpolicy = 3 and q1205_success_users = 3 then 138 q505_media_magazines=6 (13.0/3.0) 139 If q807_about_art_culture = 1 and q1205_success_users = 3 and q1203_success_visitors = 3 then q7_num_blogs=2 (28.0/10.0) 140 If q807_about_art_culture = 1 and q803_about_social = 1 then q805_about_music=1 (278.0/89.0) 141 If q808_about_media = 1 and q1201_success_personal_satisf = 4 and q804_about_tech = 0 then q502_media_tv=2 (43.0/21.0) 142 If q808_about_media = 1 and q809_about_sport = 1 and q30_home_greece = attiki then q811_about_gossip=1 (27.0/13.0) 143 If q808_about_media = 1 and q2002_affect_career_cv = 1 and q101_platform_blogspot = 1 then q811_about_gossip=1 (8.0/3.0) 144 If q808_about_media = 1 and q1205_success_users = 2 and q31_sex = 2 then q811_about_gossip=1 (4.0/0.0) 145 If q810_about_gaming = 1 then q804_about_tech=1 (21.0/7.0) 146 If q812_about_science = 1 and q301_com_forum = 1 then q804_about_tech=1 (110.0/42.0) 147 If q812_about_science = 1 and q608_socweb_flickr = 1 and q101_platform_blogspot = 0 then q806_about_news=1 (19.0/4.0) 148 If q812_about_science = 1 and q601_socweb_facebook = 0 and q801_about_personal = 1 then q813_about_religion=1 (37.0/16.0) 149 If q812_about_science = 1 and q806_about_news = 1 and q303_com_socnet = 1 then q814_about_health=1 (42.0/13.0) 150 If q812_about_science = 1 and q806_about_news = 1 and q502_media_tv = 1 then q814_about_health=1 (30.0/14.0) 151 If q812_about_science = 1 and q28_annual = 5 and q2104_invitation_press = 1 then q26_education=6 (7.0/0.0) 152 If q813_about_religion = 1 and q804_about_tech = 1 then q812_about_science=1 (25.0/4.0) 153 If q814_about_health = 1 and q2105_invitation_tv/radio = 1 and q401_char_blog = 2 then q505_media_magazines=2 (10.0/4.0) 154 If q814_about_health = 1 then q807_about_art_culture=1 (80.0/28.0) 155 If q814_about_health = 1 then q812_about_science=1 (110.0/31.0) 156 If q814_about_health = 1 and q1204_success_links = 3 and q13_satisfied = 2 then q22_time=5 (9.0/1.0) 157 If q815_about_other = education and q505_media_magazines = 6 then q401_char_blog=-1 (4.0/0.0) 158 If q9_anonymity = 2 then q1001_anonymity_you=1 (100.0/19.0) 159 If q1001_anonymity_you = 1 and q13_satisfied = 2 and q7_num_blogs = 2 then q9_anonymity=2 (28.0/12.0) 160 If q1001_anonymity_you = 1 and q1107_why_blog_cv = 4 and q30_home_greece = attiki then q9_anonymity=2 (10.0/3.0) 161 If q1001_anonymity_you = -1 and q31_sex = 2 then q1002_anonymity_you_other=front_name (15.0/4.0) 162 If q1001_anonymity_you = -1 and q401_char_blog = 2 then q1002_anonymity_you_other=both (7.0/1.0) 163 If q1001_anonymity_you = -1 and q101_platform_blogspot = 0 then q1002_anonymity_you_other=both (6.0/2.0) 164 If q1001_anonymity_you = -1 then q1002_anonymity_you_other=pseudo_known (8.0/3.0) 165 If q1002_anonymity_you_other = both then q1001_anonymity_you=-1 (12.0/0.0) 166 If q1002_anonymity_you_other = front_name then q1001_anonymity_you=-1 (11.0/0.0) 167 If q1002_anonymity_you_other = pseudo_known then q1001_anonymity_you=-1 (16.0/7.0) 168 If q1101_why_blog_opinion = 4 and q805_about_music = 1 and q1001_anonymity_you = -1 then q401_char_blog=-1 (6.0/1.0) Analysis Report 000-0007 32
  33. 33. # Rule 169 If q1101_why_blog_opinion = 3 and q1203_success_visitors = 4 and q19_advert_benef = 1 then q7_num_blogs=2 (14.0/2.0) 170 If q1101_why_blog_opinion = 4 and q26_education = 3 then q27_age=4 (26.0/9.0) 171 If q1104_why_blog_inform_ff = 4 and q503_media_radio = 4 then q502_media_tv=5 (3.0/0.0) 172 If q1106_why_blog_money_career = 2 and q502_media_tv = 7 and q401_char_blog = 2 then q615_socweb_other=digg (2.0/0.0) If q1108_why_blog_customers = 4 and q2105_invitation_tv/radio = 0 and q302_com_chatroom = 0 then q401_char_blog=3 173 (14.0/3.0) 174 If q1108_why_blog_customers = 3 and q804_about_tech = 1 and q806_about_news = 0 then q401_char_blog=3 (8.0/3.0) 175 If q1108_why_blog_customers = 4 and q505_media_magazines = 6 and q303_com_socnet = 0 then q26_education=3 (7.0/0.0) 176 If q1202_success_comments = 1 and q502_media_tv = 7 and q1205_success_users = 1 then q503_media_radio=7 (9.0/2.0) 177 If q1202_success_comments = 4 and q1708_greekprob_culture = 2 and q301_com_forum = 1 then q27_age=2 (7.0/2.0) 178 If q1203_success_visitors = 4 and q2105_invitation_tv/radio = 1 and q7_num_blogs = 6 then q22_time=6 (12.0/3.0) If q1203_success_visitors = 1 and q2007_affect_career_noneofthem = 0 and q606_socweb_Google reader = 1 then 179 q401_char_blog=-1 (5.0/0.0) 180 If q1204_success_links = 3 and q14_blog_journal = 1 and q606_socweb_Google reader = 1 then q503_media_radio=3 (31.0/13.0) 181 If q1204_success_links = 3 and q27_age = 2 and q16_more_time = 3 then q503_media_radio=3 (16.0/6.0) 182 If q1205_success_users = 4 and q26_education = 2 and q1703_greekprob_humanrights = 4 then q503_media_radio=7 (14.0/6.0) 183 If q1205_success_users = 3 and q804_about_tech = 1 then q606_socweb_Google reader=1 (52.0/22.0) 184 If q1205_success_users = 2 and q503_media_radio = 7 and q30_home_greece = attiki then q7_num_blogs=3 (11.0/3.0) 185 If q1208_success_leads = 3 and q503_media_radio = 5 then q101_platform_blogspot=0 (2.0/0.0) 186 If q13_satisfied = 3 and q1104_why_blog_inform_ff = 3 and q605_socweb_delicious = 1 then q505_media_magazines=5 (5.0/0.0) 187 If q13_satisfied = 5 and q304_com_other = no then q501_media_internet=7 (3.0/1.0) 188 If q13_satisfied = 3 and q501_media_internet = 3 then q503_media_radio=2 (20.0/9.0) 189 If q13_satisfied = 2 and q805_about_music = 1 then q7_num_blogs=2 (62.0/29.0) 190 If q13_satisfied = 1 and q15_copyright = 3 then q27_age=7 (4.0/1.0) 191 If q14_blog_journal = 1 and q1208_success_leads = 3 then q22_time=5 (5.0/1.0) 192 If q14_blog_journal = 2 and q1706_greekprob_externalpolicy = 1 and q7_num_blogs = 2 then q13_satisfied=2 (11.0/2.0) 193 If q14_blog_journal = 2 and q17010_greekprob_healthsys = 3 and q1701_greekprob_economy = 4 then q13_satisfied=2 (17.0/5.0) 194 If q16_more_time = 1 and q1103_why_blog_contacts = 2 then q501_media_internet=5 (3.0/1.0) 195 If q16_more_time = 5 and q805_about_music = 0 and q601_socweb_facebook = 0 then q27_age=4 (81.0/27.0) 196 If q16_more_time = 5 and q502_media_tv = 1 then q27_age=4 (84.0/37.0) 197 If q1703_greekprob_humanrights = 3 and q22_time = 3 and q505_media_magazines = 4 then q503_media_radio=4 (6.0/1.0) If q1703_greekprob_humanrights = 4 and q1706_greekprob_externalpolicy = 1 and q1202_success_comments = 3 then 198 q601_socweb_facebook=0 (27.0/10.0) 199 If q1703_greekprob_humanrights = 4 and q501_media_internet = 3 then q26_education=3 (18.0/7.0) 200 If q1704_greekprob_education = 3 and q505_media_magazines = 6 and q26_education = 2 then q503_media_radio=4 (7.0/1.0) 201 If q1705_greekprob_insurance = 5 and q2102_invitation_discussion = 0 then q26_education=1 (3.0/1.0) If q1706_greekprob_externalpolicy = 2 and q1703_greekprob_humanrights = 2 and q804_about_tech = 0 then 202 q30_home_greece=other (18.0/3.0) 203 If q1708_greekprob_culture = 4 and q1203_success_visitors = 1 and q812_about_science = 0 then q502_media_tv=3 (23.0/9.0) If q1709_greekprob_technology = 2 and q1702_greekprob_enviroment = 4 and q304_com_other = 0 then q503_media_radio=2 204 (24.0/11.0) 205 If q1709_greekprob_technology = 2 and q1208_success_leads = 3 then q815_about_other=environment (2.0/0.0) Analysis Report 000-0007 33
  34. 34. # Rule 206 If q1709_greekprob_technology = 3 and q304_com_other = no then q30_home_greece=other (13.0/4.0) 207 If q18_advert = 1 and q1207_success_money = 4 then q401_char_blog=2 (7.0/1.0) If q2001_affect_career_known = 1 and q1108_why_blog_customers = 3 and q1709_greekprob_technology = 4 then 208 q401_char_blog=3 (8.0/2.0) 209 If q2001_affect_career_known = 1 and q805_about_music = 0 then q1001_anonymity_you=1 (60.0/11.0) 210 If q2007_affect_career_noneofthem = 0 and q1108_why_blog_customers = 4 then q2401_occupation=3 (31.0/9.0) 211 If q2104_invitation_press = 1 and q1105_why_blog_known = 3 then q22_time=3 (6.0/0.0) 212 If q2104_invitation_press = 1 and q25_occupation_about = 6 and q802_about_political = 1 then q2401_occupation=-1 (6.0/2.0) If q2106_invitation_speaker = 1 and q1201_success_personal_satisf = 3 and q1001_anonymity_you = 1 then q7_num_blogs=3 213 (14.0/4.0) 214 If q22_time = 5 and q15_copyright = 1 and q13_satisfied = 3 then q503_media_radio=2 (19.0/7.0) 215 If q22_time = 6 and q2104_invitation_press = 1 and q1107_why_blog_cv = 1 then q7_num_blogs=6 (24.0/9.0) 216 If q23_unique_visitors = 12 and q16_more_time = 2 and q601_socweb_facebook = 0 then q501_media_internet=3 (6.0/2.0) 217 If q2401_occupation = 7 then q27_age=5 (6.0/3.0) 218 If q2401_occupation = 6 and q303_com_socnet = 1 then q27_age=2 (61.0/5.0) 219 If q2401_occupation = 6 and q1203_success_visitors = 4 then q27_age=2 (26.0/3.0) 220 If q2401_occupation = 6 and q1103_why_blog_contacts = 4 and q1203_success_visitors = 3 then q27_age=2 (11.0/0.0) 221 If q2401_occupation = 5 and q7_num_blogs = 2 then q27_age=2 (4.0/0.0) 222 If q2401_occupation = 6 and q401_char_blog = -1 and q602_socweb_youtube = 1 then q27_age=2 (4.0/0.0) 223 If q2401_occupation = 7 and q1701_greekprob_economy = 3 then q27_age=6 (4.0/1.0) 224 If q2401_occupation = 6 and q26_education = 1 then q27_age=1 (3.0/0.0) 225 If q2401_occupation = 6 and q26_education = 2 and q809_about_sport = 1 then q27_age=1 (7.0/2.0) 226 If q2401_occupation = 6 and q401_char_blog = 2 then q27_age=2 (11.0/4.0) 227 If q25_occupation_about = 2 and q7_num_blogs = 2 and q801_about_personal = 0 then q503_media_radio=3 (21.0/9.0) 228 If q25_occupation_about = 6 and q502_media_tv = 5 then q503_media_radio=5 (6.0/2.0) If q25_occupation_about = 2 and q2007_affect_career_noneofthem = 0 and q303_com_socnet = 0 then q605_socweb_delicious=1 229 (15.0/6.0) 230 If q25_occupation_about = 2 and q2007_affect_career_noneofthem = 0 then q804_about_tech=1 (14.0/4.0) 231 If q25_occupation_about = 2 and q808_about_media = 1 then q804_about_tech=1 (22.0/7.0) 232 If q25_occupation_about = 1 and q27_age = 3 and q807_about_art_culture = 1 then q808_about_media=1 (7.0/1.0) If q25_occupation_about = 10 and q608_socweb_flickr = 1 and q1707_greekprob_corruption = 4 then 233 q1002_anonymity_you_other=pseudo_known (5.0/2.0) 234 If q25_occupation_about = 20 then q22_time=4 (10.0/4.0) 235 If q25_occupation_about = 17 then q2401_occupation=2 (17.0/1.0) 236 If q25_occupation_about = 9 and q7_num_blogs = 2 then q2401_occupation=2 (16.0/7.0) 237 If q25_occupation_about = 5 and q28_annual = 3 then q2401_occupation=2 (25.0/6.0) 238 If q25_occupation_about = 14 then q26_education=2 (10.0/3.0) 239 If q25_occupation_about = 5 and q7_num_blogs = 3 and q1103_why_blog_contacts = 3 then q26_education=6 (6.0/1.0) 240 If q26_education = 4 and q1101_why_blog_opinion = 2 and q16_more_time = 4 then q608_socweb_flickr=1 (13.0/3.0) If q26_education = 5 and q1201_success_personal_satisf = 2 and q609_socweb_linkedln = 1 then q614_socweb_ustream=1 241 (4.0/1.0) Analysis Report 000-0007 34
  35. 35. # Rule 242 If q26_education = 6 and q1001_anonymity_you = 1 then q812_about_science=1 (20.0/8.0) 243 If q26_education = 3 and q1202_success_comments = 1 and q809_about_sport = 0 then q30_home_greece=other (14.0/2.0) 244 If q27_age = 1 and q502_media_tv = 3 then q615_socweb_other=hi5 (4.0/1.0) 245 If q27_age = 3 and q2901_home = 2 and q14_blog_journal = 1 then q504_media_newspapers=6 (10.0/3.0) 246 If q27_age = 5 and q502_media_tv = 0 then q505_media_magazines=0 (2.0/0.0) 247 If q27_age = 4 and q28_annual = 6 then q601_socweb_facebook=0 (46.0/15.0) 248 If q27_age = 2 then q2401_occupation=6 (161.0/46.0) 249 If q27_age = 1 then q2401_occupation=6 (17.0/0.0) 250 If q27_age = 5 and q1204_success_links = 2 then q2401_occupation=3 (29.0/8.0) 251 If q27_age = 1 then q26_education=2 (14.0/3.0) 252 If q27_age = 3 and q502_media_tv = 2 and q803_about_social = 0 then q26_education=5 (33.0/12.0) 253 If q27_age = 3 and q1705_greekprob_insurance = 3 and q7_num_blogs = 1 then q26_education=5 (30.0/12.0) 254 If q28_annual = 1 and q601_socweb_facebook = 1 and q804_about_tech = 1 then q27_age=2 (31.0/12.0) 255 If q28_annual = 3 and q605_socweb_delicious = 1 and q608_socweb_flickr = 1 then q604_socweb_frienfeed=1 (3.0/0.0) 256 If q28_annual = 6 and q610_socweb_buzz = 1 and q504_media_newspapers = 3 then q810_about_gaming=1 (3.0/0.0) 257 If q28_annual = 2 and q25_occupation_about = 5 then q2401_occupation=2 (36.0/13.0) 258 If q28_annual = 5 and q805_about_music = 1 and q102_platform_worpress = 0 then q27_age=5 (5.0/0.0) Table xx: Extended list of significant rules discovered Analysis Report 000-0007 35
  36. 36. Contact Information This report was prepared by Eleutheria Kanavou, data engineer. You may contact her directly at eleutheria@datamine.it. This report was prepared for Nikos Drandakis, www.sync.gr. datamine.it 14 Meletiou Vasileiou Str 11 745 Athens, Greece T +30 6937 122 065 go@datamine.it http://datamine.it The totality of contents of this report by DataMine.it is distributed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License. Analysis Report 000-0007 36

×