SlideShare a Scribd company logo
Recommending Communities at the Regional & City Level
John P. Verostek
markITview Research
Cambridge, MA, USA
{john} @markitview.com
ABSTRACT
An exploratory study was undertaken to compare the
community ecosystems of Boston, Silicon Valley, and New
York City. The motivation was to understand what new
groups could be recommended to improve the community
landscapes. In particular, business and technology Meetup
groups were studied with respect to type, number, size; and
the interconnectedness across groups. The core data for the
study was obtained via Meetup’s API which enables access
to community information. Using a trained, manually
coded subset of titles and topic tags, machine learning was
performed to categorize groups. Summary statistics were
then produced and cities compared. For example, Boston’s
tech Meetup foundation was built upon programming
languages such as PhP and Java. In contrast, New York
City historically has cultivated a stronger business
networking culture; however this type of group has been
less pronounced in Boston. Next, social network analysis
was utilized to identify key community groups via
centrality measures and to identify “nearest neighbors” for
certain groups. The content-based machine learning output
was combined with the social network analysis to create a
tiered, hybrid recommendation system. Lessons learned
from the study would appear generalizable to cities beyond
the three initial covered. For example, an economic advisor
could recreate the community evolution of a city, and
perform comparative regional analysis to then develop new
recommendation for community growth. The next project
steps were to move from ‘proof of concept’ towards
developing a prototype community recommendation
system.
Author Keywords
Meetup.com, community, recommendations, social network
analysis, economic development, machine learning
ACM Classification Keywords
H.3.1 [Information Storage and Retrieval]: Content
Analysis and Indexing.
INTRODUCTION
One research stream within economic development has
been knowledge diffusion (or spillover) within a region. In
1994, Saxenian analyzed the electronic industries in Silicon
Valley and Boston’s Route 128, and found that informal
networks helped Silicon Valley with economic growth [1].
Powell, et al, studied the biotechnology industry using
network connectivity as a lens for which to understand
knowledge transfer [2]. In general, these and other pre-
Internet studies of knowledge flow relied upon qualitative
research and quantitative measures such as scientific
journal citations, patents, and company financials.
In the past decade, online social media, networks, and
communities have drawn much more interest than their
offline counterparts. Traditional face-to-face communities,
whose history was illustrated in the Robert Putnam’s book
Bowling Alone [3], have received less attention from a
business standpoint or academic research perspective.
Furthermore, the line between offline and online has
become blurred such as with Meetup.com straddling both
worlds. A notable past study covering this platform was
Weinberg and Williams analysis of Howard Dean’s 2004
presidential run and his utilization of Meetup [4]. At that
time Meetup.com was in its infancy having launched
shortly after 9/11 in 2002.
More recently Meetup has seen accelerated growth
especially in New York City (its home town), Silicon
Valley, and also in London, Washington DC, Chicago,
Boston, Toronto, Los Angeles among several others.
Groups were started by organizers who each work with
their members to hold events. Communities exist across a
wide variety of interests often reflecting preexisting local
activities. However, this study focuses on business and
technology which has been a popular topic in Cambridge,
MA (and in particular Kendall Square near MIT).
Nonetheless, the findings appear generalizable across many
types of groups. Additionally, Meetup cross-collaboration
has occurred naturally among groups, i.e. co-hosting events
or cross-promoting. However, a city or regional-level
perspective was chosen for this research in order to
understand broader community dynamics. After these
ecosystems have been described, then recommendations
could be made to create, connect, or expand existing groups
(or clusters of groups) to positively impact a community.
Workshop CSCW 2012, July 16–20, 2012, Montreal, Canada.
In conjuction with UMAP 2012.
Hence, today the evolution and acceleration of online social
media platforms (such as LinkedIN and Meetup), have
made visible previously unavailable data on communities.
Regional community ecosystems can be more easily
compared to understand similarities and differences.
The paper was organized as follows: first the primary
research objectives were described; followed by a
walkthrough of the data sources and necessary extensions to
make the raw data useful for further analysis. Since this
project has been a work-in-progress, preliminary results
were then presented. This short paper concluded with near-
term plans for next research and development steps.
PROPOSED RESEARCH APPROACH
The primary objective of this exploratory and descriptive
research has been to develop a community recommendation
system. The research was conducted across three inter-
related dimensions which were represented as dyads:
 Community-Community: The goal was recommend to
existing Meetup groups potential opportunities for
collaboration with related local tech and business
groups. This approach of finding similar communities
was applied to groups in a home city; as well as other
cities. An objective of these of recommendations was
to further integrate existing communities. Groups
share common members such that collaboration among
groups may be beneficial such as with cross-pollination
of ideas and knowledge. This objective was achieved
primarily by understanding group-member overlap
utilizing social network dynamics.
 City-City: The goal was to recommend new groups for
a region. The recommendations were based on groups
in other cities; but not present locally. For example,
the Boston Predictive Analytics1
was formed after
discovery of NYC Predictive Analytics2
. Similarly,
Hacks & Hackers3
, a journalism meets software
development group, has been adopted in several cities
through the efforts of individuals who had learned of
this group’s presence on Meetup.
 Member-City: The goal was to recommend Meetup
groups based upon aggregate interests of members of a
local community. Please note that Meetup currently
recommends existing groups to members based upon
individual interests; therefore the focus here was to
recommend new groups based on unmet interests.
Among the above three dyads (or levels of analysis) there
existed a strong probability that the recommendations from
each may conflict. For example, another city may have a
strong industry presence that seldom exist in other places,
1
http://www.meetup.com/Boston-Predictive-Analytics/
2
http://www.meetup.com/NYC-Predictive-Analytics/
3
http://meetupbos.hackshackers.com/
e.g. the uniqueness of the movie industry in Los Angeles.
Conversely, a group like Hacks/Hackers given its
journalism dimension would seem applicable to many
cities. The above highlights that expansion of this project
to include more city-specific attributes might improve the
recommendations.
DATA SOURCES AND PREPARATION
Meetup’s API includes several different methods that cover
cities, groups, members, interests (or topics), events, and
rsvp information. Foremost, Meetup.com API’s was
utilized to pull state and city information on community
demographics for the United States.4
Next, community
group and member information including their associated
“interests” were pulled for Silicon Valley, New York City,
and Boston. These cities were chosen given their strong
technology entrepreneurship leadership, high Meetup
adoption, and as Cambridge, MA has been the home of the
researcher for several years. A geographic factor by proxy
of zip code was used to help identify regional boundaries.
A key to classifying the groups was utilizing topic tags;
however this data exhibits a very long-tail such that it was
necessary to create main categories and sub-categories in
order to then enable regional comparisons and develop
recommendations. Categories were created by manually
coding over a thousand Meetup groups. Main categories
created included technology and business dimensions; as
well as travel, dining, music, outdoors, recreation, fitness,
photography, singles, parenting, religion, et al.
Using machine learning classification techniques, thousands
of groups from Boston, New York City, and Silicon Valley
were then categorized across over thirty main categories.
Specifically, R was used for data pulls, data management,
and analysis. The library, RTextTools5
, was utilized for
text processing and machine learning. This library contains
well outlined steps for performing supervised learning.
Multiple learning approaches were available including
SVM, Maximum Entropy, Neural Networks, and others.
Several iterations were performed towards identifying the
best approach; as well as providing feedback for additional
manual coding in order to improve model accuracy.
Subsequent analysis of the technology and business groups
revealed patterns, particularly city differences, pertaining to
the types of groups. Therefore, technology and business
categories were further sub-segmented. For example,
technology topics included software development, mobile,
cloud, etc. Business categories covered professional
networking, entrepreneurship, careers, jobs, marketing &
4
Many thanks to Vipin Sachdeva of IBM Cambridge for
helping with data pulls, and for showing me better ways of
performing this step.
5
http://www.rtexttools.com/
sales, and industry-specific verticals, such as healthcare and
clean energy.
These new categories enabled recommendations to occur at
multiple levels: category, sub-category, and specific group.
For example, for a particular city a recommendation might
be to create more “business” groups; or perhaps to develop
a sub-category-level ‘thematic’ group such as “startups”; or
be more specific, and recommend a community group
pertaining to “healthcare information technology startups”.
Meetup.com member profile also includes fields for
complementary social media platforms including Twitter,
LinkedIN, and Facebook. However, on Meetup these social
media fields, as well as member interests; were optional
fields thereby reducing the available sample.
With respect to the different levels of inquiry additional
data processing was necessary towards setting the stage for
further analysis:
 Community-Community: Group and member data was
transformed into group-member dyads; also known as
an edgelist. An R package, called ‘tnet’ developed by
Tore Opsahl6
was utilized to find edge weights as
determined by member overlap between two groups.
The output from this R library enabled bipartite social
network analysis to quantify the overlap among groups
(using group-member dyads). Gephi, a social network
software package, was then utilized to perform analysis
to derive network characteristics such as “centrality”.7
Gephi also was used to create data visualizations.
Hierarchical cluster analysis was also performed to
under the structure a region’s ecosystem.
 City-City: Groups across cities were compared using
text-based machine learning to ascertain the similarity
or dissimilarity of groups. The text dataset consisted of
group names with up to fifteen topic tags. Where
applicable, the social graphs of city-groups were also
compared to make more specific recommendations.
 Member-City: Meetup member interests were pulled
by using their API; as well as extracting the interests
specified by organizers for their groups. Meetup
members were not limited to a set number of topics
such that some had listed dozens upon dozens of
interests. A common Meetup tag language simplified
the analysis between members and groups. LinkedIN
groups and skills were found often times to be visible
on profiles. Please note that the initial exploratory
research for LinkedIN was a subset of its members;
specifically people who were also on Meetup’s
platform.
6
http://toreopsahl.com/
7
http://gephi.org/
PRELIMINARY RESULTS AND DISCUSSION
Longitudinal analysis on community formation was also
performed and showed that over the past few years
Meetup.com has seen accelerated growth; especially for
technology and business groups. Analysis at the category
level showed Silicon Valley to have the highest percentage
of technology and business groups; whereas Boston showed
to have a lower than average percentage of business groups.
NYC had a slightly higher than average amount or both. At
the sub-category, or thematic level, Boston showed to have
a propensity of programming language groups, and a lower
number of business networking groups.
Among the three cities, the most in-depth analysis was
performed on Boston, and so these initial results will focus
on this region. With respect to different recommendation
components the preliminary findings included:
 Community-Community: The largest Boston groups
have been software development and programming
language. However, social network analysis revealed
that a NewTech8
group, comparatively a smaller and
younger group, was a key group based on its centrality
in the network. Geographic analysis was helpful to
study why many small entrepreneurial groups were less
connected to the core technology center (Boston has a
well-known hub-and-spoke layout). The addition of zip
code based data confirmed that suburban groups were
less connected.
 City-City: Although a final goal was to have graph
matching performed across entire regions, initial
analysis was limited to sub-graphs, i.e. micro-clusters
of communities. Supervised learning techniques were
used find the nearest counterparts of groups in Boston
versus those in Silicon Valley and NYC. For example,
Silicon Valley did not have a “predictive analytics”
group by name; though several interesting groups were
discovered including “Graph Database Group” and
“Big Data Analytics: Mobile, Social and Web”. The
“SV Business Intelligence” was the closest match.
 Member-City: Members included in the exploratory
sample were Boston tech and business members who in
their Meetup profile had provided a LinkedIN address
AND listed their interests. In aggregate, the most
frequently desired interests displayed were similar for
both Meetup and LinkedIN. That is, member identities
were consistent between platforms. The most desired
themes included networking, startups, web, and social
media (Meetup groups exist for each). Two of the
highest unmet needs for Boston were for Perl and XML
which for each the magnitude of interest was
moderately strong. Next research steps will include
sampling a broader selection of LinkedIN members.
8
http://www.meetup.com/Boston_New_Technology/
CONCLUSION
New data on community groups from Meetup.com has
enabled more granular analysis for understanding how to
both efficiently and effectively manage communities. Open
questions remain as to the impact of these informal
communities on local economic performance. However
this phase of research was aimed at quantifying, and
identifying new communities that may serve as initial
recommendations, or inputs, to discussions on community
development.
ACKNOWLEDGMENTS
I thank Vipin Sachdeva and Ravi Garg for enlightening
discussions on computer science and machine learning;
Tore Opsahl for creating the R package ‘tnet’, and the
RTextTools team for creating a fantastic R library!
REFERENCES
1. Saxenian, A. (1994). Regional Advantage: Culture and
Competition in Silicon Valley and Route 128
(Cambridge, MA: Harvard University Press)
2. Powell, W., K.W. Koput, & L. Smith-Doerr (1996),
Interorganizational collaboration and the locus of
innovation: networks of learning in biotechnology,
Administrative Science Quarterly 42 (1): 116-145
3. Putnam, Robert D. (2000). Bowling Alone: The
Collapse and Revival of American Community. New
York: Simon & Schuster.
4. Weinberg, B. D., Williams, C. B. (2006). The 2004 US
Presidential campaign: Impact of hybrid offline &
online 'meetup' communities. Journal of Direct, Data
and Digital Marketing Practice, 8 (1 (July)), 46-57.

More Related Content

What's hot

Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Premsankar Chakkingal
 
Structure of Communication and Narrative Construction of Social Movements wit...
Structure of Communication and Narrative Construction of Social Movements wit...Structure of Communication and Narrative Construction of Social Movements wit...
Structure of Communication and Narrative Construction of Social Movements wit...
Mario Orefice, University of Urbino
 
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
Han Woo PARK
 
2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis
Marc Smith
 
2009-Social computing-Analyzing social media networks
2009-Social computing-Analyzing social media networks2009-Social computing-Analyzing social media networks
2009-Social computing-Analyzing social media networks
Marc Smith
 
Social Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive IntelligenceSocial Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive Intelligence
August Jackson
 
Social Media Mining - Chapter 2 (Graph Essentials)
Social Media Mining - Chapter 2 (Graph Essentials)Social Media Mining - Chapter 2 (Graph Essentials)
Social Media Mining - Chapter 2 (Graph Essentials)
SocialMediaMining
 
Introduction to complex systems and social network analysis
Introduction to complex systems and social network analysisIntroduction to complex systems and social network analysis
Introduction to complex systems and social network analysis
BRAC University Computer Club
 
2007-JOSS-Visualizing the signatures of social roles in online discussion groups
2007-JOSS-Visualizing the signatures of social roles in online discussion groups2007-JOSS-Visualizing the signatures of social roles in online discussion groups
2007-JOSS-Visualizing the signatures of social roles in online discussion groups
Marc Smith
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
BAINIDA
 
Community Evolution in the Digital Space and Creation of Social Information C...
Community Evolution in the Digital Space and Creation of SocialInformation C...Community Evolution in the Digital Space and Creation of SocialInformation C...
Community Evolution in the Digital Space and Creation of Social Information C...
Saptarshi Ghosh
 
Social Networks and Social Capital
Social Networks and Social CapitalSocial Networks and Social Capital
Social Networks and Social Capital
Giorgos Cheliotis
 
Citizens Speak Out:Public e-Engagement Experienceof Slovakia
Citizens Speak Out:Public e-Engagement Experienceof SlovakiaCitizens Speak Out:Public e-Engagement Experienceof Slovakia
Citizens Speak Out:Public e-Engagement Experienceof Slovakia
Кафедра міжнародної інформації РІС
 
Jasist11
Jasist11Jasist11
Jasist11
svennus
 
Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...
Mike Kujawski
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Patti Anklam
 
Social Network Theory & Analysis
Social Network Theory & Analysis Social Network Theory & Analysis
Social Network Theory & Analysis
Susan Chesley Fant
 
Social network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and moreSocial network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and more
Wael Elrifai
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
Marc Smith
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
theijes
 

What's hot (20)

Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Structure of Communication and Narrative Construction of Social Movements wit...
Structure of Communication and Narrative Construction of Social Movements wit...Structure of Communication and Narrative Construction of Social Movements wit...
Structure of Communication and Narrative Construction of Social Movements wit...
 
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
 
2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis
 
2009-Social computing-Analyzing social media networks
2009-Social computing-Analyzing social media networks2009-Social computing-Analyzing social media networks
2009-Social computing-Analyzing social media networks
 
Social Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive IntelligenceSocial Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive Intelligence
 
Social Media Mining - Chapter 2 (Graph Essentials)
Social Media Mining - Chapter 2 (Graph Essentials)Social Media Mining - Chapter 2 (Graph Essentials)
Social Media Mining - Chapter 2 (Graph Essentials)
 
Introduction to complex systems and social network analysis
Introduction to complex systems and social network analysisIntroduction to complex systems and social network analysis
Introduction to complex systems and social network analysis
 
2007-JOSS-Visualizing the signatures of social roles in online discussion groups
2007-JOSS-Visualizing the signatures of social roles in online discussion groups2007-JOSS-Visualizing the signatures of social roles in online discussion groups
2007-JOSS-Visualizing the signatures of social roles in online discussion groups
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
 
Community Evolution in the Digital Space and Creation of Social Information C...
Community Evolution in the Digital Space and Creation of SocialInformation C...Community Evolution in the Digital Space and Creation of SocialInformation C...
Community Evolution in the Digital Space and Creation of Social Information C...
 
Social Networks and Social Capital
Social Networks and Social CapitalSocial Networks and Social Capital
Social Networks and Social Capital
 
Citizens Speak Out:Public e-Engagement Experienceof Slovakia
Citizens Speak Out:Public e-Engagement Experienceof SlovakiaCitizens Speak Out:Public e-Engagement Experienceof Slovakia
Citizens Speak Out:Public e-Engagement Experienceof Slovakia
 
Jasist11
Jasist11Jasist11
Jasist11
 
Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Social Network Theory & Analysis
Social Network Theory & Analysis Social Network Theory & Analysis
Social Network Theory & Analysis
 
Social network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and moreSocial network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and more
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 

Similar to Recommending Communities at the Regional and City Level

Social Networking Sites: An Academic Approach to Revenue Generation
Social Networking Sites: An Academic Approach to Revenue GenerationSocial Networking Sites: An Academic Approach to Revenue Generation
Social Networking Sites: An Academic Approach to Revenue Generation
Kristelle Siarza
 
Jx2517481755
Jx2517481755Jx2517481755
Jx2517481755
IJERA Editor
 
Jx2517481755
Jx2517481755Jx2517481755
Jx2517481755
IJERA Editor
 
Smac technical report
Smac technical reportSmac technical report
Smac technical report
Naga Pradeep
 
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
IOSR Journals
 
2_Doc5_2.pdf
2_Doc5_2.pdf2_Doc5_2.pdf
2_Doc5_2.pdf
Ashishpal342837
 
Social Media Influence Analysis using Data Science Techniques
Social Media Influence Analysis using Data Science TechniquesSocial Media Influence Analysis using Data Science Techniques
Social Media Influence Analysis using Data Science Techniques
Muhammad Bilal
 
Online social networking
Online social networkingOnline social networking
Online social networking
Gurudutt Reddy
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
BalasundaramSr
 
1999 ACM SIGCHI - Counting on Community in Cyberspace
1999   ACM SIGCHI - Counting on Community in Cyberspace1999   ACM SIGCHI - Counting on Community in Cyberspace
1999 ACM SIGCHI - Counting on Community in Cyberspace
Marc Smith
 
Social networking-overview
Social networking-overviewSocial networking-overview
Social networking-overview
sakshicherry
 
nm
nmnm
Fuzzy AndANN Based Mining Approach Testing For Social Network Analysis
Fuzzy AndANN Based Mining Approach Testing For Social Network AnalysisFuzzy AndANN Based Mining Approach Testing For Social Network Analysis
Fuzzy AndANN Based Mining Approach Testing For Social Network Analysis
IJERA Editor
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
Farida Vis
 
Literature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social MediaLiterature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social Media
David Thompson
 
An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...
An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...
An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...
Rick Vogel
 
Suazo, martínez & elgueta english version
Suazo, martínez & elgueta english versionSuazo, martínez & elgueta english version
Suazo, martínez & elgueta english version
2011990
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
Alexander Decker
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
Alexander Decker
 
POLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOK
POLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOKPOLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOK
POLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOK
IJwest
 

Similar to Recommending Communities at the Regional and City Level (20)

Social Networking Sites: An Academic Approach to Revenue Generation
Social Networking Sites: An Academic Approach to Revenue GenerationSocial Networking Sites: An Academic Approach to Revenue Generation
Social Networking Sites: An Academic Approach to Revenue Generation
 
Jx2517481755
Jx2517481755Jx2517481755
Jx2517481755
 
Jx2517481755
Jx2517481755Jx2517481755
Jx2517481755
 
Smac technical report
Smac technical reportSmac technical report
Smac technical report
 
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
 
2_Doc5_2.pdf
2_Doc5_2.pdf2_Doc5_2.pdf
2_Doc5_2.pdf
 
Social Media Influence Analysis using Data Science Techniques
Social Media Influence Analysis using Data Science TechniquesSocial Media Influence Analysis using Data Science Techniques
Social Media Influence Analysis using Data Science Techniques
 
Online social networking
Online social networkingOnline social networking
Online social networking
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
1999 ACM SIGCHI - Counting on Community in Cyberspace
1999   ACM SIGCHI - Counting on Community in Cyberspace1999   ACM SIGCHI - Counting on Community in Cyberspace
1999 ACM SIGCHI - Counting on Community in Cyberspace
 
Social networking-overview
Social networking-overviewSocial networking-overview
Social networking-overview
 
nm
nmnm
nm
 
Fuzzy AndANN Based Mining Approach Testing For Social Network Analysis
Fuzzy AndANN Based Mining Approach Testing For Social Network AnalysisFuzzy AndANN Based Mining Approach Testing For Social Network Analysis
Fuzzy AndANN Based Mining Approach Testing For Social Network Analysis
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
 
Literature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social MediaLiterature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social Media
 
An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...
An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...
An Empirical Study On IMDb And Its Communities Based On The Network Of Co-Rev...
 
Suazo, martínez & elgueta english version
Suazo, martínez & elgueta english versionSuazo, martínez & elgueta english version
Suazo, martínez & elgueta english version
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
 
POLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOK
POLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOKPOLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOK
POLITICAL OPINION ANALYSIS IN SOCIAL NETWORKS: CASE OF TWITTER AND FACEBOOK
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdfAI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
Techgropse Pvt.Ltd.
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdfAI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 

Recommending Communities at the Regional and City Level

  • 1. Recommending Communities at the Regional & City Level John P. Verostek markITview Research Cambridge, MA, USA {john} @markitview.com ABSTRACT An exploratory study was undertaken to compare the community ecosystems of Boston, Silicon Valley, and New York City. The motivation was to understand what new groups could be recommended to improve the community landscapes. In particular, business and technology Meetup groups were studied with respect to type, number, size; and the interconnectedness across groups. The core data for the study was obtained via Meetup’s API which enables access to community information. Using a trained, manually coded subset of titles and topic tags, machine learning was performed to categorize groups. Summary statistics were then produced and cities compared. For example, Boston’s tech Meetup foundation was built upon programming languages such as PhP and Java. In contrast, New York City historically has cultivated a stronger business networking culture; however this type of group has been less pronounced in Boston. Next, social network analysis was utilized to identify key community groups via centrality measures and to identify “nearest neighbors” for certain groups. The content-based machine learning output was combined with the social network analysis to create a tiered, hybrid recommendation system. Lessons learned from the study would appear generalizable to cities beyond the three initial covered. For example, an economic advisor could recreate the community evolution of a city, and perform comparative regional analysis to then develop new recommendation for community growth. The next project steps were to move from ‘proof of concept’ towards developing a prototype community recommendation system. Author Keywords Meetup.com, community, recommendations, social network analysis, economic development, machine learning ACM Classification Keywords H.3.1 [Information Storage and Retrieval]: Content Analysis and Indexing. INTRODUCTION One research stream within economic development has been knowledge diffusion (or spillover) within a region. In 1994, Saxenian analyzed the electronic industries in Silicon Valley and Boston’s Route 128, and found that informal networks helped Silicon Valley with economic growth [1]. Powell, et al, studied the biotechnology industry using network connectivity as a lens for which to understand knowledge transfer [2]. In general, these and other pre- Internet studies of knowledge flow relied upon qualitative research and quantitative measures such as scientific journal citations, patents, and company financials. In the past decade, online social media, networks, and communities have drawn much more interest than their offline counterparts. Traditional face-to-face communities, whose history was illustrated in the Robert Putnam’s book Bowling Alone [3], have received less attention from a business standpoint or academic research perspective. Furthermore, the line between offline and online has become blurred such as with Meetup.com straddling both worlds. A notable past study covering this platform was Weinberg and Williams analysis of Howard Dean’s 2004 presidential run and his utilization of Meetup [4]. At that time Meetup.com was in its infancy having launched shortly after 9/11 in 2002. More recently Meetup has seen accelerated growth especially in New York City (its home town), Silicon Valley, and also in London, Washington DC, Chicago, Boston, Toronto, Los Angeles among several others. Groups were started by organizers who each work with their members to hold events. Communities exist across a wide variety of interests often reflecting preexisting local activities. However, this study focuses on business and technology which has been a popular topic in Cambridge, MA (and in particular Kendall Square near MIT). Nonetheless, the findings appear generalizable across many types of groups. Additionally, Meetup cross-collaboration has occurred naturally among groups, i.e. co-hosting events or cross-promoting. However, a city or regional-level perspective was chosen for this research in order to understand broader community dynamics. After these ecosystems have been described, then recommendations could be made to create, connect, or expand existing groups (or clusters of groups) to positively impact a community. Workshop CSCW 2012, July 16–20, 2012, Montreal, Canada. In conjuction with UMAP 2012.
  • 2. Hence, today the evolution and acceleration of online social media platforms (such as LinkedIN and Meetup), have made visible previously unavailable data on communities. Regional community ecosystems can be more easily compared to understand similarities and differences. The paper was organized as follows: first the primary research objectives were described; followed by a walkthrough of the data sources and necessary extensions to make the raw data useful for further analysis. Since this project has been a work-in-progress, preliminary results were then presented. This short paper concluded with near- term plans for next research and development steps. PROPOSED RESEARCH APPROACH The primary objective of this exploratory and descriptive research has been to develop a community recommendation system. The research was conducted across three inter- related dimensions which were represented as dyads:  Community-Community: The goal was recommend to existing Meetup groups potential opportunities for collaboration with related local tech and business groups. This approach of finding similar communities was applied to groups in a home city; as well as other cities. An objective of these of recommendations was to further integrate existing communities. Groups share common members such that collaboration among groups may be beneficial such as with cross-pollination of ideas and knowledge. This objective was achieved primarily by understanding group-member overlap utilizing social network dynamics.  City-City: The goal was to recommend new groups for a region. The recommendations were based on groups in other cities; but not present locally. For example, the Boston Predictive Analytics1 was formed after discovery of NYC Predictive Analytics2 . Similarly, Hacks & Hackers3 , a journalism meets software development group, has been adopted in several cities through the efforts of individuals who had learned of this group’s presence on Meetup.  Member-City: The goal was to recommend Meetup groups based upon aggregate interests of members of a local community. Please note that Meetup currently recommends existing groups to members based upon individual interests; therefore the focus here was to recommend new groups based on unmet interests. Among the above three dyads (or levels of analysis) there existed a strong probability that the recommendations from each may conflict. For example, another city may have a strong industry presence that seldom exist in other places, 1 http://www.meetup.com/Boston-Predictive-Analytics/ 2 http://www.meetup.com/NYC-Predictive-Analytics/ 3 http://meetupbos.hackshackers.com/ e.g. the uniqueness of the movie industry in Los Angeles. Conversely, a group like Hacks/Hackers given its journalism dimension would seem applicable to many cities. The above highlights that expansion of this project to include more city-specific attributes might improve the recommendations. DATA SOURCES AND PREPARATION Meetup’s API includes several different methods that cover cities, groups, members, interests (or topics), events, and rsvp information. Foremost, Meetup.com API’s was utilized to pull state and city information on community demographics for the United States.4 Next, community group and member information including their associated “interests” were pulled for Silicon Valley, New York City, and Boston. These cities were chosen given their strong technology entrepreneurship leadership, high Meetup adoption, and as Cambridge, MA has been the home of the researcher for several years. A geographic factor by proxy of zip code was used to help identify regional boundaries. A key to classifying the groups was utilizing topic tags; however this data exhibits a very long-tail such that it was necessary to create main categories and sub-categories in order to then enable regional comparisons and develop recommendations. Categories were created by manually coding over a thousand Meetup groups. Main categories created included technology and business dimensions; as well as travel, dining, music, outdoors, recreation, fitness, photography, singles, parenting, religion, et al. Using machine learning classification techniques, thousands of groups from Boston, New York City, and Silicon Valley were then categorized across over thirty main categories. Specifically, R was used for data pulls, data management, and analysis. The library, RTextTools5 , was utilized for text processing and machine learning. This library contains well outlined steps for performing supervised learning. Multiple learning approaches were available including SVM, Maximum Entropy, Neural Networks, and others. Several iterations were performed towards identifying the best approach; as well as providing feedback for additional manual coding in order to improve model accuracy. Subsequent analysis of the technology and business groups revealed patterns, particularly city differences, pertaining to the types of groups. Therefore, technology and business categories were further sub-segmented. For example, technology topics included software development, mobile, cloud, etc. Business categories covered professional networking, entrepreneurship, careers, jobs, marketing & 4 Many thanks to Vipin Sachdeva of IBM Cambridge for helping with data pulls, and for showing me better ways of performing this step. 5 http://www.rtexttools.com/
  • 3. sales, and industry-specific verticals, such as healthcare and clean energy. These new categories enabled recommendations to occur at multiple levels: category, sub-category, and specific group. For example, for a particular city a recommendation might be to create more “business” groups; or perhaps to develop a sub-category-level ‘thematic’ group such as “startups”; or be more specific, and recommend a community group pertaining to “healthcare information technology startups”. Meetup.com member profile also includes fields for complementary social media platforms including Twitter, LinkedIN, and Facebook. However, on Meetup these social media fields, as well as member interests; were optional fields thereby reducing the available sample. With respect to the different levels of inquiry additional data processing was necessary towards setting the stage for further analysis:  Community-Community: Group and member data was transformed into group-member dyads; also known as an edgelist. An R package, called ‘tnet’ developed by Tore Opsahl6 was utilized to find edge weights as determined by member overlap between two groups. The output from this R library enabled bipartite social network analysis to quantify the overlap among groups (using group-member dyads). Gephi, a social network software package, was then utilized to perform analysis to derive network characteristics such as “centrality”.7 Gephi also was used to create data visualizations. Hierarchical cluster analysis was also performed to under the structure a region’s ecosystem.  City-City: Groups across cities were compared using text-based machine learning to ascertain the similarity or dissimilarity of groups. The text dataset consisted of group names with up to fifteen topic tags. Where applicable, the social graphs of city-groups were also compared to make more specific recommendations.  Member-City: Meetup member interests were pulled by using their API; as well as extracting the interests specified by organizers for their groups. Meetup members were not limited to a set number of topics such that some had listed dozens upon dozens of interests. A common Meetup tag language simplified the analysis between members and groups. LinkedIN groups and skills were found often times to be visible on profiles. Please note that the initial exploratory research for LinkedIN was a subset of its members; specifically people who were also on Meetup’s platform. 6 http://toreopsahl.com/ 7 http://gephi.org/ PRELIMINARY RESULTS AND DISCUSSION Longitudinal analysis on community formation was also performed and showed that over the past few years Meetup.com has seen accelerated growth; especially for technology and business groups. Analysis at the category level showed Silicon Valley to have the highest percentage of technology and business groups; whereas Boston showed to have a lower than average percentage of business groups. NYC had a slightly higher than average amount or both. At the sub-category, or thematic level, Boston showed to have a propensity of programming language groups, and a lower number of business networking groups. Among the three cities, the most in-depth analysis was performed on Boston, and so these initial results will focus on this region. With respect to different recommendation components the preliminary findings included:  Community-Community: The largest Boston groups have been software development and programming language. However, social network analysis revealed that a NewTech8 group, comparatively a smaller and younger group, was a key group based on its centrality in the network. Geographic analysis was helpful to study why many small entrepreneurial groups were less connected to the core technology center (Boston has a well-known hub-and-spoke layout). The addition of zip code based data confirmed that suburban groups were less connected.  City-City: Although a final goal was to have graph matching performed across entire regions, initial analysis was limited to sub-graphs, i.e. micro-clusters of communities. Supervised learning techniques were used find the nearest counterparts of groups in Boston versus those in Silicon Valley and NYC. For example, Silicon Valley did not have a “predictive analytics” group by name; though several interesting groups were discovered including “Graph Database Group” and “Big Data Analytics: Mobile, Social and Web”. The “SV Business Intelligence” was the closest match.  Member-City: Members included in the exploratory sample were Boston tech and business members who in their Meetup profile had provided a LinkedIN address AND listed their interests. In aggregate, the most frequently desired interests displayed were similar for both Meetup and LinkedIN. That is, member identities were consistent between platforms. The most desired themes included networking, startups, web, and social media (Meetup groups exist for each). Two of the highest unmet needs for Boston were for Perl and XML which for each the magnitude of interest was moderately strong. Next research steps will include sampling a broader selection of LinkedIN members. 8 http://www.meetup.com/Boston_New_Technology/
  • 4. CONCLUSION New data on community groups from Meetup.com has enabled more granular analysis for understanding how to both efficiently and effectively manage communities. Open questions remain as to the impact of these informal communities on local economic performance. However this phase of research was aimed at quantifying, and identifying new communities that may serve as initial recommendations, or inputs, to discussions on community development. ACKNOWLEDGMENTS I thank Vipin Sachdeva and Ravi Garg for enlightening discussions on computer science and machine learning; Tore Opsahl for creating the R package ‘tnet’, and the RTextTools team for creating a fantastic R library! REFERENCES 1. Saxenian, A. (1994). Regional Advantage: Culture and Competition in Silicon Valley and Route 128 (Cambridge, MA: Harvard University Press) 2. Powell, W., K.W. Koput, & L. Smith-Doerr (1996), Interorganizational collaboration and the locus of innovation: networks of learning in biotechnology, Administrative Science Quarterly 42 (1): 116-145 3. Putnam, Robert D. (2000). Bowling Alone: The Collapse and Revival of American Community. New York: Simon & Schuster. 4. Weinberg, B. D., Williams, C. B. (2006). The 2004 US Presidential campaign: Impact of hybrid offline & online 'meetup' communities. Journal of Direct, Data and Digital Marketing Practice, 8 (1 (July)), 46-57.