In online social media platforms, users can express their ideas by posting original content or by adding comments and responses to existing posts, thus generating virtual discussions and conversations. Studying these conversations is essential for understanding the online communication behavior of users. This study proposes a novel approach to retrieve popular patterns on online conversations using network-based analysis. The analysis consists of two main stages: intent analysis and network generation. Users’ intention is detected using keyword-based categorization of posts and comments, integrated with classification through Naïve Bayes and Support Vector Machine algorithms for uncategorized comments. A continuous human-in-the-loop approach further improves the keyword-based classification. To build and understand communication patterns among the users, we build conversation graphs starting from the hierarchical structure of posts and comments, using a directed multigraph network. The experiments categorize 90% comments with 98% accuracy on a real social media dataset. The model then identifies relevant patterns in terms of shape and content; and finally determines the relevance and frequency of the patterns. Results show that the most popular online discussion patterns obtained from conversation graphs resemble real-life interactions and communication.
Social network analysis for modeling & tuning social media websiteEdward B. Rockower
Social Network Analysis of a Professional Online Social Media Collaboration Community. Tuning and optimizing based on observed social network dynamics and user behavior.
Community analysis using graph representation learning on social networksMarco Brambilla
In a world more and more connected, new and complex interaction
patterns can be extracted in the communication between people.
This is extremely valuable for brands that can better understand
the interests of users and the trends on social media to better target
their products. In this paper, we aim to analyze the communities
that arise around commercial brands on social networks to understand
the meaning of similarity, collaboration, and interaction
among users.We exploit the network that builds around the brands
by encoding it into a graph model.We build a social network graph,
considering user nodes and friendship relations; then we compare
it with a heterogeneous graph model, where also posts and hashtags
are considered as nodes and connected to the different node
types; we finally build also a reduced network, generated by inducing
direct user-to-user connections through the intermediate
nodes (posts and hashtags). These different variants are encoded
using graph representation learning, which generates a numerical
vector for each node. Machine learning techniques are applied to
these vectors to extract valuable insights for each user and for the
communities they belong to. In the paper, we report on our experiments
performed on an emerging fashion brand on Instagram, and
we show that our approach is able to discriminate potential customers
for the brand, and to highlight meaningful sub-communities
composed by users that share the same kind of content on social
networks.
Data Cleaning for social media knowledge extractionMarco Brambilla
Social media platforms let users share their opinions through textual or multimedia content. In many settings, this becomes a valuable source of knowledge that can be exploited for specific business objectives. Brands and companies often ask to monitor social media as sources for understanding the stance, opinion, and sentiment of their customers, audience and potential audience. This is crucial for them because it let them understand the trends and future commercial and marketing opportunities.
However, all this relies on a solid and reliable data collection phase, that grants that all the analyses, extractions and predictions are applied on clean, solid and focused data. Indeed, the typical topic-based collection of social media content performed through keyword-based search typically entails very noisy results.
We recently implemented a simple study aiming at cleaning the data collected from social content, within specific domains or related to given topics of interest. We propose a basic method for data cleaning and removal of off-topic content based on supervised machine learning techniques, i.e. classification, over data collected from social media platforms based on keywords regarding a specific topic. We define a general method for this and then we validate it through an experiment of data extraction from Twitter, with respect to a set of famous cultural institutions in Italy, including theaters, museums, and other venues.
For this case, we collaborated with domain experts to label the dataset, and then we evaluated and compared the performance of classifiers that are trained with different feature extraction strategies.
Iterative knowledge extraction from social networks. The Web Conference 2018Marco Brambilla
Knowledge in the world continuously evolves, and ontologies are largely incomplete, especially regarding data belonging to the so-called long tail. We propose a method for discovering emerging knowledge by extracting it from social content. Once initialized by domain experts, the method is capable of finding relevant entities by means of a mixed syntactic-semantic method. The method uses seeds, i.e. prototypes of emerging entities provided by experts, for generating candidates; then, it associates candidates to feature vectors built by using terms occurring in their social content and ranks the candidates by using their distance from the centroid of seeds, returning the top candidates. Our method can run iteratively, using the results as new seeds.
In this paper we address the following research questions: (1) How does the reconstructed domain knowledge evolve if the candidates of one extraction are recursively used as seeds (2) How does the reconstructed domain knowledge spread geographically (3) Can the method be used to inspect the past, present, and future of knowledge (4) Can the method be used to find emerging knowledge?.
This work was presented at The Web Conference 2018, MSM workshop.
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng
The advent of the social networks has completely changed our daily life. The deluge of data collected on Social Network Services (SNS) and recent developments in complex network theory have enabled many marvelous predictive analysis, which tells us many amazing stories.
Why do we often feel that "the world is so small?" Is the six-degree separation purely imagination or based on mathematical insights? Why are there just a few rockstars who enjoy extreme popularity while most of us stay unknown to the world? When science meets coffee shop knowledge, things are bound to be intriguing.
I will first briefly describe what social networks are, in the mathematical sense. Then I will introduce some ways to extract characteristics of networks, and how these analyses can explain many anecdotes in our life. Finally, I'll show an example of what we can learn from social network analysis, based on data from Groupon.
An introduction in the world of Social Network Analysis and a view on how this may help learning networks. History, data collection and several analysis techniques are shown.
Social network analysis for modeling & tuning social media websiteEdward B. Rockower
Social Network Analysis of a Professional Online Social Media Collaboration Community. Tuning and optimizing based on observed social network dynamics and user behavior.
Community analysis using graph representation learning on social networksMarco Brambilla
In a world more and more connected, new and complex interaction
patterns can be extracted in the communication between people.
This is extremely valuable for brands that can better understand
the interests of users and the trends on social media to better target
their products. In this paper, we aim to analyze the communities
that arise around commercial brands on social networks to understand
the meaning of similarity, collaboration, and interaction
among users.We exploit the network that builds around the brands
by encoding it into a graph model.We build a social network graph,
considering user nodes and friendship relations; then we compare
it with a heterogeneous graph model, where also posts and hashtags
are considered as nodes and connected to the different node
types; we finally build also a reduced network, generated by inducing
direct user-to-user connections through the intermediate
nodes (posts and hashtags). These different variants are encoded
using graph representation learning, which generates a numerical
vector for each node. Machine learning techniques are applied to
these vectors to extract valuable insights for each user and for the
communities they belong to. In the paper, we report on our experiments
performed on an emerging fashion brand on Instagram, and
we show that our approach is able to discriminate potential customers
for the brand, and to highlight meaningful sub-communities
composed by users that share the same kind of content on social
networks.
Data Cleaning for social media knowledge extractionMarco Brambilla
Social media platforms let users share their opinions through textual or multimedia content. In many settings, this becomes a valuable source of knowledge that can be exploited for specific business objectives. Brands and companies often ask to monitor social media as sources for understanding the stance, opinion, and sentiment of their customers, audience and potential audience. This is crucial for them because it let them understand the trends and future commercial and marketing opportunities.
However, all this relies on a solid and reliable data collection phase, that grants that all the analyses, extractions and predictions are applied on clean, solid and focused data. Indeed, the typical topic-based collection of social media content performed through keyword-based search typically entails very noisy results.
We recently implemented a simple study aiming at cleaning the data collected from social content, within specific domains or related to given topics of interest. We propose a basic method for data cleaning and removal of off-topic content based on supervised machine learning techniques, i.e. classification, over data collected from social media platforms based on keywords regarding a specific topic. We define a general method for this and then we validate it through an experiment of data extraction from Twitter, with respect to a set of famous cultural institutions in Italy, including theaters, museums, and other venues.
For this case, we collaborated with domain experts to label the dataset, and then we evaluated and compared the performance of classifiers that are trained with different feature extraction strategies.
Iterative knowledge extraction from social networks. The Web Conference 2018Marco Brambilla
Knowledge in the world continuously evolves, and ontologies are largely incomplete, especially regarding data belonging to the so-called long tail. We propose a method for discovering emerging knowledge by extracting it from social content. Once initialized by domain experts, the method is capable of finding relevant entities by means of a mixed syntactic-semantic method. The method uses seeds, i.e. prototypes of emerging entities provided by experts, for generating candidates; then, it associates candidates to feature vectors built by using terms occurring in their social content and ranks the candidates by using their distance from the centroid of seeds, returning the top candidates. Our method can run iteratively, using the results as new seeds.
In this paper we address the following research questions: (1) How does the reconstructed domain knowledge evolve if the candidates of one extraction are recursively used as seeds (2) How does the reconstructed domain knowledge spread geographically (3) Can the method be used to inspect the past, present, and future of knowledge (4) Can the method be used to find emerging knowledge?.
This work was presented at The Web Conference 2018, MSM workshop.
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng
The advent of the social networks has completely changed our daily life. The deluge of data collected on Social Network Services (SNS) and recent developments in complex network theory have enabled many marvelous predictive analysis, which tells us many amazing stories.
Why do we often feel that "the world is so small?" Is the six-degree separation purely imagination or based on mathematical insights? Why are there just a few rockstars who enjoy extreme popularity while most of us stay unknown to the world? When science meets coffee shop knowledge, things are bound to be intriguing.
I will first briefly describe what social networks are, in the mathematical sense. Then I will introduce some ways to extract characteristics of networks, and how these analyses can explain many anecdotes in our life. Finally, I'll show an example of what we can learn from social network analysis, based on data from Groupon.
An introduction in the world of Social Network Analysis and a view on how this may help learning networks. History, data collection and several analysis techniques are shown.
Social Network Analysis (SNA) and its implications for knowledge discovery in...ACMBangalore
Social Network Analysis (SNA) and its implications for knowledge discovery in Informal Networks- Talk by Dr Jai Ganesh, SETLabs, Infosys at Search and Social Platforms tutorial, as part of Compute 2009, ACM Bangalore
Social Network Analysis power point presentation Ratnesh Shah
Basics of social network analysis,Application and also explain interesting study done by facebook , twitter, youtube and many more social media network ,slide contains some of interesting study to get knowledge about online social network analysis.
How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim
Slides and details available at: http://jeromyanglim.blogspot.com/2009/10/how-to-conduct-social-network-analysis.html
A talk on using social network analysis as a team development tool.
LAK13 Tutorial Social Network Analysis 4 Learning Analyticsgoehnert
Slides of the tutorial "Computational Methods and Tools for Social Network Analysis Networked Learning Communities" at the LAK 2013 in Leuven.
Tutorial Homepage:
http://snatutoriallak2013.ku.de/index.php/SNA_tutorial_at_LAK_2013
Conference Homepage:
http://lakconference2013.wordpress.com/
2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith
Review of trends related to social network analysis in the enterprise. Presented at the 2010 Catalyst Conference in San Diego, CA july 29, 2010. Presented with Mike Gotta, Gartner Group.
2013 NodeXL Social Media Network AnalysisMarc Smith
Social media network analysis and visualization with NodeXL - the network overview discovery and exploration add-in for Excel. Map Twitter, Facebook, email, blogs, and the web with a point and click interface within the familiar spreadsheet.
Social network analysis [SNA] is the mapping and measuring of relationships and flows between people, groups, organizations, computers, URLs, and other connected information/knowledge entities. SNA provides both a visual and a mathematical analysis of human relationships.
Social Network Analysis (SNA) and its implications for knowledge discovery in...ACMBangalore
Social Network Analysis (SNA) and its implications for knowledge discovery in Informal Networks- Talk by Dr Jai Ganesh, SETLabs, Infosys at Search and Social Platforms tutorial, as part of Compute 2009, ACM Bangalore
Social Network Analysis power point presentation Ratnesh Shah
Basics of social network analysis,Application and also explain interesting study done by facebook , twitter, youtube and many more social media network ,slide contains some of interesting study to get knowledge about online social network analysis.
How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim
Slides and details available at: http://jeromyanglim.blogspot.com/2009/10/how-to-conduct-social-network-analysis.html
A talk on using social network analysis as a team development tool.
LAK13 Tutorial Social Network Analysis 4 Learning Analyticsgoehnert
Slides of the tutorial "Computational Methods and Tools for Social Network Analysis Networked Learning Communities" at the LAK 2013 in Leuven.
Tutorial Homepage:
http://snatutoriallak2013.ku.de/index.php/SNA_tutorial_at_LAK_2013
Conference Homepage:
http://lakconference2013.wordpress.com/
2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith
Review of trends related to social network analysis in the enterprise. Presented at the 2010 Catalyst Conference in San Diego, CA july 29, 2010. Presented with Mike Gotta, Gartner Group.
2013 NodeXL Social Media Network AnalysisMarc Smith
Social media network analysis and visualization with NodeXL - the network overview discovery and exploration add-in for Excel. Map Twitter, Facebook, email, blogs, and the web with a point and click interface within the familiar spreadsheet.
Social network analysis [SNA] is the mapping and measuring of relationships and flows between people, groups, organizations, computers, URLs, and other connected information/knowledge entities. SNA provides both a visual and a mathematical analysis of human relationships.
Current trends of opinion mining and sentiment analysis in social networkseSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Activating Research Collaboratories with Collaboration PatternsCommunitySense
This presentation explains how collaborative communities require evolving socio-technical systems. Collaboration patterns are important to design these systems and capture lessons learnt. The role of librarians as collaboration pattern stewards and collaborative working system architects is outlined.
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...inventionjournals
The problem of web search time complexity and accuracy has been visited in many research papers, and the authors discussed many approaches to improve the search performance. Still the approaches does not produce any noticeable improvement and struggles with more time complexity as well. To overcome the issues identified, an efficient multi mode conceptual clustering algorithm has been discussed in this paper, which identifies the similar interested user groups by clustering their search context according to different conceptual queries. Identified user groups are shared with the related conceptual queries and their results to reduce the time complexity. The multi mode conceptual clustering, performs grouping of search queries and users according to number of users and their search pattern. The concept of search is identified by using Natural language processing methods and the web logs produced by the default web search engines. The author designed a dedicated web interface to collect the web log about the user search and the same data has been used to cluster the social groups according to number of conceptual queries. The search results has been shared between the users of identified social groups which reduces the search time complexity and improves the efficiency of web search in better manner
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
Knowledge graph-based method for solutions detection and evaluation in an on...IJECEIAES
Online communities are a real medium for human experiences sharing. They contain rich knowledge of lived situations and experiences that can be used to support decision-making process and problem-solving. This work presents an approach for extracting, representing, and evaluating components of problem-solving knowledge shared in online communities. Few studies have tackled the issue of knowledge extraction and its usefulness evaluation in online communities. In this study, we propose a new approach to detect and evaluate best solutions to problems discussed by members of online communities. Our approach is based on knowledge graph technology and graphs theory enabling the representation of knowledge shared by the community and facilitating its reuse. Our process of problem-solving knowledge extraction in online communities (PSKEOC) consists of three phases: problems and solutions detection and classification, knowledge graph constitution and finally best solutions evaluation. The experimental results are compared to the World Health Organization (WHO) model chapter about Infant and young child feeding and show that our approach succeed to extract and reveal important problem-solving knowledge contained in online community’s conversations. Our proposed approach leads to the construction of an experiential knowledge graph as a representation of the constructed knowledge base in the community studied in this paper.
Integrated expert recommendation model for online communitiesst02IJwest
Online communities have become vital places for Web 2.0 users to share knowledg
e and experiences.
Recently, finding expertise user in community has become an important research issue. This paper
proposes a novel cascaded model for expert recommendation using aggregated knowledge extracted from
enormous contents and social network fe
atures. Vector space model is used to compute the relevance of
published content with respect
to a specific query while PageRank
algorithm is applied to rank candidate
experts. The experimental results sho
w that the proposed model is
an effective recommen
dation which can
guarantee that the most candidate experts are both highly relevant to the specific queries and highly
influential in corresponding areas
Hierarchical Transformers for User Semantic Similarity - ICWE 2023Marco Brambilla
We discuss the use of hierarchical transformers for user semantic similarity in the context of analyzing users' behavior and profiling social media users. The objectives of the research include finding the best model for computing semantic user similarity, exploring the use of transformer-based models, and evaluating whether the embeddings reflect the desired similarity concept and can be used for other tasks.
We use a large dataset of Twitter users and apply an automatic labeling approach. The dataset consists of English tweets posted in November and December 2020, totaling about 27GB of compressed data. Preprocessing steps include filtering out short texts, cleaning user connections, and selecting a benchmark set of users for evaluation.
Since Transformer architectures are known to work well on short text, we cannot use them on extensive collections of tweets describing the activity of a user. Therefore, we propose a hierarchical structure of transformer models to be used first on tweets and then on their aggregations.
The models used in the study include hierarchical transformers, and the tweet embeddings are obtained using four Transformer-based models: RoBERTa2, BERTweet3, Sentence BERT4, and Twitter4SSE5. The researchers test different techniques for processing tweet embeddings to generate accurate user embeddings, including mean pooling, recurrence over BERT (RoBERT), and transformer over BERT (ToBERT).
The evaluation of the models is done on a set of 5,000 users, comparing user similarities with 30 other candidate users, 5 of which are considered similar and 25 considered dissimilar. The evaluation metrics used include mean average precision (MAP), mean reciprocal rank (MRR) at 10, and normalized discounted cumulative gain (nDCG).
The optimization process involves selecting a loss function and using the AdamW optimizer with specific hyperparameters. The results show that the hierarchical approach with a Stage-1 Twitter4SSE model and a Stage-2 Transformer model performs the best among the alternatives.
In conclusion, the research provides a large unbiased dataset for user similarity analysis, presents a hierarchical language model optimized for accurate user similarity computation, and validates the models' performance on similarity tasks, with potential applications to related problems.
The future work includes investigating the impact of time and topic drift on the models' performance.
Exploring the Bi-verse.A trip across the digital and physical ecospheresMarco Brambilla
The Web and social media are the environments where people post their content, opinions, activities, and resources. Therefore, a considerable amount of user-generated content is produced every day for a wide variety of purposes. On the other side, people live their everyday life immersed in the physical world, where society, economy, politics and personal relations continuously evolve. These two opposite and complementary environment are today fully integrated: they reflect each other and they interact with each other in a stronger and stronger way.
Exploring and studying content and data coming from both environments offers a great opportunity to understand the ever evolving modern society, in terms of topics of interest, events, relations, and behaviour.
In this speech I will discuss through business cases and socio-political scenarios how we can extract insights and understand reality by combining and analyzing data from the digital and physical world, so as to reach a better overall picture of reality itself. Along this path, we need to keep into account that reality is complex and varies in time, space and along many other dimensions, including societal and economic variables. The speech highlights the main challenges that need to be addressed and outlines some data science strategies that can be applied to tackle these specific challenges.
This slide deck has been presented as a keynote speech at WISE 2022 in Biarritz, France.
Trigger.eu: Cocteau game for policy making - introduction and demoMarco Brambilla
COCTEAU stands for "Co-Creating the European Union".
It's a project supported by the European Union whose objective is to involve citizens to cooperate alongside policy makers, contributing to build a better future.
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...Marco Brambilla
A large audience of users and typically a long time frame are needed to produce sensible and useful log data, making it an expensive task.
To address this limit, we propose a method that focuses on the generation of REALISTIC NAVIGATIONAL PATHS, i.e., web logs .
Our approach is extremely relevant because it can at the same time tackle the problem of lack of publicly available data about web navigation logs, and also be adopted in industry for AUTOMATIC GENERATION OF REALISTIC TEST SETTINGS of Web sites yet to be deployed.
The generation has been implemented using deep learning methods for generating more realistic navigation activities, namely
Recurrent Neural Network, which are very well suited to temporally evolving data
Generative Adversarial Network: neural networks aimed at generating new data, such as images or text, very similar to the original ones and sometimes indistinguishable from them, that have become increasingly popular in recent years.
We run experiments using open data sets of weblogs as training, and we run tests for assessing the performance of the methods. Results in generating new weblog data are quite good with respect to the two evaluation metrics adopted (BLEU and Human evaluation).
Our study is described in detail in the paper published at ICWE 2020 – International Conference on Web Engineering with DOI: 10.1007/978-3-030-50578-3. It’s available online on the Springer Web site.
Analyzing rich club behavior in open source projectsMarco Brambilla
The network of collaborations in an open source project can reveal relevant emergent properties that influence its prospects of success.
In this work, we analyze open source projects to determine whether they exhibit a rich-club behavior, i.e., a phenomenon where contributors with a high number of collaborations (i.e., strongly connected within the collaboration network)
are likely to cooperate with other well-connected individuals. The presence or absence of a rich-club has an impact on the sustainability and robustness of the project.
For this analysis, we build and study a dataset with the 100 most popular projects in GitHub, exploiting connectivity patterns in the graph structure of collaborations that arise from commits, issues and pull requests. Results show that rich-club behavior is present in all the projects, but only few of them have an evident club structure. We compute coefficients both for single source graphs and the overall interaction graph, showing that rich-club behavior varies across different layers of software development. We provide possible explanations of our results, as well as implications for further analysis.
Analysis of On-line Debate on Long-Running Political Phenomena.The Brexit C...Marco Brambilla
In this study, we demonstrate that the computational social science is important to understand people behavior in political phenomena, and based on the long-running Brexit debate analysis on Twitter, we predict the public stance, discussion topics, and we measure the involvement of automated accounts and politicians’ social media accounts.
Driving Style and Behavior Analysis based on Trip Segmentation over GPS Info...Marco Brambilla
Over one billion cars interact with each other on the road every day. Each driver has his own driving style, which could impact safety, fuel economy and road congestion. Knowledge about the driving style of the driver could be used to encourage ``better" driving behaviour through immediate feedback
while driving, or by scaling auto insurance rates based on the aggressiveness of the driving style.
In this work we report on our study of driving behaviour profiling based on unsupervised data mining methods. The main goal is to detect the different driving behaviours, and thus to cluster drivers with similar behaviour.
This paves the way to new business models related to the driving sector, such as Pay-How-You-Drive insurance
policies and car rentals.
Driver behavioral characteristics are studied by collecting information from GPS sensors on the cars and by applying three different analysis approaches (DP-means, Hidden Markov Models, and Behavioural Topic Extraction) to the contextual scene detection problems on car trips, in order to detect different
behaviour along each trip. Subsequently, drivers are clustered in similar profiles based on that and the results are compared with a human-defined groundtruth on drivers classification. The proposed framework is tested on a real dataset containing sampled car signals. While the different approaches show relevant differences in trip segment classification, the coherence of the final driver clustering results is surprisingly high.
Myths and challenges in knowledge extraction and analysis from human-generate...Marco Brambilla
For centuries, science (in German "Wissenschaft") has aimed to create ("schaften") new knowledge ("Wissen") from the observation of physical phenomena, their modelling, and empirical validation. Recently, a new source of knowledge has emerged: not (only) the physical world any more, but the virtual world, namely the Web with its ever-growing stream of data materialized in the form of social network chattering, content produced on demand by crowds of people, messages exchanged among interlinked devices in the Internet of Things. The knowledge we may find there can be dispersed, informal, contradicting, unsubstantiated and ephemeral today, while already tomorrow it may be commonly accepted. The challenge is once again to capture and create knowledge that is new, has not been formalized yet in existing knowledge bases, and is buried inside a big, moving target (the live stream of online data). The myth is that existing tools (spanning fields like semantic web, machine learning, statistics, NLP, and so on) suffice to the objective. While this may still be far from true, some existing approaches are actually addressing the problem and provide preliminary insights into the possibilities that successful attempts may lead to.
The talk explores the mixed realistic-utopian domain of knowledge extraction and reports on some tools and cases where digital and physical world have brought together for better understanding our society.
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...Marco Brambilla
Knowledge bases like DBpedia, Yago or Google's Knowledge
Graph contain huge amounts of ontological knowledge harvested from
(semi-)structured, curated data sources, such as relational databases or
XML and HTML documents. Yet, the Web is full of knowledge that is
not curated and/or structured and, hence, not easily indexed, for ex-
ample social data. Most work so far in this context has been dedicated
to the extraction of entities, i.e., people, things or concepts. This poster
describes our work toward the extraction of relationships among entities.
The objective is reconstructing a typed graph of entities and relation-
ships to represent the knowledge contained in social data, without the
need for a-priori domain knowledge. The experiments with real datasets
show promising performance across a variety of domains.
The key distinguishing
feature of the work is its focus on highly unstructured social data (tweets and
Facebook posts) without reliable grammar structures. Traditional relation extraction approaches supervised , semi-supervised or unsupervised,
commonly assume the availability of grammatically correct language corpora.
Model-driven Development of User Interfaces for IoT via Domain-specific Comp...Marco Brambilla
Internet of Things technologies and applications are evolving and continuously gaining traction in all fields and environments, including homes, cities, services, industry and commercial enterprises. However, still many problems need to be addressed. For instance, the
IoT vision is mainly focused on the technological and infrastructure aspect, and on the management and analysis of the huge amount of generated data, while so far the development of front-end and user interfaces for
IoT has not played a relevant role in research. On the contrary, user interfaces in the IoT ecosystem they can play a key role in the acceptance of solutions by final adopters. In this paper we present a model-driven approach to the design of IoT interfaces, by defining a specific visual design language and design patterns for IoT\ applications, and we show them at work. The language we propose is defined as an extension of the OMG standard language called IFML.
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.Marco Brambilla
Consumer-centered software applications nowadays are required
to be available both as mobile and desktop versions.
However, the app design is frequently made only for one of
the two (i.e., mobile first or web first) while missing an appropriate
design for the other (which, in turn, simply mimics
the interaction of the first one). This results into poor quality
of the interaction on one or the other platform. Current solutions
would require different designs, to be realized through
different design methods and tools, and that may require to
double development and maintenance costs.
In order to mitigate such an issue, this paper proposes a
novel approach that supports the design of both web and mobile
applications at once. Starting from a unique requirement
and business specification, where web– and mobile–specific
aspects are captured through tagging, we derive a platform independent
design of the system specified in IFML. This
model is subsequently refined and detailed for the two platforms,
and used to automatically generate both the web and
mobile versions. If more precise interactions are needed for
the mobile part, a blending with MobML, a mobile-specific
modeling language, is devised. Full traceability of the relations
between artifacts is granted.
The Web Science course focuses on the study of large-scale socio-technical systems associated with the World Wide Web. It considers the relationship between people and technology, the ways that society and technology complement one another and the way they impact on broader society. These analyses are inherently associated with Big Data management issues.
The course is organised in four parts.
1. Syntax
In the first part, the course introduces the basis of content analysis. If focuses on the syntactic aspects, covering the fundamentals of natural language processing and text mining. It describes the structure and typical characteristics of the different web sources, spanning search results, social media contents, social network structures, Web APIs, and so on. It also provides an overview of the basic Web analysis techniques applied in Web search and Web recommendation.
2. Semantics
In the second part, the course presents semantic technologies. These technologies are very important nowadays because they allow to treat the "variety" dimension of Big Data, i.e., they enable integration of multiple and diverse sources of information, which is typical on the modern Web platform. Covered topics include:
- RDF - a flexible data model to represent heterogeneous data
- OWL - a flexible ontological language to model heterogeneous data sources
- SPARQL - a query language for RDF.
It shows how to put all the pieces together in order to achieve interoperability among heterogeneous information sources
3. Time
The third part covers the realm of temporal-dependent data. The topics covered here allow to treat the "velocity" dimension of Big Data. It shows the importance for many Big Data analysis scenarios to process data stream, coming for instance from Internet of Things (IoT) and Social Media sources; and it describes how to apply semantic and syntactic techniques in the context of time-dependent information. For instance, it shows how to extend RDF to model RDF streams, how to extend SPARQL to continuously process RDF streams and how to reason on those RDF Streams
4. Applications
In the fourth part, the course focuses on specific application scenarios and presents the typical settings and problems where the presented techniques can be applied. This part discusses settings such as: big data analysis for smart cities; data analytics for brand monitoring (marketing) and event monitoring; data analysis for trend detection and user engagement; and so on.
On the Quest for Changing Knowledge. Capturing emerging entities from social ...Marco Brambilla
Massive data integration technologies have been recently used to produce very large ontologies. However, knowledge in the world continuously evolves, and ontologies are largely incomplete for what concerns low-frequency data, belonging to the so-called long tail.
Socially produced content is an excellent source for discovering emerging knowledge: it is huge, and immediately
reflects the relevant changes which hide emerging entities. Thus, we propose a method for
discovering emerging entities by extracting them from social content.
We start from a purely-syntactic method as a baseline, and we propose a semantics-based method based on entity recognition and DBpedia. The method associates candidate entities to feature vectors, built
from social content by using co-occurrence, and then extracts the emerging entities by using feature similarity measures.
Once instrumented by experts through very simple initialization, the method is capable of finding emerging
entities and extracting their relevant relationships to given types; the method can be
continuously or periodically iterated, using the already identified emerged knowledge as new starting point.
We validate our method by applying it to a set of diverse domain-specific application scenarios, spanning fashion, literature, exhibitions and so on. We show the approach at work and we demonstrate its effectiveness on datasets with different characterization in terms of coverage, dynamics and size.
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...Marco Brambilla
Cities are growing as melting pots of people with different culture, religion, and language. In this paper, through multilingual analysis of Twitter contents shared within a city, we analyze the prevalent language in the different neighborhoods of the city and we compare the results with census data, in order to highlight any parallelisms or discrepancies between the two data sources. We show that the officially identified neighborhoods are actually representing significantly different communities and that the use of the social media as a data source helps to detect those weak signals that are not captured from traditional data.
Model driven software engineering in practice book - Chapter 9 - Model to tex...Marco Brambilla
Slides for the mdse-book.com chapter 9 - Model-to-text transformations.
Complete set of slides now available:
Chapter 1 - http://www.slideshare.net/mbrambil/modeldriven-software-engineering-in-practice-chapter-1-introduction
Chapter 2 - http://www.slideshare.net/mbrambil/modeldriven-software-engineering-in-practice-chapter-2-mdse-principles
Chapter 3 - http://www.slideshare.net/jcabot/model-driven-software-engineering-in-practice-chapter-3-mdse-use-cases
Chapter 4 - http://www.slideshare.net/jcabot/modeldriven-software-engineering-in-practice-chapter-4
Chapter 5 - http://www.slideshare.net/mbrambil/modeldriven-software-engineering-in-practice-chapter-5-integration-of-modeldriven-in-development-processes
Chapter 6 - http://www.slideshare.net/jcabot/mdse-bookslideschapter6
Chapter 7 - http://www.slideshare.net/mbrambil/model-driven-software-engineering-in-practice-book-chapter-7-developing-your-own-modeling-language
Chapter 8 - http://www.slideshare.net/jcabot/modeldriven-software-engineering-in-practice-chapter-8-modeltomodel-transformations
Chapter 9 - https://www.slideshare.net/mbrambil/model-driven-software-engineering-in-practice-book-chapter-9-model-to-text-transformations-and-code-generation
Chapter 10 - http://www.slideshare.net/jcabot/mdse-bookslideschapter10managingmodels
This book discusses how approaches based on modeling can improve the daily practice of software professionals. This is known as Model-Driven Software Engineering (MDSE) or, simply, Model-Driven Engineering (MDE).
MDSE practices have proved to increase efficiency and effectiveness in software development. MDSE adoption in the software industry is foreseen to grow exponentially in the near future, e.g., due to the convergence of software development and business analysis.
This book is an agile and flexible tool to introduce you to the MDE and MDSE world, thus allowing you to quickly understand its basic principles and techniques and to choose the right set of MDE instruments for your needs so that you can start to benefit from MDE right away.
The book is organized into two main parts.
The first part discusses the foundations of MDSE in terms of basic concepts (i.e., models and transformations), driving principles, application scenarios and current standards, like the wellknown MDA initiative proposed by OMG (Object Management Group) as well as the practices on how to integrate MDE in existing development processes.
The second part deals with the technical aspects of MDSE, spanning from the basics on when and how to build a domain-specific modeling language, to the description of Model-to-Text and Model-to-Model transformations, and the tools that support the management of MDE projects.
The book covers a wide set of introductory and technical topics, spanning MDE at large, definitions and orientation in the MD* world, metamodeling, domain specific languages, model transformations, reverse engineering, OMG's MDA, UML, OCL, A
Model driven software engineering in practice book - chapter 7 - Developing y...Marco Brambilla
Slides for the mdse-book.com - Chapter 7: Developing Your Own Modeling Language.
Complete set of slides now available:
Chapter 1 - http://www.slideshare.net/mbrambil/modeldriven-software-engineering-in-practice-chapter-1-introduction
Chapter 2 - http://www.slideshare.net/mbrambil/modeldriven-software-engineering-in-practice-chapter-2-mdse-principles
Chapter 3 - http://www.slideshare.net/jcabot/model-driven-software-engineering-in-practice-chapter-3-mdse-use-cases
Chapter 4 - http://www.slideshare.net/jcabot/modeldriven-software-engineering-in-practice-chapter-4
Chapter 5 - http://www.slideshare.net/mbrambil/modeldriven-software-engineering-in-practice-chapter-5-integration-of-modeldriven-in-development-processes
Chapter 6 - http://www.slideshare.net/jcabot/mdse-bookslideschapter6
Chapter 7 - http://www.slideshare.net/mbrambil/model-driven-software-engineering-in-practice-book-chapter-7-developing-your-own-modeling-language
Chapter 8 - http://www.slideshare.net/jcabot/modeldriven-software-engineering-in-practice-chapter-8-modeltomodel-transformations
Chapter 9 - https://www.slideshare.net/mbrambil/model-driven-software-engineering-in-practice-book-chapter-9-model-to-text-transformations-and-code-generation
Chapter 10 - http://www.slideshare.net/jcabot/mdse-bookslideschapter10managingmodels
This book discusses how approaches based on modeling can improve the daily practice of software professionals. This is known as Model-Driven Software Engineering (MDSE) or, simply, Model-Driven Engineering (MDE).
MDSE practices have proved to increase efficiency and effectiveness in software development. MDSE adoption in the software industry is foreseen to grow exponentially in the near future, e.g., due to the convergence of software development and business analysis.
This book is an agile and flexible tool to introduce you to the MDE and MDSE world, thus allowing you to quickly understand its basic principles and techniques and to choose the right set of MDE instruments for your needs so that you can start to benefit from MDE right away.
The first part discusses the foundations of MDSE in terms of basic concepts (i.e., models and transformations), driving principles, application scenarios and current standards, like the wellknown MDA initiative proposed by OMG (Object Management Group) as well as the practices on how to integrate MDE in existing development processes.
The second part deals with the technical aspects of MDSE, spanning from the basics on when and how to build a domain-specific modeling language, to the description of Model-to-Text and Model-to-Model transformations, and the tools that support the management of MDE projects.
The book covers the MD* world, metamodeling, domain specific languages, model transformations, reverse engineering, OMG's MDA, UML, OCL, ATL, QVT, MOF, Eclipse, EMF, GMF, TCS, xText.
Grow Your Reddit Community Fast.........SocioCosmos
Sociocosmos helps you gain Reddit followers quickly and easily. Build your community and expand your influence.
https://www.sociocosmos.com/product-category/reddit/
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...SocioCosmos
Get more Pinterest followers, reactions, and repins with Sociocosmos, the leading platform to buy all kinds of Pinterest presence. Boost your profile and reach a wider audience.
https://www.sociocosmos.com/product-category/pinterest/
Enhance your social media strategy with the best digital marketing agency in Kolkata. This PPT covers 7 essential tips for effective social media marketing, offering practical advice and actionable insights to help you boost engagement, reach your target audience, and grow your online presence.
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...AJHSSR Journal
ABSTRACT: In the Malaysian context, small and medium enterprises (SMEs) experience a significant
burden of workplace accidents. A consensus among scholars attributes a substantial portion of these incidents to
human factors, particularly unsafe behaviors. This study, conducted in Malaysia's northern region, specifically
targeted Safety and Health/Human Resource professionals within the manufacturing sector of SMEs. We
gathered a robust dataset comprising 107 responses through a meticulously designed self-administered
questionnaire. Employing advanced partial least squares-structural equation modeling (PLS-SEM) techniques
with SmartPLS 3.2.9, we rigorously analyzed the data to scrutinize the intricate relationship between safety
behavior and safety performance. The research findings unequivocally underscore the palpable and
consequential impact of safety behavior variables, namely safety compliance and safety participation, on
improving safety performance indicators such as accidents, injuries, and property damages. These results
strongly validate research hypotheses. Consequently, this study highlights the pivotal significance of cultivating
safety behavior among employees, particularly in resource-constrained SME settings, as an essential step toward
enhancing workplace safety performance.
KEYWORDS :Safety compliance, safety participation, safety performance, SME
“To be integrated is to feel secure, to feel connected.” The views and experi...AJHSSR Journal
ABSTRACT: Although a significant amount of literature exists on Morocco's migration policies and their
successes and failures since their implementation in 2014, there is limited research on the integration of subSaharan African children into schools. This paperis part of a Ph.D. research project that aims to fill this gap. It
reports the main findings of a study conducted with migrant children enrolled in two public schools in Rabat,
Morocco, exploring how integration is defined by the children themselves and identifying the obstacles that they
have encountered thus far. The following paper uses an inductive approach and primarily focuses on the
relationships of children with their teachers and peers as a key aspect of integration for students with a migration
background. The study has led to several crucial findings. It emphasizes the significance of speaking Colloquial
Moroccan Arabic (Darija) and being part of a community for effective integration. Moreover, it reveals that the
use of Modern Standard Arabic as the language of instruction in schools is a source of frustration for students,
indicating the need for language policy reform. The study underlines the importanceof considering the
children‟s agency when being integrated into mainstream public schools.
.
KEYWORDS: migration, education, integration, sub-Saharan African children, public school
Your Path to YouTube Stardom Starts HereSocioCosmos
Skyrocket your YouTube presence with Sociocosmos' proven methods. Gain real engagement and build a loyal audience. Join us now.
https://www.sociocosmos.com/product-category/youtube/
Unlock TikTok Success with Sociocosmos..SocioCosmos
Discover how Sociocosmos can boost your TikTok presence with real followers and engagement. Achieve your social media goals today!
https://www.sociocosmos.com/product-category/tiktok/
Surat Digital Marketing School is created to offer a complete course that is specifically designed as per the current industry trends. Years of experience has helped us identify and understand the graduate-employee skills gap in the industry. At our school, we keep up with the pace of the industry and impart a holistic education that encompasses all the latest concepts of the Digital world so that our graduates can effortlessly integrate into the assigned roles.
This is the place where you become a Digital Marketing Expert.
Multilingual SEO Services | Multilingual Keyword Research | Filosemadisonsmith478075
Multilingual SEO services are essential for businesses aiming to expand their global presence. They involve optimizing a website for search engines in multiple languages, enhancing visibility, and reaching diverse audiences. Filose offers comprehensive multilingual SEO services designed to help businesses optimize their websites for search engines in various languages, enhancing their global reach and market presence. These services ensure that your content is not only translated but also culturally and contextually adapted to resonate with local audiences.
Visit us at -https://www.filose.com/
1. Marco Brambilla, Alireza Javadian Sabet,
and Amin Endah Sulistiawati
Conversation Graphs in
Online Social Media
2. Table of Content
Introduction
State-of-the-art in Conversation Graphs on Social Media
Rsearch Questions and Contributions
Case study
Proposed methodology
Intent Analysis
Network Generation
Pattern Identification
Conclusion
Refrences
3. Introduction
The emergence of social media (SM) has profoundly changed the perspective
of communication, which resulted in a revolution in the way people interact
with each other [1].
In online SM platforms, users can express their ideas by posting original
content or by adding comments and responses to existing posts, thus
generating virtual discussions and conversations.
For reasons such as: interacting within the inner circle of friendship,
entertainment purposes, subscribing to news, knowledge sharing purpose on
online learning, and Q&A platforms [2],[3].
Many companies adopt SM to utilize this growing trend to gain business
values [4].
4. State-of-the-art in Conversation Graphs on Social Media
Ning et al. [5] utilize graph analysis to better support Q&A systems.
Aumayr et al. [6] explore classification methods for recovering the reply
structures in forum threads.
Cogan et al. [7] propose a method to reconstruct complete conversations
around initial tweets.
Zayats and Ostendorf [8] predict the popularity of comments on Reddit
discussions.
Kumar et al. [9] propose a mathematical model for the generation of basic
conversation structure to explore the model humans follow during online
conversations.
Aragon et al. [10] investigate the impact of threading the messages instead of
linearly displaying them.
5. Research Questions
RQ1: How to define appropriate graph models representing conversations on
SM?
RQ2: How to reconstruct complex conversations structures when they are not
explicitly tracked by the social network platforms?
RQ3: How to assign author intentions to posts and comments in the
conversations?
RQ4: How to identify recurrent discussion patterns in conversation graphs?
6. Contributions
A graph-based view on the discussions between social media contributors.
Retrieve popular patterns on online conversations.
Intent Analysis
Network Generation
Pattern Identification
7. Case Study: YourExpo2015 Game Challenge*
Long-running Live Event [11]
* http://www.socialmediaexpo2015.com/yourexpo/
15,000 Photos 600,000 Actions
100,000 Comments 80,000 Participants
12. Sentiment Analysis vs. Intent Analysis
General sentiment analysis: Positive, Negative, and Neutral.
Intent analysis: provides more understanding of the communication patterns
of the online users.
20. Comment category distribution (min 30 comments)
Distribution of comment categories on conversations having minimum 30 comments
(number of comments)
21. Comment category distribution (7-29 comments)
Distribution of comment categories on conversations having number of comments between 7 and 29
(number of comments)
26. Reply time in thank thank thank positive pattern
27. Number of user in top conversation patterns
The number of users that join the top conversation patterns
3 Nodes 4 Nodes
28. Conclusion
By identifying the relationships among all comments on an SM post:
Retrieve the discussions
Construct the conversation graphs
By mining the constructed conversation graphs:
Identify the popular conversation patterns
Provide interesting insights into the SM users’ preferences and behaviors
Useful for designing conversational agents
29. References
[1] Qualman, Erik. Socialnomics: How social media transforms the way we live and do business. John Wiley & Sons, 2012.
[2] Al-Atabi, Mushtak, and Jennifer DeBoer. "Teaching entrepreneurship using massive open online course (MOOC)." Technovation 34.4
(2014): 261-264.
[3] Vasilescu, Bogdan, et al. "How social Q&A sites are changing knowledge sharing in open source software communities." Proceedings
of the 17th ACM conference on Computer supported cooperative work & social computing. 2014.
[4] Dong, John Qi, and Weifang Wu. "Business value of social media technologies: Evidence from online user innovation communities."
The Journal of Strategic Information Systems 24.2 (2015): 113-127.
[5] Yang, Jaewon, Julian McAuley, and Jure Leskovec. "Community detection in networks with node attributes." 2013 IEEE 13th
international conference on data mining. IEEE, 2013.
[6] Aumayr, Erik, Jeffrey Chan, and Conor Hayes. "Reconstruction of threaded conversations in online discussion forums." Proceedings of
the International AAAI Conference on Web and Social Media. Vol. 5. No. 1. 2011.
[7] Cogan, Peter, et al. "Reconstruction and analysis of twitter conversation graphs." Proceedings of the First ACM International Workshop
on Hot Topics on Interdisciplinary Social Networks Research. 2012.
[8] Zayats, Victoria, and Mari Ostendorf. "Conversation modeling on reddit using a graph-structured lstm." Transactions of the Association
for Computational Linguistics 6 (2018): 121-132.
[9] Kumar, Ravi, Mohammad Mahdian, and Mary McGlohon. "Dynamics of conversations." Proceedings of the 16th ACM SIGKDD
international conference on Knowledge discovery and data mining. 2010.
[10] Aragón, Pablo, Vicenç Gómez, and Andreaks Kaltenbrunner. "To thread or not to thread: The impact of conversation threading on
online discussion." Proceedings of the International AAAI Conference on Web and Social Media. Vol. 11. No. 1. 2017.
[11] Brambilla, Marco, Alireza Javadian Sabet, and Marjan Hosseini. "The role of social media in long-running live events: The case of the
Big Four fashion weeks dataset." Data in Brief 35 (2021): 106840.
Hello,
My name is Alireza Javadian Sabet from Politecnico di Milano and I am presenting the work entitled “Conversation Graphs in Online Social Media” on behalf of my co-authors Marco Brambilla and Amin Endah Sulistiawati.
In this presentation, after the introduction, I will discuss briefly the state-of-the-art in conversation graphs on social media.
Then, I will explain the research questions and contributions of this work.
I will continue the presentation by a discussion on the case study, after which I will detail the proposed methodology and the results of the study.
The emergence of social media has profoundly changed the perspective of communication, which resulted in a revolution in the way people interact with each other.
In online social media platforms, users can express their ideas by posting original content or by adding comments and responses to existing posts, thus generating virtual discussions and conversations.
Reasons of people relying on social media platforms include, but are not limited to, interacting within the inner circle of friendship, entertainment purposes, or subscribing to news.
Also as presented in various work, the use of social media is evolving widely for knowledge sharing purpose on online learning and Question Answering platforms.
Moreover, many companies adopt social media to utilize this growing trend to gain business values.
In the following, I list some of the state-of-the-art in Conversation Graphs on Social Media.
Utilizing graph analysis to better support QA systems.
Exploring classification methods for recovering the reply structures in forum threads.
Reconstructing complete conversations around initial tweets.
Predicting the popularity of comments on Reddit discussions.
Generation of basic conversation structure to explore the model that humans follow during online conversations.
And,
Investigating the impact of threading the messages instead of linearly displaying them.
The research questions addressed in this work are as follows:
First, How to define appropriate graph models representing conversations on Social Media?
Second: How to reconstruct complex conversations structures when they are not explicitly tracked by the social network platforms?
Third: How to assign author intentions to posts and comments in the conversations?
And the last research question is How to identify recurrent discussion patterns in conversation graphs?
The proposal of this work is to offer a graph-based view on the discussions between social media contributors and to retrieve popular patterns on online conversations using network-based analysis.
The proposed solution consists of three main stages: intent analysis, network generation, and pattern identification.
We tested the proposed methodology on a real long-running live event, i.e., a game challenge developed for EXPO Milano 2015.
The game was based on Instagram posts, which are tagged by specific hashtags published every week by the event.
During the whole challenge cycle of nine weeks, we collected a large dataset containing more than 15,000 photos and 600,000 actions, including near 100,000 comments, thanks to the engagement of more than 80,000 participants.
Lets start with the “Intent Analysis”
Users’ intention described within the posts and comments is detected using the following pipeline.
Initially, a list of category names is defined using popular keywords based on a set of bag-of-words.
When the label names are set, keyword-based classification is performed to put a class label on each social media comment representing its meaning.
Naïve Bayes and Support Vector Machine algorithms are then employed to improve the categorization process on the remaining uncategorized comments.
A continuous human-in-the-loop approach further improves the keyword-based classification.
The method categorizes 90% of the comments with 98% accuracy on the case study.
The following table presents the bag of words in the form of their base as well as the number of occurrences.
Observing words represented in the bold form is interesting where each of them represents a different intention which will be discussed in the following slide.
With a subjective assumption, we conclude that the suitable categories for Instagram contents associated to the case study data are as follows:
thank, congratulation, agreement, positive, invitation, food, greeting, question, hashtag, and other.
The hashtag category denotes the type of comments that only contain words started with hash # that may intend to specific information.
The other category appoints to Instagram comments, which cannot be assigned to any other class.
The reason for selecting those 10 categories, instead of a general sentiment analysis composed of positive, negative, and neutral is because we perform analysis on the data from social media challenge that has engaged a significant number of users.
In this study, we want to determine their intention and opinion about the game.
We expect with more categories would come the better understanding.
In the following slides I will describe the “Network Generation” step of the proposed methodology.
To understand the users' communication patterns, we build conversation graphs starting from the hierarchical structure of posts and comments, using a directed multigraph network composed of 462,000 nodes and 1.5 Million edges.
The figure on the left presents a general social network design representing relationships among all components, such as posts, users, comments, locations, etc.
The figure on the right displays a graph illustration of a post on social media.
The path destination is needed, for instance, to describe the relationship between comment nodes within a conversation and to track which comment's sequence.
This is the reason for designing a directed multigraph for this study.
Meanwhile, a multigraph is selected since there are possibly multiple edges connecting two nodes.
Attributes of each node and edge from the graph depict the information needed for our analysis.
Finally, the generated graph is stored in a graph file to be used for the analysis.
The figure on the left visualizes 3 posts, shown as the blue centres of the clusters, and associated relevant nodes that are: users, comments, hashtags, locations, and so on.
The figure on the right shows the results of the intent analysis over the conversation about a specific photo.
A reply edge connects one comment to one or more comments.
These relationships portray the opinion exchange between the users.
The figure on the left visualizes 3 posts, shown as the blue centres of the clusters, and associated relevant nodes that are: users, comments, hashtags, locations, and so on.
The figure on the right shows the results of the intent analysis over the conversation about a specific photo.
A reply edge connects one comment to one or more comments.
These relationships portray the opinion exchange between the users.
In the rest of the presentation, I will present some of the results that we obtained.
The experiment is performed on 15,343 Instagram’s photos related to the case study.
The presented table shows the statistical analysis of the collection of all comments and retrieved conversations.
The analysis consists of the number of comments for each photo, the number of conversation retrieved per photo, and the number of comments (i.e., the membership) for each conversation.
As we can see the number of comments ranges from 0 to 328. If we exclude photos with no comment, the average number of comments is 7.
If we include a comment with no relationship with other comments, the maximum number of conversations extracted in all photos is 177.
On average, the size of the conversation is 2 nodes.
From all conversations in all photos, we obtain that the most extended conversation is a conversation with the highest size (i.e., 93 nodes).
The following figure displays the number of conversations that occurred in all posts.
X-axis is the conversation size and, Y- axis indicates the number of conversations in each conversation size.
Please note that the y-axis in not scaled.
A single comment that does not have a relation with any comment, has the highest frequency.
Conversations composed of 2 nodes are the most prevalent among all conversations.
As we can see, the frequency decreases gradually as the size of the conversation increases and most of the long conversations only occur once.
Since the purpose of this work is to understand Social Media's communication behaviors related to the challenge, we are interested in studying long conversations in popular photos.
Thus, we first perform our analysis on the photos with at least 30 comments.
The following plot describes the spread of intent categories.
As we can see positive and thank comments dominate all conversations.
Two other intent classes that appear almost in all variations of conversation size are greeting and question types.
Comments with invitation and agreement intention are slightly expressed in most conversations, whereas congratulation comments are only mentioned in some discussions. And as expected, thank is not stated in solo conversations, which is most likely in a real discussion.
Additionally, hashtag comments generally appear in single comment.
In longer discussions, users participate in the challenge generally talk about compliments, gratitude, and salutation.
Considering such online conversations, by investigating the figure, one might conclude that by increasing the conversation size, the portion of most of the categories will be dominated by a fewer number of categories.
In the end, Food is the third significant topic mostly carried out in discussions; however, it is barely mentioned in large conversations.
The second type of conversation analysis is described using all photos that have comments between 7 and 29.
In this analysis, the same as previous analysis thank and positive categories dominated the overall conversations.
The main difference with respect to the previous analysis is that the discussions related to the food category have been increased.
Similar to the previous analysis, agreement, congratulation, and invitation categories appear in low frequency.
The variety in the number of comments for each conversation drives another idea in the time analysis.
We would like to know if the time and length of conversation are correlated or not.
This figure displays the number of conversations with respect to the size (i.e., number of comments) and duration (i.e., elapsed time).
The calculation is done by subtracting the latest posted comment time and the first comment time.
Durations range from less than 5 minutes until longer than 1 week.
We were expecting that smaller conversations would takes less time than longer ones.
However, the result contradicts our assumption in general.
According to the figure, we can conclude that mostly smaller discussions possibly have a longer duration.
Conversations with 2 to 10 comments have all ranges of duration, while conversations composed of more than 10 comments tend to narrow the duration.
It shows that long discussions with conversation size greater than 10 do not take a duration of less than 15 minutes.
It is clearly stated that users involved in the discussion need time to write a comment reply.
Another proof states that longer conversations do not take more than 1 day to end the discussion.
For instance, a conversation that involves 93 comments takes time between 12 and 24 hours.
In conclusion, the small discussions can take a longer time to finish, while more extended conversations lean to finish discussion within 24 hours.
The next analysis is to identify relevant patterns in terms of content.
Results show that the most popular online discussion patterns obtained from conversation graphs resemble real-life interactions and communication.
The table illustrates a heat matrix that details the occurrences for each combination of intents in the case of two-node patterns (i.e., one comment followed by another).
The matrix columns represent the intent of the initial comment, while the rows represent the intent of a comment that replies to the initial one.
As we expected, the results indicate that the most popular pattern created in two nodes is:
thank → positive; in other words, a gratitude action is generally expressed after a compliment.
Similar rational behaviors which frequently happened are:
thank → thank,
positive → positive,
positive → greeting,
thank → invitation and so on.
These virtual characters imitate real-world communication manners.
It also reveals less popular combinations that most likely do not happen in direct communication, such as expressing agreement after a congratulation comment or congratulation after someone saying an invitation or even asking a question to someone who gives congratulation.
The other less possible pattern is hashtag comment used to reply to any other types of comments.
In conclusion, with combinations of all intention labels on the two linked comments, we can obtain digital communication behavior that similarly adopts real-life conversation.
Both most and least popular patterns are likely to happen also in daily communication.
Therefore, in the next stage of our analysis, we want to know how far we can expand the length of conversation paths.
The expansion method is initiated by selecting the most popular patterns.
In this case, we select intent combinations that have more than 1,000 occurrences.
They are including:
thank → positive,
positive → positive,
thank → thank,
thank → food
and
thank → greeting.
As we can see, the results show that the top pattern is: thank → thank → positive.
It replicates direct communication when a person says a compliment comment to their partner, then the partner replies to express their gratefulness, afterward, most likely, the first person replies with another gratitude comment.
Other popular patterns are reasonable as well. However, the number of occurrences decrease significantly from the most popular one.
From the retrieved patterns, we select top ones composed of 3 and 4 nodes to perform temporal analysis and analyze the number of users involved in the discussions.
In the first analysis, our idea is to find how long a user takes time to write a reply comment.
We pick thank → thank → positive pattern that has 1,254 occurrences in the whole conversation graphs.
This figure displays the diversity of reply times.
The first part of the chart shows the time needed for the last comment to reply the previous one and the second part is duration of the second comment to reply the first comment.
We observe that the reply time from the second comment to the first one mostly takes less than 5 minutes; as well as the period needed for the third comment to answer the second one.
However, some users need more than 1 week to reply to a comment.
On average, the time needed for the second comment to reply to the first one is from 12 to 24 hours, and the required period of the third comment to answer the second one is between 6 and 12 hours.
The second analysis is applied to the top pattern with 4 nodes:
thank → thank → thank → positive.
The result shows that the time needed for the second comment to reply the first comment varies in the range of 5 minutes to more than a week.
However, in other cases, for the third comment to answer the second one and the fourth comment to react to the third comment, the period taken is generally less than 5 minutes.
On average, the second comment takes 6 to 12 hours to respond to the previous comment.
The third comment requires 30 minutes to 1 hour to answer the second comment, and the fourth comment needs 3 to 6 hours to react to the third comment.
Another thing that interests us is how many users are involved in the conversations.
We analyze the top patterns with 3 nodes as well as 4 nodes and sum up the number of users that join the discussion.
Overall, two users participate in the conversations, and in some cases, 3 and 4 users have taken part in the discussions.
In conclusion, it is a natural behavior that when a compliment is presented at the beginning of the talk, and the following responses are all gratitude expressions, the number of people who talk is two.
This generally indicates that the first person expresses a positive opinion, then the second person says their gratitude, afterwards, the first person responds the comment, and so on.
In conclusion, by identifying the relationships among all comments on a Social Media post, the proposed methodology retrieves the discussions and constructs the conversation graphs.
Also, by mining the constructed conversation graphs, the method identifies the popular conversation patterns.
From one side, the patterns provide interesting insights into the Social Media users' preferences and behavior, on the other side, they can be used for designing conversational agents.
In this slide you can find the list of references discussed in this presentation.
Thanks for the time and attention.
Please do not hesitate to contact us for any question and suggestion.