SlideShare a Scribd company logo
1 of 12
ISTS
Abstract:
Social network services (SNS) are prevalent and have become important
communication platforms in our daily life. Our algorithms process collections of
short posts on specific topics on the well-known site called Twitter and create short
summaries from these collections of posts on a specific topic. The goal is to
produce summaries that are similar to what a human would delivered and also
produce for the summaries result from the collection of user’s post. So construct
the model for a novel incremental clustering problem for comment stream
summarization on SNS. Moreover, we propose IncreSTS algorithm that can
incrementally update clustering results with latest incoming comments in real time.
Furthermore, we design an at-a-glance visualization interface to help users easily
and rapidly get an overview summary. From extensive experimental results and a
real case demonstration, we verify that IncreSTS possesses the advantages of high
efficiency, high scalability, and better handling outliers.
Introduction:
A social network is a social structure made up of a set of social actors (such
as individuals or organizations) and a set of the dyadic ties between these actors.
The social network perspective provides a set of methods for analyzing the
structure of whole social entities as well as a variety of theories explaining the
patterns observed from the structures.
Due to the popularity and convenience of these platforms, celebrities,
corporations, and organizations also set up social pages to interact with their fans
and the public. Note that for each message, users are able to express their opinions
by forwarding, giving a like, and leaving comments on it. Not only the quantity of
comments is large, but also the generation rate is remarkably high.
Users unnecessarily and almost impossibly go over the whole comment list
of each message. However, we may still desire to know what are they talking about
and what are the opinions of these discussion participants. Moreover, celebrities
and corporations will have high interest to understand how their fans and
customers reacting to certain topics and content. With these motivations,
Inspired to develop an advanced summarization technique targeting at
comment streams in social network services. Numerous studies and systems have
proposed techniques and mechanisms to generate various types of summaries on
comment streams. One major category aims to extract representative and
significant comments from messy discussion.
Our proposed system does not focus on traditional comment streams that
usually express more complete information, such as the discussion on products or
movies. We target at comment streams in SNS that are in short text style with
casual language usage.
For each social message, our main objective is to cluster comments with
similar content together and generate a concise opinion summary for this message.
We want to discover how many different group opinions exist and provide an
overview of each group to make users easily and rapidly understand.
For instance, when Lady Gaga uploads a photo to SNS, there are hundreds
and thousands of comments given by her fans during a short period of time.
Some of them may say that she is very beautiful, and another group of fans
may think that the outfit is too weird. Even more, some may particularly discuss
the hair style of this photo.
Our goal is developing an efficient and effective technique to identify the
clusters of these comments. Note that this problem is clearly different from
existing research and possesses numerous unique characteristics and challenges.
Explore the problem of incremental short text summarization on comment
streams from social network services. We model this problem as an incremental
clustering task and propose the Incremental Short Text Summarization algorithm
to discover the top-k clusters including different groups of opinions towards one
social message.
For each comment cluster, important and common terms will be extracted to
construct a key-term cloud. This key-term cloud provides an at-a-glance
presentation that users can easily and rapidly understand the main points of similar
comments in a cluster. Moreover, representative comments in each group will also
be identified.
Existing System:
In social network consist of problems during when a user uploads a photo to
social network services; there are hundreds and thousands of comments given by
her fans during a short period of time. Some of them may say that she is very
beautiful, and another group of fans may think that the outfit is too weird.
Even more, some may particularly discuss the hair style of this photo.
Therefore, our goal is developing an efficient and effective technique to identify
the clusters of these comments.
The quantity of comments may increase at a high rate right after a social
message is published. Moreover, distinct users will request the summary result at
any moment. For these reasons, in order to immediately generate a summary based
on the current comment stream, an incremental approach is preferable to meet the
real-time needs of this application.
The comments in social network service are usually short, and users widely
make use of informal and unstructured texts that contain acronyms, shortening
words, etc.
This phenomenon increases the difficulty of determining the similarity
between comments. On the other hand, it is worth mentioning that instead of
emphasizing on the quality of clustering, the most crucial point of this task is to
produce a general summary promptly so that users can easily get the overview of a
comment stream. It can be perceived that this problem is able to be modeled as a
clustering task. However, traditional clustering methods have several inherent
restrictions that cannot be directly applied here.
The computational complexity of existing methods is high, and they cannot
straightforwardly adapt to satisfy the incremental need. Moreover, in this problem,
defining the number of desired clusters in advance is unreasonable, which is
required in many clustering algorithms. In addition, there will be a lot of outliers in
a comment stream, meaning that without employing a good strategy for selecting
initial cluster centers, existing method may be prone to poorresults.
Disadvantages:
o The difficulty of determining the similarity between comments
is more.
o Cluster identification is difficult one.
o Computational complexity is High.
o Due to sparse information contained in each comment, these
approaches are not suitable for short text summarization
especially when the number of comments is not large.
Problem Definition:
To discover top-k groups where the comments in the same group express
similar opinions while the comments belonging to different groups express diverse
points of view. Once a message is posted on SNS, users can leave comments
immediately and the number of comments may rise quickly and continuously.
Readers are usually unwilling to go over the whole list of comments, but
they may request to see the summary at any moment. adopt the term vector model,
and therefore each comment is transformed into a set of n-gram terms by the NLP
module. Since informal and unstructured texts are widely used on SNS, we also
apply some heuristics to enhance the quality of n-gram terms that can better
represent each comment.
In such a context, whenever a request is received, the proposed IncreSTS
algorithm will efficiently produce top-k groups of opinions in real time. It can be
perceived that it is infeasible to repeatedly carry out the complete clustering task
due to the high complexity. For this reason, we design the IncreSTS algorithm in
incremental manner, meaning that the clustering result of the previous phase will
be leveraged to generate the current summary with newly-incoming comments.
For the visualization interface, representative terms will be extracted to form
a key-term cloud for each group. Thus, users will be provided a concise,
informative, and at-a-glace presentation that can help them easily comprehend the
main points of responses to one message on social network services.
ProposedSystem:
Explore the problem of incremental short text summarization on comment
streams from social network services. To model this problem as an incremental
clustering task and propose Incremental Short Text Summarization algorithm to
discover the top-k clusters including different groups of opinions towards one
social message. For each comment cluster, important and common terms will be
extracted to construct a key-term cloud. This key-term cloud provides an at-a-
glance presentation that users can easily and rapidly understand the main points of
similar comments in a cluster.
Representative comments in each group will also be identified. Our
objective is to generate an informative, concise, and impressive interface that can
help users get an overview understanding without reading all comments.our
proposed system depends on a fully incremental algorithm that is almost
parameterfree and can handle the outlier problem.
The most significant advantage of our algorithm is its high efficiency,
indicating that it can generate clustering results with latest incoming comments in
real time. These capabilities certainly meet the need of comment stream
summarization on social network serices.
To verify the effectiveness of IncreSTS algorithm, we collect real comment
streams from Facebook and conduct extensive experiments with comparative
methods to show the strength and superiority of our approach. a novel incremental
clustering problem based on the requirements of comment stream summarization
on SNS.
To propose IncreSTS algorithm that can incrementally update clustering
results with latest incoming comments in real time. We design an at-a-glance
presentation, which is concise, informative, and impressive, to help users easily
and rapidly get an overview understanding of a comment stream.
Social network services are not restricted to well-known social websites,
such as Facebook, Twitter, etc. The Web services providing interaction
functionality for users can be generally included. for each word, the process of
punctuation removal will be applied to eliminate unnecessary punctuation marks
connected with this word. Moreover, we develop the heuristic process of redundant
character removal, designed for restoring words on SNS. It can be observed that
casual language style is commonly used on SNS.
Users often emphasize the emotion by repeating characters in a word. This
phenomenon certainly causes the problem of not being able to correctly identify
the original words. To copewith this problem,
Finally examine each word to find out whether there is any character
consecutively appearing more than three times. If this situation is detected,
appended characters will be regarded as redundant, and only one character will be
retained.
Advantages:
 High efficiency.
 High scalability, and better handling outliers,
 Justifies the practicability of Incremental Short Text Summarization
on the target problem.
Algorithm:
Input: C: the set of previous clustering result
vnew: the newly-incoming comment
θr: the radius threshold
Output: top-k clusters which have top-k most comments
1. Ca = {Ci | Ci is an element of C ∩ dis(vnew,Ci) is not infinite};
2. Cb= {Cj | Cj is an element of Ca ∩ dis(vnew,Cj) < θr};
3. if Cb is not empty
4. Add vnew into Cadded which have most comments in Cb;
5. Initialize Cchanged= Ø ;
6. for each element Ci of Ca where Ci ≠ Cadded
7. for each comment vj in Ci
8. if dis(vj,Cadded) < θr
9. Add vj into Cadded;
10. Exclude vj from Ci;
11. Cchanged = Cchanged∪ Ci;
12. for each element Ci of Cchanged
13. while V = {vj | dis(vj,Ci) ≧ θr} is not empty
14. Exclude all elements in V from Ci;
15. Try to add each comment in V into other clusters
from large to small sizes;
16. else
17. Form a new cluster Cnew with the comment vnew;
18. C = C ∪ Cnew;
19. Output top-k clusters in C which have top-k most comments;
End
Conclusion:
To enable the capability of comment stream summarization on SNS, we
model a novel incremental clustering problem and propose the algorithm
Incremental Short Text Summarization, which can incrementally update clustering
results with latest incoming comments in real time. With the output of IncreSTS,
we design a visualization interface consisting of basic information, key-term
clouds, and representative comments. This at a glance of users to easily and rapidly
get an overview understanding of a comment stream. From extensive experimental
results and a real case demonstration, we verify that IncreSTS possesses the
advantages of high efficiency, high scalability, and better handling outliers, which
justifies the practicability of IncreSTS on the target problem.
Future Enhancement:
In the high popularity of social security serices, the number of comments for
a specific message may increase very quickly, and users will request to view the
summary of comments at any time. Moreover, since new messages appear
continuously, users generally only view the summary of a specific message once
and will not go back to browse the updated summary in the future. In such a
context, to immediately produce the latest top-k clusters, we propose the IncreSTS
algorithm that has the capability of incremental update.

More Related Content

Viewers also liked

IST´s (Infecções Sexualmente Transmissíveis)
IST´s (Infecções Sexualmente Transmissíveis)IST´s (Infecções Sexualmente Transmissíveis)
IST´s (Infecções Sexualmente Transmissíveis)António Matias
 
Infecções Sexualmente Transmissíveis
Infecções Sexualmente TransmissíveisInfecções Sexualmente Transmissíveis
Infecções Sexualmente Transmissíveiseborges
 
Doenças Sexualmente Transmissíveis
Doenças Sexualmente TransmissíveisDoenças Sexualmente Transmissíveis
Doenças Sexualmente Transmissíveislipe98
 
2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShare2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShareSlideShare
 
What to Upload to SlideShare
What to Upload to SlideShareWhat to Upload to SlideShare
What to Upload to SlideShareSlideShare
 
Getting Started With SlideShare
Getting Started With SlideShareGetting Started With SlideShare
Getting Started With SlideShareSlideShare
 

Viewers also liked (8)

IST´s (Infecções Sexualmente Transmissíveis)
IST´s (Infecções Sexualmente Transmissíveis)IST´s (Infecções Sexualmente Transmissíveis)
IST´s (Infecções Sexualmente Transmissíveis)
 
Infecções Sexualmente Transmissíveis
Infecções Sexualmente TransmissíveisInfecções Sexualmente Transmissíveis
Infecções Sexualmente Transmissíveis
 
Doenças Sexualmente Transmissíveis
Doenças Sexualmente TransmissíveisDoenças Sexualmente Transmissíveis
Doenças Sexualmente Transmissíveis
 
DST
DSTDST
DST
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 
2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShare2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShare
 
What to Upload to SlideShare
What to Upload to SlideShareWhat to Upload to SlideShare
What to Upload to SlideShare
 
Getting Started With SlideShare
Getting Started With SlideShareGetting Started With SlideShare
Getting Started With SlideShare
 

Similar to Ists

A novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networksA novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networkseSAT Journals
 
A novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networksA novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networkseSAT Journals
 
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...IEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...IEEEFINALYEARSTUDENTPROJECT
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...IEEEMEMTECHSTUDENTSPROJECTS
 
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
IRJET-  	  Finding Related Forum Posts through Intention-Based SegmentationIRJET-  	  Finding Related Forum Posts through Intention-Based Segmentation
IRJET- Finding Related Forum Posts through Intention-Based SegmentationIRJET Journal
 
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...inventionjournals
 
Vol 7 No 1 - November 2013
Vol 7 No 1 - November 2013Vol 7 No 1 - November 2013
Vol 7 No 1 - November 2013ijcsbi
 
APPLYING OPINION MINING TO ORGANIZE WEB OPINIONS
APPLYING OPINION MINING TO ORGANIZE WEB OPINIONSAPPLYING OPINION MINING TO ORGANIZE WEB OPINIONS
APPLYING OPINION MINING TO ORGANIZE WEB OPINIONSIJCSEA Journal
 
Framework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review DatasetFramework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review Datasetrahulmonikasharma
 
Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...redpel dot com
 
IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...
IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...
IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...IRJET Journal
 
Product Analyst Advisor
Product Analyst AdvisorProduct Analyst Advisor
Product Analyst AdvisorIRJET Journal
 
Running head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docx
Running head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docxRunning head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docx
Running head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docxhealdkathaleen
 
EffectiveCrowdSourcingForProductFeatureIdeation v18
EffectiveCrowdSourcingForProductFeatureIdeation v18EffectiveCrowdSourcingForProductFeatureIdeation v18
EffectiveCrowdSourcingForProductFeatureIdeation v18Karthikeyan Rajasekharan
 

Similar to Ists (20)

A novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networksA novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networks
 
A novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networksA novel incremental clustering for information extraction from social networks
A novel incremental clustering for information extraction from social networks
 
Notes on mining social media updated
Notes on mining social media updatedNotes on mining social media updated
Notes on mining social media updated
 
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
 
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
IRJET-  	  Finding Related Forum Posts through Intention-Based SegmentationIRJET-  	  Finding Related Forum Posts through Intention-Based Segmentation
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
 
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
 
Mood
MoodMood
Mood
 
Vol 7 No 1 - November 2013
Vol 7 No 1 - November 2013Vol 7 No 1 - November 2013
Vol 7 No 1 - November 2013
 
APPLYING OPINION MINING TO ORGANIZE WEB OPINIONS
APPLYING OPINION MINING TO ORGANIZE WEB OPINIONSAPPLYING OPINION MINING TO ORGANIZE WEB OPINIONS
APPLYING OPINION MINING TO ORGANIZE WEB OPINIONS
 
Framework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review DatasetFramework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review Dataset
 
Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...
 
IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...
IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...
IRJET- Sentimental Prediction of Users Perspective through Live Streaming : T...
 
Mingle spot project
Mingle spot projectMingle spot project
Mingle spot project
 
Mingle spot project
Mingle spot  project Mingle spot  project
Mingle spot project
 
Hacking UX
Hacking UXHacking UX
Hacking UX
 
Product Analyst Advisor
Product Analyst AdvisorProduct Analyst Advisor
Product Analyst Advisor
 
Running head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docx
Running head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docxRunning head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docx
Running head DEPRESSION PREDICTION DRAFT1DEPRESSION PREDICTI.docx
 
EffectiveCrowdSourcingForProductFeatureIdeation v18
EffectiveCrowdSourcingForProductFeatureIdeation v18EffectiveCrowdSourcingForProductFeatureIdeation v18
EffectiveCrowdSourcingForProductFeatureIdeation v18
 

Recently uploaded

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 

Recently uploaded (20)

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 

Ists

  • 1. ISTS Abstract: Social network services (SNS) are prevalent and have become important communication platforms in our daily life. Our algorithms process collections of short posts on specific topics on the well-known site called Twitter and create short summaries from these collections of posts on a specific topic. The goal is to produce summaries that are similar to what a human would delivered and also produce for the summaries result from the collection of user’s post. So construct the model for a novel incremental clustering problem for comment stream summarization on SNS. Moreover, we propose IncreSTS algorithm that can incrementally update clustering results with latest incoming comments in real time. Furthermore, we design an at-a-glance visualization interface to help users easily and rapidly get an overview summary. From extensive experimental results and a real case demonstration, we verify that IncreSTS possesses the advantages of high efficiency, high scalability, and better handling outliers.
  • 2. Introduction: A social network is a social structure made up of a set of social actors (such as individuals or organizations) and a set of the dyadic ties between these actors. The social network perspective provides a set of methods for analyzing the structure of whole social entities as well as a variety of theories explaining the patterns observed from the structures. Due to the popularity and convenience of these platforms, celebrities, corporations, and organizations also set up social pages to interact with their fans and the public. Note that for each message, users are able to express their opinions by forwarding, giving a like, and leaving comments on it. Not only the quantity of comments is large, but also the generation rate is remarkably high. Users unnecessarily and almost impossibly go over the whole comment list of each message. However, we may still desire to know what are they talking about and what are the opinions of these discussion participants. Moreover, celebrities and corporations will have high interest to understand how their fans and customers reacting to certain topics and content. With these motivations, Inspired to develop an advanced summarization technique targeting at comment streams in social network services. Numerous studies and systems have proposed techniques and mechanisms to generate various types of summaries on
  • 3. comment streams. One major category aims to extract representative and significant comments from messy discussion. Our proposed system does not focus on traditional comment streams that usually express more complete information, such as the discussion on products or movies. We target at comment streams in SNS that are in short text style with casual language usage. For each social message, our main objective is to cluster comments with similar content together and generate a concise opinion summary for this message. We want to discover how many different group opinions exist and provide an overview of each group to make users easily and rapidly understand. For instance, when Lady Gaga uploads a photo to SNS, there are hundreds and thousands of comments given by her fans during a short period of time. Some of them may say that she is very beautiful, and another group of fans may think that the outfit is too weird. Even more, some may particularly discuss the hair style of this photo. Our goal is developing an efficient and effective technique to identify the clusters of these comments. Note that this problem is clearly different from existing research and possesses numerous unique characteristics and challenges. Explore the problem of incremental short text summarization on comment streams from social network services. We model this problem as an incremental
  • 4. clustering task and propose the Incremental Short Text Summarization algorithm to discover the top-k clusters including different groups of opinions towards one social message. For each comment cluster, important and common terms will be extracted to construct a key-term cloud. This key-term cloud provides an at-a-glance presentation that users can easily and rapidly understand the main points of similar comments in a cluster. Moreover, representative comments in each group will also be identified.
  • 5. Existing System: In social network consist of problems during when a user uploads a photo to social network services; there are hundreds and thousands of comments given by her fans during a short period of time. Some of them may say that she is very beautiful, and another group of fans may think that the outfit is too weird. Even more, some may particularly discuss the hair style of this photo. Therefore, our goal is developing an efficient and effective technique to identify the clusters of these comments. The quantity of comments may increase at a high rate right after a social message is published. Moreover, distinct users will request the summary result at any moment. For these reasons, in order to immediately generate a summary based on the current comment stream, an incremental approach is preferable to meet the real-time needs of this application. The comments in social network service are usually short, and users widely make use of informal and unstructured texts that contain acronyms, shortening words, etc. This phenomenon increases the difficulty of determining the similarity between comments. On the other hand, it is worth mentioning that instead of emphasizing on the quality of clustering, the most crucial point of this task is to
  • 6. produce a general summary promptly so that users can easily get the overview of a comment stream. It can be perceived that this problem is able to be modeled as a clustering task. However, traditional clustering methods have several inherent restrictions that cannot be directly applied here. The computational complexity of existing methods is high, and they cannot straightforwardly adapt to satisfy the incremental need. Moreover, in this problem, defining the number of desired clusters in advance is unreasonable, which is required in many clustering algorithms. In addition, there will be a lot of outliers in a comment stream, meaning that without employing a good strategy for selecting initial cluster centers, existing method may be prone to poorresults. Disadvantages: o The difficulty of determining the similarity between comments is more. o Cluster identification is difficult one. o Computational complexity is High. o Due to sparse information contained in each comment, these approaches are not suitable for short text summarization especially when the number of comments is not large.
  • 7. Problem Definition: To discover top-k groups where the comments in the same group express similar opinions while the comments belonging to different groups express diverse points of view. Once a message is posted on SNS, users can leave comments immediately and the number of comments may rise quickly and continuously. Readers are usually unwilling to go over the whole list of comments, but they may request to see the summary at any moment. adopt the term vector model, and therefore each comment is transformed into a set of n-gram terms by the NLP module. Since informal and unstructured texts are widely used on SNS, we also apply some heuristics to enhance the quality of n-gram terms that can better represent each comment. In such a context, whenever a request is received, the proposed IncreSTS algorithm will efficiently produce top-k groups of opinions in real time. It can be perceived that it is infeasible to repeatedly carry out the complete clustering task due to the high complexity. For this reason, we design the IncreSTS algorithm in incremental manner, meaning that the clustering result of the previous phase will be leveraged to generate the current summary with newly-incoming comments. For the visualization interface, representative terms will be extracted to form a key-term cloud for each group. Thus, users will be provided a concise, informative, and at-a-glace presentation that can help them easily comprehend the main points of responses to one message on social network services.
  • 8. ProposedSystem: Explore the problem of incremental short text summarization on comment streams from social network services. To model this problem as an incremental clustering task and propose Incremental Short Text Summarization algorithm to discover the top-k clusters including different groups of opinions towards one social message. For each comment cluster, important and common terms will be extracted to construct a key-term cloud. This key-term cloud provides an at-a- glance presentation that users can easily and rapidly understand the main points of similar comments in a cluster. Representative comments in each group will also be identified. Our objective is to generate an informative, concise, and impressive interface that can help users get an overview understanding without reading all comments.our proposed system depends on a fully incremental algorithm that is almost parameterfree and can handle the outlier problem. The most significant advantage of our algorithm is its high efficiency, indicating that it can generate clustering results with latest incoming comments in real time. These capabilities certainly meet the need of comment stream summarization on social network serices. To verify the effectiveness of IncreSTS algorithm, we collect real comment streams from Facebook and conduct extensive experiments with comparative methods to show the strength and superiority of our approach. a novel incremental
  • 9. clustering problem based on the requirements of comment stream summarization on SNS. To propose IncreSTS algorithm that can incrementally update clustering results with latest incoming comments in real time. We design an at-a-glance presentation, which is concise, informative, and impressive, to help users easily and rapidly get an overview understanding of a comment stream. Social network services are not restricted to well-known social websites, such as Facebook, Twitter, etc. The Web services providing interaction functionality for users can be generally included. for each word, the process of punctuation removal will be applied to eliminate unnecessary punctuation marks connected with this word. Moreover, we develop the heuristic process of redundant character removal, designed for restoring words on SNS. It can be observed that casual language style is commonly used on SNS. Users often emphasize the emotion by repeating characters in a word. This phenomenon certainly causes the problem of not being able to correctly identify the original words. To copewith this problem, Finally examine each word to find out whether there is any character consecutively appearing more than three times. If this situation is detected, appended characters will be regarded as redundant, and only one character will be retained. Advantages:  High efficiency.  High scalability, and better handling outliers,
  • 10.  Justifies the practicability of Incremental Short Text Summarization on the target problem. Algorithm: Input: C: the set of previous clustering result vnew: the newly-incoming comment θr: the radius threshold Output: top-k clusters which have top-k most comments 1. Ca = {Ci | Ci is an element of C ∩ dis(vnew,Ci) is not infinite}; 2. Cb= {Cj | Cj is an element of Ca ∩ dis(vnew,Cj) < θr}; 3. if Cb is not empty 4. Add vnew into Cadded which have most comments in Cb; 5. Initialize Cchanged= Ø ; 6. for each element Ci of Ca where Ci ≠ Cadded 7. for each comment vj in Ci 8. if dis(vj,Cadded) < θr 9. Add vj into Cadded; 10. Exclude vj from Ci; 11. Cchanged = Cchanged∪ Ci; 12. for each element Ci of Cchanged 13. while V = {vj | dis(vj,Ci) ≧ θr} is not empty 14. Exclude all elements in V from Ci; 15. Try to add each comment in V into other clusters from large to small sizes; 16. else 17. Form a new cluster Cnew with the comment vnew; 18. C = C ∪ Cnew;
  • 11. 19. Output top-k clusters in C which have top-k most comments; End Conclusion: To enable the capability of comment stream summarization on SNS, we model a novel incremental clustering problem and propose the algorithm Incremental Short Text Summarization, which can incrementally update clustering results with latest incoming comments in real time. With the output of IncreSTS, we design a visualization interface consisting of basic information, key-term clouds, and representative comments. This at a glance of users to easily and rapidly get an overview understanding of a comment stream. From extensive experimental results and a real case demonstration, we verify that IncreSTS possesses the advantages of high efficiency, high scalability, and better handling outliers, which justifies the practicability of IncreSTS on the target problem.
  • 12. Future Enhancement: In the high popularity of social security serices, the number of comments for a specific message may increase very quickly, and users will request to view the summary of comments at any time. Moreover, since new messages appear continuously, users generally only view the summary of a specific message once and will not go back to browse the updated summary in the future. In such a context, to immediately produce the latest top-k clusters, we propose the IncreSTS algorithm that has the capability of incremental update.