4. Problem Statement
• Internet has now become a social media platform where people get every type of
information and share their views about it.
• Due to increase in popularity of web forms and blogs , sifting through all the data posted
their manually is impossible.
• The size of post and number of comments is increasing day by day and hence most of
the material found is not reliable.
• Sometime these views initiate a chain reaction of comments giving rise to illegal and
suspicions activities.
• propagation of copyrighted movies, aggressive messages, gambling and propaganda
against government.
• So their should be a system to check these view and comments to highlight those
platforms where illegal actions are performed or not.
5. Scope of Project
• Data Crawling from blogs
• Stemming algorithm to get root words
• Suspicious Activity detection technique
7. Significance
• Process is straightforward
• Natural Language processing is used
• Custom-based dictionary to cover every keyword
• Easy to use
8. Reasons
• Increased illegal activity on blogs and forums
• No regulatory mechanism
• Data is thought to be unimportant
• Increased user interest in blogs
• Manual checking of blogs is impossible
9. Complications
• Crawling data is not allowed
• Natural language Processing
• Getting root words from comments
Suspicious