The document describes the Twitter ISA (Intelligent Synthesis and Real Time Response) system. The Twitter ISA analyzes social media streams in real-time to identify events. It uses machine learning models to classify tweets by topic, like identifying traffic-related tweets. The system also evaluates different event detection techniques using sentiment analysis, active users, and social graphs. Additionally, it develops methods to distinguish event hashtags from noisy meme hashtags using classifiers. Evaluations show the Twitter ISA can accurately perform these tasks but that a 1% sample stream has limitations compared to the 10% garden hose sample for applications needing more detail.
This document discusses the use of mobile devices for accessing chemistry information and applications. It outlines some current chemistry applications available on mobile platforms, including eBooks, educational tools, multimedia resources, structure drawing tools, and apps from scientific publishers. The document also looks at the future of mobile chemistry apps, predicting growth in cloud-based paper management apps, tablet optimization, and online prediction algorithms and databases accessed through application programming interfaces.
This document describes a plagiarism checker system that uses the Levenshtein algorithm to detect plagiarism in text documents. The system allows users to import paragraphs to check for plagiarism against a database of stored paragraphs. It analyzes the imported text and ranks how similar it is to texts in the database using the Levenshtein algorithm, which measures the minimum number of single-character edits needed to change one word into another. The system aims to efficiently detect plagiarism in medium to large volumes of student work with less human effort than other plagiarism checkers.
Maltego is open source data mining software that maps relationships between nodes and connections. The document discusses using Maltego to analyze relationships in social networks by entering search terms and mapping hub users with many followers who connect different networks. This allows identifying important people to follow on social media to stay informed about topics like monoclonal antibodies in areas like science news.
Mobilizing science – chemistry in our handsDocumentStory
This document discusses the current and future state of mobile chemistry applications. It describes how smartphones and tablets can access electronic books, databases, and tools for tasks like balancing equations, drawing chemical structures, and looking up molecular properties. It envisions these applications becoming more integrated with online databases and predictive algorithms hosted in the cloud. The future may see more standardized APIs and services rather than dedicated apps, and tablets optimizing interfaces for their form factor. Overall, mobile devices are expanding access to chemistry resources anytime, anywhere.
Slides of session I presensented to my folks at University of Calgary on research paper on Mudflow and Flowdroid.
Links given below:
https://www.st.cs.uni-saarland.de/appmining/mudflow/
https://www.google.ca/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwik583ola7XAhUX6WMKHQYXCnoQFggoMAA&url=https%3A%2F%2Fblogs.uni-paderborn.de%2Fsse%2Ftools%2Fflowdroid%2F&usg=AOvVaw1t13BQnA07LA9FA3O5wNvN
This is a presentation I gave at the Special Libraries Association 2010 conference in New Orleans regarding "Mobile Chemistry". What is the status of chemistry on mobile devices? How do Apps match up against mobile browser based approaches? What is the future of tablets/pads versus phones for accessing content? A review of chemistry apps, online chemistry databases, ChemSpider, Mobile ChemSpider, ChemSpider SyntheticPages
Automated eSource systems provide several benefits over traditional paper-based data collection methods:
1. eSource allows for timely data review by principal investigators and safety reviewers through real-time data quality checks and faster data availability.
2. It reduces errors by eliminating unnecessary data duplication and transcription errors, while also providing an accurate audit trail.
3. Remote monitoring is easier with eSource, which can promote real-time electronic data entry during subject visits and ensure data accuracy and completeness.
The document describes the Twitter ISA (Intelligent Synthesis and Real Time Response) system. The Twitter ISA analyzes social media streams in real-time to identify events. It uses machine learning models to classify tweets by topic, like identifying traffic-related tweets. The system also evaluates different event detection techniques using sentiment analysis, active users, and social graphs. Additionally, it develops methods to distinguish event hashtags from noisy meme hashtags using classifiers. Evaluations show the Twitter ISA can accurately perform these tasks but that a 1% sample stream has limitations compared to the 10% garden hose sample for applications needing more detail.
This document discusses the use of mobile devices for accessing chemistry information and applications. It outlines some current chemistry applications available on mobile platforms, including eBooks, educational tools, multimedia resources, structure drawing tools, and apps from scientific publishers. The document also looks at the future of mobile chemistry apps, predicting growth in cloud-based paper management apps, tablet optimization, and online prediction algorithms and databases accessed through application programming interfaces.
This document describes a plagiarism checker system that uses the Levenshtein algorithm to detect plagiarism in text documents. The system allows users to import paragraphs to check for plagiarism against a database of stored paragraphs. It analyzes the imported text and ranks how similar it is to texts in the database using the Levenshtein algorithm, which measures the minimum number of single-character edits needed to change one word into another. The system aims to efficiently detect plagiarism in medium to large volumes of student work with less human effort than other plagiarism checkers.
Maltego is open source data mining software that maps relationships between nodes and connections. The document discusses using Maltego to analyze relationships in social networks by entering search terms and mapping hub users with many followers who connect different networks. This allows identifying important people to follow on social media to stay informed about topics like monoclonal antibodies in areas like science news.
Mobilizing science – chemistry in our handsDocumentStory
This document discusses the current and future state of mobile chemistry applications. It describes how smartphones and tablets can access electronic books, databases, and tools for tasks like balancing equations, drawing chemical structures, and looking up molecular properties. It envisions these applications becoming more integrated with online databases and predictive algorithms hosted in the cloud. The future may see more standardized APIs and services rather than dedicated apps, and tablets optimizing interfaces for their form factor. Overall, mobile devices are expanding access to chemistry resources anytime, anywhere.
Slides of session I presensented to my folks at University of Calgary on research paper on Mudflow and Flowdroid.
Links given below:
https://www.st.cs.uni-saarland.de/appmining/mudflow/
https://www.google.ca/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwik583ola7XAhUX6WMKHQYXCnoQFggoMAA&url=https%3A%2F%2Fblogs.uni-paderborn.de%2Fsse%2Ftools%2Fflowdroid%2F&usg=AOvVaw1t13BQnA07LA9FA3O5wNvN
This is a presentation I gave at the Special Libraries Association 2010 conference in New Orleans regarding "Mobile Chemistry". What is the status of chemistry on mobile devices? How do Apps match up against mobile browser based approaches? What is the future of tablets/pads versus phones for accessing content? A review of chemistry apps, online chemistry databases, ChemSpider, Mobile ChemSpider, ChemSpider SyntheticPages
Automated eSource systems provide several benefits over traditional paper-based data collection methods:
1. eSource allows for timely data review by principal investigators and safety reviewers through real-time data quality checks and faster data availability.
2. It reduces errors by eliminating unnecessary data duplication and transcription errors, while also providing an accurate audit trail.
3. Remote monitoring is easier with eSource, which can promote real-time electronic data entry during subject visits and ensure data accuracy and completeness.
Analysis of Malware Infected Systems & Classification with Gradient-boosted T...Darshan Gorasiya
Analysis of Malware Infected Systems with MapReduce, Pig, Hive, SparkSQL & Classification with Spark MLlib Gradient-boosted Tree on Big Data Platform (Hadoop)
The document discusses challenges in selecting tools for social media research and analyzing collected data. It notes that the tools available constrain the research methodology, and that tools are constantly changing or becoming outdated. Researchers must tackle a lack of transparency in how tools operate. Data collection requires cleaning before analysis. The conclusions emphasize that the methodological design depends on the tools' affordances and involves trial and error. Thorough documentation is essential given tools' instability. The document also discusses workarounds for Twitter's API limitation on accessing older tweets.
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...PerkinElmer Informatics
The document discusses Democratizing Data Science by balancing flexibility and usability for scientific applications. It describes the challenges of scientific data including large volumes of varied and complex data from many sources. It outlines a platform called Signals Apps that aims to bridge the gap between raw data and results by providing a flexible yet usable system. Signals Apps includes a variety of domain-specific apps, customizable app creation, workflows, automatic data handling, and visualization capabilities to simplify complex analysis.
Aileen Novero is a software engineer and full stack developer based in Cambridge, MA. She has a Master's in Computer Science from Northeastern University and a BA in Classical Studies from Smith College. Her experience includes roles at Novartis, Dana Farber Cancer Institute, and Vertex Pharmaceuticals, where she has worked on web application development, database design, and scientific computing projects using technologies like Python, JavaScript, Angular, Flask, and SQL. She also has experience in clinical research, having worked on oncology trials and databases at Massachusetts General Hospital and Beth Israel Deaconess Medical Center.
This document discusses software as a research object and the importance of research software. Some key points:
- Many researchers rely on software for their work but few have formal software training. Software is integral to modern research.
- Studies have found low reproducibility in scientific publications due to issues with unavailable software and code. Proper documentation and sharing of research software is needed.
- The Software Sustainability Institute aims to cultivate better, more sustainable research software to enable world-class research. They provide training, community support, and advocate for improved software practices and policies.
- Culture change is needed to incentivize sharing of research software and code. Mechanisms are emerging to properly credit software contributions and cite
The document discusses how to create an integrated lab by using an electronic lab notebook (ELN) as a central system that integrates with other lab tools and data sources. It notes that a professor spent too much time copying data between spreadsheets and would benefit from a fully integrated system. Such a system using an ELN as the central source of truth can automatically pull in raw data from instruments and push it to analysis tools, saving time and improving traceability. Inventory can also be tracked to automate reagent reordering. The document recommends starting with an ELN and further integrating additional sources over time.
This document presents a proof of concept for using Twitter data to conduct syndromic surveillance for public health monitoring. It analyzed tweets containing the keyword "measles" between 2014-2015 and found 1,408 relevant tweets. The number of tweets mentioning measles was compared to confirmed measles cases from a national surveillance system, showing potential for Twitter data as an early warning system. However, limitations include using a single keyword and the free Twitter API. Future work proposed improving data collection, applying machine learning techniques, and validating tweets with other health data sources.
IRJET- Sentiment Analysis on Twitter Posts using HadoopIRJET Journal
This document discusses a study that performed sentiment analysis on Twitter posts about elections using Apache Hadoop. The study collected tweets related to an upcoming election through the Twitter API. It then used Hadoop tools like HDFS, MapReduce, Hive, and NiFi to process and analyze the large amount of unstructured Twitter data. Specifically, it extracted sentiment information from the tweets like polarity (positive or negative) and subject to understand public opinions about different political parties and issues discussed in the tweets.
Topic:
Effective Visualizations that will aid in minimizing the spread of infectious diseases
Group members:
Lamar Munoz, Michael Brockenbrough, Neisha Sadhnani
Classification of Sentiment Analysis on Tweets Based on Techniques from Machi...IRJET Journal
This document discusses techniques for sentiment analysis on tweets using machine learning. It examines using a lexicon-based approach with SentiWordNet to identify the sentiment polarity of tweets as positive, negative, or neutral. The document outlines collecting Twitter data using APIs, preprocessing the data by removing hashtags and part-of-speech tagging. It then proposes a model using features extracted from SentiWordNet to classify tweet sentiment and discusses experimenting with unigrams and bigrams as features for the machine learning algorithms.
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET Journal
This document discusses improving real-time Twitter sentiment analysis using machine learning and Word2Vec. It begins by introducing sentiment analysis and its importance for product analysis and business. Next, it describes extracting Twitter data using APIs, preprocessing it, and applying machine learning algorithms like Naive Bayes, logistic regression, and decision trees to classify tweets as expressing positive, negative or neutral sentiment. It also reviews literature on using techniques like linguistic analysis and ensemble models to improve sentiment analysis from social media sources.
The document provides an overview and user guide for the Open Drug Discovery Teams (ODDT) mobile app. ODDT is a free iOS app that aggregates open science data from various online sources on topics like rare diseases and chemistry. It allows users to browse topics, endorse or disapprove documents, and view molecule structures. The guide describes how to use the app, contribute data through Twitter hashtags, and get involved in rare disease communities through the app. The goal is to facilitate collaboration and data sharing in open science.
Researching Social Media – Big Data and Social Media AnalysisFarida Vis
Researching Social Media – Big Data and Social Media Analysis, presentation for the Social Media for Researchers: A Sheffield Universities Social Media Symposium, 23 September 2014
Social Media Data as a Public Data Resource Capable of Impacting Healthcare @...Mark Silverberg
Social Health Insight's presentation about social media and the HHS/ASPR now trending challenge at the University of Arizona's Eller INSITE Big Data in Healthcare Symposium on 10/17/2016 in Tucson, AZ
http://insite.eller.arizona.edu/bigdata/
A Baseline Based Deep Learning Approach of Live Tweetsijtsrd
In this scenario social media plays a vital role in influencing the life of people. Twitter , Facebook, Instagram etc are the major social media platforms . They act as a platform for users to raise their opinions on things and events around them. Twitter is one such micro blogging site that allows the user to tweet 6000 tweets per day each of 280 characters long. Data analyst rely on this data to reach conclusion on the events happening around and also to rate a product. But due to massive volume of reviews the analysts find it difficult to go through them and reach at conclusions. In order to solve this problem we adopt the method of sentiment analysis. Sentiment analysis is an approach to classify the sentiment of user reviews, documents etc in terms of positive good , negative bad , neutral surprise . I suggest an enhanced twitter sentiment analysis that retrieves data based on a baseline in a particular pre defined time span and performs sentiment analysis using Textblob . This scheme differs from the traditional and existing one which performs sentiment analysis on pre saved data by performing sentiment analysis on real time data fetched via Twitter API . Thereby providing a much recent and relevant conclusion. Anjana Jimmington ""A Baseline Based Deep Learning Approach of Live Tweets"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-4 , June 2019, URL: https://www.ijtsrd.com/papers/ijtsrd23918.pdf
Paper URL: https://www.ijtsrd.com/computer-science/other/23918/a-baseline-based-deep-learning-approach-of-live-tweets/anjana-jimmington
The evolution of research on social mediaFarida Vis
This document summarizes a presentation on the evolution of social media research. It discusses how early research focused on analyzing small datasets from platforms like Flickr and YouTube. Over time, as Twitter grew in popularity, researchers began analyzing larger Twitter datasets containing millions of tweets. However, these datasets still had limitations due to biases in the data available through APIs. The document also discusses critiques of "Big Data" approaches and the need for research to be question-driven rather than just analyzing available data. It emphasizes understanding the limitations of datasets and being transparent about how the data was collected and potential biases.
IRJET- A Survey on Mining of Tweeter Data for Predicting User BehaviorIRJET Journal
This document discusses mining and analyzing social media data using big data techniques to predict user behavior. It proposes using tools like Hadoop and HDFS to capture trends in areas like drug abuse from large amounts of Twitter data. A framework is presented that involves gathering Twitter data using APIs, applying big data mining techniques, and using the results for more sophisticated analysis applications to help address issues like monitoring public health. Challenges around managing large social media datasets are also discussed.
This document provides a review of techniques, tools, and platforms for analyzing social media data. It discusses the types of social media data and formats available, as well as tools for accessing, cleaning, analyzing, and visualizing social media data. Some key challenges of social media research are the restricted access to comprehensive data sources, lack of tools for in-depth analysis without programming, and need for large data storage and computing facilities to support research at scale. The document provides a methodology and critique of current approaches and outlines requirements to better support social media research.
Poster presentation in 3rd big data conclave at vit chennai on 20th april 2017Rohit Desai
Title:Leveraging Big Data and Social Sensors For Predicting Epidemic Disease Outbreak
Event & Venue:3rd Big Data Conclave at VIT Chennai on 20th & 21st April 2017
Project Conducted Under Guide:Dr.Sweetlin Hemalatha Professor at VIT Chennai
IRJET - Suicidal Text Detection using Machine LearningIRJET Journal
This document presents research on detecting suicidal text using machine learning techniques. The researchers collected Twitter posts and used data preprocessing, word embeddings and a Convolutional Neural Network model to classify tweets as suicidal or non-suicidal. They found that the CNN model achieved the best performance compared to other classifiers. The system could help identify at-risk individuals by analyzing their social media posts and inform organizations to provide support. Future work involves using location data from tweets to better target help to those in need.
Cheminformatics Workflows Using Mobile Apps for Drug DiscoverySean Ekins
This document discusses using mobile apps for drug discovery workflows. It describes how mobile devices are revolutionizing computing through new user interfaces and availability anywhere. Several examples are given of simple app workflows for tasks like looking up structures, running searches, and sharing data. The document advocates for mobile and cloud-based approaches to replace desktop-based cheminformatics workflows. This could make specialized tasks more accessible and collaboration easier. The potential for mobile apps to transform existing software vendors is also noted. The majority of the document focuses on examples of developing mobile apps to enable drug discovery for tuberculosis.
Analysis of Malware Infected Systems & Classification with Gradient-boosted T...Darshan Gorasiya
Analysis of Malware Infected Systems with MapReduce, Pig, Hive, SparkSQL & Classification with Spark MLlib Gradient-boosted Tree on Big Data Platform (Hadoop)
The document discusses challenges in selecting tools for social media research and analyzing collected data. It notes that the tools available constrain the research methodology, and that tools are constantly changing or becoming outdated. Researchers must tackle a lack of transparency in how tools operate. Data collection requires cleaning before analysis. The conclusions emphasize that the methodological design depends on the tools' affordances and involves trial and error. Thorough documentation is essential given tools' instability. The document also discusses workarounds for Twitter's API limitation on accessing older tweets.
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...PerkinElmer Informatics
The document discusses Democratizing Data Science by balancing flexibility and usability for scientific applications. It describes the challenges of scientific data including large volumes of varied and complex data from many sources. It outlines a platform called Signals Apps that aims to bridge the gap between raw data and results by providing a flexible yet usable system. Signals Apps includes a variety of domain-specific apps, customizable app creation, workflows, automatic data handling, and visualization capabilities to simplify complex analysis.
Aileen Novero is a software engineer and full stack developer based in Cambridge, MA. She has a Master's in Computer Science from Northeastern University and a BA in Classical Studies from Smith College. Her experience includes roles at Novartis, Dana Farber Cancer Institute, and Vertex Pharmaceuticals, where she has worked on web application development, database design, and scientific computing projects using technologies like Python, JavaScript, Angular, Flask, and SQL. She also has experience in clinical research, having worked on oncology trials and databases at Massachusetts General Hospital and Beth Israel Deaconess Medical Center.
This document discusses software as a research object and the importance of research software. Some key points:
- Many researchers rely on software for their work but few have formal software training. Software is integral to modern research.
- Studies have found low reproducibility in scientific publications due to issues with unavailable software and code. Proper documentation and sharing of research software is needed.
- The Software Sustainability Institute aims to cultivate better, more sustainable research software to enable world-class research. They provide training, community support, and advocate for improved software practices and policies.
- Culture change is needed to incentivize sharing of research software and code. Mechanisms are emerging to properly credit software contributions and cite
The document discusses how to create an integrated lab by using an electronic lab notebook (ELN) as a central system that integrates with other lab tools and data sources. It notes that a professor spent too much time copying data between spreadsheets and would benefit from a fully integrated system. Such a system using an ELN as the central source of truth can automatically pull in raw data from instruments and push it to analysis tools, saving time and improving traceability. Inventory can also be tracked to automate reagent reordering. The document recommends starting with an ELN and further integrating additional sources over time.
This document presents a proof of concept for using Twitter data to conduct syndromic surveillance for public health monitoring. It analyzed tweets containing the keyword "measles" between 2014-2015 and found 1,408 relevant tweets. The number of tweets mentioning measles was compared to confirmed measles cases from a national surveillance system, showing potential for Twitter data as an early warning system. However, limitations include using a single keyword and the free Twitter API. Future work proposed improving data collection, applying machine learning techniques, and validating tweets with other health data sources.
IRJET- Sentiment Analysis on Twitter Posts using HadoopIRJET Journal
This document discusses a study that performed sentiment analysis on Twitter posts about elections using Apache Hadoop. The study collected tweets related to an upcoming election through the Twitter API. It then used Hadoop tools like HDFS, MapReduce, Hive, and NiFi to process and analyze the large amount of unstructured Twitter data. Specifically, it extracted sentiment information from the tweets like polarity (positive or negative) and subject to understand public opinions about different political parties and issues discussed in the tweets.
Topic:
Effective Visualizations that will aid in minimizing the spread of infectious diseases
Group members:
Lamar Munoz, Michael Brockenbrough, Neisha Sadhnani
Classification of Sentiment Analysis on Tweets Based on Techniques from Machi...IRJET Journal
This document discusses techniques for sentiment analysis on tweets using machine learning. It examines using a lexicon-based approach with SentiWordNet to identify the sentiment polarity of tweets as positive, negative, or neutral. The document outlines collecting Twitter data using APIs, preprocessing the data by removing hashtags and part-of-speech tagging. It then proposes a model using features extracted from SentiWordNet to classify tweet sentiment and discusses experimenting with unigrams and bigrams as features for the machine learning algorithms.
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET Journal
This document discusses improving real-time Twitter sentiment analysis using machine learning and Word2Vec. It begins by introducing sentiment analysis and its importance for product analysis and business. Next, it describes extracting Twitter data using APIs, preprocessing it, and applying machine learning algorithms like Naive Bayes, logistic regression, and decision trees to classify tweets as expressing positive, negative or neutral sentiment. It also reviews literature on using techniques like linguistic analysis and ensemble models to improve sentiment analysis from social media sources.
The document provides an overview and user guide for the Open Drug Discovery Teams (ODDT) mobile app. ODDT is a free iOS app that aggregates open science data from various online sources on topics like rare diseases and chemistry. It allows users to browse topics, endorse or disapprove documents, and view molecule structures. The guide describes how to use the app, contribute data through Twitter hashtags, and get involved in rare disease communities through the app. The goal is to facilitate collaboration and data sharing in open science.
Researching Social Media – Big Data and Social Media AnalysisFarida Vis
Researching Social Media – Big Data and Social Media Analysis, presentation for the Social Media for Researchers: A Sheffield Universities Social Media Symposium, 23 September 2014
Social Media Data as a Public Data Resource Capable of Impacting Healthcare @...Mark Silverberg
Social Health Insight's presentation about social media and the HHS/ASPR now trending challenge at the University of Arizona's Eller INSITE Big Data in Healthcare Symposium on 10/17/2016 in Tucson, AZ
http://insite.eller.arizona.edu/bigdata/
A Baseline Based Deep Learning Approach of Live Tweetsijtsrd
In this scenario social media plays a vital role in influencing the life of people. Twitter , Facebook, Instagram etc are the major social media platforms . They act as a platform for users to raise their opinions on things and events around them. Twitter is one such micro blogging site that allows the user to tweet 6000 tweets per day each of 280 characters long. Data analyst rely on this data to reach conclusion on the events happening around and also to rate a product. But due to massive volume of reviews the analysts find it difficult to go through them and reach at conclusions. In order to solve this problem we adopt the method of sentiment analysis. Sentiment analysis is an approach to classify the sentiment of user reviews, documents etc in terms of positive good , negative bad , neutral surprise . I suggest an enhanced twitter sentiment analysis that retrieves data based on a baseline in a particular pre defined time span and performs sentiment analysis using Textblob . This scheme differs from the traditional and existing one which performs sentiment analysis on pre saved data by performing sentiment analysis on real time data fetched via Twitter API . Thereby providing a much recent and relevant conclusion. Anjana Jimmington ""A Baseline Based Deep Learning Approach of Live Tweets"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-4 , June 2019, URL: https://www.ijtsrd.com/papers/ijtsrd23918.pdf
Paper URL: https://www.ijtsrd.com/computer-science/other/23918/a-baseline-based-deep-learning-approach-of-live-tweets/anjana-jimmington
The evolution of research on social mediaFarida Vis
This document summarizes a presentation on the evolution of social media research. It discusses how early research focused on analyzing small datasets from platforms like Flickr and YouTube. Over time, as Twitter grew in popularity, researchers began analyzing larger Twitter datasets containing millions of tweets. However, these datasets still had limitations due to biases in the data available through APIs. The document also discusses critiques of "Big Data" approaches and the need for research to be question-driven rather than just analyzing available data. It emphasizes understanding the limitations of datasets and being transparent about how the data was collected and potential biases.
IRJET- A Survey on Mining of Tweeter Data for Predicting User BehaviorIRJET Journal
This document discusses mining and analyzing social media data using big data techniques to predict user behavior. It proposes using tools like Hadoop and HDFS to capture trends in areas like drug abuse from large amounts of Twitter data. A framework is presented that involves gathering Twitter data using APIs, applying big data mining techniques, and using the results for more sophisticated analysis applications to help address issues like monitoring public health. Challenges around managing large social media datasets are also discussed.
This document provides a review of techniques, tools, and platforms for analyzing social media data. It discusses the types of social media data and formats available, as well as tools for accessing, cleaning, analyzing, and visualizing social media data. Some key challenges of social media research are the restricted access to comprehensive data sources, lack of tools for in-depth analysis without programming, and need for large data storage and computing facilities to support research at scale. The document provides a methodology and critique of current approaches and outlines requirements to better support social media research.
Poster presentation in 3rd big data conclave at vit chennai on 20th april 2017Rohit Desai
Title:Leveraging Big Data and Social Sensors For Predicting Epidemic Disease Outbreak
Event & Venue:3rd Big Data Conclave at VIT Chennai on 20th & 21st April 2017
Project Conducted Under Guide:Dr.Sweetlin Hemalatha Professor at VIT Chennai
IRJET - Suicidal Text Detection using Machine LearningIRJET Journal
This document presents research on detecting suicidal text using machine learning techniques. The researchers collected Twitter posts and used data preprocessing, word embeddings and a Convolutional Neural Network model to classify tweets as suicidal or non-suicidal. They found that the CNN model achieved the best performance compared to other classifiers. The system could help identify at-risk individuals by analyzing their social media posts and inform organizations to provide support. Future work involves using location data from tweets to better target help to those in need.
Cheminformatics Workflows Using Mobile Apps for Drug DiscoverySean Ekins
This document discusses using mobile apps for drug discovery workflows. It describes how mobile devices are revolutionizing computing through new user interfaces and availability anywhere. Several examples are given of simple app workflows for tasks like looking up structures, running searches, and sharing data. The document advocates for mobile and cloud-based approaches to replace desktop-based cheminformatics workflows. This could make specialized tasks more accessible and collaboration easier. The potential for mobile apps to transform existing software vendors is also noted. The majority of the document focuses on examples of developing mobile apps to enable drug discovery for tuberculosis.
A MODEL BASED ON SENTIMENTS ANALYSIS FOR STOCK EXCHANGE PREDICTION - CASE STU...csandit
Predicting the behavior of shares in the stock market is a complex problem, that involves variables not always known and can undergo various influences, from the collective emotion to high-profile news. Such volatility, can represent considerable financial losses for investors. In order to anticipate such changes in the market, it has been proposed various mechanisms to try to predict the behavior of an asset in the stock market, based on previously existing information.
Such mechanisms include statistical data only, without considering the collective feeling. This article, is going to use natural language processing algorithms (LPN) to determine the collective mood on assets and later with the help of the SVM algorithm to extract patterns in an attempt to predict the active behavior. Nevertheless it is important to note that such approach is not intended to be the main factor in the decision making process, but rather an aid tool, which combined with other information, can provide higher accuracy for the solution of this problem
A MODEL BASED ON SENTIMENTS ANALYSIS FOR STOCK EXCHANGE PREDICTION - CASE STU...cscpconf
Predicting the behavior of shares in the stock market is a complex problem, that involves variables not always known and can undergo various influences, from the collective emotion to the high-profile news. Such volatility can represent considerable financial losses for investors. In order to anticipate such changes in the market, it has been proposed various mechanisms to try to predict the behavior of an asset in the stock market, based on previously existing information. Such mechanisms include statistical data only, without considering the collective feeling. This article is going to use natural language processing algorithms (LPN) to determine the collective mood on assets and later with the help of the SVM algorithm to extract patterns in an
attempt to predict the active behavior. Nevertheless it is important to note that such approach is not intended to be the main factor in the decision making process, but rather an aid tool, which combined with other information, can provide higher accuracy for the solution of this problem.
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionIJERA Editor
This paper involves two approaches for finding the trending topics in social networks that is key-based approach and link-based approach. In conventional key-based approach for topics detection have mainly focus on frequencies of (textual) words. We propose a link-based approach which focuses on posts reflected in the mentioning behavior of hundreds users. The anomaly detection in the twitter data set is carried out by retrieving the trend topics from the twitter in a sequential manner by using some API and corresponding user for training, then computed anomaly score is aggregated from different users. Further the aggregated anomaly score will be feed into change-point analysis or burst detection at the pinpoint, in order to detect the emerging topics. We have used the real time twitter account, so results are vary according to the tweet trends made. The experiment shows that proposed link-based approach performs even better than the keyword-based approach.
IRJET - Twitter Sentiment Analysis using Machine LearningIRJET Journal
This document summarizes a research paper on Twitter sentiment analysis using machine learning. It describes extracting tweets on a topic, cleaning the data, extracting features, building a logistic regression model to classify tweets as positive, negative or neutral sentiment, and validating the model. The goal is to analyze public sentiment from Twitter data, which has applications in marketing, product feedback, and other areas.
Make a query regarding a topic of interest and come to know the sentiment for the day in pie-chart or for the week in form of line-chart for the tweets gathered from twitter.com
Similar to Syndromic surveillance - Health Analytics on Social Media (20)
Mental Health and well-being Presentation. Exploring innovative approaches and strategies for enhancing mental well-being. Discover cutting-edge research, effective strategies, and practical methods for fostering mental well-being.
Get Covid Testing at Fit to Fly PCR TestNX Healthcare
A Fit-to-Fly PCR Test is a crucial service for travelers needing to meet the entry requirements of various countries or airlines. This test involves a polymerase chain reaction (PCR) test for COVID-19, which is considered the gold standard for detecting active infections. At our travel clinic in Leeds, we offer fast and reliable Fit to Fly PCR testing, providing you with an official certificate verifying your negative COVID-19 status. Our process is designed for convenience and accuracy, with quick turnaround times to ensure you receive your results and certificate in time for your departure. Trust our professional and experienced medical team to help you travel safely and compliantly, giving you peace of mind for your journey.
The facial nerve, also known as cranial nerve VII, is one of the 12 cranial nerves originating from the brain. It's a mixed nerve, meaning it contains both sensory and motor fibres, and it plays a crucial role in controlling various facial muscles, as well as conveying sensory information from the taste buds on the anterior two-thirds of the tongue.
This particular slides consist of- what is hypotension,what are it's causes and it's effect on body, risk factors, symptoms,complications, diagnosis and role of physiotherapy in it.
This slide is very helpful for physiotherapy students and also for other medical and healthcare students.
Here is the summary of hypotension:
Hypotension, or low blood pressure, is when the pressure of blood circulating in the body is lower than normal or expected. It's only a problem if it negatively impacts the body and causes symptoms. Normal blood pressure is usually between 90/60 mmHg and 120/80 mmHg, but pressures below 90/60 are generally considered hypotensive.
The best massage spa Ajman is Chandrima Spa Ajman, which was founded in 2023 and is exclusively for men 24 hours a day. As of right now, our parent firm has been providing massage services to over 50,000+ clients in Ajman for the past 10 years. It has about 8+ branches. This demonstrates that Chandrima Spa Ajman is among the most reasonably priced spas in Ajman and the ideal place to unwind and rejuvenate. We provide a wide range of Spa massage treatments, including Indian, Pakistani, Kerala, Malayali, and body-to-body massages. Numerous massage techniques are available, including deep tissue, Swedish, Thai, Russian, and hot stone massages. Our massage therapists produce genuinely unique treatments that generate a revitalized sense of inner serenely by fusing modern techniques, the cleanest natural substances, and traditional holistic therapists.
As Mumbai's premier kidney transplant and donation center, L H Hiranandani Hospital Powai is not just a medical facility; it's a beacon of hope where cutting-edge science meets compassionate care, transforming lives and redefining the standards of kidney health in India.
We are one of the top Massage Spa Ajman Our highly skilled, experienced, and certified massage therapists from different corners of the world are committed to serving you with a soothing and relaxing experience. Luxuriate yourself at our spas in Sharjah and Ajman, which are indeed enriched with an ambiance of relaxation and tranquility. We could confidently claim that we are one of the most affordable Spa Ajman and Sharjah as well, where you can book the massage session of your choice for just 99 AED at any time as we are open 24 hours a day, 7 days a week.
Visit : https://massagespaajman.com/
Call : 052 987 1315
At Apollo Hospital, Lucknow, U.P., we provide specialized care for children experiencing dehydration and other symptoms. We also offer NICU & PICU Ambulance Facility Services. Consult our expert today for the best pediatric emergency care.
For More Details:
Map: https://cutt.ly/BwCeflYo
Name: Apollo Hospital
Address: Singar Nagar, LDA Colony, Lucknow, Uttar Pradesh 226012
Phone: 08429021957
Opening Hours: 24X7
The Importance of Black Women Understanding the Chemicals in Their Personal C...bkling
Certain chemicals, such as phthalates and parabens, can disrupt the body's hormones and have significant effects on health. According to data, hormone-related health issues such as uterine fibroids, infertility, early puberty and more aggressive forms of breast and endometrial cancers disproportionately affect Black women. Our guest speaker, Jasmine A. McDonald, PhD, an Assistant Professor in the Department of Epidemiology at Columbia University in New York City, discusses the scientific reasons why Black women should pay attention to specific chemicals in their personal care products, like hair care, and ways to minimize their exposure.
Chandrima Spa Ajman is one of the leading Massage Center in Ajman, which is open 24 hours exclusively for men. Being one of the most affordable Spa in Ajman, we offer Body to Body massage, Kerala Massage, Malayali Massage, Indian Massage, Pakistani Massage Russian massage, Thai massage, Swedish massage, Hot Stone Massage, Deep Tissue Massage, and many more. Indulge in the ultimate massage experience and book your appointment today. We are confident that you will leave our Massage spa feeling refreshed, rejuvenated, and ready to take on the world.
Visit : https://massagespaajman.com/
Call : 052 987 1315
2. What is Social Media?
Social media refers to the means of interactions among people in
which they create, share, and/or exchange information and ideas in
virtual communities and networks¹.
1 Tufts university, Boston, U.S.A.
2 Social Media Examiner: 2014 Social Media Marketing Industry Report
3. What is Twitter?
1 Wikipedia
2 PEW Research Centre: January 2014
3 DMR: March 2014
4 Twitter Terms of Service as of 24/7/14
Some Twitter statistics
• 1 billion users registered
• 255 million users/month
• 100 million users per day ³
4. Commercial / Government
clients• This is a proof of concept (POC)
• The POC is not used to monitor actual diseases
• This POC is an experimental application at ESR to understand the
validity of the approach
Social Media (Twitter) Analytics overview
7. Social Media (Twitter) – Visualisation / Front-End
All
keywords
Interactive
map
(Bing)
Basic stats
by location
Tweets
Current,
active
keywords
8. Social Media (Twitter) – Visualisation / Front-End
Select
keyword
Measles
Zoom into
WLG
Basic stats
by location
Measles
Tweets
Current,
active
keywords
9. Social Media (Twitter) – Visualisation / Front-End
Select
Measles
to drive the
dashboard
Measles
Tweets in
Jun : 125
Jul : 196
Measles
Tweets in
WLG : 73
10. Social Media (Twitter) – Visualisation / Front-End
4th of July
Apple
devices are
used most
5 am busy
tweeting…
936 of 10,412
Tweets in
July of
71,719 total
collected
Tweets so far
11. Social Media (Twitter) – Visualisation / Front-End
Measles in
NZ
Strong,
negative
sentiment in
Jan, Jun/Jul
12. Challenges and learnings
- Protect against threads from the cloud
vs. open to the cloud (Twitter)
- Re-configure ESR firewall to allow
connection to Twitter API¹
- Patch BI application with new Python
version to accommodate ESR’s
firewall set-up
- Lots of back & forth, complex, but
eventually fixed issues
1 Application Programming Interface
13. Challenges and learnings
- Learn Twitter API, its settings,
parameters and its limits
- Free TwitterSearch API is time-boxed
and limited in volume
- API 1.1, see https://dev.twitter.com
- Alternatively use interface which
exactly mimics API but with user
interaction, see
https://twitter.com/search-advanced
14. Challenges and learnings
- Learn Python (programming) and
integrate into BI/DW application servers
- Execute Python from within the
ETL-engine¹
- Handle the messaging between
ETL-engine and Twitter API
- Error handling, administer
tokens, messages, counters,
looping etc.
1 Extract, Transform and Loading
15. Challenges and learnings
- Use BI / DW technology to analyse
unstructured (text) data
- Categorise tweets based on a
dictionary and use the Entity
Extraction Transform (EET) of
Data Services (i.e. group
variations of spelling of a
keyword)
- Perform sentiment analysis on
Tweets using EET module
(I am awfully sick = strong, I
am hardly ever sick = weak)
16. Challenges and learnings
- Test new visualisation technology:
Microsoft PowerPivot / PowerView in
Office2013
- Great opportunity to test a different
Business Intelligence / visualisation
tool (ESR uses Business Objects from
SAP for reporting)
- Integration of Bing/maps; no need of
a GIS system at ESR. Out-of-the-box
(provided connected to the internet
to reach Bing server).
17. Probabilityofatweetbeingcorrelatedtoa“realfact”
100 %
0 %
Time
? %
Unverified
Twitter
data
Competitor
ESR
Inject
Health Line
data into ESR
Twitter data
Inject volume related data (count of disease in point of time and location).
Inject
Health Stats
data into ESR
Twitter data
Inject
LIMS
data into ESR
Twitter data
Inject
Sentinel
data into ESR
Twitter data
Inject
Epidemiology
data into ESR
Twitter data
Competitor
ESR
A research letter on the
internet flu tracker in the
scientific journal Nature,
written by officials from
Google and the US
Centres for Disease
Control and Prevention,
says there is a close
correlation between the
rates of doctor visits for
flu symptoms, and the
use of flu-like search
terms. NZ Herald 23/7/14
Google Flu Tracker has overestimated
for 100 of the 108 weeks starting
from August, 2011, according to
the Science analysis, by researchers
at Northeastern University and
Harvard University. source: motherboard.vice.com
VISION - Big Data complements traditional methods
Calibrate social media data with verified and trusted data to identify valid tweets
Risk Opportunity
Increase probability of
success of a campaign
based on Social Media data.
Increase confidence to
invest money in to the right
actions.
Increase confidence in
Social Media data as it’s
verified against other,
trusted sources.
Google ??
Flu Tracker
NZ
18. Would I do the Social Media project again?
• Yes!
• ESR has data to complement Social Media and increase data
quality (verify Tweets).
• Try below tools first and see what benefit they offer…
• PlusOne Social
http://plusonesocial.com
• Microsoft Excel Twitter add-in:
http://www.microsoft.com/en-
us/download/details.aspx?id=26213
Others
• http://www.razorsocial.com/free-twitter-analytics/
• http://www.socialmediaexaminer.com/6-twitter-
analytics-tools/