The document describes FluCast, a mobile app that provides risk assessment and forecasts of seasonal flu. The app uses data from sources like CDC, weather reports, and social media to predict flu trends up to 3 weeks in advance with 90% accuracy. It is targeted at epidemiologists and other officials. The app allows users to view forecasts, access a news stream of flu information from RSS/Twitter, and receive outbreak alerts via notifications. It was developed using tools like Flask and PhoneGap with a focus on easy sharing of data and a clean user interface. Future directions may include offline access, additional data sources, and expanded analytics.
We may be nowhere near eradicating Dengue, but we may be able to prevent it more effectively. Through Time-Series Modeling and geospatial mapping, Team Flex was able to predict and forecast cases and deaths up to 4 months and identify potential dengue hotspots in selected cities of the CALABARZON region.
The global threat of disease outbreaks is real and it is felt more than ever now as the world struggles to contain the spread of a certain virus. But before it even created a pandemic, there is already another disease that threatens our existence, perhaps has been doing so for the longest time – Dengue.
Dengue affects thousands of lives each year and continues to be a major public health problem in the Philippines.
In the last year alone, the Philippines experienced the worst dengue outbreak since 2012 as reported cases reached beyond epidemic thresholds. On top of that, the delayed reporting of official case and death counts makes it even more difficult to pinpoint heavily dengue affected areas early on and initiate a targeted public health response. To address this problem, Team Flex members, Janine Padilla, Mox Ballo, and Rache Melendres, developed a publicly accessible web application
that can be used by concerned government agencies and public health officials to predict the spread of dengue and visualize potential breeding sites of mosquitoes.
Social Media Data as a Public Data Resource Capable of Impacting Healthcare @...Mark Silverberg
Social Health Insight's presentation about social media and the HHS/ASPR now trending challenge at the University of Arizona's Eller INSITE Big Data in Healthcare Symposium on 10/17/2016 in Tucson, AZ
http://insite.eller.arizona.edu/bigdata/
Why zika, malaria and ebola should fear analyticsRiaktr
Human travel is the largest cause of epidemic spread, but there is little data available to understand and monitor those moves. Telecom data help solve this issue, as it is unique in terms of size, granularity and mobility insights. This presentation will show you how analytics tools based on telecom data enable aid workers to make smarter decisions, take prompter action and eventually save more lives.
Real Impact Analytics (RIA) has developed an important knowledge on societal issues, epidemic risk flows in particular. It has a proven track record in Zambia for dealing with Malaria and in Western Africa for Ebola, where it supported UNICEF’s action.
Health Information Technology for Disease Surveillance & Response: H7N9 as ...Jen-Hsiang Chuang
I will introduce the applications of information technology for disease surveillance & response in Chinese Taipei and use H7N9 as an example for illustration.
We may be nowhere near eradicating Dengue, but we may be able to prevent it more effectively. Through Time-Series Modeling and geospatial mapping, Team Flex was able to predict and forecast cases and deaths up to 4 months and identify potential dengue hotspots in selected cities of the CALABARZON region.
The global threat of disease outbreaks is real and it is felt more than ever now as the world struggles to contain the spread of a certain virus. But before it even created a pandemic, there is already another disease that threatens our existence, perhaps has been doing so for the longest time – Dengue.
Dengue affects thousands of lives each year and continues to be a major public health problem in the Philippines.
In the last year alone, the Philippines experienced the worst dengue outbreak since 2012 as reported cases reached beyond epidemic thresholds. On top of that, the delayed reporting of official case and death counts makes it even more difficult to pinpoint heavily dengue affected areas early on and initiate a targeted public health response. To address this problem, Team Flex members, Janine Padilla, Mox Ballo, and Rache Melendres, developed a publicly accessible web application
that can be used by concerned government agencies and public health officials to predict the spread of dengue and visualize potential breeding sites of mosquitoes.
Social Media Data as a Public Data Resource Capable of Impacting Healthcare @...Mark Silverberg
Social Health Insight's presentation about social media and the HHS/ASPR now trending challenge at the University of Arizona's Eller INSITE Big Data in Healthcare Symposium on 10/17/2016 in Tucson, AZ
http://insite.eller.arizona.edu/bigdata/
Why zika, malaria and ebola should fear analyticsRiaktr
Human travel is the largest cause of epidemic spread, but there is little data available to understand and monitor those moves. Telecom data help solve this issue, as it is unique in terms of size, granularity and mobility insights. This presentation will show you how analytics tools based on telecom data enable aid workers to make smarter decisions, take prompter action and eventually save more lives.
Real Impact Analytics (RIA) has developed an important knowledge on societal issues, epidemic risk flows in particular. It has a proven track record in Zambia for dealing with Malaria and in Western Africa for Ebola, where it supported UNICEF’s action.
Health Information Technology for Disease Surveillance & Response: H7N9 as ...Jen-Hsiang Chuang
I will introduce the applications of information technology for disease surveillance & response in Chinese Taipei and use H7N9 as an example for illustration.
Generation of infectious disease alerts through the use of geolocationjournalBEEI
In recent years, there have been several cases of global epidemics such as influenza B or Ebola. In these cases, several factors are key to limit the effects of the epidemic and avoid contagion. Between of them is the speed of knowing which persons are infected, which persons has been in contact with any infected person or know what the focus of the epidemic. In general, obtaining this information requires a process of research among the first affected that can be slow and complicated. This article describes a tool that aims to generate alerts when there are data about an epidemic, and notify all persons who could be exposed to contagion and prevent new infections occurs.
Real-Time Biosurveillance Program Pilot - India & Sri LankaNuwan Waidyanatha
The Biosurv program was tailored for a range of functions. Its main objective program was the rapid detection and
notification of any possible health outbreak using cutting edge information processing technology. The
mHealthSurvey application takes a few seconds to enter each patient's disease information. This rich dataset is sent over the existing commercial GPRS channels to
a centralized database. With such techniques, the
incoming health data can be automatically monitored for unusual changes in the numbers of reported disease
cases. The same data is also used to characterize statistical relationships between all available combinations of reported genders, locations, ages, symptoms and signs, etc., even if the number of such combinations is
prohibitively large for humans to process. That enables epidemiologists to pin down a potential outbreak of, for
instance, a gastrointestinal disease among children living in the Southwestern suburbs of the city, before it
spreads to other areas or to other demographic groups. T-Cube Web Interface (TCWI) and its underlying disease
outbreak detection algorithms are capable of reducing time-intensive calculations involved in such analyses from
hours or days down to as quick as turning on a light switch.
Any meaningful analysis of data depends entirely on the quality of data that is captured. And the method to capture data, which is still quite common in fieldwork, is the paper form.
Even where technology is deployed, the process is brought with vulnerabilities making the data either stale or inaccurate. Mobile viz, using smart devices to replace incumbent methods promises to solve all problems in a single shot.
Ideafarms's Heathwatch is a prototype solution that shows the power of this idea in the area of Disease Surveillance and Proactive Management of Epidemics.
Application for Diagnosis of Detection of Pneumonia Disease for Rural Contextijtsrd
Pneumonia is a respiratory infection resulting in inflammation of the lungs. The causes of this infectious disease could be attributed to viruses, bacteria or fungi. One of the many ways of detecting the disease is by a chest X ray of the patient. The rural population in developing nations have limited access to doctors, medical diagnostic facilities, and hospitals. Hence, diagnosis is delayed resulting in adverse consequences. This paper is an attempt to design and develop a iOS Platform based application app for the preliminary detection of pneumonia using X ray images. The app is based on machine learning which identifies pneumonia, using a chest X ray image of a patient and uses ML Model integration. Sejal Khanna | Jagjeet Gandhi "Application for Diagnosis of Detection of Pneumonia Disease for Rural Context" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-2 , February 2021, URL: https://www.ijtsrd.com/papers/ijtsrd38163.pdf Paper Url: https://www.ijtsrd.com/computer-science/other/38163/application-for-diagnosis-of-detection-of-pneumonia-disease-for-rural-context/sejal-khanna
With the spread of social media platforms and the proliferation of misleading news, misinformation
detection within microblogging platforms has become a real challenge. During the Covid-19 pandemic,
many fake news and rumors were broadcasted and shared daily on social media. In order to filter out these
fake news, many works have been done on misinformation detection using machine learning and sentiment
analysis in the English language. However, misinformation detection research in the Arabic language on
social media is limited. This paper introduces a misinformation verification system for Arabic COVID-19
related news using an Arabic rumors dataset on Twitter. We explored the dataset and prepared it using
multiple phases of preprocessing techniques before applying different machine learning classification
algorithms combined with a semantic analysis method. The model was applied on 3.6k annotated tweets
achieving 93% best overall accuracy of the model in detecting misinformation. We further build another
dataset of Covid-19 related claims in Arabic to examine how our model performs with this new set of
claims. Results show that the combination of machine learning techniques and linguistic analysis achieves
the best scores reaching 92% best accuracy in detecting the veracity of sentences of the new dataset.
COMBINING MACHINE LEARNING AND SEMANTIC ANALYSIS FOR EFFICIENT MISINFORMATION...ijcsit
With the spread of social media platforms and the proliferation of misleading news, misinformation
detection within microblogging platforms has become a real challenge. During the Covid-19 pandemic,
many fake news and rumors were broadcasted and shared daily on social media. In order to filter out these
fake news, many works have been done on misinformation detection using machine learning and sentiment
analysis in the English language. However, misinformation detection research in the Arabic language on
social media is limited. This paper introduces a misinformation verification system for Arabic COVID-19
related news using an Arabic rumors dataset on Twitter. We explored the dataset and prepared it using
multiple phases of preprocessing techniques before applying different machine learning classification
algorithms combined with a semantic analysis method. The model was applied on 3.6k annotated tweets
achieving 93% best overall accuracy of the model in detecting misinformation. We further build another
dataset of Covid-19 related claims in Arabic to examine how our model performs with this new set of
claims. Results show that the combination of machine learning techniques and linguistic analysis achieves
the best scores reaching 92% best accuracy in detecting the veracity of sentences of the new dataset.
Data Visualisation and Interactive Mapping to Support Response to Disease Out...UN Global Pulse
From January – May 2015, a typhoid outbreak occurred in Uganda. Pulse Lab Kampala was invited to join the National Task Force in response to the outbreak. In coordination with WHO, and in collaboration with the Ministry of Health, Pulse Lab Kampala produced a series of data visualisations to support the early response to the disease. Visualisations of weekly reports from health centres were produced with interactive maps at district, sub-county and individual health facility level. The visualisations allowed decision making for the allocation of medicine, medical personnel and health centres, as well as targeting training areas.
Cite as: "Data Visualisation and Interactive Mapping to Support Response to Disease Outbreak”, Global Pulse Project Series no. 21, 2015
In this Project we analyses the spread and impact of the novel coronavirus pandemic which has taken the world by storm with its rapid growth. In this we like to develop a machine learning model in Python to analyze what has been its impact so far and analyze the outbreak of COVID 19 across various regions, visualize them using charts and tables, and predict the number of upcoming confirmed cases.
Disease outbreak detection, monitoring and notification systems are increasingly gaining popularity since
these systems are designed to assess threats to public health and disease outbreaks are becoming
increasingly common world-wide. A variety of systems are in use around the world, with coverage of
national, international and global disease outbreaks. These systems use different taxonomies and
classifications for the detection and prioritization of potential disease outbreaks. In this paper, we study
and analyze the current disease outbreak systems. Subsequently, we extract features and functions of
typical and generic disease outbreak systems. We then propose a generic model for disease outbreak
notification systems. Our effort is directed towards standardizing the design process for typical disease
outbreak systems.
Healthcare apps for Nokia X and Nokia Asha phones present a great opportunity to help improve the lives of millions of users around the world. In this webinar, we’ll discuss the fundamentals of mobile healthcare apps and give you an overview of the opportunities and challenges involved in developing such apps for Nokia phones. We’ll open the webinar with an introduction to the fundamentals of mobile healthcare, an overview of opportunities and challenges of developing apps for wellness and mobile health, and the role of mobile platforms in enabling health and healthcare apps. Then we will dig into specific techniques available when developing such apps for Nokia X and Nokia Asha platforms.We’ll demonstrate different approaches available to developers targeting the two platforms by examining a Blood Pressure Diary app, which is implemented for Nokia X and Nokia Asha. As part of that discussion we’ll show how to retrieve heart-rate data from medical devices using Bluetooth technology.
The Internet of Things means not just that computing devices have connectivity to the cloud but that they themselves are connected to each other, and therefore that novel applications can be developed in this rich ecosystem. One area for development is linking quantified self wearable sensors with automotive sensors for applications including Fatigue Detection, Real-time Parking and Assistance, Anger/Stress Reduction, Keyless Authentication, and DIY Diagnostics.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Generation of infectious disease alerts through the use of geolocationjournalBEEI
In recent years, there have been several cases of global epidemics such as influenza B or Ebola. In these cases, several factors are key to limit the effects of the epidemic and avoid contagion. Between of them is the speed of knowing which persons are infected, which persons has been in contact with any infected person or know what the focus of the epidemic. In general, obtaining this information requires a process of research among the first affected that can be slow and complicated. This article describes a tool that aims to generate alerts when there are data about an epidemic, and notify all persons who could be exposed to contagion and prevent new infections occurs.
Real-Time Biosurveillance Program Pilot - India & Sri LankaNuwan Waidyanatha
The Biosurv program was tailored for a range of functions. Its main objective program was the rapid detection and
notification of any possible health outbreak using cutting edge information processing technology. The
mHealthSurvey application takes a few seconds to enter each patient's disease information. This rich dataset is sent over the existing commercial GPRS channels to
a centralized database. With such techniques, the
incoming health data can be automatically monitored for unusual changes in the numbers of reported disease
cases. The same data is also used to characterize statistical relationships between all available combinations of reported genders, locations, ages, symptoms and signs, etc., even if the number of such combinations is
prohibitively large for humans to process. That enables epidemiologists to pin down a potential outbreak of, for
instance, a gastrointestinal disease among children living in the Southwestern suburbs of the city, before it
spreads to other areas or to other demographic groups. T-Cube Web Interface (TCWI) and its underlying disease
outbreak detection algorithms are capable of reducing time-intensive calculations involved in such analyses from
hours or days down to as quick as turning on a light switch.
Any meaningful analysis of data depends entirely on the quality of data that is captured. And the method to capture data, which is still quite common in fieldwork, is the paper form.
Even where technology is deployed, the process is brought with vulnerabilities making the data either stale or inaccurate. Mobile viz, using smart devices to replace incumbent methods promises to solve all problems in a single shot.
Ideafarms's Heathwatch is a prototype solution that shows the power of this idea in the area of Disease Surveillance and Proactive Management of Epidemics.
Application for Diagnosis of Detection of Pneumonia Disease for Rural Contextijtsrd
Pneumonia is a respiratory infection resulting in inflammation of the lungs. The causes of this infectious disease could be attributed to viruses, bacteria or fungi. One of the many ways of detecting the disease is by a chest X ray of the patient. The rural population in developing nations have limited access to doctors, medical diagnostic facilities, and hospitals. Hence, diagnosis is delayed resulting in adverse consequences. This paper is an attempt to design and develop a iOS Platform based application app for the preliminary detection of pneumonia using X ray images. The app is based on machine learning which identifies pneumonia, using a chest X ray image of a patient and uses ML Model integration. Sejal Khanna | Jagjeet Gandhi "Application for Diagnosis of Detection of Pneumonia Disease for Rural Context" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-2 , February 2021, URL: https://www.ijtsrd.com/papers/ijtsrd38163.pdf Paper Url: https://www.ijtsrd.com/computer-science/other/38163/application-for-diagnosis-of-detection-of-pneumonia-disease-for-rural-context/sejal-khanna
With the spread of social media platforms and the proliferation of misleading news, misinformation
detection within microblogging platforms has become a real challenge. During the Covid-19 pandemic,
many fake news and rumors were broadcasted and shared daily on social media. In order to filter out these
fake news, many works have been done on misinformation detection using machine learning and sentiment
analysis in the English language. However, misinformation detection research in the Arabic language on
social media is limited. This paper introduces a misinformation verification system for Arabic COVID-19
related news using an Arabic rumors dataset on Twitter. We explored the dataset and prepared it using
multiple phases of preprocessing techniques before applying different machine learning classification
algorithms combined with a semantic analysis method. The model was applied on 3.6k annotated tweets
achieving 93% best overall accuracy of the model in detecting misinformation. We further build another
dataset of Covid-19 related claims in Arabic to examine how our model performs with this new set of
claims. Results show that the combination of machine learning techniques and linguistic analysis achieves
the best scores reaching 92% best accuracy in detecting the veracity of sentences of the new dataset.
COMBINING MACHINE LEARNING AND SEMANTIC ANALYSIS FOR EFFICIENT MISINFORMATION...ijcsit
With the spread of social media platforms and the proliferation of misleading news, misinformation
detection within microblogging platforms has become a real challenge. During the Covid-19 pandemic,
many fake news and rumors were broadcasted and shared daily on social media. In order to filter out these
fake news, many works have been done on misinformation detection using machine learning and sentiment
analysis in the English language. However, misinformation detection research in the Arabic language on
social media is limited. This paper introduces a misinformation verification system for Arabic COVID-19
related news using an Arabic rumors dataset on Twitter. We explored the dataset and prepared it using
multiple phases of preprocessing techniques before applying different machine learning classification
algorithms combined with a semantic analysis method. The model was applied on 3.6k annotated tweets
achieving 93% best overall accuracy of the model in detecting misinformation. We further build another
dataset of Covid-19 related claims in Arabic to examine how our model performs with this new set of
claims. Results show that the combination of machine learning techniques and linguistic analysis achieves
the best scores reaching 92% best accuracy in detecting the veracity of sentences of the new dataset.
Data Visualisation and Interactive Mapping to Support Response to Disease Out...UN Global Pulse
From January – May 2015, a typhoid outbreak occurred in Uganda. Pulse Lab Kampala was invited to join the National Task Force in response to the outbreak. In coordination with WHO, and in collaboration with the Ministry of Health, Pulse Lab Kampala produced a series of data visualisations to support the early response to the disease. Visualisations of weekly reports from health centres were produced with interactive maps at district, sub-county and individual health facility level. The visualisations allowed decision making for the allocation of medicine, medical personnel and health centres, as well as targeting training areas.
Cite as: "Data Visualisation and Interactive Mapping to Support Response to Disease Outbreak”, Global Pulse Project Series no. 21, 2015
In this Project we analyses the spread and impact of the novel coronavirus pandemic which has taken the world by storm with its rapid growth. In this we like to develop a machine learning model in Python to analyze what has been its impact so far and analyze the outbreak of COVID 19 across various regions, visualize them using charts and tables, and predict the number of upcoming confirmed cases.
Disease outbreak detection, monitoring and notification systems are increasingly gaining popularity since
these systems are designed to assess threats to public health and disease outbreaks are becoming
increasingly common world-wide. A variety of systems are in use around the world, with coverage of
national, international and global disease outbreaks. These systems use different taxonomies and
classifications for the detection and prioritization of potential disease outbreaks. In this paper, we study
and analyze the current disease outbreak systems. Subsequently, we extract features and functions of
typical and generic disease outbreak systems. We then propose a generic model for disease outbreak
notification systems. Our effort is directed towards standardizing the design process for typical disease
outbreak systems.
Healthcare apps for Nokia X and Nokia Asha phones present a great opportunity to help improve the lives of millions of users around the world. In this webinar, we’ll discuss the fundamentals of mobile healthcare apps and give you an overview of the opportunities and challenges involved in developing such apps for Nokia phones. We’ll open the webinar with an introduction to the fundamentals of mobile healthcare, an overview of opportunities and challenges of developing apps for wellness and mobile health, and the role of mobile platforms in enabling health and healthcare apps. Then we will dig into specific techniques available when developing such apps for Nokia X and Nokia Asha platforms.We’ll demonstrate different approaches available to developers targeting the two platforms by examining a Blood Pressure Diary app, which is implemented for Nokia X and Nokia Asha. As part of that discussion we’ll show how to retrieve heart-rate data from medical devices using Bluetooth technology.
The Internet of Things means not just that computing devices have connectivity to the cloud but that they themselves are connected to each other, and therefore that novel applications can be developed in this rich ecosystem. One area for development is linking quantified self wearable sensors with automotive sensors for applications including Fatigue Detection, Real-time Parking and Assistance, Anger/Stress Reduction, Keyless Authentication, and DIY Diagnostics.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
The affect of service quality and online reviews on customer loyalty in the E...
FluCast - Android Application to Forecast Flu
1. FluCast
Risk assessment and Forecast of
seasonal flu in real time
YIXIN HU, SANDEEP SHANTHARAM AND LASIFU TA
Data Sciences & Analytics Group, Computational & Statistical Analytics Division, National Security Directorate
PNNL-SA-104651
2. FluCast - Mobile App for Risk and Forecast of Flu
Why Flu?
Infectious respiratory disease causing Seasonal Epidemic
$71-167 billion per year spent on “Flu” in the U.S.
Transmission rate between humans is high
Keywords of flu symptoms are expressed on Social Media
Rich and Varied data sources for disease analysis
August 6, 2014 2
3. FluCast - Mobile App for Risk and Forecast of Flu
Need for Mobile App
Target Customers
- Epidemiologists – primary
- Transport / School / Company Officials – secondary
Customer Needs
- Decrease time to action
- Enhance collaboration
- Access to varied data sources
- Targeted flu news
- Mobile analytics app
August 6, 2014 3
4. FluCast - Mobile App for Risk and Forecast of Flu
Flu Forecast
August 6, 2014 4
5. FluCast - Mobile App for Risk and Forecast of Flu
Flu Forecast – Approach
Prediction with data from historical flu cases, weather and social media.
ARIMA Model for flu prediction with Time-Series data.
Forecasting three weeks ahead of CDC.
Estimated flu cases with accuracy about 90%
August 6, 2014 5
6. FluCast - Mobile App for Risk and Forecast of Flu
Flu Forecast – Approach
August 6, 2014 6
H1N1
7. FluCast - Mobile App for Risk and Forecast of Flu
Flu Forecast – Implementation
Automated data retrieval from CDC and Programmatically access
weather and twitter data from NOAA and Twitter
Flask web framework for backend REST API with Mongo Database
suited for time series data
Jquery Mobile to build the HTML5 App
PhoneGap framework for transformation
August 6, 2014 7
8. FluCast - Mobile App for Risk and Forecast of Flu
Flu Forecast – UX Design
August 6, 2014 8
9. FluCast - Mobile App for Risk and Forecast of Flu
Flu Forecast – UX Design
August 6, 2014 9
10. FluCast - Mobile App for Risk and Forecast of Flu
Data Sharing
August 6, 2014 10
11. FluCast - Mobile App for Risk and Forecast of Flu
Data Sharing – Approach
Information Sharing and Collaboration between Epidemiologists
Sharing data within a group of trusted individuals
Communication between the analysts over E-mail
Sharing public domain information to Social Network
August 6, 2014 11
12. FluCast - Mobile App for Risk and Forecast of Flu
Data Sharing – Implementation
Sharing the screenshot of the active window and data in tabular form
Information sharing through the phone’s email application
Ability to create groups with the contacts on the analyst’s phone
Sharing public level data to Social Networks like Facebook, Twitter
August 6, 2014 12
13. FluCast - Mobile App for Risk and Forecast of Flu
Data Sharing – UX Design
August 6, 2014 13
14. FluCast - Mobile App for Risk and Forecast of Flu
Flu Stream
August 6, 2014 14
15. FluCast - Mobile App for Risk and Forecast of Flu
Flu Stream – Approach
News stream from RSS feeds of flu related websites like CDC and WHO
Twitter real-time stream with geo-location for tweets related to Flu
Stay updated with the news from the RSS and Twitter in real time
Share the News article and Tweets to Social Network
August 6, 2014 15
16. FluCast - Mobile App for Risk and Forecast of Flu
Flu Stream – Implementation
Flu Stream is updated every hour for both RSS and Twitter Stream
Items in News stream and Twitter stream are geotagged to allow analysts
identify the location associated
Geotagging by the list of keywords based on states and usage of natural
language processing to tokenize the keywords
Sharing to all third party apps with the phones in-built share function
August 6, 2014 16
17. FluCast - Mobile App for Risk and Forecast of Flu
Flu Stream – UX Design
August 6, 2014 17
18. FluCast - Mobile App for Risk and Forecast of Flu
Flu Stream – UX Design
August 6, 2014 18
19. FluCast - Mobile App for Risk and Forecast of Flu
Outbreak Alert
August 6, 2014 19
20. FluCast - Mobile App for Risk and Forecast of Flu
Outbreak Alert – Approach
Real-time monitoring of Twitter stream for the flu symptom mentions
Exponential Smoothing Model used in calculating the threshold value
Push Notification to the analyst, for possible outbreaks on a daily basis
August 6, 2014 20
21. FluCast - Mobile App for Risk and Forecast of Flu
Outbreak Alert – Approach
August 6, 2014 21
H1N1
22. FluCast - Mobile App for Risk and Forecast of Flu
Outbreak Alert – Implementation
Live data stream from twitter allows for real-time analysis
Tweets are automatically geotagged as the data is streamed
Push notification to the analyst based on the twitter mentions being
higher than threshold value for high risk
Push Notification to the device with related information gathered from
twitter for Outbreak Conditions
August 6, 2014 22
23. FluCast - Mobile App for Risk and Forecast of Flu
Outbreak Alert – UX Design
August 6, 2014 23
24. FluCast - Mobile App for Risk and Forecast of Flu
Competition
August 6, 2014 24
25. FluCast - Mobile App for Risk and Forecast of Flu
Benefit
August 6, 2014 25
26. FluCast - Mobile App for Risk and Forecast of Flu
Future Direction
July 22, 2014 26
Option for robust prediction models
Offline access to the app
Subtyping Flu
Data source from multiple social networks
Better Twitter Data Analysis
Analyst reporting
27. FluCast - Mobile App for Risk and Forecast of Flu
Acknowledgement
August 6, 2014 27
Technical Staff
Harisson, Joshua J
Dowling, Chase P
Henry, Michael J
Franklin, Lyndsey R
Lancaster, Mary
Managerial Staff
Spence, Christine N
Noonan, Christine F
Cowley, Wendy E
Corley, Courtney D
Didier, Brett T
28. FluCast - Mobile App for Risk and Forecast of Flu
Yixin Hu – Data Analyst
Columbia University
Android Team One
Lasifu Ta – UX Designer/ Front-end
University of Washington
Sandeep Shantharam – Developer
Indiana University Purdue University,
Indianapolis
Thank you!
August 6, 2014 28
Editor's Notes
“Hello Everyone, Today we are going to present our application for the Biosurvelliance Mobile App Competition.”
“Our Mobile App – Flucast – is an app to assess risk and forecast the seasonal flu in real time for USA.”
“We are the interns from Android Team One – from Data Science and Analytics Group”
Before going into the details of the Mobile App, let me - list the main reason for choosing Flu;
Flu is an annually recurring disease characterized by the prevalence of outbreaks.
In USA alone, the cost of influenza epidemics to the economy is $ 71-167 billion per year.
Rate of flu transmission is high, so early detection of the epidemic will help in reducing the cost of expenditure for the Health agencies
Keywords associated with Flu are observed in feeds of social media which is a real-time source for disease analysis.
Various data sources like weather are correlated with flu cases which help in risk assessment and risk prediction of the disease.
The Apps - Primary Target Customers are Epidemiologists who are responsible solely to monitor the disease outbreaks.
In addition to the Primary, The Secondary Target Customers would be Airport officials, School Directors, company executives.
In order to better understand the customer’s need, we conducted interviews with Mary Lancaster, an epidemiologist at PNNL. With her feedbacks and suggestions – we have compiled the following list of customer needs -
Tool or Technology to Help in decreasing the time to action which will help stop the outbreak of influenza.
Enhanced collaboration between the analysts involved in the early decision making for flu.
Simultaneous Access to the different data sources like weather, social media and ILI cases to study the correlation.
Targeted news from flu related websites and also access to flu related social media banter.
An Analysis tool that is mobile enough to change certain parameters that help in decision making.
App Flow for Flu Forecast
--In flu forecast feature, we predict seasonal flu based on the data from ILINET flu cases, which is the influenza-like-illness weekly report, and weather data consisting of precipitation, temperature and wind speed. In addition, we are trying to integrate the social media mentions containing flu related keywords.
--The forecast time period is 3 weeks ahead of the CDC report on the ILINET cases.
--In the prediction analysis, we would be using ARIMA Model. In the final risk forecast, data from weather and social media were included into the analysis.
-- Forecasted estimate of flu cases with an error rate of 11,18 and 24% for three weeks of prediction
In addition, we provide 95% confidence interval for the number of flu cases.
Region level data is ILINET cases and State Level Cases are calculated
Scheduled Automation of data retrieval for CDC cases and Google Flu Trends.
Programmatically access weather and twitter data for analysis.
Flask Python Web framework built for REST API to interface with Database which also suits the python data analysis framework that we have built
Mongo Database to store time series data.
Angularjs for the data model and Jquery Mobile for the interface design to build HTML5 App.
Phonegap framework to convert HTML5 App to Android Native App. Helping us build the app for multiple platforms like Android, iOs and Windows phone with a single codebase.
App Flow for Data Sharing
--With interesting analysis scenario, the app provides an ability to share the information in the form of screenshots and tabular data form to other analyst to enable collaboration.
-- The app provides an ability to create groups based on the needs of the analyst.
--The conversation between different analyst through the phone’s email application, thus enabling the analyst not take extra effort to use our app.
--The App will be a convenient platform for the analyst to communicate and collaborate for flu disease.
Ability to share every screen with a screenshot of the active window and simultaneously data in a tabular form for the analysts own analysis
Without the app being a broker for communication by allowing the analysts to continue the communication over email
Creating groups based on the contacts will enable the analyst to easily import the contacts from their phones
App flu for Flu Stream
Flu stream allows the app users to be updated with the flu related news through different websites from CDC, WHO and other flu related websites.
Real time Twitter stream of data with geo-location of the tweet, with the tweets being filtered for flu related keywords.
Allow the analyst to read the complete article for the flu in the mobile app.
In addition, the ability to share the article on third party apps on the android phone.
- These Flu stream is updated every hour and categorized with time so users could keep track of the latest flu situation.
- The sharing to third party apps features is implemented by the android in-built share function.
Flu News and Tweets from the twitter are geotagged to identify the location of the news and help the analyst the threat region for the particular news item.
Using elastic-search database to store the tweets, with its built in twitter river to store 1% of twitter feed data that is streamed real time to all the developers.
Geo-Tagging of the flu news and tweets are done by tokenizing the summary of the article and twitter users location by Natural Language processors in Python. Twitter users profile location is found to be 80% accurate prediction of the tweet location. Geotagging is at the level of states.
Monitoring twitter for real-time stream of data, which is updated every day and monitored for any unpredictable changes in the flu patterns.
Threshold value for Outbreak alert is calculated from the historical tweets data on a weekly basis with exponential smoothing model.
Push notification to notify the analyst about any potential outbreaks in the region concerned with the flu mentions crossing the high threshold value set for the region or state.
Real time monitoring of twitter from the twitter real-time data stream allows to data analysis in real time enabling changes to the number of flu mentions to alert the analysts about any potential outbreak.
Tweets are geo-tagged in real-time and outbreak is based on the threshold value and pushed to the user through Google Cloud Messaging
Push Notification would contain related information about the region associated with the potential outbreak and link to tweets that are associated with the analysis
Columbia Prediction of Infectious Diseases : Influenza Forecasts (Web App);
SickWeather (iOS App);
FluSpotter ( Facebook App);
Fludar ( Web App)
By researching about similar competitors in the area which could help epidemiology experts; we observed a general lack of features -
-- Their lack of mobility hinders users ability to use tools on the move.
-- They don’t have features that enable sharing between individuals or in a group.
-- They do not use the real-time data for risk assessment or notify outbreaks. Their data sources are very limited, either no weather or social media data.
-- Many of these apps are just information visualization with no analysis involved in the final presentation.
Yixin Hu
Columbia University
Lasifu Ta
University of Washington
Sandeep Shantharam
Indiana University Purdue University, Indianapolis