This document discusses forecasting models for dengue outbreaks in San Juan, Puerto Rico. The team analyzed historical dengue case data along with environmental factors like temperature and humidity. They developed autoregressive integrated moving average (ARIMA) models to forecast quarterly dengue cases over a 3-year period. The best model included factors for 6 outbreak periods and an AR(15) term. It produced plausible forecasts with a mean absolute percentage error of 43.42%. Recommendations include targeted insecticide use and increased awareness during peak seasons.
We may be nowhere near eradicating Dengue, but we may be able to prevent it more effectively. Through Time-Series Modeling and geospatial mapping, Team Flex was able to predict and forecast cases and deaths up to 4 months and identify potential dengue hotspots in selected cities of the CALABARZON region.
The global threat of disease outbreaks is real and it is felt more than ever now as the world struggles to contain the spread of a certain virus. But before it even created a pandemic, there is already another disease that threatens our existence, perhaps has been doing so for the longest time – Dengue.
Dengue affects thousands of lives each year and continues to be a major public health problem in the Philippines.
In the last year alone, the Philippines experienced the worst dengue outbreak since 2012 as reported cases reached beyond epidemic thresholds. On top of that, the delayed reporting of official case and death counts makes it even more difficult to pinpoint heavily dengue affected areas early on and initiate a targeted public health response. To address this problem, Team Flex members, Janine Padilla, Mox Ballo, and Rache Melendres, developed a publicly accessible web application
that can be used by concerned government agencies and public health officials to predict the spread of dengue and visualize potential breeding sites of mosquitoes.
Professor Aboul Ella hassanien publications related to COVID-19 and Emerging Technologies such as AI, Machine Learning, Drones, Blockchain, IoT, Big Data
Artificial intelligence to fight against covid19saritamathania
Artificial intelligence (AI) and machine learning are playing a significant role in understanding and addressing the crisis caused by COVID-19. The technology mimic human intelligence and ingest great volumes of data to quickly chart patterns and identify insights.
One example is when BenevolentAI, a global leader in the development and application of artificial intelligence for drug discovery, took just few days to find that Baricitinib (a drug currently approved for rheumatoid arthritis, owned by Eli Lilly) is a strongest candidate and can be a potential treatment for COVID-19 patients.
This accelerated the clinical trials of #Baricitinib and Eli Lilly (a giant American Pharmaceutical company) has already commenced phase III clinical trials of Baricitinib to treat COVID-19.
Few more names include Deepmind, ImmunoPrecise, Insilico, healx, Imperial College, Tech Mahindra, and Deargen. Some Indian companies include NIRAMAI, Staqu, Qure.AI, Tech Mahindra, and DiyCam.
We may be nowhere near eradicating Dengue, but we may be able to prevent it more effectively. Through Time-Series Modeling and geospatial mapping, Team Flex was able to predict and forecast cases and deaths up to 4 months and identify potential dengue hotspots in selected cities of the CALABARZON region.
The global threat of disease outbreaks is real and it is felt more than ever now as the world struggles to contain the spread of a certain virus. But before it even created a pandemic, there is already another disease that threatens our existence, perhaps has been doing so for the longest time – Dengue.
Dengue affects thousands of lives each year and continues to be a major public health problem in the Philippines.
In the last year alone, the Philippines experienced the worst dengue outbreak since 2012 as reported cases reached beyond epidemic thresholds. On top of that, the delayed reporting of official case and death counts makes it even more difficult to pinpoint heavily dengue affected areas early on and initiate a targeted public health response. To address this problem, Team Flex members, Janine Padilla, Mox Ballo, and Rache Melendres, developed a publicly accessible web application
that can be used by concerned government agencies and public health officials to predict the spread of dengue and visualize potential breeding sites of mosquitoes.
Professor Aboul Ella hassanien publications related to COVID-19 and Emerging Technologies such as AI, Machine Learning, Drones, Blockchain, IoT, Big Data
Artificial intelligence to fight against covid19saritamathania
Artificial intelligence (AI) and machine learning are playing a significant role in understanding and addressing the crisis caused by COVID-19. The technology mimic human intelligence and ingest great volumes of data to quickly chart patterns and identify insights.
One example is when BenevolentAI, a global leader in the development and application of artificial intelligence for drug discovery, took just few days to find that Baricitinib (a drug currently approved for rheumatoid arthritis, owned by Eli Lilly) is a strongest candidate and can be a potential treatment for COVID-19 patients.
This accelerated the clinical trials of #Baricitinib and Eli Lilly (a giant American Pharmaceutical company) has already commenced phase III clinical trials of Baricitinib to treat COVID-19.
Few more names include Deepmind, ImmunoPrecise, Insilico, healx, Imperial College, Tech Mahindra, and Deargen. Some Indian companies include NIRAMAI, Staqu, Qure.AI, Tech Mahindra, and DiyCam.
Using Mobile Data and Airtime Credit Purchases to Estimate Food Security - Pr...UN Global Pulse
In this study, mobile phone activity data was combined with remote sensing data to understand how people communicated during severe flooding in the Mexican state of Tabasco in 2009, in order to explore ways that mobile data can be used to improve disaster response. By comparing the mobile data with official population census data, the representativeness of the research was validated. The results of the study showed that the patterns of mobile phone activity in affected locations during and after the floods could be used as indicators of (1) flooding impact on infrastructure and population and (2) public awareness of the disaster. These early results demonstrated the value of a public-private partnership on using mobile data to accurately indicate flooding impacts in Tabasco, thus improving early warning and crisis management.
For more classes visit
www.snaptutorial.com
HSA 535 Week 1 Discussion 1 -
CDC and BMA" Please respond to the following:
From the first two (2) e-Activities, give a synopsis of the various challenges facing health care professionals, and
A look at two different Datasets (infection data & mobility data to make some predictions about Corona Virus. The main takeaways:
1. Without a vaccine Corona is here to stay for 18 months till herd immunity. We need to have cyclical lockdowns of 2 weeks lockdown 6 weeks opening.
2. The structure of a city dictates whether a lockdown works or not. Rural and Nature heavy cities like Utah can't follow the same strategy like NY or Manhattan.
ANALYSIS OF COVID-19 IN THE UNITED STATES USING MACHINE LEARNINGmlaij
The unprecedented outbreak of COVID-19 also known as the coronavirus has caused a pandemic like none
ever seen before this century. Its impact has been massive on a global level. The deadly virus has
commanded nations around the world to increase their efforts to fight against the spread of the virus after
the stress it has put on resources. With the number of new cases increasing day by day around the world,
the objective of this paper is to contribute towards the analysis of the virus by leveraging machine learning
models to understand its behavior and predict future patterns in the United States (US) based on data
obtained from the COVID-19 Tracking Project.
Analysis of Covid-19 in the United States using Machine Learningmlaij
The unprecedented outbreak of COVID-19 also known as the coronavirus has caused a pandemic like none ever seen before this century. Its impact has been massive on a global level. The deadly virus has commanded nations around the world to increase their efforts to fight against the spread of the virus after the stress it has put on resources. With the number of new cases increasing day by day around the world, the objective of this paper is to contribute towards the analysis of the virus by leveraging machine learning models to understand its behavior and predict future patterns in the United States (US) based on data obtained from the COVID-19 Tracking Project.
Epidemic Alert System: A Web-based Grassroots ModelIJECEIAES
Most web-based disease surveillance systems that give epidemic alerts are based on very large and unstructured data from various news sources, social media and online queries that are parsed by complex algorithms. This has the tendency to generate results that are so diverse and non-specific. When considered along with the fact that there are no existing standards for mining and analyzing data from the internet, the results or decisions reached based on internet sources have been classified as low-quality. This paper proposes a web-based grassroots epidemic alert system that is based on data collected specifically from primary health centers, hospitals and registered laboratories. It takes a more traditional approach to indicator-based disease surveillance as a step towards standardizing web-based disease surveillance. It makes use of a threshold value that is based on the third quartile (75 th percentile) to determine the need to trigger the alarm for the onset of an epidemic. It also includes, for deeper analysis, demographic information.
How a U.S. COVID-19 Data Registry Fuels Global ResearchHealth Catalyst
In addition to driving COVID-19 understanding within the United States, a national disease registry is informing research beyond U.S. borders. Clinicians with the Singapore Ministry of Healthcare Office for Healthcare Transformation (MOHT) have used Health Catalyst Touchstone® COVID-19 data to develop a machine learning tool that helps predict the likelihood of COVID-19 mortality. With this national data set that leverages deep aggregated EHR data, the MOHT accessed the research-grade data it needed to build a machine-learning algorithm that predicts risk of death from COVID-19. The registry-informed prediction model was accurate enough to stand up to comparisons in the published literature and promises to help inform vaccine research and, ultimately, allocation of vaccines within populations.
In this Project we analyses the spread and impact of the novel coronavirus pandemic which has taken the world by storm with its rapid growth. In this we like to develop a machine learning model in Python to analyze what has been its impact so far and analyze the outbreak of COVID 19 across various regions, visualize them using charts and tables, and predict the number of upcoming confirmed cases.
COVID-19 data configuration and statistical analysisAnshJAIN50
The following report aims to identify the primary factors influencing the spread of Covid-19. To do this, I have analyzed the rate of spread in MEDCs and LEDCs - countries differing significantly in development. MEDCs, being more economically developed, tend to have superior healthcare, higher life expectancy, and generally better infrastructure, contrasting with LEDCs. This report aims to understand whether the characteristics of MEDCs and LEDCs can significantly impact the rate of spread of Covid-19, as well as more obscure factors that could have a greater impact than previously thought. In this report we will be examining 3 different MEDCs and LEDCs to develop a clear conclusion on whether we believe a country's development correlates to the rate of spread of Covid-19.
Using Mobile Data and Airtime Credit Purchases to Estimate Food Security - Pr...UN Global Pulse
In this study, mobile phone activity data was combined with remote sensing data to understand how people communicated during severe flooding in the Mexican state of Tabasco in 2009, in order to explore ways that mobile data can be used to improve disaster response. By comparing the mobile data with official population census data, the representativeness of the research was validated. The results of the study showed that the patterns of mobile phone activity in affected locations during and after the floods could be used as indicators of (1) flooding impact on infrastructure and population and (2) public awareness of the disaster. These early results demonstrated the value of a public-private partnership on using mobile data to accurately indicate flooding impacts in Tabasco, thus improving early warning and crisis management.
For more classes visit
www.snaptutorial.com
HSA 535 Week 1 Discussion 1 -
CDC and BMA" Please respond to the following:
From the first two (2) e-Activities, give a synopsis of the various challenges facing health care professionals, and
A look at two different Datasets (infection data & mobility data to make some predictions about Corona Virus. The main takeaways:
1. Without a vaccine Corona is here to stay for 18 months till herd immunity. We need to have cyclical lockdowns of 2 weeks lockdown 6 weeks opening.
2. The structure of a city dictates whether a lockdown works or not. Rural and Nature heavy cities like Utah can't follow the same strategy like NY or Manhattan.
ANALYSIS OF COVID-19 IN THE UNITED STATES USING MACHINE LEARNINGmlaij
The unprecedented outbreak of COVID-19 also known as the coronavirus has caused a pandemic like none
ever seen before this century. Its impact has been massive on a global level. The deadly virus has
commanded nations around the world to increase their efforts to fight against the spread of the virus after
the stress it has put on resources. With the number of new cases increasing day by day around the world,
the objective of this paper is to contribute towards the analysis of the virus by leveraging machine learning
models to understand its behavior and predict future patterns in the United States (US) based on data
obtained from the COVID-19 Tracking Project.
Analysis of Covid-19 in the United States using Machine Learningmlaij
The unprecedented outbreak of COVID-19 also known as the coronavirus has caused a pandemic like none ever seen before this century. Its impact has been massive on a global level. The deadly virus has commanded nations around the world to increase their efforts to fight against the spread of the virus after the stress it has put on resources. With the number of new cases increasing day by day around the world, the objective of this paper is to contribute towards the analysis of the virus by leveraging machine learning models to understand its behavior and predict future patterns in the United States (US) based on data obtained from the COVID-19 Tracking Project.
Epidemic Alert System: A Web-based Grassroots ModelIJECEIAES
Most web-based disease surveillance systems that give epidemic alerts are based on very large and unstructured data from various news sources, social media and online queries that are parsed by complex algorithms. This has the tendency to generate results that are so diverse and non-specific. When considered along with the fact that there are no existing standards for mining and analyzing data from the internet, the results or decisions reached based on internet sources have been classified as low-quality. This paper proposes a web-based grassroots epidemic alert system that is based on data collected specifically from primary health centers, hospitals and registered laboratories. It takes a more traditional approach to indicator-based disease surveillance as a step towards standardizing web-based disease surveillance. It makes use of a threshold value that is based on the third quartile (75 th percentile) to determine the need to trigger the alarm for the onset of an epidemic. It also includes, for deeper analysis, demographic information.
How a U.S. COVID-19 Data Registry Fuels Global ResearchHealth Catalyst
In addition to driving COVID-19 understanding within the United States, a national disease registry is informing research beyond U.S. borders. Clinicians with the Singapore Ministry of Healthcare Office for Healthcare Transformation (MOHT) have used Health Catalyst Touchstone® COVID-19 data to develop a machine learning tool that helps predict the likelihood of COVID-19 mortality. With this national data set that leverages deep aggregated EHR data, the MOHT accessed the research-grade data it needed to build a machine-learning algorithm that predicts risk of death from COVID-19. The registry-informed prediction model was accurate enough to stand up to comparisons in the published literature and promises to help inform vaccine research and, ultimately, allocation of vaccines within populations.
In this Project we analyses the spread and impact of the novel coronavirus pandemic which has taken the world by storm with its rapid growth. In this we like to develop a machine learning model in Python to analyze what has been its impact so far and analyze the outbreak of COVID 19 across various regions, visualize them using charts and tables, and predict the number of upcoming confirmed cases.
COVID-19 data configuration and statistical analysisAnshJAIN50
The following report aims to identify the primary factors influencing the spread of Covid-19. To do this, I have analyzed the rate of spread in MEDCs and LEDCs - countries differing significantly in development. MEDCs, being more economically developed, tend to have superior healthcare, higher life expectancy, and generally better infrastructure, contrasting with LEDCs. This report aims to understand whether the characteristics of MEDCs and LEDCs can significantly impact the rate of spread of Covid-19, as well as more obscure factors that could have a greater impact than previously thought. In this report we will be examining 3 different MEDCs and LEDCs to develop a clear conclusion on whether we believe a country's development correlates to the rate of spread of Covid-19.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
1. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
1
Dengue Endemic
Forecasting for San Juan
Team 2:
▪ Saurav Gupta
▪ Sasidhar Konda
▪ Ankita Paunikar
▪ Parmod Rathee
▪ Huixian Wang
1. Executive Summary:
2. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
2
Dengue is the major cause of death and illness in Puerto Rico. There are around 4001 million
people infected every year in the world and worse yet, Dengue virus can be transmitted by
mosquito bites very quickly and for now, there is no effective vaccines to prevent the spread of
this disease. Therefore, it is extremely meaningful to analyze the historical data of Dengue
endemic, and use forecast the disease outbreaks in the future. The objective of our project is to
determine the relationship between environmental factors, such as temperature and humidity, and
the amount of disease cases, and to examine other intervention factors which might affect the
spread of Dengue disease. Through the process of model selection and analysis of forecasting
results, we are able to provide the public and governmental health services with relatively accurate
information by implementing our model. This paper reflects our conclusions based on our findings
of times series models and recommendations to better prepare for future Dengue endemic
outbreaks within limited resources.
2. Statement of the Problem:
Historical surveillance data is supported San Juan, Puerto Rico. The data include weekly
laboratory-confirmed and serotype-specific cases for the location. Environmental data (like
temperature and humidity) from weather stations, satellites, and climate models are also provided.2
Forecasted model will be able to answer following key points.
A. Timing of peak incidence, i.e when the highest incidence of dengue occurs during July
and October every year.
3. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
3
B. Maximum Quarterly incidence, the number of dengue cases reported during the quarter
when incidence peaks, is during July 2008.
3. Background:
From the starting, as team, we had the consensus that we are going to do the forecasting. When
we’re looking for the datasets for this project we had verity of datasets to choose from. We had a
dataset to forecast the sales of the shampoo to the dataset where we should forecast the price of a
stock. We selected Dengue dataset as it is still an ongoing struggle of the state and to utilize our
analytics learning to something beyond business domain. To familiarize ourselves with the dataset
we read articles and documents which are listed in the references section. They provided us a great
deal of clarity as there were few biological variables in the dataset. We understand the concern of
US government in dengue control as they are devoting resources for the containment of these
endemic. On the official website of the Centers for Diseases Control and Prevention we found that
“Travel-associated dengue infections occur and several dengue outbreaks have been detected in
the continental United States, most dengue cases among U.S. citizens occur because of endemic
transmission in some U.S. territories, such as Puerto Rico”.3
4. Methodology:
Forecasting models are based on an infectious disease – dengue cases data collected by the Centers
for Disease Control and Prevention (CDC), which include satellite precipitation, humidity and
temperature from 1990 to 2007. Dengue cases forecasting models were developed using
autoregressive integrated moving average models and produced quarterly forecast over a 3-year
forecasting period.
4. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
4
Event Description: In Puerto Rico, the 1994 water shortage most affected the agricultural sector,
with losses estimated at more than $94 million. Production was off by more than 50 percent for
vegetables like plantains, a staple in Puerto Rican cuisine whose price has doubled. Tourism did
not suffer, government officials say, but hotels, hospitals and other commercial customers spent
thousands of dollars a month on water to stay open. And health officials said water stored in open
containers everywhere bred mosquitoes and was a factor in the worst outbreak of dengue fever
since the 1960's.4 Dengue is endemic to Puerto Rico, which sees 3,000 to 9,000 cases in non-
epidemic years. The worst epidemics since 1990 saw 24,700 cases in 1994, 17,000 in 1998, and
10,508 in 2007. During the most recent epidemic in 2007, half of the cases were hospitalized, and
one third were hemorrhagic.5
Environmental data: Satellite-derived environmental data were obtained from National Oceanic
and Atmospheric Administration, which gave weekly average temperature (in Kelvin) and average
specific humidity (g/kg)
5. Results:
The Quarterly total cases we modeled using Point: 7 + Point: 18 + Point: 19 + Point: 34 + Point: 35 +
Point: 62 + AR(15) model. Although temperature and humidity are not significant regressors to
dengue quarterly cases, the several explosive outbreaks of dengue endemic in San Juan were all
occurred in seasons when both temperature and humidity are higher than normal days.
6. Conclusions and recommendations:
5. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
5
Environmental data such as temperature, precipitation, humidity, and vegetation could be better used
to improve the accuracy of dengue predictions.
Based on our analysis we would give following recommendations:
When analyzing characteristics of weather condition of peak transmission seasons, we could
apply proper insecticides to water storage areas during rain seasons, preventing mosquitoes
from accessing egg-laying habitats.
Create awareness poll in San Juan about Dengue specifically before June and October month
since it is peak Dengue period every year
As per the forecast for upcoming period government should set up mobile medical units.
Create awareness about clean and hygienic environment among citizens.
APPENDIX:
6. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
6
Time Series
Prediction Errors
Autocorrelation Plots
Stationary test probability
The series graph does not suggest any strong
trend or seasonality, however, there are at least
four major outbreaks corresponding to those
peaks in the plot. A plot of the predicted values
of the models and the original data is displayed.
The prediction errors appear to be random.
The autocorrelation plots of the residuals are
displayed and none of the spikes are significantly
different from zero. These plots confirm that the
model explains all the significant autocorrelation
that was in the original data.
The white noise tests indicate that there is no
significant autocorrelation in the residuals.
Residuals also pass tests for stationarity.
7. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
7
Parameter Estimates
Statistics Fit
Forecast
Forecast dataset
The autoregressive lag12 and lag15 are not
significantly different from zero. Except these
two lags, all the other AR coefficients are
significant at the 10% level. This model should
not be disqualified because statistical significance
can be misleading and large p-values for
estimates are not enough to disqualify a model.
The RMSE for this model is 116.86, which is
superior than other models we tried. MAPE can be
interpreted as: the forecasts average a 43.42%
error.
The forecasts look plausible based on a visual
inspection of the historical data.
Evidence suggests that this is the best model we
found and we may accept this model since its
performance is superior than any other model we
tried.
8. University of Connecticut (MS-BAPM) Data Mining and Business Intelligence
8
Developed Models
7. References
1. Official CDC website.
https://www.cdc.gov/dengue/index.html
2. Official website of National Oceanic and Atmospheric Administration.
http://dengueforecasting.noaa.gov/
3. Official CDC website.
https://www.cdc.gov/dengue/about/inpuerto.html
4. New York times article from 1995.
http://www.nytimes.com/1995/01/23/us/taps-go-dry-as-puerto-rico-copes-with-drought.html
5. World Health Organization
http://www.who.int/mediacentre/factsheets/fs117/en/
6. Puerto Rico Declares Dengue Epidemic
http://www.healthmap.org/site/diseasedaily/article/puerto-rico-declares-dengue-epidemic-101812
The screen capture above shows all models
during quarter 34, 19, 7, 18, 62, and 35, the
total cases of Dengue disease spike up, so we
added 6 interventions into the model. Also,
AR (15) is implied based on previous
analysis.