The document discusses a hybrid technique for associative classification of heart diseases using data mining. It summarizes existing classification and association rule mining algorithms applied to heart disease data. The author aims to improve accuracy by generating classification association rules efficiently and integrating classification with association rule mining. The proposed approach is implemented in Weka to extract rules from a heart disease dataset using Apriori and FP-Growth algorithms. The rules are used to classify patients and evaluate the performance compared to other methods.
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
Data mining techniques are used for a variety of applications. In healthcare industry, datamining plays an important
role in predicting diseases. For detecting a disease number of tests should be required from the patient. But using data
mining technique the number of tests can be reduced. This reduced test plays an important role in time and performance.
This report analyses data mining techniques which can be used for predicting different types of diseases. This report reviewed
the research papers which mainly concentrate on predicting various disease
We are predicting Heart Disease by Taking 14 Medical Parameters as an inputs through 2 data Minning Techniques(Decision Tree(Faster) And KNN neighbour Algorithms(Slower)).
And Visualizing The dataset.If the output 1 then it means Higher Chances of getting Heart Attack ,if 0 then it means Less chances of Heart Attack.
This describes the techniques that are used for prediction of heart diseases using the concept of data mining.It states about IHPDS(Intelligent Heart Disease Prediction System)
Heart is the most important organ of a human body. It circulates oxygen and other vital nutrients through blood to different parts of the body and helps in the metabolic activities. Apart from this it also helps in removal of the metabolic wastes. Thus, even minor problems in heart can affect the whole organism. Researchers are diverting a lot of data analysis work for assisting the doctors to predict the heart problem. So, an analysis of the data related to different health problems and its functioning can help in predicting with a certain probability for the wellness of this organ. In this paper we have analysed the different prescribed data of 1094 patients from different parts of India. Using this data, we have built a model which gets trained using this data and tries to predict whether a new out-of-sample data has a probability of having any heart attack or not. This model can help in decision making along with the doctor to treat the patient well and creating a transparency between the doctor and the patient. In the validation set of the data, it’s not only the accuracy that the model has to take care, rather the True Positive Rate and False-Negative Rate along with the AUC-ROC helps in building/fixing the algorithm inside the model.
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
Data mining techniques are used for a variety of applications. In healthcare industry, datamining plays an important
role in predicting diseases. For detecting a disease number of tests should be required from the patient. But using data
mining technique the number of tests can be reduced. This reduced test plays an important role in time and performance.
This report analyses data mining techniques which can be used for predicting different types of diseases. This report reviewed
the research papers which mainly concentrate on predicting various disease
We are predicting Heart Disease by Taking 14 Medical Parameters as an inputs through 2 data Minning Techniques(Decision Tree(Faster) And KNN neighbour Algorithms(Slower)).
And Visualizing The dataset.If the output 1 then it means Higher Chances of getting Heart Attack ,if 0 then it means Less chances of Heart Attack.
This describes the techniques that are used for prediction of heart diseases using the concept of data mining.It states about IHPDS(Intelligent Heart Disease Prediction System)
Heart is the most important organ of a human body. It circulates oxygen and other vital nutrients through blood to different parts of the body and helps in the metabolic activities. Apart from this it also helps in removal of the metabolic wastes. Thus, even minor problems in heart can affect the whole organism. Researchers are diverting a lot of data analysis work for assisting the doctors to predict the heart problem. So, an analysis of the data related to different health problems and its functioning can help in predicting with a certain probability for the wellness of this organ. In this paper we have analysed the different prescribed data of 1094 patients from different parts of India. Using this data, we have built a model which gets trained using this data and tries to predict whether a new out-of-sample data has a probability of having any heart attack or not. This model can help in decision making along with the doctor to treat the patient well and creating a transparency between the doctor and the patient. In the validation set of the data, it’s not only the accuracy that the model has to take care, rather the True Positive Rate and False-Negative Rate along with the AUC-ROC helps in building/fixing the algorithm inside the model.
Existing model uses structured data to predict the patients of either high risk or low risk.
But for a complex disease, structured data is not a good way to describe the disease.
We propose a new convolutional neural network (CNN)-based multimodal disease risk prediction algorithm using structured and unstructured data from hospital.
In this paper, we mainly focus on the risk prediction of cerebral infarction.
Prediction of Heart Disease using Machine Learning Algorithms: A Surveyrahulmonikasharma
According to recent survey by WHO organisation 17.5 million people dead each year. It will increase to 75 million in the year 2030[1].Medical professionals working in the field of heart disease have their own limitation, they can predict chance of heart attack up to 67% accuracy[2], with the current epidemic scenario doctors need a support system for more accurate prediction of heart disease. Machine learning algorithm and deep learning opens new door opportunities for precise predication of heart attack. Paper provideslot information about state of art methods in Machine learning and deep learning. An analytical comparison has been provided to help new researches’ working in this field.
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUESIAEME Publication
Diabetes mellitus is a common disease caused by a set of metabolic ailments
where the sugar stages over drawn-out period is very high. It touches diverse organs
of the human body which therefore harm a huge number of the body's system, in
precise the blood strains and nerves. Early prediction in such disease can be exact
and save human life. To achieve the goal, this research work mainly discovers
numerous factors associated to this disease using machine learning techniques.
Machine learning methods provide effectual outcome to extract knowledge by building
predicting models from diagnostic medical datasets together from the diabetic
patients. Quarrying knowledge from such data can be valuable to predict diabetic
patients. In this research, six popular used machine learning techniques, namely
Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision
Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are
compared in order to get outstanding machine learning techniques to forecast diabetic
mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved
higher accuracy compared to other machine learning techniques.
Brain tumor classification using artificial neural network on mri imageseSAT Journals
Abstract
In this paper, an attempt has been made to summarize the multi-resolution transformation and the different classifiers useful to
analyze the brain tumor using MRI. X-ray, MRI, Ultrasound etc. are different techniques used to scan brain tumor images.
Radiologist prefers MRI to get detail information about tumor to help him diagnoses. In this paper we have used MRI of brain
tumor for analysis. We have used Digital image processing tool for detection of the tumor. The identification, detection and
classification of brain tumor have been done by extracting features from MRI with the help of wavelet transformation. The MRI of
brain tumor is classified into two categories normal and abnormal brain. In this work Digital image processing has been used as
a tool for getting clear and exact details about tumor in earlier stages. This helps the physicians and practitioners for diagnoses.
Key word – Brain tumor, Wavelet transform, segmentation.
Bias: It is the amount by which Machine Learning (ML) model predictions differ from the actual value of the target.
Variance: It is the amount by which the ML model prediction would change if we estimate it using different training datasets.
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISijcsit
The Healthcare industry contains big and complex data that may be required in order to discover fascinating pattern of diseases & makes effective decisions with the help of different machine learning techniques. Advanced data mining techniques are used to discover knowledge in database and for medical
research. This paper has analyzed prediction systems for Diabetes, Kidney and Liver disease using more
number of input attributes. The data mining classification techniques, namely Support Vector Machine(SVM) and Random Forest (RF) are analyzed on Diabetes, Kidney and Liver disease database. The performance of these techniques is compared, based on precision, recall, accuracy, f_measure as well
as time. As a result of study the proposed algorithm is designed using SVM and RF algorithm and the experimental result shows the accuracy of 99.35%, 99.37 and 99.14 on diabetes, kidney and liver disease respectively.
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseijtsrd
The early prognosis of cardiovascular diseases can aid in making decisions to lifestyle changes in high risk patients and in turn reduce their complications. Research has attempted to pinpoint the most influential factors of heart disease as well as accurately predict the overall risk using homogenous data mining techniques. Recent research has delved into amalgamating these techniques using approaches such as hybrid data mining algorithms. This paper proposes a rule based model to compare the accuracies of applying rules to the individual results of logistic regression on the Cleveland Heart Disease Database in order to present an accurate model of predicting heart disease. K. Sandhya Rani | M. Sai Chaitanya | G. Sai Kiran"A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-3 , April 2018, URL: http://www.ijtsrd.com/papers/ijtsrd11402.pdf http://www.ijtsrd.com/computer-science/data-miining/11402/a-heart-disease-prediction-model-using-logistic-regression-by-cleveland-database/k-sandhya-rani
With the increasingly connected world revolving around the revolution of internet and new technologies like mobiles, smartphones, and tablets, and with the wide usage of wireless technologies, the information security risks have increased. Both individuals and organizations are under regular attacks for commercial or non-commercial gains. The objectives of such attacks may be to take revenge, malign the reputation of a competitor organization, understand the strategies and sensitive information about the competitor, simply have fun of exploiting the vulnerabilities. Hence, the need to protect information assets and ensure information security receives adequate attention.
In this session, I will discuss how AI and Machine Learning can be applied in detecting, predicting and preventing cyber security/information security vulnerabilities and what are the benefits of using Machine Learning and AI. We also touch upon some of the tools available to perform the same.
Psdot 14 using data mining techniques in heartZTech Proje
FINAL YEAR IEEE PROJECTS,
EMBEDDED SYSTEMS PROJECTS,
ENGINEERING PROJECTS,
MCA PROJECTS,
ROBOTICS PROJECTS,
ARM PIC BASED PROJECTS, MICRO CONTROLLER PROJECTS Z Technologies, Chennai
Existing model uses structured data to predict the patients of either high risk or low risk.
But for a complex disease, structured data is not a good way to describe the disease.
We propose a new convolutional neural network (CNN)-based multimodal disease risk prediction algorithm using structured and unstructured data from hospital.
In this paper, we mainly focus on the risk prediction of cerebral infarction.
Prediction of Heart Disease using Machine Learning Algorithms: A Surveyrahulmonikasharma
According to recent survey by WHO organisation 17.5 million people dead each year. It will increase to 75 million in the year 2030[1].Medical professionals working in the field of heart disease have their own limitation, they can predict chance of heart attack up to 67% accuracy[2], with the current epidemic scenario doctors need a support system for more accurate prediction of heart disease. Machine learning algorithm and deep learning opens new door opportunities for precise predication of heart attack. Paper provideslot information about state of art methods in Machine learning and deep learning. An analytical comparison has been provided to help new researches’ working in this field.
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUESIAEME Publication
Diabetes mellitus is a common disease caused by a set of metabolic ailments
where the sugar stages over drawn-out period is very high. It touches diverse organs
of the human body which therefore harm a huge number of the body's system, in
precise the blood strains and nerves. Early prediction in such disease can be exact
and save human life. To achieve the goal, this research work mainly discovers
numerous factors associated to this disease using machine learning techniques.
Machine learning methods provide effectual outcome to extract knowledge by building
predicting models from diagnostic medical datasets together from the diabetic
patients. Quarrying knowledge from such data can be valuable to predict diabetic
patients. In this research, six popular used machine learning techniques, namely
Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision
Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are
compared in order to get outstanding machine learning techniques to forecast diabetic
mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved
higher accuracy compared to other machine learning techniques.
Brain tumor classification using artificial neural network on mri imageseSAT Journals
Abstract
In this paper, an attempt has been made to summarize the multi-resolution transformation and the different classifiers useful to
analyze the brain tumor using MRI. X-ray, MRI, Ultrasound etc. are different techniques used to scan brain tumor images.
Radiologist prefers MRI to get detail information about tumor to help him diagnoses. In this paper we have used MRI of brain
tumor for analysis. We have used Digital image processing tool for detection of the tumor. The identification, detection and
classification of brain tumor have been done by extracting features from MRI with the help of wavelet transformation. The MRI of
brain tumor is classified into two categories normal and abnormal brain. In this work Digital image processing has been used as
a tool for getting clear and exact details about tumor in earlier stages. This helps the physicians and practitioners for diagnoses.
Key word – Brain tumor, Wavelet transform, segmentation.
Bias: It is the amount by which Machine Learning (ML) model predictions differ from the actual value of the target.
Variance: It is the amount by which the ML model prediction would change if we estimate it using different training datasets.
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISijcsit
The Healthcare industry contains big and complex data that may be required in order to discover fascinating pattern of diseases & makes effective decisions with the help of different machine learning techniques. Advanced data mining techniques are used to discover knowledge in database and for medical
research. This paper has analyzed prediction systems for Diabetes, Kidney and Liver disease using more
number of input attributes. The data mining classification techniques, namely Support Vector Machine(SVM) and Random Forest (RF) are analyzed on Diabetes, Kidney and Liver disease database. The performance of these techniques is compared, based on precision, recall, accuracy, f_measure as well
as time. As a result of study the proposed algorithm is designed using SVM and RF algorithm and the experimental result shows the accuracy of 99.35%, 99.37 and 99.14 on diabetes, kidney and liver disease respectively.
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseijtsrd
The early prognosis of cardiovascular diseases can aid in making decisions to lifestyle changes in high risk patients and in turn reduce their complications. Research has attempted to pinpoint the most influential factors of heart disease as well as accurately predict the overall risk using homogenous data mining techniques. Recent research has delved into amalgamating these techniques using approaches such as hybrid data mining algorithms. This paper proposes a rule based model to compare the accuracies of applying rules to the individual results of logistic regression on the Cleveland Heart Disease Database in order to present an accurate model of predicting heart disease. K. Sandhya Rani | M. Sai Chaitanya | G. Sai Kiran"A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-3 , April 2018, URL: http://www.ijtsrd.com/papers/ijtsrd11402.pdf http://www.ijtsrd.com/computer-science/data-miining/11402/a-heart-disease-prediction-model-using-logistic-regression-by-cleveland-database/k-sandhya-rani
With the increasingly connected world revolving around the revolution of internet and new technologies like mobiles, smartphones, and tablets, and with the wide usage of wireless technologies, the information security risks have increased. Both individuals and organizations are under regular attacks for commercial or non-commercial gains. The objectives of such attacks may be to take revenge, malign the reputation of a competitor organization, understand the strategies and sensitive information about the competitor, simply have fun of exploiting the vulnerabilities. Hence, the need to protect information assets and ensure information security receives adequate attention.
In this session, I will discuss how AI and Machine Learning can be applied in detecting, predicting and preventing cyber security/information security vulnerabilities and what are the benefits of using Machine Learning and AI. We also touch upon some of the tools available to perform the same.
Psdot 14 using data mining techniques in heartZTech Proje
FINAL YEAR IEEE PROJECTS,
EMBEDDED SYSTEMS PROJECTS,
ENGINEERING PROJECTS,
MCA PROJECTS,
ROBOTICS PROJECTS,
ARM PIC BASED PROJECTS, MICRO CONTROLLER PROJECTS Z Technologies, Chennai
Heart Disease Prediction Using Data Mining TechniquesIJRES Journal
There are huge amounts of data in the medical industry which is not processed properly and hence cannot be used effectively in making decisions. We can use data mining techniques to mine these patterns and relationships. This research has developed a prototype Heart Disease Prediction using data mining techniques, namely Neural Network, K-Means Clustering and Frequent Item Set Generation. Using medical profiles such as age, sex, blood pressure and blood sugar it can predict the likelihood patients getting a heart disease. It enables significant knowledge, e.g. patterns, relationships between medical factors related to heart disease to be established. Performance of these techniques is compared through sensitivity, specificity and accuracy. It has been observed that Artificial Neural Networks outperform K Means clustering in all the parameters i.e. Sensitivity, Specificity and Accuracy.
Smart health prediction using data mining by customsoftCustom Soft
CustomSoft India based software company developed wonderful software for Smart Health prediction using Data Mining for its esteemd Ckients from US, UK, Canada, Singapore, South Africa based clients
Health Prediction System - an Artificial Intelligence Project 2015Maruf Abdullah (Rion)
Health Prediction System - An Artificial Intelligence Project 2015
The project aimed to build a fully functional system in order to achieve the efficiency in faster heath treatment and online consultation system. The overall mission of system development is to make the primary treatment quickly and easily complete the Online Consultation System.
In machine learning, support vector machines (SVMs, also support vector networks[1]) are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis. The basic SVM takes a set of input data and predicts, for each given input, which of two possible classes forms the output, making it a non-probabilistic binary linear classifier.
A comparative analysis of classification techniques on medical data setseSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
10 Ways Your Boss Kills Employee MotivationOfficevibe
It’s so hard to have engaged employees. It’s such a delicate thing to try and get right because employees can be fragile.
As a manager, you have to do everything in your power to make sure employees are happy and engaged at all times.
Usually, the problem is the boss, and not things like the company, mission statement, or co-workers.
If you know that your boss is the biggest problem, there are ten things that they do to kill motivation. If you’re a manager and you’re reading this, make sure you avoid these mistakes to ensure that your employees are engaged during work.
The secret to good leadership is to be authentic. Be honest with your staff.
Read more on Officevibe blog:
https://www.officevibe.com/blog/10-kill-employee-motivation
like us on Facebook!:
www.facebook.com/officevibe
In healthcare sector, data are enormous and diverse because it contains a data of different types and getting knowledge from these data is crucial. So to get that knowledge, data mining techniques may be utilized to mine knowledge by building models from healthcare dataset. At present, the classification of heart diseases patients has been a demanding research confront for many researchers. For building a classification model for a these patient, we used four different classification algorithms such as NaiveBayes, MultilayerPerceptron, RandomForest and DecisionTable. The intention behind this work is to classify that whether a patient is tested positive or tested negative for heart diseases, based on some diagnostic measurements integrated into the dataset.
A Survey on Heart Disease Prediction Techniquesijtsrd
Heart disease is the main reason for a huge number of deaths in the world over the last few decades and has evolved as the most life threatening disease. The health care industry is found to be rich in information. So, there is a need to discover hidden patterns and trends in them. For this purpose, data mining techniques can be applied to extract the knowledge from the large sets of data. Many researchers, in recent times have been using several machine learning techniques for predicting the heart related diseases as it can predict the disease effectively. Even though a machine learning technique proves to be effective in assisting the decision makers, still there is a scope for developing an accurate and efficient system to diagnose and predict the heart diseases thereby helping doctors with ease of work. This paper presents a survey of various techniques used for predicting heart disease and reviews their performance. G. Niranjana | Dr I. Elizabeth Shanthi "A Survey on Heart Disease Prediction Techniques" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-2 , February 2021, URL: https://www.ijtsrd.com/papers/ijtsrd38349.pdf Paper Url: https://www.ijtsrd.com/computer-science/other/38349/a-survey-on-heart-disease-prediction-techniques/g-niranjana
Ascendable Clarification for Coronary Illness Prediction using Classification...ijtsrd
Coronary disease is predicted by classification technique. The data mining tool WEKA has been exploited for implementing Naïve Bayes classifier. Proposed work is trapped with a specific end goal to enhance the execution of models. For improving the classification accuracy Naïve Bayes is combined with Bagging and Attribute Selection. Trial results demonstrated a critical change over in the current Naïve Bayes classifier. This approach enhances the classification accuracy and reduces computational time. D. Haripriya | Dr. M. Lovelin Ponn Felciah "Ascendable Clarification for Coronary Illness Prediction using Classification Mining and Feature Selection Performances" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd26690.pdfPaper URL: https://www.ijtsrd.com/computer-science/data-miining/26690/ascendable-clarification-for-coronary-illness-prediction-using-classification-mining-and-feature-selection-performances/d-haripriya
StarCompliance is a leading firm specializing in the recovery of stolen cryptocurrency. Our comprehensive services are designed to assist individuals and organizations in navigating the complex process of fraud reporting, investigation, and fund recovery. We combine cutting-edge technology with expert legal support to provide a robust solution for victims of crypto theft.
Our Services Include:
Reporting to Tracking Authorities:
We immediately notify all relevant centralized exchanges (CEX), decentralized exchanges (DEX), and wallet providers about the stolen cryptocurrency. This ensures that the stolen assets are flagged as scam transactions, making it impossible for the thief to use them.
Assistance with Filing Police Reports:
We guide you through the process of filing a valid police report. Our support team provides detailed instructions on which police department to contact and helps you complete the necessary paperwork within the critical 72-hour window.
Launching the Refund Process:
Our team of experienced lawyers can initiate lawsuits on your behalf and represent you in various jurisdictions around the world. They work diligently to recover your stolen funds and ensure that justice is served.
At StarCompliance, we understand the urgency and stress involved in dealing with cryptocurrency theft. Our dedicated team works quickly and efficiently to provide you with the support and expertise needed to recover your assets. Trust us to be your partner in navigating the complexities of the crypto world and safeguarding your investments.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
2. Table of Contents
Ø Introduction
Ø Motivation
Ø Data Mining
Ø Classification
Ø Association
Ø Heart Disease Database
Ø Literature Survey
Ø Problem Formulation
Ø Objectives
Ø Present Work
Ø Result and Discussion
Ø Conclusion
Ø Future Scope
Ø References
3. Motivation
Ø Accumulation of huge data-sets in the field of
Engineering and Biomedical Science.
Ø Ability to extract hidden and useful knowledge from
large databases.
Ø Need to development intelligent and cost effective
decision support system.
Ø How to teach the people to ignore the irrelevant
data.
Ø The greatest problem of today is to get optimal
outcome of irrelevant data.
4. Data Mining
Ø Data mining computational process of finding
patterns in large data sets including methods at the
intersection of machine learning, artificial
intelligence, statistics and database systems.
Ø The main focus of data mining process is to obtain
information from the data and converted it into an
knowledgeable and reasonable structure for further
use.
6. Classification
Classification is the problem of identifying to which of
a set of categories a new observation belongs, on the
basis of a training set of data containing observations
(or instances) whose category membership is known.
7. Association
Association learning method for discovering interesting
relations between variables in large databases. It is
intended to identify strong rules discovered in
databases using different measures of interestingness.
For example, the rule :
{onions, potatoes} => {burger}.
8. Example : Heart diseases Dataset
ID age Gender Chest pain
Blood
pressure
diagnosis
1
63
male
typ_angina
High
No
2
67
male
asympt
very_high
Yes
3
67
male
asympt
high
Yes
4
37
male
non_anginal
high
No
5
41
female
atyp_angina
high
No
6
56
male
atyp_angina
high
No
7
62
female
asympt
high
Yes
8
57
female
asympt
high
No
9
63
male
asympt
high
Yes
10
53
male
asympt
high
Yes
11
57
male
asympt
high
No
12
56
female
atyp_angina
high
No
13
56
male
non_anginal
high
Yes
14
44
male
atyp_angina
high
No
10. Result new prediction ?
age gender Chest pain Blood
pressure
diagnosis
52
male
non_anginal
very_high
11. Classifiers
Ø ZeroR : There is no predictability, it is useful for determining a baseline
performance as a benchmark for other classification methods.
Ø OneR : Classification rules based on the value of a single predictor, that generates
one rule for each predictor in the data.
Ø NaiveBayes: Bayes rule is implemented or assigned to make easier to evaluate
prior from a probability model. it handles condition of some missing entries in data.
Ø J48: It creates a binary tree, With this technique, a tree is constructed to model the
classification process.
Ø IBk (k nearest neighbour): The nearest neighbor algorithm categorise a given
instance depend on a set of already categorise the training set by measuring the
distance to the closed instances
12. Association Methods
Ø Aprior Algorithm: Find rules that will predict the
occurrence of an item based on the occurrences of
other items in the transaction.
Ø FP-Growth Algorithm: Allows frequent discovery
without candidate itemset generation. Extracts
frequent itemsets form the FP-tree. Follow Divide
and conquer approach.
13. Heart Disease Database
Sr. No.
Attributes
Description
Values
1
age
Age in years
Continuous
2
gender
Male or female
1 = Male,
0 = female
3
cp
Chest pain type
1 = typical type,
2 = typical type angina,
3 = non-angina pain,
4 = asymptomatic
4
thestbps
Resting blood pres-
sure
Continuous value in mm hg
5
chol
Serum cholesterol
Continuous value in mm/dl
6
thalach
Maximum heart rate
achieved
Continuous value
7
fbs
Fasting blood sugar
1 =>120 mg/dl,
0 =<120 mg/dl
14. Continue…
8
Restecg
Resting electro-
graphic results
0 = normal,
1 = having ST-T wave abnormal,
2 = left ventricular hypertrophy
9
exang
Exercise induced
angina
0 = no 1 = yes
10
oldpeak
ST depression
induced by exercise
relative to rest
Continuous value
11
slope
Slope of the peak
exercise ST segment
1 = unsloping,
2 = flat,
3 = downsloping
12
ca
Number of major
vessels colored by
floursopy
0 - 3 value
13
thal
Defect type
3 = normal,
6 = fixed,
7 = reversible defect
14
Diagnosis
Heart disease Predi-
cation
Value 1: no heart disease
Value 0: has heart disease
15. Literature Survey
Ø Liao et al. [3] author report about data mining techniques and application,
development through a survey of literature, form 2000 to 2011. Paper surveys
three areas of data mining research: knowledge types, analysis types, and
architecture types. A discussion deals with future progress in social science and
Engineering methodologies implement data mining techniques and the development
of applications in problem- oriented
Ø Liu et al. [4] presented an associative classification, to integrate classification rules
and association rule mining. The integration is done by focusing on mining a special
subset of association rules whose consequent parts are restricted to the classification
class labels, called Class Association Rules (CARs). This algorithm first generates all
the association rules and then selects a small set of rules to form the classifiers.
When predicting the class label for a coming sample, the best rule is chosen.
16. Continue…
Ø The first association rule mining algorithm was the Apriori algorithm [5] developed
by Agrawal, and swami. The Apriori algorithm generates the candidate item sets in
one pass through only the item sets with large support in the previous pass, without
considering the transactions in the database.
Ø Palaniappan and Awang [6] developed a prototype Intelligent Heart Disease
Prediction System (IHDPS) using data mining techniques, namely, Decision Trees,
Nave Bayes and Neural Network. Results show that each technique has its unique
strength in realizing the objectives of the defined mining goals. IHDPS can answer
complex what if queries which traditional decision support systems cannot. Using
medical profiles such as age, gender, blood pressure and blood sugar it can predict
the likelihood of patients getting a heart disease. IHDPS is Web-based, user-
friendly, scalable, reliable and expandable. It is implemented on the .NET platform.
17. Continue…
Ø Srinivas et al. [7] presented Application of Data Mining Technique in Healthcare and
Prediction of Heart Attacks. The potential use of classification based data mining techniques
such as Rule based, Decision tree, Nave Bayes and Artificial Neural Network to the massive
Volume of healthcare data. Tanagra data mining tool was used for exploratory data analysis,
machine learning and statistical learning algorithms. The training data set consists of 3000
instances with14 different attributes.
Ø Shouman et al. [8] proposed k-means clustering with the decision tree method to predict the
heart disease. In their work they suggested several centroid selection methods for k- means
clustering to increase efficiency. The 13 input attributes were collected from Cleveland Clinic
Foundation Heart disease data set. For the random attribute and random row methods, ten
runs were executed and the average and best for each method were calculated. In Addition,
integrating k-means clustering and decision tree could achieve higher accuracy than the
paging algorithm in the diagnosis of heart disease patients. The accuracy achieved was
83.9% by the enabler method with two clusters.
The algorithm used
Accuracy
Time taken
Naive Bayes
52.33%
609ms
Decision list
52%
719ms
K-NN
45.67%
1000ms
18. Summary and Gaps Identified
Ø Implementation of different methods like NaiveBayes, Decision tree and
Neural, K-nearest, Artificial Neural Network etc, is done on heart disease
dataset.
Ø The performance of the classifiers is evaluated and their results are
analysed.
Ø Maximum accuracy achieved according to the survey is 83.9% using K-
means clustering with decision tree.
Ø The classification methods does not provide better accuracy and
experimental results.
Ø Integration of associative classification is not yet implemented on heart
diseases data set.
19. Problem Formulation
Ø Accuracy of heart data diseases is only calculate on basis of classification
methods.
Ø Accuracy of corrected classified instances is less to predict heart diseases.
Ø Association and classification suffers from inefficiency due to the fact that it
often generates a very large number of insignificant rules.
Ø Most of the associative classification algorithms adopt the exhaustive search
method to discover the rules and require multiple passes over the
database.
Ø They find frequent items in one phase and generate the rules in a separate
phase consuming more resources such as storage and processing time.
20. Objectives
Ø To propose a technique that can generate
Classification Association Rules (CARs) efficiently for
heart diseases prediction.
Ø Perform evaluation of proposed approach.
Ø Comparative analysis of proposed method with
other state-of-the-art techniques
21. Present Work
The Present Work has been implemented using data mining tool Weka .
Implementation steps are listed below :
1. Review of the classification and association rule generation methods.
2. Understanding the existing algorithm of classification.
3. Study the existing methods of Classification and association to predict heart
diseases.
4. Understanding the heart disease data set attributes used in predication.
5. Study ARFF file format standard of representing datasets.
6. Preparing data set for implementation of association algorithm
22.
23. Continue…
7. Implement association algorithm like Aprior and FP growth on prepared
data set.
8. Select the best 10 rules for each associate algorithm.
9. Make classes and extract training data sets bases on different rules.
10. Implement classification algorithms on extracted training data set.
11. Compared the performance and accuracy of corrected classified instances
of classification methods.
12. Construct a system based on high performance and better accuracy of
classification meth- ods.
26. Sample Data form of Heart Disease Prediction
Online Available : http://gndec.ac.in/~jagdeepmalhi/ihdps/
27. Sample Data of Heart Disease Prediction for Risk Level: No
28. Sample Data of Heart Disease Prediction for Risk Level: Low
29. Sample Data of Heart Disease Prediction for Risk Level: High
30. Results and Discussion
The Evaluation of results is done on bases of two
categories.
Ø Compare the different parameters like time taken,
Correctly/Incorrectly classified instances, Kappa statistic
value, mean absolute error and root mean squared
error rate of different classifier with Aprior and FP-
Growth association algorithm.
Ø Compare the accuracy evaluated by different authors
on the heart disease dataset.
31. Continue…
Comparison of different classifiers using Aprior association
algorithm on heart diseases dataset.
Classifiers
Time
Taken (In
seconds)
Correctly
Classified
I n s t a n c e s
(%)
Incorrectly
Classified
I n s t a n c e s
(%)
Kappa
statistic
Mean
absolute
error
Root mean
squared
error
ZeroR
0.001
67.2
32.79
0
0.441
0.470
OneR
0.01
97.31
2.6
0.94
0.027
0.164
J48
0.04
97.85
2.15
0.951
0.031
0.143
IBk
0.003
99.19
0.81
0.982
0.010
0.090
NaiveBayes
0.01
97.58
2.42
0.946
0.023
0.137
32. Continue…
Comparison of different classifiers using FP- Growth
association algorithm on heart diseases dataset.
Classifiers
Time
Taken (In
seconds)
Correctly
Classified
Instances
(%)
Incorrectly
Classified
Instances
(%)
Kappa
statistic
Mean
absolute
error
Root mean
squared
error
ZeroR
0.001
85.67
14.33
0
0.247
0.350
OneR
0.005
92.55
7.45
0.649
0.075
0.273
J48
0.01
96.56
3.44
0.859
0.056
0.185
IBk
0.001
94.84
5.16
0.779
0.053
0.227
NaiveBayes
0.003
97.55
7.45
0.711
0.088
0.265
33. Continue…
Comparison of Aprior and FP-Growth association
algorithms heart diseases dataset
Association
Algorithms
ZeroR
accuracy
OneR
accuracy
J48
accuracy
IBk
accuracy
NaiveBayes
accuracy
Aprior
67.2
97.31
97.85
99.19
97.58
FP-Growth
85.67
92.55
96.56
94.84
97.55
34. Continue…
Comparison of results evaluated by different authors
on the heart disease dataset.
Author /Year Technique Accuracy (%)
Cheung 2001 [11] NaiveBayes 81.48
Polat and Sahan et al. 2007 [12] K-Nearest Neighbor 87.00
Shouman and Turner et al. 2012 [13] Decision tree 84.10
Das and Turkoglu et al. 2009 [14] K-Nearest Neighbor 97.40
Tu and Shin et al. 2009 [15] J4.8 Decision Tree 78.90
Proposed Method 2014 IBk with Aprior Algorithm 99.19
35. Conclusion
Ø The development of a hybrid technique for implementation
of associative classification is done on heart diseases
dataset to predict more accurate results.
Ø Dataset is implement on weka environment and compared
the performance of different classifier after apply
association algorithm.
Ø Results show that IBk (k Nearest Neighbor) with Aprior
associative algorithms shows better results than others.
Ø Compare the results of different classifiers with proposed
implementation methods.
Ø Finally develop Intelligent Heart Diseases Prediction System
(IHDPS) for end user to check the risk of heart diseases.
36. Future Scope
Ø In future work plan to reduce numbers of attributes
and to determine the attribute which contribute
towards the diagnosis of heart disease.
Ø Additional Data Mining techniques can be
incorporated to provide better results.
Ø There is a need to build a system where every
human can check the risk of heart diseases using
minimum recourses and parameters.
Ø Parameters like processing time, resources and
memory used can be further enhanced.
37. References
1) U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, “Data mining to knowledge discovery in
databases,” American Association for Artificial Intelligence, vol. 17, no. 3, pp. 37–54, 1996.
2) D. Aha. (1988, July) Heart disease databases. [Online]. Available: http://repository.seasr.
org/Datasets/UCI/arff/heart-c.arff.
3) S. H. Liao, P. H. Chu, and P. Y. Hsiao, “Data mining techniques and applications - a decade
review from 2000 to 2011,” Elsevier Expert Systems with Applications, vol. 39, no. 1, pp. 11
303–11 311, 2012.
4) B. Liu, W. Hsu, and Y. Ma, “Integrating classification and association rule mining,” In
Knowledge Discovery and Data Mining, New York, vol. 2, pp. 80–86, 1998.
5) R. Agrawal and R. Srikant, “Fast algorithms for mining association rules,” in VLDB, Santi-
ago, Chile, September 1994, pp. 487–499.
6) S.Palaniappan and R.Awang, “Intelligent heart disease prediction system using data mining
techniques,” in IEEE/ACS International Conference, Doha, 2008, pp. 108–115.
7) K. Srinivas, B. K. Rani, and D. A. Govrdhan, “Application of data mining techniques in
healthcare and prediction of heart attacks,” International Journal on Computer Science and
Engineering, vol. 2, no. 2, pp. 250–255, 2011.
38. Continue …
8) M. Shouman, T. Turner, and R. Stocker, “Integrating decision tree and k-means clustering with different
initial centroid selection methods in the diagnosis of heart disease patients,” in Proceedings of the
International Conference on Data Mining, 2012.
10) J. Singh, H. Singh, and A. Kamra, “Recent trends in data mining: A review,” in Proceeding of 3rd
International Conference on Biomedical Engineering and Assistive Technologies, Chandigarh, India, 2014,
pp. 138–144.
11) N.Cheung, “Machine learning techniques for medical analysis,” B.Sc. Thesis, School of Information
Technology and Electrical Engineering, University of Queenland, 2001.
12) K. Polat, S. Sahan, and S. Gunes, “Automatic detection of heart disease using an artifi- cial immune
recognition system (airs) with fuzzy resource allocation mechanism and k-nn (nearest neighbor) based
weighting preprocessing,” Expert Systems with Applications, pp. 625–663, 2007.
13) M. Shouman, T. Turner, and R. Stocker, “Applying k-nearest neighbor in diagnosing heart disease
patients,” International Journal of Information and Education Technology, vol. 2, no. 3, pp. 220–223, June
2012.
14) R. Das, I. Turkoglu, and A. Sengur, “Effective diagnosis of heart disease through neural networks
ensembles,” Expert Systems with Applications, Elsevier, pp. 7675–7680, 2009.
15) M. C. Tu, D. Shin, and D. Shin, “Effective diagnosis of heart disease through bagging approach,” in
Proceeding of 2nd International Conference on Biomedical Engineering and Informatics. Seoul, South Korea:
IEEE, October 2009, pp. 1–4.