Hello Folks, Anupama here, Presenting on behalf of my team for our internship project - Patients Condition Classification Using Drug Reviews. for that, we use python and machine learning algorithms and models.
2. Contents
• Business Objective
• Project Architecture
• Data Collection & Details
• Exploratory Data Analysis
• Visualizations
• Sentimental Analysis
• Model Building
• Model Evaluation
• Depolyment
3. Business Objective:
• This is a sample dataset which consists of 161297 drug name, condition
reviews and ratings from different patients and our goal is to examine how
patients are feeling using the drugs their positive and negative experiences
so that we can recommend him a suitable drug. By analysing the reviews,
we can understand the drug effectiveness and its side effects.
• The dataset provides patient reviews on specific drugs along with related
conditions and a 10 star patient rating reflecting overall patient
satisfaction. So in this dataset, we can see many patients conditions but we
will focus only on the below, classify the below conditions from the
patients reviews
a) Depression
b) High Blood Pressure
c) Diabetes, Type 2
4. Project Architecture / Project Flow
Business
Understanding
Data
Collection
Data
Preparation
Exploratory
Data Analysis
Model
Evaluation
Model
Deployment
6. Data Collection & Details
• 161297 Rows and 7 Columns
Attribute Information:
• 1. DrugName (categorical): name of drug
• 2. condition (categorical): name of condition
• 3. review (text): patient review
• 4. rating (numerical): 10 star patient rating
• 5. date (date): date of review entry
• 6. usefulCount (numerical): number of users who found review
useful
39. How Challenges Overcome
• As we are not much aware of the Medicines we have done a lot of
research to find out the drugs review history.
• We as a team worked so hard to get the knowledge on the previous
year gold price data.
• Which helps us to do the project more effectively.
40. Deciding Model Building Technique
• As we tried many model building techniques every model don’t have
such a significant difference in the output
• We are little bit worried about the output results that we got.
• But we again overcame this as a team, Everyone has worked really
hard on this part and we finally build a model that best suits the data