SlideShare a Scribd company logo
Power
Consumption
Prediction
Develop a robust tool for optimizing energy
usage and efficient production of power
Presented by : Narra Yugendra
Introduction
• The Power sector is most important, a necessary sector and is very well influenced
by technological advancements, changing consumer preferences, and a competitive
market.
• Power Consumption, which is the phenomenon of users consuming the provided
electricity, poses unique challenges and opportunities. When the power
consumption is altered, it can seriously affect on production and transmission of
electricity.
• Machine learning, with its predictive capabilities, offers a transformative approach
to understanding and mitigating the challenges posed by changes in power
consumption.
Power Domain And Importance
The Power Sector is a rapidly growing sector and it's not shy to present its own set of distinct
challenges and opportunities.
I chose Power Domain for my Capstone Project because :
• Efficiency of Supply: The efficient supply of power to consumers create a stable
system of operations which enables system to produce to power based on the
demand of the consumer.
• Costs matters : The cost of supply ,transmission and distribution of power to
consumers should be optimized so the providers don't experience losses .
• Chain of operation : Different production units hydro, thermal, wind are used. So as
per the demand these units should be made operational.
• Tech is always changing: New tech stuff is always popping up, especially in Power
industry. Figuring out how to use these tech innovations to help consumers is part of
the adventure.
Problem Statement &
Explanation
Electricity
Production from
Various Units
Sub Station
Zone_1
Zone_2
Zone_3
Weather conditions, and potential emission diffusion metrics :
1. Temperature
2. Humidity
3. Wind Speed
4. Diffuse flows
5. General Diffuse flows
Dataset Information
Here are the key details about the dataset used in this project:
• Number of records: Our dataset comprises a robust collection of
data, consisting of 52,416 records. Each record represents a
unique entry, contributing to the richness and depth of our analysis.
• Features/Columns: The dataset is characterized by a diverse set
of features, each providing valuable insights into climatic
conditions, flows of water, and power consumptions in various
zones. In total, there are 9 features/columns that form the basis of
our predictive modeling.
• Source of the Data: We have partnered with a leading Moroccan
renewable energy company committed to providing efficient and
sustainable energy solutions. They want to develop a robust tool
for optimizing energy usage in Agadir, a critical region for their
operations.
Columns/Features
• Datetime
• Temperature
• Humidity
• Wind Speed
• General Diffuse Flows
• Diffuse Flows
• PowerConsumption_Zone1
• PowerConsumption_Zone2
• PowerConsumption_Zone3
Exploratory Data Analysis (EDA)
• Exploring the data allowed us to gain a comprehensive overview of
the data's structure. It uncovered potential patterns, helped us
identify key trends and get essential insights from the dataset.
• Throughout the EDA process, we analyzed the distribution of
individual features, investigated correlations, and explored any
inherent relationships between variables.
• Visualizations also played a crucial role in providing a clear
representation of the data, offering insights into customer behavior
and identifying the factors that may contribute to customer churn.
Exploratory Data Analysis (EDA)
1. First, we made sure there were no Null values and Duplicates in the dataset. And luckily, there weren't
any. Our dataset was clean to begin with.
2. There are some columns that don't provide any useful information and hence they won't contribute
much to the predictions. Therefore, we will drop the following columns during Preprocessing : Datetime,
PowerConsumption_Zone1, PowerConsumption_Zone2.
3. The target variable, PowerConsumption_Zone3 exhibits Continuousness.
4. The independent variables Temperature, Humidity, Windspeed, General Diffuse Flows, Diffuse Flows
also exhibits Continuousness.
Visualizations
0
5000
10000
15000
20000
25000
30000
35000
40000
45000
50000
8.93
10.4
11.43
12.26
13.06
13.62
14.16
14.69
15.23
15.74
16.21
16.76
17.46
18.14
18.79
19.4
19.95
20.43
20.89
21.36
21.91
22.56
23.25
23.97
24.67
25.53
26.4
27.55
29.49
Power
Consumption
in
Zone3
Sorted Temperature
PowerConsumption_Zone3 Vs. Temperature
0
5000
10000
15000
20000
25000
30000
35000
40000
45000
50000
11.34
35.41
41.95
46.09
49.42
52.25
54.64
56.85
58.75
60.48
62.14
63.62
65.07
66.51
67.87
69.23
70.6
71.9
73.4
74.9
76.4
78
79.5
81.1
82.4
83.8
84.9
86
87
88.1
89.7
Power
Consumption
in
Zone3
Humidity
PowerConsumption_Zone3 VS Sorted Humidity
PowerConsumption_Zone3
0
10000
20000
30000
40000
50000
0.05
0.068
0.07
0.071
0.073
0.074
0.076
0.078
0.08
0.081
0.082
0.083
0.084
0.085
0.087
0.09
0.278
4.904
4.908
4.911
4.914
4.916
4.918
4.919
4.92
4.922
4.924
Power
Consumption
in
Zone3
Sorted Windspeed
PowerConsumption_Zone3 VS Sorted
WindSpeed
PowerConsumption_Zone3
Visualizations
0
10000
20000
30000
40000
50000
0.004
0.033
0.04
0.044
0.051
0.055
0.059
0.062
0.066
0.073
0.077
0.084
0.091
0.11
1.412
12.28
41.76
76.4
114.7
160.1
212
271.9
337.3
407.8
475.4
547.8
640.5
732
823
Power
Consumption
in
Zone3
Sorted General Diffuse Flows
PowerConsumption_Zone3 VS Sorted Genral
Diffuse Flows
PowerConsumption_Zone3
0
10000
20000
30000
40000
50000
0.011
0.078
0.089
0.096
0.104
0.111
0.115
0.122
0.126
0.133
0.141
0.148
0.159
0.182
1.253
10.88
31.56
41.98
52
62.71
75.1
90.1
105.3
127.9
154.5
187.9
228.5
286.8
391.8
Power
Consumption
in
Zone3
Sorted Diffuse Flows
PowerConsumption_Zone3 VS Sorted Diffuse
Flows
PowerConsumption_Zone3
These above Visualizations of various weather conditions, and potential emission
diffusion metrics Vs. Power Consumption in Zone 3 observes that these are
Independent variables with respect to the power consumption.
Visualizations
Upon inspecting the heatmap, we can see that there is strong positive correlation
observed among the columns PowerConsumption_Zone1, PowerConsumption_Zone2,
PowerConsumption_Zone3 . As a result, PowerConsumption_Zone1,
PowerConsumption_Zone2 will be dropped.
Preprocessing
• First, “Datetime” , “PowerConsumption_Zone1” and “PowerConsumption_Zone2”
columns were dropped as they didn’t provide any useful information for our
predictions
• we made sure there were no Null values and Duplicates in the dataset. And luckily,
there weren't any. Our dataset was clean to begin with.
Splitting the data into X and y
• Now, we partition the dataset into two components: X and y.
• The variable X encompasses all independent variables, representing the features
that contribute to our predictions.
• On the other hand, y encapsulates the dependent variable or target variable, serving
as the outcome we aim to predict.
Train-Test Split
• We then split the dataset into training data and testing data.
• We'll now split the dataset into training and testing data. We will do an 80:20
split, so our test size will be set to 0.2.
• We will take Random State as 42. This will guarantee the reproducibility of
our results across different runs.
Minmax Scaler
• We used Minmax Scaler to normalize the features of the dataset.
• This ensured that the consistency between the features of the dataset was
maintained.
• MinMax Scaler scales the data so that it is in the range of [0, 1].
Applying Machine
Learning Algorithms
This Power Consumption prediction problem we have here is a Continuous Regression problem.
Models used:
• Linear Regression : Linear regression is a quiet and the simplest statistical regression method used for predictive
analysis in machine learning. Linear regression shows the linear relationship between the independent(predictor)
variable I, and the dependent(output) variable
• Decision Tree Regression : Decision tree regression observes features of an object and trains a model in the
structure of a tree to predict data in the future to produce meaningful continuous output. In the context of Power
Consumption prediction, it observes the features of independent variables and trains the model.
• Random Forest Regression : Random forest regression is a supervised learning algorithm and bagging technique
that uses an ensemble learning method for regression in machine learning. The trees in random forests run in
parallel, meaning there is no interaction between these trees while building the trees.
• Gradient Boost Regression : Gradient boosting regression trees are based on the idea of an ensemble method
derived from a decision tree. The decision tree uses a tree structure. Starting from tree root, branching according to
the conditions and heading toward the leaves, the goal leaf is the prediction result.
Evaluation Metrics
Model r2 Score Mse Error Rmse Error
Linear 0.29 0.017910 0.133830
DTR 0.45 0.014113 0.118798
RTR 0.71 0.007249 0.085141
GBR 0.33 0.016745 0.129402
Model Selection
and Considerations
• Random Forest Regression outperforms
Linear Regression, Decision tree Regression
and Gradient Boost Regression in all metrics,
demonstrating higher r2 Score, Lower mse and
rmse error. It seems to be a promising model for
our task.
• Based on the provided metrics, Random Forest
Regression stands out as the best-performing
model overall.
• Hence, we will go with Random Forest
Regression as our final model as it is quite
evident that it predicts best for our Power
Consumption prediction model.
Conclusion
• With the help of several insights, patterns and trends in our data, we’ve used Machine Learning to
predict the power consumption in zone3.
• This project offers significant benefits to electricity providers:
• By predicting power consumption, Electricity providers can adopt proactive measures to produce
power at the required rate. This involves proper electricity production , less transmission losses
and proper supply to consumers.
• By focusing efforts on consumption at a high rate, Electricity providers can streamline operations,
reduce production costs, and improve overall efficiency.
• Understanding the factors influencing power consumption enables providers to efficiently supply
the power to meet individual needs. This level of personalization fosters stronger consumer
relationships, increases efficiency in supply of electricity without losses.
Predicting Power Consumption for a Greener Tomorrow: Machine Learning Project Presentation

More Related Content

Similar to Predicting Power Consumption for a Greener Tomorrow: Machine Learning Project Presentation

MODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptx
MODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptxMODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptx
MODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptx
Jasmeet939104
 
Optimal Capacitor Placement for IEEE 14 bus system using Genetic Algorithm
Optimal Capacitor Placement for IEEE 14 bus system using Genetic AlgorithmOptimal Capacitor Placement for IEEE 14 bus system using Genetic Algorithm
Optimal Capacitor Placement for IEEE 14 bus system using Genetic Algorithm
AM Publications
 
Voltage_Stability_Analysis_With DG NEW (1).pptx
Voltage_Stability_Analysis_With DG NEW (1).pptxVoltage_Stability_Analysis_With DG NEW (1).pptx
Voltage_Stability_Analysis_With DG NEW (1).pptx
rameshss
 

Similar to Predicting Power Consumption for a Greener Tomorrow: Machine Learning Project Presentation (20)

MODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptx
MODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptxMODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptx
MODERN SMART GRIDS AND LEVERAGING SMART METER DATA.pptx
 
Optimal Capacitor Placement for IEEE 14 bus system using Genetic Algorithm
Optimal Capacitor Placement for IEEE 14 bus system using Genetic AlgorithmOptimal Capacitor Placement for IEEE 14 bus system using Genetic Algorithm
Optimal Capacitor Placement for IEEE 14 bus system using Genetic Algorithm
 
Stand alone-pv-hybrid-systems barcelona05 excellent model
Stand alone-pv-hybrid-systems barcelona05 excellent modelStand alone-pv-hybrid-systems barcelona05 excellent model
Stand alone-pv-hybrid-systems barcelona05 excellent model
 
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
 BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics  BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
 
Voltage_Stability_Analysis_With DG NEW (1).pptx
Voltage_Stability_Analysis_With DG NEW (1).pptxVoltage_Stability_Analysis_With DG NEW (1).pptx
Voltage_Stability_Analysis_With DG NEW (1).pptx
 
A Review on Energy Consumption Monitoring and Analysis System
A Review on Energy Consumption Monitoring and Analysis SystemA Review on Energy Consumption Monitoring and Analysis System
A Review on Energy Consumption Monitoring and Analysis System
 
Optimal Capacitor Placement in Distribution System using Fuzzy Techniques
Optimal Capacitor Placement in Distribution System using Fuzzy TechniquesOptimal Capacitor Placement in Distribution System using Fuzzy Techniques
Optimal Capacitor Placement in Distribution System using Fuzzy Techniques
 
aknoor presentation.pptx lt switch gear.
aknoor presentation.pptx lt switch gear.aknoor presentation.pptx lt switch gear.
aknoor presentation.pptx lt switch gear.
 
C04111114
C04111114C04111114
C04111114
 
Digital energy meter
Digital energy meter Digital energy meter
Digital energy meter
 
Low-power Innovative techniques for Wearable Computing
Low-power Innovative techniques for Wearable ComputingLow-power Innovative techniques for Wearable Computing
Low-power Innovative techniques for Wearable Computing
 
Use of Qualitative and Quantitative Data in Sectoral Energy Performance Bench...
Use of Qualitative and Quantitative Data in Sectoral Energy Performance Bench...Use of Qualitative and Quantitative Data in Sectoral Energy Performance Bench...
Use of Qualitative and Quantitative Data in Sectoral Energy Performance Bench...
 
IRJET- Smart and Efficient Load Control System using Dynamic Tariff
IRJET- Smart and Efficient Load Control System using Dynamic TariffIRJET- Smart and Efficient Load Control System using Dynamic Tariff
IRJET- Smart and Efficient Load Control System using Dynamic Tariff
 
IRJET- Voltage Stability, Loadability and Contingency Analysis with Optimal I...
IRJET- Voltage Stability, Loadability and Contingency Analysis with Optimal I...IRJET- Voltage Stability, Loadability and Contingency Analysis with Optimal I...
IRJET- Voltage Stability, Loadability and Contingency Analysis with Optimal I...
 
Roadmap for distribution loss reduction.. a step by step approach
Roadmap  for distribution loss reduction.. a step by step approachRoadmap  for distribution loss reduction.. a step by step approach
Roadmap for distribution loss reduction.. a step by step approach
 
Bg4201393395
Bg4201393395Bg4201393395
Bg4201393395
 
Module1-Power-System-operation and-control
Module1-Power-System-operation and-controlModule1-Power-System-operation and-control
Module1-Power-System-operation and-control
 
Test different neural networks models for forecasting of wind,solar and energ...
Test different neural networks models for forecasting of wind,solar and energ...Test different neural networks models for forecasting of wind,solar and energ...
Test different neural networks models for forecasting of wind,solar and energ...
 
2. power system losses evaluation
2. power system losses evaluation2. power system losses evaluation
2. power system losses evaluation
 
Demand side management
Demand side managementDemand side management
Demand side management
 

More from Boston Institute of Analytics

More from Boston Institute of Analytics (20)

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Solar production with K means clustering
Solar production with K means clusteringSolar production with K means clustering
Solar production with K means clustering
 
Demystifying Salaries: A Data Science Approach to Predicting Salary Ranges
Demystifying Salaries: A Data Science Approach to Predicting Salary RangesDemystifying Salaries: A Data Science Approach to Predicting Salary Ranges
Demystifying Salaries: A Data Science Approach to Predicting Salary Ranges
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor NetworksSensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
Unveiling the Market: Predicting House Prices with Data Science
Unveiling the Market: Predicting House Prices with Data ScienceUnveiling the Market: Predicting House Prices with Data Science
Unveiling the Market: Predicting House Prices with Data Science
 
Beyond Thumbs Up/Down: Using AI to Analyze Movie Reviews
Beyond Thumbs Up/Down: Using AI to Analyze Movie ReviewsBeyond Thumbs Up/Down: Using AI to Analyze Movie Reviews
Beyond Thumbs Up/Down: Using AI to Analyze Movie Reviews
 
Unveiling the Patterns: A Cluster Analysis of NYC Shootings
Unveiling the Patterns: A Cluster Analysis of NYC ShootingsUnveiling the Patterns: A Cluster Analysis of NYC Shootings
Unveiling the Patterns: A Cluster Analysis of NYC Shootings
 
Enhancing Cybersecurity: An In-depth Analysis of Travelblog.org
Enhancing Cybersecurity: An In-depth Analysis of Travelblog.orgEnhancing Cybersecurity: An In-depth Analysis of Travelblog.org
Enhancing Cybersecurity: An In-depth Analysis of Travelblog.org
 
Exploring Web Security Threats: A Practical Study on SQL Injection and CSRF
Exploring Web Security Threats: A Practical Study on SQL Injection and CSRFExploring Web Security Threats: A Practical Study on SQL Injection and CSRF
Exploring Web Security Threats: A Practical Study on SQL Injection and CSRF
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Detecting Credit Card Fraud: An AI-driven Approach
Detecting Credit Card Fraud: An AI-driven ApproachDetecting Credit Card Fraud: An AI-driven Approach
Detecting Credit Card Fraud: An AI-driven Approach
 
Predicting House Prices: A Machine Learning Approach
Predicting House Prices: A Machine Learning ApproachPredicting House Prices: A Machine Learning Approach
Predicting House Prices: A Machine Learning Approach
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...
Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...
Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
NLP Based project presentation: Analyzing Automobile Prices
NLP Based project presentation: Analyzing Automobile PricesNLP Based project presentation: Analyzing Automobile Prices
NLP Based project presentation: Analyzing Automobile Prices
 

Recently uploaded

一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 

Recently uploaded (20)

一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 

Predicting Power Consumption for a Greener Tomorrow: Machine Learning Project Presentation

  • 1.
  • 2. Power Consumption Prediction Develop a robust tool for optimizing energy usage and efficient production of power Presented by : Narra Yugendra
  • 3. Introduction • The Power sector is most important, a necessary sector and is very well influenced by technological advancements, changing consumer preferences, and a competitive market. • Power Consumption, which is the phenomenon of users consuming the provided electricity, poses unique challenges and opportunities. When the power consumption is altered, it can seriously affect on production and transmission of electricity. • Machine learning, with its predictive capabilities, offers a transformative approach to understanding and mitigating the challenges posed by changes in power consumption.
  • 4. Power Domain And Importance The Power Sector is a rapidly growing sector and it's not shy to present its own set of distinct challenges and opportunities. I chose Power Domain for my Capstone Project because : • Efficiency of Supply: The efficient supply of power to consumers create a stable system of operations which enables system to produce to power based on the demand of the consumer. • Costs matters : The cost of supply ,transmission and distribution of power to consumers should be optimized so the providers don't experience losses . • Chain of operation : Different production units hydro, thermal, wind are used. So as per the demand these units should be made operational. • Tech is always changing: New tech stuff is always popping up, especially in Power industry. Figuring out how to use these tech innovations to help consumers is part of the adventure.
  • 5. Problem Statement & Explanation Electricity Production from Various Units Sub Station Zone_1 Zone_2 Zone_3 Weather conditions, and potential emission diffusion metrics : 1. Temperature 2. Humidity 3. Wind Speed 4. Diffuse flows 5. General Diffuse flows
  • 6. Dataset Information Here are the key details about the dataset used in this project: • Number of records: Our dataset comprises a robust collection of data, consisting of 52,416 records. Each record represents a unique entry, contributing to the richness and depth of our analysis. • Features/Columns: The dataset is characterized by a diverse set of features, each providing valuable insights into climatic conditions, flows of water, and power consumptions in various zones. In total, there are 9 features/columns that form the basis of our predictive modeling. • Source of the Data: We have partnered with a leading Moroccan renewable energy company committed to providing efficient and sustainable energy solutions. They want to develop a robust tool for optimizing energy usage in Agadir, a critical region for their operations. Columns/Features • Datetime • Temperature • Humidity • Wind Speed • General Diffuse Flows • Diffuse Flows • PowerConsumption_Zone1 • PowerConsumption_Zone2 • PowerConsumption_Zone3
  • 7. Exploratory Data Analysis (EDA) • Exploring the data allowed us to gain a comprehensive overview of the data's structure. It uncovered potential patterns, helped us identify key trends and get essential insights from the dataset. • Throughout the EDA process, we analyzed the distribution of individual features, investigated correlations, and explored any inherent relationships between variables. • Visualizations also played a crucial role in providing a clear representation of the data, offering insights into customer behavior and identifying the factors that may contribute to customer churn.
  • 8. Exploratory Data Analysis (EDA) 1. First, we made sure there were no Null values and Duplicates in the dataset. And luckily, there weren't any. Our dataset was clean to begin with. 2. There are some columns that don't provide any useful information and hence they won't contribute much to the predictions. Therefore, we will drop the following columns during Preprocessing : Datetime, PowerConsumption_Zone1, PowerConsumption_Zone2. 3. The target variable, PowerConsumption_Zone3 exhibits Continuousness. 4. The independent variables Temperature, Humidity, Windspeed, General Diffuse Flows, Diffuse Flows also exhibits Continuousness.
  • 9. Visualizations 0 5000 10000 15000 20000 25000 30000 35000 40000 45000 50000 8.93 10.4 11.43 12.26 13.06 13.62 14.16 14.69 15.23 15.74 16.21 16.76 17.46 18.14 18.79 19.4 19.95 20.43 20.89 21.36 21.91 22.56 23.25 23.97 24.67 25.53 26.4 27.55 29.49 Power Consumption in Zone3 Sorted Temperature PowerConsumption_Zone3 Vs. Temperature 0 5000 10000 15000 20000 25000 30000 35000 40000 45000 50000 11.34 35.41 41.95 46.09 49.42 52.25 54.64 56.85 58.75 60.48 62.14 63.62 65.07 66.51 67.87 69.23 70.6 71.9 73.4 74.9 76.4 78 79.5 81.1 82.4 83.8 84.9 86 87 88.1 89.7 Power Consumption in Zone3 Humidity PowerConsumption_Zone3 VS Sorted Humidity PowerConsumption_Zone3 0 10000 20000 30000 40000 50000 0.05 0.068 0.07 0.071 0.073 0.074 0.076 0.078 0.08 0.081 0.082 0.083 0.084 0.085 0.087 0.09 0.278 4.904 4.908 4.911 4.914 4.916 4.918 4.919 4.92 4.922 4.924 Power Consumption in Zone3 Sorted Windspeed PowerConsumption_Zone3 VS Sorted WindSpeed PowerConsumption_Zone3
  • 10. Visualizations 0 10000 20000 30000 40000 50000 0.004 0.033 0.04 0.044 0.051 0.055 0.059 0.062 0.066 0.073 0.077 0.084 0.091 0.11 1.412 12.28 41.76 76.4 114.7 160.1 212 271.9 337.3 407.8 475.4 547.8 640.5 732 823 Power Consumption in Zone3 Sorted General Diffuse Flows PowerConsumption_Zone3 VS Sorted Genral Diffuse Flows PowerConsumption_Zone3 0 10000 20000 30000 40000 50000 0.011 0.078 0.089 0.096 0.104 0.111 0.115 0.122 0.126 0.133 0.141 0.148 0.159 0.182 1.253 10.88 31.56 41.98 52 62.71 75.1 90.1 105.3 127.9 154.5 187.9 228.5 286.8 391.8 Power Consumption in Zone3 Sorted Diffuse Flows PowerConsumption_Zone3 VS Sorted Diffuse Flows PowerConsumption_Zone3 These above Visualizations of various weather conditions, and potential emission diffusion metrics Vs. Power Consumption in Zone 3 observes that these are Independent variables with respect to the power consumption.
  • 11. Visualizations Upon inspecting the heatmap, we can see that there is strong positive correlation observed among the columns PowerConsumption_Zone1, PowerConsumption_Zone2, PowerConsumption_Zone3 . As a result, PowerConsumption_Zone1, PowerConsumption_Zone2 will be dropped.
  • 12. Preprocessing • First, “Datetime” , “PowerConsumption_Zone1” and “PowerConsumption_Zone2” columns were dropped as they didn’t provide any useful information for our predictions • we made sure there were no Null values and Duplicates in the dataset. And luckily, there weren't any. Our dataset was clean to begin with. Splitting the data into X and y • Now, we partition the dataset into two components: X and y. • The variable X encompasses all independent variables, representing the features that contribute to our predictions. • On the other hand, y encapsulates the dependent variable or target variable, serving as the outcome we aim to predict.
  • 13. Train-Test Split • We then split the dataset into training data and testing data. • We'll now split the dataset into training and testing data. We will do an 80:20 split, so our test size will be set to 0.2. • We will take Random State as 42. This will guarantee the reproducibility of our results across different runs. Minmax Scaler • We used Minmax Scaler to normalize the features of the dataset. • This ensured that the consistency between the features of the dataset was maintained. • MinMax Scaler scales the data so that it is in the range of [0, 1].
  • 14. Applying Machine Learning Algorithms This Power Consumption prediction problem we have here is a Continuous Regression problem. Models used: • Linear Regression : Linear regression is a quiet and the simplest statistical regression method used for predictive analysis in machine learning. Linear regression shows the linear relationship between the independent(predictor) variable I, and the dependent(output) variable • Decision Tree Regression : Decision tree regression observes features of an object and trains a model in the structure of a tree to predict data in the future to produce meaningful continuous output. In the context of Power Consumption prediction, it observes the features of independent variables and trains the model. • Random Forest Regression : Random forest regression is a supervised learning algorithm and bagging technique that uses an ensemble learning method for regression in machine learning. The trees in random forests run in parallel, meaning there is no interaction between these trees while building the trees. • Gradient Boost Regression : Gradient boosting regression trees are based on the idea of an ensemble method derived from a decision tree. The decision tree uses a tree structure. Starting from tree root, branching according to the conditions and heading toward the leaves, the goal leaf is the prediction result.
  • 15. Evaluation Metrics Model r2 Score Mse Error Rmse Error Linear 0.29 0.017910 0.133830 DTR 0.45 0.014113 0.118798 RTR 0.71 0.007249 0.085141 GBR 0.33 0.016745 0.129402
  • 16. Model Selection and Considerations • Random Forest Regression outperforms Linear Regression, Decision tree Regression and Gradient Boost Regression in all metrics, demonstrating higher r2 Score, Lower mse and rmse error. It seems to be a promising model for our task. • Based on the provided metrics, Random Forest Regression stands out as the best-performing model overall. • Hence, we will go with Random Forest Regression as our final model as it is quite evident that it predicts best for our Power Consumption prediction model.
  • 17. Conclusion • With the help of several insights, patterns and trends in our data, we’ve used Machine Learning to predict the power consumption in zone3. • This project offers significant benefits to electricity providers: • By predicting power consumption, Electricity providers can adopt proactive measures to produce power at the required rate. This involves proper electricity production , less transmission losses and proper supply to consumers. • By focusing efforts on consumption at a high rate, Electricity providers can streamline operations, reduce production costs, and improve overall efficiency. • Understanding the factors influencing power consumption enables providers to efficiently supply the power to meet individual needs. This level of personalization fosters stronger consumer relationships, increases efficiency in supply of electricity without losses.