In this project, we use a statistical multiple regression to study the impact of eight various predictors (relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, glazing area distribution) to estimate the cooling load energy efficiency of residential buildings. We try to analyze and visualize the effect of each predictor with each of the response variable using different classical statistical analysis tools used in describing linear models, in such a way so that we can find out the most strongly related predictor variables. Before starting all of this, we used the idea of model selection by stepwise regression technique and compare the AIC of these models and identified a better model between all of them. Then, we compare a classical linear regression approach by simulations on 768 diverse residential buildings show that we can predict CL with low mean absolute error. By using ANOVA we determined variation in the different residuals. Also, we used non constant variance test to verify it. Furthermore, we check leverage and influence points as well as outliers as well as determined cook distance for influential points. By taking box cox transformation and weights, we also introduced WLS technique to fit the model for better results and did all type of important analysis to understand the energy efficiency. Finally, we should 5-fold cross validation to verify our model.
Source of used data:
The dataset was created by Angeliki Xifara (angxifara '@' gmail.com, Civil/Structural Engineer) and was processed by Athanasios Tsanas (tsanasthanasis '@' gmail.com, Oxford Centre for Industrial and Applied Mathematics, University of Oxford, UK).
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Numerical study of natural convection in an enclosed square cavity using cons...eSAT Journals
Abstract In the present study, Constrained Interpolated Profile (CIP) method was used to simulate the natural convection heat transfer and fluid flow in an enclosed square cavity with differentially heated side walls. The fundamental idea of this method is to solve the advection phase equation with CIP method and the non-advection phase equation is calculated with finite difference method. CIPNSE is applied to predict the temperature and velocity profiles in a square cavity for various Rayleigh number: Ra=103, 104 and 105. The streamline and isotherms obtained under these conditions were then compared with those published in literature and found a good agreement. Keywords: Constrained Interpolated Profile (CIP), Finite Difference Method (FDM) and Lattice Boltzmann Method (LBM), Natural Convection, Square Cavity, Stream-Function Vorticity
Numerical simulation of marangoni driven boundary layer flow over a flat plat...eSAT Journals
Abstract
A numerical algorithm is presented for studying Marangoni convection flow over a flat plate with an imposed temperature
distribution. Plate temperature varies with x in the following prescribed manner: where A and k are constants.
By means of similarity transformation, the original nonlinear partial differential equations of flow are transformed to a pair of
nonlinear ordinary differential equations. Subsequently they are reduced to a first order system and integrated using Newton
Raphson and adaptive Runge-Kutta methods. The computer codes are developed for this numerical analysis in Matlab
environment. Velocity profiles for various values of k, and temperature profiles for various Prandtl number and k are illustrated
graphically. The results of the present simulation are then compared with the previous works available in literature with good
agreement.
Keywords: Matlab, Marangoni Convection, Numerical Simulation, Surface Tension, Flat Plate.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Numerical study of natural convection in an enclosed square cavity using cons...eSAT Journals
Abstract In the present study, Constrained Interpolated Profile (CIP) method was used to simulate the natural convection heat transfer and fluid flow in an enclosed square cavity with differentially heated side walls. The fundamental idea of this method is to solve the advection phase equation with CIP method and the non-advection phase equation is calculated with finite difference method. CIPNSE is applied to predict the temperature and velocity profiles in a square cavity for various Rayleigh number: Ra=103, 104 and 105. The streamline and isotherms obtained under these conditions were then compared with those published in literature and found a good agreement. Keywords: Constrained Interpolated Profile (CIP), Finite Difference Method (FDM) and Lattice Boltzmann Method (LBM), Natural Convection, Square Cavity, Stream-Function Vorticity
Numerical simulation of marangoni driven boundary layer flow over a flat plat...eSAT Journals
Abstract
A numerical algorithm is presented for studying Marangoni convection flow over a flat plate with an imposed temperature
distribution. Plate temperature varies with x in the following prescribed manner: where A and k are constants.
By means of similarity transformation, the original nonlinear partial differential equations of flow are transformed to a pair of
nonlinear ordinary differential equations. Subsequently they are reduced to a first order system and integrated using Newton
Raphson and adaptive Runge-Kutta methods. The computer codes are developed for this numerical analysis in Matlab
environment. Velocity profiles for various values of k, and temperature profiles for various Prandtl number and k are illustrated
graphically. The results of the present simulation are then compared with the previous works available in literature with good
agreement.
Keywords: Matlab, Marangoni Convection, Numerical Simulation, Surface Tension, Flat Plate.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Recent developments in the field of reduced order modeling - and in particular, active subspace construction - have made it possible to efficiently approximate complex models by constructing low-order response surfaces based upon a small subspace of the original high dimensional parameter space. These methods rely upon the fact that the response tends to vary more prominently in a few dominant directions defined by linear combinations of the original inputs, allowing for a rotation of the coordinate axis and a consequent transformation of the parameters. In this talk, we discuss a gradient free active subspace algorithm that is feasible for high dimensional parameter spaces where finite-difference techniques are impractical. We illustrate an initialized gradient-free active subspace algorithm for a neutronics example implemented with SCALE6.1.
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETIJMERJOURNAL
ABSTRACT : In this paper, a simple and exact analytical model for Small Geometry GaAs MESFET is developed to determine the potential distribution along the channel of the device. The model is based on the exact solution of two-dimensional Poisson’s equation in the depletion region under the gate. Then the obtained model is used to study the channel potential and threshold voltage of the device. Using the analytical model, the effect of the device parameter and bias conditions on performance of the device is investigated. The obtained results are graphically exhibited and discussed. In order to verification of the analytical results, TCAD device simulator is used and good accordance is observed.
Exploiting Hierarchy in the Abstraction-Based Verification of Statecharts Usi...Akos Hajdu
Presentation of our paper at the 14th International Workshop on Formal Engineering approaches to Software Components and Architectures (FESCA 2017). Uppsala, Sweden
Shortcut Design Method for Multistage Binary Distillation via MS-ExceIJERA Editor
Multistage distillation is most widely used industrial method for separating chemical mixtures with high energy consumptions especially when relative volatility of key components is lower than 1.5. The McCabe Thiele is considered to be the simplest and perhaps most instructive method for the conceptual design of binary distillation column which is still widely used, mainly for quick preliminary calculations. In this present work, we provide a numerical solution to a McCabe-Thiele method to find out theoretical number of stages for ideal and non-ideal binary system, reflux ratio, condenser duty, reboiler duty, each plate composition inside the column. Each and every point related to McCabe-Thiele in MS-Excel to give quick column dimensions are discussed in details
A Visual Guide to Design of Experiments using Quantum XLRamon Balisnomo
An introductory course on developing transfer functions through Design of Experiment (DOE) using the statistical software Quantum XL (QXL). The presentation's purpose is to: (1) give practical advice on the trade-off between the required number of experiments and the accuracy of the transfer function; (2) showcase the user interface for Quantum XL, which integrates DOE and Monte Carlo seamlessly in one package.
Selection and Validation of Mathematical Models of Power Converters using Rap...IJECEIAES
This paper presents a methodology based on two interrelated rapid prototyping pro- cesses in order to find the best correspondence between theoretical, simulated, and experimental results of a power converter controlled by a digital PWM. The method supplements rapid control prototyping (RCP) with effective math tools to quickly select and validate models of a controlled system. We show stability analysis of the classical and two modified buck converter models controlled by zero average dynamics (ZAD) and fixed-point induction control (FPIC). The methodology consists of obtaining the mathematical representation of power converters with the controllers and the Lyapunov Exponents (LEs). Besides, the theoretical results are compared with the simulated and experimental results by means of one- and two-parameter bifurcation diagrams. The responses of the three models are compared by changing the parameter (K ) of the ZAD and the parameter (N) of the FPIC. The results show that the stability zones, periodic orbits, periodic bands, and chaos are obtained for the three models, finding more similarities between theoretical, simulated, and experimental tests with the third model of the buck converter with ZAD and FPIC as it considers more parameters related to the losses in different elements of the system. Additionally, the intervals of the chaos are obtained by using the LEs and validated by numerical and experimental tests. s
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
Combinational logic circuit timing analysis is an important issue that all designers need to
address. The present paper presents a simple and compact analysis procedure. We follow the
guidelines drawn by previous methods, but we shall define new time-dependent logic variables
that help us improve their efficiency. By using the methodology suggested, we shall replace a
very laborious technique (pure delay circuit + time constants method) with a simpler procedure
that can pinpoint the specific conditions for a logic circuit’s anomalous behaviour within a few
simple steps. Considering the logic function implemented the methodology presented will
require analysis of only a limited number of situations/combinations to determine the presence
of an anomalous behaviour. When anomalous behaviour is identified, the methodology provides
a clear timing description
Conjugate Gradient for Normal Equations and PreconditioningFahad B. Mostafa
Between many ideas of solving linear systems, each technique has its own merits and demerits. To handle large linear system, it is particularly important to use a convenient method for reducing convergence time and obtaining better outputs. In this project we will use Steepest Decent (SD) method and Conjugate Gradient (CG) method for solving linear system with high dimensions. There are many other techniques to solve such systems, for example, simple Gaussian elimination or Cholesky Factorization, however these methods are unsuitable to handle large systems. On the other hand, SD and CG methods are quite familiar to solve sparse system of linear equations. Moreover, from many existed technique, normal equation is one of the convenient ways to solve big non-invertible system. However, this technique is more ill conditioned, but it plays a good role for some specific methods. We introduce preconditioning of the system so that condition number is close to one. For optimizing convex quadratic functions, CG can be used for optimal results with fast convergence. Then, we apply preconditioned Conjugate Gradient method with normal equation on the modified system. In this study, we compared three different methods (LS with SD, GMRES with Arnoldi’s iteration and PCGNE). PCGNE is the main aim to show in this project. We basically use many plots and tables to show the better method. Finally, we use a block preconditioning technique known as Block Conjugate Gradient algorithms for least squares for some better and faster convergence.
Statistical multivariate analysis to infer the presence breast cancerFahad B. Mostafa
The primary aim of this multivariate analysis is to show statistical significance of many statistical technique to analysis multivariate data. To do this we start with exploratory study to develop and assess a prediction model which can potentially be used as a biomarker of breast cancer, based on anthropometric data and parameters which can be gathered in routine blood analysis of 116 women. To conduct this process, we will plot the sample data and show the type of distribution it follows. Main aim of this research is to reduce dimensionality using eigen decomposition of data matrix. To perform it we use the most useful PCA method. Finally, we want to find some hypothesis tests for finding the normality assumption, equal mean and covariance test, as well as simultaneous confidence interval for our data sets. Moreover, to predict breast cancer we used logistic regression model as well as confusion matrix to show how confuse our model.
More Related Content
Similar to Multiple linear regression for energy efficacy of residential buildings
Recent developments in the field of reduced order modeling - and in particular, active subspace construction - have made it possible to efficiently approximate complex models by constructing low-order response surfaces based upon a small subspace of the original high dimensional parameter space. These methods rely upon the fact that the response tends to vary more prominently in a few dominant directions defined by linear combinations of the original inputs, allowing for a rotation of the coordinate axis and a consequent transformation of the parameters. In this talk, we discuss a gradient free active subspace algorithm that is feasible for high dimensional parameter spaces where finite-difference techniques are impractical. We illustrate an initialized gradient-free active subspace algorithm for a neutronics example implemented with SCALE6.1.
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETIJMERJOURNAL
ABSTRACT : In this paper, a simple and exact analytical model for Small Geometry GaAs MESFET is developed to determine the potential distribution along the channel of the device. The model is based on the exact solution of two-dimensional Poisson’s equation in the depletion region under the gate. Then the obtained model is used to study the channel potential and threshold voltage of the device. Using the analytical model, the effect of the device parameter and bias conditions on performance of the device is investigated. The obtained results are graphically exhibited and discussed. In order to verification of the analytical results, TCAD device simulator is used and good accordance is observed.
Exploiting Hierarchy in the Abstraction-Based Verification of Statecharts Usi...Akos Hajdu
Presentation of our paper at the 14th International Workshop on Formal Engineering approaches to Software Components and Architectures (FESCA 2017). Uppsala, Sweden
Shortcut Design Method for Multistage Binary Distillation via MS-ExceIJERA Editor
Multistage distillation is most widely used industrial method for separating chemical mixtures with high energy consumptions especially when relative volatility of key components is lower than 1.5. The McCabe Thiele is considered to be the simplest and perhaps most instructive method for the conceptual design of binary distillation column which is still widely used, mainly for quick preliminary calculations. In this present work, we provide a numerical solution to a McCabe-Thiele method to find out theoretical number of stages for ideal and non-ideal binary system, reflux ratio, condenser duty, reboiler duty, each plate composition inside the column. Each and every point related to McCabe-Thiele in MS-Excel to give quick column dimensions are discussed in details
A Visual Guide to Design of Experiments using Quantum XLRamon Balisnomo
An introductory course on developing transfer functions through Design of Experiment (DOE) using the statistical software Quantum XL (QXL). The presentation's purpose is to: (1) give practical advice on the trade-off between the required number of experiments and the accuracy of the transfer function; (2) showcase the user interface for Quantum XL, which integrates DOE and Monte Carlo seamlessly in one package.
Selection and Validation of Mathematical Models of Power Converters using Rap...IJECEIAES
This paper presents a methodology based on two interrelated rapid prototyping pro- cesses in order to find the best correspondence between theoretical, simulated, and experimental results of a power converter controlled by a digital PWM. The method supplements rapid control prototyping (RCP) with effective math tools to quickly select and validate models of a controlled system. We show stability analysis of the classical and two modified buck converter models controlled by zero average dynamics (ZAD) and fixed-point induction control (FPIC). The methodology consists of obtaining the mathematical representation of power converters with the controllers and the Lyapunov Exponents (LEs). Besides, the theoretical results are compared with the simulated and experimental results by means of one- and two-parameter bifurcation diagrams. The responses of the three models are compared by changing the parameter (K ) of the ZAD and the parameter (N) of the FPIC. The results show that the stability zones, periodic orbits, periodic bands, and chaos are obtained for the three models, finding more similarities between theoretical, simulated, and experimental tests with the third model of the buck converter with ZAD and FPIC as it considers more parameters related to the losses in different elements of the system. Additionally, the intervals of the chaos are obtained by using the LEs and validated by numerical and experimental tests. s
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
Combinational logic circuit timing analysis is an important issue that all designers need to
address. The present paper presents a simple and compact analysis procedure. We follow the
guidelines drawn by previous methods, but we shall define new time-dependent logic variables
that help us improve their efficiency. By using the methodology suggested, we shall replace a
very laborious technique (pure delay circuit + time constants method) with a simpler procedure
that can pinpoint the specific conditions for a logic circuit’s anomalous behaviour within a few
simple steps. Considering the logic function implemented the methodology presented will
require analysis of only a limited number of situations/combinations to determine the presence
of an anomalous behaviour. When anomalous behaviour is identified, the methodology provides
a clear timing description
Similar to Multiple linear regression for energy efficacy of residential buildings (20)
Conjugate Gradient for Normal Equations and PreconditioningFahad B. Mostafa
Between many ideas of solving linear systems, each technique has its own merits and demerits. To handle large linear system, it is particularly important to use a convenient method for reducing convergence time and obtaining better outputs. In this project we will use Steepest Decent (SD) method and Conjugate Gradient (CG) method for solving linear system with high dimensions. There are many other techniques to solve such systems, for example, simple Gaussian elimination or Cholesky Factorization, however these methods are unsuitable to handle large systems. On the other hand, SD and CG methods are quite familiar to solve sparse system of linear equations. Moreover, from many existed technique, normal equation is one of the convenient ways to solve big non-invertible system. However, this technique is more ill conditioned, but it plays a good role for some specific methods. We introduce preconditioning of the system so that condition number is close to one. For optimizing convex quadratic functions, CG can be used for optimal results with fast convergence. Then, we apply preconditioned Conjugate Gradient method with normal equation on the modified system. In this study, we compared three different methods (LS with SD, GMRES with Arnoldi’s iteration and PCGNE). PCGNE is the main aim to show in this project. We basically use many plots and tables to show the better method. Finally, we use a block preconditioning technique known as Block Conjugate Gradient algorithms for least squares for some better and faster convergence.
Statistical multivariate analysis to infer the presence breast cancerFahad B. Mostafa
The primary aim of this multivariate analysis is to show statistical significance of many statistical technique to analysis multivariate data. To do this we start with exploratory study to develop and assess a prediction model which can potentially be used as a biomarker of breast cancer, based on anthropometric data and parameters which can be gathered in routine blood analysis of 116 women. To conduct this process, we will plot the sample data and show the type of distribution it follows. Main aim of this research is to reduce dimensionality using eigen decomposition of data matrix. To perform it we use the most useful PCA method. Finally, we want to find some hypothesis tests for finding the normality assumption, equal mean and covariance test, as well as simultaneous confidence interval for our data sets. Moreover, to predict breast cancer we used logistic regression model as well as confusion matrix to show how confuse our model.
This project work is concerned with the study of Runge-Kutta method of higher order and to apply in solving initial and boundary value problems for ordinary as well as partial differential equations. The derivation of fourth order and sixth order Runge-Kutta method have been done firstly. After that, Fortran 90/95 code has been written for particular problems. Numerical results have been obtained for various problems. The main focus has been given on sixth order Runge-Kutta method. Exact and approximate results have been obtained and shown in tubular and graphical form.
This project work is concerned with the study of Runge-Kutta method of higher order and to apply in solving initial and boundary value problems for ordinary as well as partial differential equations. The derivation of fourth order and sixth order Runge-Kutta method have been done firstly. After that, Fortran 90/95 code has been written for particular problems. Numerical results have been obtained for various problems. The main focus has been given on sixth order Runge-Kutta method. Exact and approximate results have been obtained and shown in tubular and graphical form
Finite Difference method to solve Combined effects of viscous dissipation, radiation and heat generation on unsteady non-Newtonian fluid along a vertically stretched surface
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Multiple linear regression for energy efficacy of residential buildings
1. FAHAD BIN MOSTAFA
TEXAS TECH UNIVERSITY
MAY 05, 2020
Texas Tech University
Multiple Linear Regression for
Cooling load efficiency of residential buildings
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 1
3. Background
Primary aim of this regression analysis is to show statistical
significance of many statistical technique to analysis cooling load of
buildings.
Perform energy analysis using 12 different building shapes simulated
in Ecotect. The buildings differ with respect to the glazing area, the
glazing area distribution, and the orientation, amongst other parameters.
Simulate various settings as functions of the afore-mentioned
characteristics to obtain 768 building shapes.
Dataset comprises 768 samples and 8 features, aiming to predict two
real valued responses. However, we only work with one response.
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 3
4. DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 4
Variables Descriptions
𝑋1 Relative compactness
𝑋2 Surface area
𝑋3 Wall area
𝑋4 Roof area
𝑋5 Overall Hight
𝑋6 Orientation
𝑋7 Glazing area
𝑋8 Glazing area distribution
𝑌𝐶𝐿 Cooling Load
Nomenclature of predictors and response
5. Methodology
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 5
The multiple linear regression model is
𝑌𝑖~ 𝑁 β 0+ X 𝑖1β 1+ . . . + X 𝑖𝑝β 𝑝 , 𝜎2
1
Using Matrix Form, β 𝑜𝑙𝑠 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑦
Hat matrix, 𝐻 = 𝑋(𝑋 𝑇 𝑋)−1 𝑋 𝑇
𝐻 involves the weights ℎ𝑖𝑖; 𝑖 = 1 … 𝑛 depends on predictors.
Cook distance, 𝐷𝑖 =
𝑒𝑖
2
𝑝𝑠2 [
ℎ𝑖𝑖
1−ℎ𝑖𝑖
2]
Box Cox Transformation, gλ y =
𝑦λ−1
λ
; ; λ ≠ 0
log λ ; λ = 0
(2)
The weighted least squares estimate, β 𝑊𝐿𝑆 = (𝑋 𝑇
𝑊𝑋)−1
𝑋 𝑇
𝑊𝑦
K-fold cross validation: Comparing RMSE with model RMSE
8. Based on AIC why not adj-R-square!!!
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 8
Table 03: summary statistics of the model 3
Coefficients Estimates Std. Error Pr(> |t|)
Intercept 97.336561 20.754252 3.24e-06
Relative compactness -70.787707 11.219992 4.76e-10
Surface area -0.088245 0.018620 < 2e-16
Overall height 4.283843 0.368557 1.15e-09
Wall area 0.044682 0.007249 1.15e-09
Orientation 0.121510 0.103269 0.24
Glazing area 14.817971 0.867239 < 2e-16
9. Result for Model 2
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 9
Table 05: Analysis of Variance Table (ANOVA)
Source DF Sum of Square Mean Square F value Pr(>F)
Relative compactness 1 27931.9 27931.9 2726.885 < 2.2e−16
Surface area 1 8254.2 8254.2 805.823 < 2.2e−16
Overall height 1 22046.5 22046.5 2152.309 < 2.2e−16
Wall area 1 389.0 389.0 37.973 1.162e−10
Glazing area 1 2988.9 2988.9 291.797 < 2.2e−16
Residuals 762 7805.3 10.2
Total 767
For this multiple linear regression, we have
𝐻0: 𝛽1 = 𝛽2 = 𝛽3 = 𝛽4 = 𝛽5 = 0
𝐻1: 𝐴𝑡 𝑙𝑒𝑎𝑠𝑡 𝑜𝑛𝑒 𝛽 𝑖𝑠 𝑛𝑜𝑡 𝑧𝑒𝑟𝑜
The null hypothesis claims that there is no significant correlation at all. That is, all of the
coefficients are zero and none of the variables belong in the model.
Test of heteroscedasticity for the model can be done by the following test
𝐻0: 𝜎2′
𝑠 𝑎𝑟𝑒 𝑒𝑞𝑢𝑎𝑙
Chi-square = 187.5252, Df = 1, p = 2.22 𝑒−16
. At
5% level of significance we can say that we do have
sufficient evidence to reject null hypothesis. So,
variances are not equal. So, it has a problem with
heteroscedasticity.
10. Result for Model 2
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 10
11. Exploring Cooling Load
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 11
λ=-0.8 and is extremely close to the maximum, which suggests a transformation of the form
𝐶𝑜𝑜𝑙𝑖𝑛𝑔_𝐿𝑜𝑎𝑑λ
− 1
λ
=
𝐶𝑜𝑜𝑙𝑖𝑛𝑔_𝐿𝑜𝑎𝑑−0.8
− 1
−0.8
12. Result for WLS model
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 12
Table 06: summary for WLS with box cox transformed response
Coefficients Estimates Std. Error Pr(>|t|)
Intercept 9.424e−01
6.312e−02
< 2e−16
Relative compactness 1.902e−02
3.416e−02
0.5777
Surface area 7.122e−05
5.674e−05 0.20977
Overall height 1.878e−02
1.13e−03
< 2e−16
Wall area 7.904e−05
2.236e−05 1.16e−09
Glazing area 5.863e−02
2.638e−03
< 2e−16
In the WLS model the Residual standard
error: 0.009118 on 762 degrees of freedom,
Multiple R-squared: 0.9137, Adjusted R-
squared: 0.9131.
F-statistic: 1613 on 5 and 762 DF, p −
value: < 2.2e−16
Table 05: Analysis of Variance Table (ANOVA) for WLS model
Source DF Sum of Square Mean Square F value Pr(>F)
Relative compactness 1 0.33606 0.33606 4042.453 < 2.2e−16
Surface area 1 0.04954 0.04954 595.887 < 2.2e−16
Overall height 1 0.24296 0.24296 2922.628 < 2.2e−16
Wall area 1 0.00103 0.00103 12.448 0.0004434
Glazing area 1 0.04106 0.04106 493.969 < 2.2e−16
Residuals 762 0.06335 0.00008
Total 767
13. Result for Model 2
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 13
15. Output of final WLS model
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 15
16. Result for WLS model
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 16
Table 06 some extreme values from WLS model diagnosis
Data points Standardized Residual Hat CookD
22 -0.1372486 0.01772626 5.672951𝑒−05
24 -0.2019629 0.01773586 1.229031𝑒−04
45 -2.8854279 0.01386842 1.932885𝑒−02
48 -2.9225801 0.01387279 1.983057𝑒−02
19. Conclusion
rate of change with respect to surface area, overall height, wall area and grazing area has a
positive effect on cooling load
however wall area as well as surface area is numerically small in case of rate of change.
MLR is not a good model to predict cooling load because we are loosing important predictors.
Although cross validation verified a good fit.
Elastic net could be a better model because we can use two different penalties with
regularization parameters which avail from CV.
DEPARTMENT OF MATHEMATICS AND STATISTICS, TEXAS TECH UNIVERSITY 19