Introduction to simple and multiple linear regression.
https://issuu.com/arbrylledisonparmodules/docs/archi203_par_report_multiple_and_simple_linear_reg
The document provides an introduction to regression analysis and performing regression using SPSS. It discusses key concepts like dependent and independent variables, assumptions of regression like linearity and homoscedasticity. It explains how to calculate regression coefficients using the method of least squares and how to perform regression analysis in SPSS, including selecting variables and interpreting the output.
The document presents the results of a simple linear regression analysis conducted by a black belt to predict the number of calls answered (dependent variable) based on staffing levels (independent variable) using data collected over 240 samples in a call center. The regression equation found 83.4% of the variation in calls answered was explained by staffing levels. Notable outliers and leverage points were identified that could impact the strength of the predicted relationship between calls answered and staffing.
Regression analysis is a statistical technique used to estimate the relationships between variables. It allows one to predict the value of a dependent variable based on the value of one or more independent variables. The document discusses simple linear regression, where there is one independent variable, as well as multiple linear regression which involves two or more independent variables. Examples of linear relationships that can be modeled using regression analysis include price vs. quantity, sales vs. advertising, and crop yield vs. fertilizer usage. The key methods for performing regression analysis covered in the document are least squares regression and regressions based on deviations from the mean.
Multiple regression allows researchers to use several independent variables simultaneously to predict a continuous dependent variable. It fits a mathematical equation to the data that describes the overall relationship between the dependent variable and independent variables. The equation can be used to predict the dependent variable value based on the values of the independent variables. The technique is useful for social science research where phenomena are influenced by multiple causal factors.
Introduction to Probability and Probability DistributionsJezhabeth Villegas
This document provides an overview of teaching basic probability and probability distributions to tertiary level teachers. It introduces key concepts such as random experiments, sample spaces, events, assigning probabilities, conditional probability, independent events, and random variables. Examples are provided for each concept to illustrate the definitions and computations. The goal is to explain the necessary probability foundations for teachers to understand sampling distributions and assessing the reliability of statistical estimates from samples.
This document provides an overview of regression analysis. It defines regression analysis as a predictive modeling technique used to investigate relationships between dependent and independent variables. It describes simple linear regression as involving one independent variable and one dependent variable, with the goal of finding the best fitting straight line through the data points. An example is provided to demonstrate how to conduct a simple linear regression to predict population in the year 2005 based on population data from previous years.
Discrete probability distribution (complete)ISYousafzai
This document discusses discrete random variables. It begins by defining a random variable as a function that assigns a numerical value to each outcome of an experiment. There are two types of random variables: discrete and continuous. Discrete random variables have a countable set of possible values, while continuous variables can take any value within a range. Examples of discrete variables include the number of heads in a coin flip and the total value of dice. The document then discusses how to describe the probabilities associated with discrete random variables using lists, histograms, and probability mass functions.
Introduces and explains the use of multiple linear regression, a multivariate correlational statistical technique. For more info, see the lecture page at http://goo.gl/CeBsv. See also the slides for the MLR II lecture http://www.slideshare.net/jtneill/multiple-linear-regression-ii
The document provides an introduction to regression analysis and performing regression using SPSS. It discusses key concepts like dependent and independent variables, assumptions of regression like linearity and homoscedasticity. It explains how to calculate regression coefficients using the method of least squares and how to perform regression analysis in SPSS, including selecting variables and interpreting the output.
The document presents the results of a simple linear regression analysis conducted by a black belt to predict the number of calls answered (dependent variable) based on staffing levels (independent variable) using data collected over 240 samples in a call center. The regression equation found 83.4% of the variation in calls answered was explained by staffing levels. Notable outliers and leverage points were identified that could impact the strength of the predicted relationship between calls answered and staffing.
Regression analysis is a statistical technique used to estimate the relationships between variables. It allows one to predict the value of a dependent variable based on the value of one or more independent variables. The document discusses simple linear regression, where there is one independent variable, as well as multiple linear regression which involves two or more independent variables. Examples of linear relationships that can be modeled using regression analysis include price vs. quantity, sales vs. advertising, and crop yield vs. fertilizer usage. The key methods for performing regression analysis covered in the document are least squares regression and regressions based on deviations from the mean.
Multiple regression allows researchers to use several independent variables simultaneously to predict a continuous dependent variable. It fits a mathematical equation to the data that describes the overall relationship between the dependent variable and independent variables. The equation can be used to predict the dependent variable value based on the values of the independent variables. The technique is useful for social science research where phenomena are influenced by multiple causal factors.
Introduction to Probability and Probability DistributionsJezhabeth Villegas
This document provides an overview of teaching basic probability and probability distributions to tertiary level teachers. It introduces key concepts such as random experiments, sample spaces, events, assigning probabilities, conditional probability, independent events, and random variables. Examples are provided for each concept to illustrate the definitions and computations. The goal is to explain the necessary probability foundations for teachers to understand sampling distributions and assessing the reliability of statistical estimates from samples.
This document provides an overview of regression analysis. It defines regression analysis as a predictive modeling technique used to investigate relationships between dependent and independent variables. It describes simple linear regression as involving one independent variable and one dependent variable, with the goal of finding the best fitting straight line through the data points. An example is provided to demonstrate how to conduct a simple linear regression to predict population in the year 2005 based on population data from previous years.
Discrete probability distribution (complete)ISYousafzai
This document discusses discrete random variables. It begins by defining a random variable as a function that assigns a numerical value to each outcome of an experiment. There are two types of random variables: discrete and continuous. Discrete random variables have a countable set of possible values, while continuous variables can take any value within a range. Examples of discrete variables include the number of heads in a coin flip and the total value of dice. The document then discusses how to describe the probabilities associated with discrete random variables using lists, histograms, and probability mass functions.
Introduces and explains the use of multiple linear regression, a multivariate correlational statistical technique. For more info, see the lecture page at http://goo.gl/CeBsv. See also the slides for the MLR II lecture http://www.slideshare.net/jtneill/multiple-linear-regression-ii
The document discusses simple linear regression. It defines key terms like regression equation, regression line, slope, intercept, residuals, and residual plot. It provides examples of using sample data to generate a regression equation and evaluating that regression model. Specifically, it shows generating a regression equation from bivariate data, checking assumptions visually through scatter plots and residual plots, and interpreting the slope as the marginal change in the response variable from a one unit change in the explanatory variable.
- The document discusses simple linear regression analysis and how to use it to predict a dependent variable (y) based on an independent variable (x).
- Key points covered include the simple linear regression model, estimating regression coefficients, evaluating assumptions, making predictions, and interpreting results.
- Examples are provided to demonstrate simple linear regression analysis using data on house prices and sizes.
This document provides an introduction to correlation and regression analysis. It defines correlation as a measure of the association between two variables and regression as using one variable to predict another. The key aspects covered are:
- Calculating correlation using Pearson's correlation coefficient r to measure the strength and direction of association between variables.
- Performing simple linear regression to find the "line of best fit" to predict a dependent variable from an independent variable.
- Using a TI-83 calculator to graphically display scatter plots of data and calculate the regression equation and correlation coefficient.
- Regression analysis is a statistical tool used to examine relationships between variables and can help predict future outcomes. It allows one to assess how the value of a dependent variable changes as the value of an independent variable is varied.
- Simple linear regression involves one independent variable, while multiple regression can include any number of independent variables. Regression analysis outputs include coefficients, residuals, and measures of fit like the R-squared value.
- An example uses home size and price data from 10 houses to generate a linear regression equation predicting that price increases by around $110 for each additional square foot. This model explains 58% of the variation in home prices.
Applications of regression analysis - Measurement of validity of relationshipRithish Kumar
This document provides a summary of regression analysis in 9 steps: 1) Specify dependent and independent variables, 2) Check for linearity with scatter plots, 3) Transform variables if nonlinear, 4) Estimate the regression model, 5) Test the model fit with R2, 6) Perform a joint hypothesis test of the coefficients, 7) Test individual coefficients, 8) Check for violations of assumptions like autocorrelation and heteroscedasticity, 9) Interpret the intercept and slope coefficients. Regression analysis is used to determine relationships between variables and estimate how changes in independents impact dependents.
To get a copy of the slides for free Email me at: japhethmuthama@gmail.com
You can also support my PhD studies by donating a 1 dollar to my PayPal.
PayPal ID is japhethmuthama@gmail.com
This presentation describes the application of regression analysis in research, testing assumptions involved in it and understanding the outputs generated in the analysis.
This document discusses multiple linear regression analysis performed using SAS. It begins by outlining the assumptions of linear regression, including a linear relationship between variables, normality, no multicollinearity, and homoscedasticity. It then explains that multiple linear regression attempts to model the relationship between multiple explanatory variables and a response variable by fitting a linear equation to observed data. The document goes on to describe the regression analysis process, model selection, interpretation of outputs like R-squared and p-values, and evaluation of diagnostics like autocorrelation. It concludes by listing the predictor variables selected by the stepwise regression model and interpreting their parameter estimates.
This presentation introduces regression analysis. It discusses key concepts such as dependent and independent variables, simple and multiple regression, and linear and nonlinear regression models. It also covers different types of regression including simple linear regression, cross-sectional vs time series data, and methods for building regression models like stepwise regression and forward/backward selection. Examples are provided to demonstrate calculating regression equations using the least squares method and computing deviations from mean values.
This chapter summary covers simple linear regression models. Key topics include determining the simple linear regression equation, measures of variation such as total, explained, and unexplained sums of squares, assumptions of the regression model including normality, homoscedasticity and independence of errors. Residual analysis is discussed to examine linearity and assumptions. The coefficient of determination, standard error of estimate, and Durbin-Watson statistic are also introduced.
Bernoullis Random Variables And Binomial Distributionmathscontent
Bernoulli and binomial random variables are used to model success/failure experiments. A Bernoulli variable represents a single trial with outcomes success (1) and failure (0). A binomial variable counts the number of successes in n independent Bernoulli trials. The probability of x successes in n trials is given by the binomial distribution. Its mean and variance can be derived. The moment generating function of the binomial distribution helps compute moments like variance.
Regression analysis is a statistical technique for investigating relationships between variables. Simple linear regression defines a relationship between two variables (X and Y) using a best-fit straight line. Multiple regression extends this to model relationships between a dependent variable Y and multiple independent variables (X1, X2, etc.). Regression coefficients are estimated to define the regression equation, and R-squared and the standard error can be used to assess the goodness of fit of the regression model to the data. Regression analysis has applications in pharmaceutical experimentation such as analyzing standard curves for drug analysis.
The document discusses multiple linear regression and partial correlation. It explains that multiple regression allows one to analyze the unique contribution of predictor variables to an outcome variable after accounting for the effects of other predictor variables. Partial correlation similarly examines the relationship between two variables while controlling for a third, but only considers two variables, whereas multiple regression examines the effects of multiple predictor variables simultaneously. Examples are given comparing the correlation between height and weight with and without controlling for other relevant variables like gender, age, exercise habits, etc.
This document provides an introduction to basic statistics and regression analysis. It defines regression as relating to or predicting one variable based on another. Regression analysis is useful for economics and business. The document outlines the objectives of understanding simple linear regression, regression coefficients, and merits and demerits of regression analysis. It describes types of regression including simple and multiple regression. Key concepts explained in more detail include regression lines, regression equations, regression coefficients, and the difference between correlation and regression. Examples are provided to demonstrate calculating regression equations using different methods.
This document presents a presentation on regression analysis submitted to Dr. Adeel. It includes:
- An introduction to regression analysis and its uses in measuring relationships between variables and making predictions.
- Methods for studying regression including graphically, algebraically using least squares, and deviations from means.
- An example calculating regression equations using data on students' grades and scores through least squares and deviations from means.
- Conclusion that the regression equations match those obtained through other common methods.
The document discusses simple linear regression analysis. It provides definitions and formulas for simple linear regression, including that the regression equation is y = a + bx. An example is shown of using the stepwise method to determine if there is a significant relationship between number of absences (x) and grades (y) for students. The analysis finds a significant negative relationship, meaning more absences correlated with lower grades. Formulas are provided for calculating the slope, intercept, and testing significance of the regression model.
Covariance is a measure of how two random variables change together, taking any value from -∞ to +∞. Covariance can be affected by changing the units of the variables. Correlation is a scaled version of covariance that indicates the strength of the relationship between two variables on a scale of -1 to 1. Unlike covariance, correlation is not affected by changes in the location or scale of the variables and provides a standardized measure of their relationship. Correlation is therefore preferred over covariance as a measure of the relationship between two variables.
This document provides an overview of simple linear regression. It defines regression as determining the statistical relationship between variables where changes in one variable depend on changes in another. Regression analysis is used for prediction and exploring relationships between dependent and independent variables. The key aspects covered include:
- Dependent variables change due to independent variables.
- Lines of regression show the relationship between the variables.
- The method of least squares is used to determine the line of best fit that minimizes the error between predicted and actual values.
- Linear regression models take the form of y = a + bx and are used for tasks like prediction and determining impact of independent variables.
Simple Linear Regression: Step-By-StepDan Wellisch
This presentation was made to our meetup group found here.: https://www.meetup.com/Chicago-Technology-For-Value-Based-Healthcare-Meetup/ on 9/26/2017. Our group is focused on technology applied to healthcare in order to create better healthcare.
Urbinsight is a next-generation data analysis platform designed for participatory mapping and planning processes. Its goal is to provide engaged cities and citizens with the necessary tools and technology to affect the resiliency and sustainability of their cities and settlements in a positive way. Evolved from earlier mapping methods that we pioneered in the San Francisco Bay Area in the 2000s, the project was first launched as the Ecocitizen World Map in 2014 and has since morphed into the much larger Urbinsight platform.
This document provides an overview of an introduction to machine learning course, including:
- A description of the course content which covers Python programming, data visualization, supervised learning algorithms, regression, and unsupervised learning.
- An example of predicting bike share usage at different stations and the importance of understanding the problem and data.
- Guidance on exploring and visualizing data in Python to gain insights before applying machine learning algorithms.
The document discusses simple linear regression. It defines key terms like regression equation, regression line, slope, intercept, residuals, and residual plot. It provides examples of using sample data to generate a regression equation and evaluating that regression model. Specifically, it shows generating a regression equation from bivariate data, checking assumptions visually through scatter plots and residual plots, and interpreting the slope as the marginal change in the response variable from a one unit change in the explanatory variable.
- The document discusses simple linear regression analysis and how to use it to predict a dependent variable (y) based on an independent variable (x).
- Key points covered include the simple linear regression model, estimating regression coefficients, evaluating assumptions, making predictions, and interpreting results.
- Examples are provided to demonstrate simple linear regression analysis using data on house prices and sizes.
This document provides an introduction to correlation and regression analysis. It defines correlation as a measure of the association between two variables and regression as using one variable to predict another. The key aspects covered are:
- Calculating correlation using Pearson's correlation coefficient r to measure the strength and direction of association between variables.
- Performing simple linear regression to find the "line of best fit" to predict a dependent variable from an independent variable.
- Using a TI-83 calculator to graphically display scatter plots of data and calculate the regression equation and correlation coefficient.
- Regression analysis is a statistical tool used to examine relationships between variables and can help predict future outcomes. It allows one to assess how the value of a dependent variable changes as the value of an independent variable is varied.
- Simple linear regression involves one independent variable, while multiple regression can include any number of independent variables. Regression analysis outputs include coefficients, residuals, and measures of fit like the R-squared value.
- An example uses home size and price data from 10 houses to generate a linear regression equation predicting that price increases by around $110 for each additional square foot. This model explains 58% of the variation in home prices.
Applications of regression analysis - Measurement of validity of relationshipRithish Kumar
This document provides a summary of regression analysis in 9 steps: 1) Specify dependent and independent variables, 2) Check for linearity with scatter plots, 3) Transform variables if nonlinear, 4) Estimate the regression model, 5) Test the model fit with R2, 6) Perform a joint hypothesis test of the coefficients, 7) Test individual coefficients, 8) Check for violations of assumptions like autocorrelation and heteroscedasticity, 9) Interpret the intercept and slope coefficients. Regression analysis is used to determine relationships between variables and estimate how changes in independents impact dependents.
To get a copy of the slides for free Email me at: japhethmuthama@gmail.com
You can also support my PhD studies by donating a 1 dollar to my PayPal.
PayPal ID is japhethmuthama@gmail.com
This presentation describes the application of regression analysis in research, testing assumptions involved in it and understanding the outputs generated in the analysis.
This document discusses multiple linear regression analysis performed using SAS. It begins by outlining the assumptions of linear regression, including a linear relationship between variables, normality, no multicollinearity, and homoscedasticity. It then explains that multiple linear regression attempts to model the relationship between multiple explanatory variables and a response variable by fitting a linear equation to observed data. The document goes on to describe the regression analysis process, model selection, interpretation of outputs like R-squared and p-values, and evaluation of diagnostics like autocorrelation. It concludes by listing the predictor variables selected by the stepwise regression model and interpreting their parameter estimates.
This presentation introduces regression analysis. It discusses key concepts such as dependent and independent variables, simple and multiple regression, and linear and nonlinear regression models. It also covers different types of regression including simple linear regression, cross-sectional vs time series data, and methods for building regression models like stepwise regression and forward/backward selection. Examples are provided to demonstrate calculating regression equations using the least squares method and computing deviations from mean values.
This chapter summary covers simple linear regression models. Key topics include determining the simple linear regression equation, measures of variation such as total, explained, and unexplained sums of squares, assumptions of the regression model including normality, homoscedasticity and independence of errors. Residual analysis is discussed to examine linearity and assumptions. The coefficient of determination, standard error of estimate, and Durbin-Watson statistic are also introduced.
Bernoullis Random Variables And Binomial Distributionmathscontent
Bernoulli and binomial random variables are used to model success/failure experiments. A Bernoulli variable represents a single trial with outcomes success (1) and failure (0). A binomial variable counts the number of successes in n independent Bernoulli trials. The probability of x successes in n trials is given by the binomial distribution. Its mean and variance can be derived. The moment generating function of the binomial distribution helps compute moments like variance.
Regression analysis is a statistical technique for investigating relationships between variables. Simple linear regression defines a relationship between two variables (X and Y) using a best-fit straight line. Multiple regression extends this to model relationships between a dependent variable Y and multiple independent variables (X1, X2, etc.). Regression coefficients are estimated to define the regression equation, and R-squared and the standard error can be used to assess the goodness of fit of the regression model to the data. Regression analysis has applications in pharmaceutical experimentation such as analyzing standard curves for drug analysis.
The document discusses multiple linear regression and partial correlation. It explains that multiple regression allows one to analyze the unique contribution of predictor variables to an outcome variable after accounting for the effects of other predictor variables. Partial correlation similarly examines the relationship between two variables while controlling for a third, but only considers two variables, whereas multiple regression examines the effects of multiple predictor variables simultaneously. Examples are given comparing the correlation between height and weight with and without controlling for other relevant variables like gender, age, exercise habits, etc.
This document provides an introduction to basic statistics and regression analysis. It defines regression as relating to or predicting one variable based on another. Regression analysis is useful for economics and business. The document outlines the objectives of understanding simple linear regression, regression coefficients, and merits and demerits of regression analysis. It describes types of regression including simple and multiple regression. Key concepts explained in more detail include regression lines, regression equations, regression coefficients, and the difference between correlation and regression. Examples are provided to demonstrate calculating regression equations using different methods.
This document presents a presentation on regression analysis submitted to Dr. Adeel. It includes:
- An introduction to regression analysis and its uses in measuring relationships between variables and making predictions.
- Methods for studying regression including graphically, algebraically using least squares, and deviations from means.
- An example calculating regression equations using data on students' grades and scores through least squares and deviations from means.
- Conclusion that the regression equations match those obtained through other common methods.
The document discusses simple linear regression analysis. It provides definitions and formulas for simple linear regression, including that the regression equation is y = a + bx. An example is shown of using the stepwise method to determine if there is a significant relationship between number of absences (x) and grades (y) for students. The analysis finds a significant negative relationship, meaning more absences correlated with lower grades. Formulas are provided for calculating the slope, intercept, and testing significance of the regression model.
Covariance is a measure of how two random variables change together, taking any value from -∞ to +∞. Covariance can be affected by changing the units of the variables. Correlation is a scaled version of covariance that indicates the strength of the relationship between two variables on a scale of -1 to 1. Unlike covariance, correlation is not affected by changes in the location or scale of the variables and provides a standardized measure of their relationship. Correlation is therefore preferred over covariance as a measure of the relationship between two variables.
This document provides an overview of simple linear regression. It defines regression as determining the statistical relationship between variables where changes in one variable depend on changes in another. Regression analysis is used for prediction and exploring relationships between dependent and independent variables. The key aspects covered include:
- Dependent variables change due to independent variables.
- Lines of regression show the relationship between the variables.
- The method of least squares is used to determine the line of best fit that minimizes the error between predicted and actual values.
- Linear regression models take the form of y = a + bx and are used for tasks like prediction and determining impact of independent variables.
Simple Linear Regression: Step-By-StepDan Wellisch
This presentation was made to our meetup group found here.: https://www.meetup.com/Chicago-Technology-For-Value-Based-Healthcare-Meetup/ on 9/26/2017. Our group is focused on technology applied to healthcare in order to create better healthcare.
Urbinsight is a next-generation data analysis platform designed for participatory mapping and planning processes. Its goal is to provide engaged cities and citizens with the necessary tools and technology to affect the resiliency and sustainability of their cities and settlements in a positive way. Evolved from earlier mapping methods that we pioneered in the San Francisco Bay Area in the 2000s, the project was first launched as the Ecocitizen World Map in 2014 and has since morphed into the much larger Urbinsight platform.
This document provides an overview of an introduction to machine learning course, including:
- A description of the course content which covers Python programming, data visualization, supervised learning algorithms, regression, and unsupervised learning.
- An example of predicting bike share usage at different stations and the importance of understanding the problem and data.
- Guidance on exploring and visualizing data in Python to gain insights before applying machine learning algorithms.
Beyond Design & Construction: Greening Your School Through Sustainable Operat...YRG sustainability
This document discusses how to green school operations through sustainable practices. It recommends evaluating energy usage, encouraging alternative transportation, conserving water and improving waste diversion. Specific strategies include conducting energy audits, providing bike racks, installing efficient fixtures and composting. The document emphasizes establishing clear goals and policies, ongoing monitoring and benchmarking performance. Sustainable operations require engaging occupants and making sustainable behaviors fun and socially rewarding.
The document summarizes a thesis that evaluated a social visualization tool called Stepgreen.org, which allows users to commit to green actions and view personal and social savings. The thesis conducted a study comparing users who saw only personal feedback versus social feedback showing community performance. Users seeing social feedback fulfilled more actions, suggesting social visualization can motivate sustainable behavior. Future work could explore competitive elements and applying this approach to other collective goals like voting, education, and healthcare.
JakartaOne Livestream CN4J: Bringing Reactive to Enterprise DevelopersJakarta_EE
This document discusses bringing reactive programming to enterprise developers. It defines reactive as responding to stimuli without blocking. Reactive programming uses non-blocking I/O and callbacks to build concurrent and distributed systems that are responsive, message-driven, and resilient. The document advocates using reactive techniques with messaging systems like Kafka to build distributed applications and services that can handle failures and elastic workloads. It presents the Quarkus framework as unifying imperative and reactive programming for Java developers.
This document provides an overview of basic concepts for environmental impact assessment (EIA). It defines key terms like "impacts" and "baseline situation." The EIA process begins with understanding the proposed activity and its development objective. It then involves screening the activity to determine the appropriate level of analysis. A preliminary assessment may be conducted for activities of moderate or unknown risk to identify impacts and necessary mitigation measures. For very high risk activities, a full EIA study is required, which involves more detailed analysis of alternatives and impacts. Public participation is an important part of transparent and effective EIA.
The document discusses water demand forecasting and population forecasting methods. It describes calculating total annual water demand, average daily flow rates, and per capita demand. It also outlines factors that affect per capita demand and reasons for selecting a design period. The document then discusses various population forecasting methods like arithmetic increase, geometric increase, incremental increase, and graphical methods. It provides formulas and explanations for different curve fitting techniques to extrapolate population projections, including linear, geometric, parabolic, modified exponential, Gompertz, and logistic curves.
A Practical Guide to Sustainability in MeetingsEIBTM
Tamara Kennedy-Hill from GMIC outlines best practice steps that can be taken to make events more sustainable. These steps can be delivered using the framework APEX/ASTM.
Products are used by people to live their lives and are made by companies through economic activities that have impacts throughout their supply chain and after use. Accounting only considers financial impacts, but a product's full life cycle almost always has significant negative environmental and social impacts that are not accounted for, such as resource depletion and pollution. Impact accounting seeks to incorporate these externalities by tracking changes to financial, human, natural, and man-made capital in order to give a more complete picture of a product's true costs and value.
This document discusses multi-dimensional impact accounting and how it can be used to better account for the impacts of products and economic activities. It describes how traditional financial accounting only considers monetary impacts, while ignoring impacts on other capitals like human, natural, and social capital. The document proposes a framework to adjust financial accounts by adding positive impacts not accounted for, and deducting negative externalities. It suggests impact accounting could be applied at each stage of a supply chain to aggregate impacts. The goal is to provide more transparency about total life cycle impacts to help consumers make informed choices that factor in environmental and social considerations, not just price.
A Review of Solar PV Benefit and Cost StudiesJohn Farrell
A marvelous presentation on the many complicated factors involved in calculating the value of solar to an electric utility. Presented on 9/20/13 by Lena Hansen and Virginia Lacy of the Rocky Mountain Institute to a Value of Solar Workshop hosted by the Division of Energy Resources of the Minnesota Department of Commerce. Part 1 of the stakeholder process for establishing the state's value of solar methodology for utilities.
This document discusses cost-benefit analysis for forestry projects. It begins by defining appraisal and outlining the nature of forestry projects, which have long production periods as trees are both the production unit and product. Common objectives of forestry projects include improving economic efficiency, social conditions, stability, and the environment. The stages of cost-benefit analysis are then outlined, including defining the issue, identifying options and costs/benefits, adjusting for future values, assessing risks, distributional impacts, and using techniques like net present value to evaluate projects. Environmental impacts are an important consideration in cost-benefit analysis to account for externalities. Sensitivity analysis is also recommended to assess how sensitive results are to changes in key parameters.
The document describes the ABCD model for program evaluation. The model has four components: A) beneficiaries or clients of the program, B) the program itself, C) effects or outcomes of the program, and D) social impact. An example study is described that uses the ABCD model to evaluate a Botika ng Barangay (BnB) or community drugstore program. The study assesses the program's contributions to community health, savings, and living standards. The evaluation finds positive effects including reduced health expenditures and improved health outcomes.
AEIOU Framework:Towards “Laws of Service” Across Time-Space-ScaleHaluk Demirkan
This document presents a framework for service science called AEIOU (Abstract-Entity-Interaction-Outcome-Universals). It discusses entities as actors and service systems, types of service system entities and structures. It also addresses questions in service science about entities, interactions and outcomes over time and space. Universities are presented as complex service systems important to service science.
This document summarizes research into how dimensionality affects variable interaction in large-scale optimization problems (LSOPs). The researchers estimated correlation between variable pairs in 86 test problems by adapting a covariance matrix. They found that correlation decays with increasing dimensions, such that problems with 100+ dimensions show weak correlation and 500+ dimensions show nearly null correlation. This suggests LSOPs can be efficiently optimized by exploiting directions along axes rather than exploratory diagonal moves, due to their weak variable correlation in practice. Further studies will propose lighter covariance adaptation and algorithms exploiting the weak correlation between variable pairs.
Presentation by Grant Young at Design Thinking Sydney meetup, Feb 2016. Looks at some of the differences in applying common UX, design thinking and lean startup methods in a for-purpose context. Touches on defining value, flearning, engaging stakeholders, behaviour change, metrics and traction.
The document summarizes the regression discontinuity method used to evaluate the impact of Morocco's National Human Development Initiative (INDH) poverty reduction program. Key points:
- INDH targeted communities with poverty rates over 30% for additional funding. This threshold was used to compare outcomes just above and below the cutoff in a regression discontinuity design.
- Panel survey data from 2008, 2011, and 2013 was used to analyze economic outcomes like income, consumption, and assets at the household level around the threshold.
- Regression models found INDH caused a 12.5% increase in consumption in 2008 and 20.7% in 2011, but no significant effects on income or assets.
- The analysis is
This is a final training report of the training course provided by MOCC, World Bank.
Simplify the ISO 14001 concept to explain how a company in the business sector can act to the global warming with limited resources.
A slideshow about the ongoing sustainability initiative at International School Manila as presented to the East Asian Council of Overseas Schools (EARCOS) teachers conference 2013
Similar to Statistics - Simple Linear and Multiple Linear Regression (20)
References:
Asq.org (n.d.). What are stakeholders? Quality Resources
Lecciones, A. (2021). Green Cities. Hex Talks
Narayanaswami, P., Gronseth, G., Dubinsky, R., Penfold-Murray, R., Cox, J., Bever, C., Martins, Y., Rheaume, C., Shouse, D., & Getchius, T. (2015). The impact of social media on dissemination and implementation of clinical practice guidelines: A longitudinal observational study. Journal of Medical Internet Research, 17(8), 1-12. https://doi.org/10.2196/jmir.4414
Philippine Statistics Authority (2021). Causes of deaths in the Philippines (preliminary): January to December 2020. Press Releases
Online TDM Encyclopedia (2017). Walking and cycling encouragement: Strategies that encourage people to use non-motorized transportation. Victoria Transport Policy Institute
Online TDM Encyclopedia (2019). Automobile Dependency. Victoria Transport Policy Institute
United Nations Foundation (n.d.). Sustainable Development Goals
World Health Organization (n.d.). Air Pollution. Health Topics
Yazid, M. & Ladim, M. (2015). Urban design and active transport. International Journal of Engineering and and Advanced Technology, 4(3), 132-135.
This document discusses mixed methods research and data collection methods. It defines mixed methods research as combining qualitative and quantitative research approaches. Qualitative research aims to understand people and contexts through methods like interviews, while quantitative research tests hypotheses and looks at cause and effect through collected statistics. The document also outlines various primary data collection methods like surveys, case studies, questionnaires, and interviews. It describes data coding, cleaning, and analysis as important processes for organizing and summarizing collected information.
The document discusses the negative effects of unplanned urban sprawl, including overcrowding, environmental degradation, and health issues. It notes that unplanned urban sprawl can lead to overcrowding and congestion in cities as population grows. This overcrowding, especially in poor urban communities, increases risks of communicable diseases and mental health issues. Unplanned sprawl also contributes to environmental problems like pollution, natural resource depletion, and the urban heat island effect. The document recommends integrating more urban green spaces into development to help address these issues.
Managing spaces and visual resources environmental issues in urban designBryll Edison Par
The document discusses managing visual resources and spaces. It defines visual resources as the natural and built features that give an area its visual character, such as vegetation, water features, landmarks, and human modifications to the landscape. It describes the process of visual resource management, which involves inventorying, designating objectives, and planning to minimize impacts on scenic values. Key visual resources in Metro Manila are identified, such as Rizal Park, the Las Piñas-Parañaque Critical Habitat and Ecotourism Area wetlands, and the Ninoy Aquino Parks wildlife center.
This document provides an overview of ethnography as a qualitative research methodology. It defines ethnography as the systematic study of people and cultures from the point of view of the subject. Ethnography involves direct observation and interaction with participants in their natural environment through methods such as interviews and surveys. It requires skills such as interpretative agility, impartiality, and cultural sensitivity. The document outlines the history, key features, advantages, and disadvantages of ethnographic research and provides guidance on its applications and effective conduct.
The nipa hut as a green building by bryll edison parBryll Edison Par
The document provides an overview of the history and characteristics of the nipa hut, which was traditionally used as shelter by indigenous Filipinos before colonization. It describes the key parts of the nipa hut, including the stilts, living space, walls, windows, and roof. The nipa hut had a lightweight and open design that allowed maximum airflow for cooling, and used natural local materials like bamboo, wood, and nipa thatch. While some permanent structures were introduced during Spanish colonial rule, the nipa hut design continued to influence Filipino architecture and remains an important part of cultural heritage today.
The document discusses principles of tropical architecture and design through two case studies - the Belarocca Island Resort in the Philippines and a house in Maui, Hawaii. It outlines how these projects utilize passive design elements like orientation, ventilation, shading and natural materials to promote thermal comfort without mechanical cooling. Key strategies include maximizing air flow, removing hot air via convection currents, and using vegetation for shade and fresh air.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Statistics - Simple Linear and Multiple Linear Regression
1. PRESENTED BY |
Advanced Quantitative Research in the Designed and Built Environment
Simple Linear and Multiple Linear Regression
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
BRYLL EDISON C. PAR
A. INTRODUCTION TO SIMPLE LINEAR REGRESSION
B. HOW TO PERFORM LINEAR REGRESSION
C. MULTIPLE REGRESSION
2. PRESENTED BY |
PART 1 – Introduction to Simple Linear Regression
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
BRYLL EDISON C. PAR
3. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: https://www.scribbr.com/statistics/simple-linear-regression/
IMAGE REFERENCES: From left-analyticsvidhya.com; Nwaogazie (2017)
Regression models describe the relationship between variables by fitting a line to the observed data.
Linear regression models use a straight line, while logistic and nonlinear regression models use a
curved line. Regression allows you to estimate how a dependent variable changes as the independent
variable(s) change.
FIGURE 1 – LINEAR REGRESSION MODEL FIGURE 2 – LOGISTIC AND NONLINEAR REGRESSION MODEL
4. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
Linear regression attempts to model the relationship between two variables by fitting a linear equation to
observed data. One variable is considered to be an explanatory variable, and the other is considered to be
a dependent variable.
SOURCE: stat.yale.edu
IMAGE REFERENCES: From left-Scribbr; Auerkari, e at.,(2017); sphweb.bumc.bu.edu
5. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Objectives of Linear Regression
To establish if there is a relationship between two
variables:
- More specifically, establish if there is a statistically
significant relationship between the two.
- Examples: Income and spending, wage and gender,
and student height and exam scores.
Forecast new observations:
- Can we use what we know about the relationship to
forecast unobserved values?
- Examples: What will our sales be for the next quarter?
What will the ROI of a new store opening be contingent
on store attributes.
Variable Roles The Magic
• Dependent Variable – Denoted by “y”
• Independent Variable – Denoted by “x”
Slope-intercept form
• y = a+bx
• y = mx+b
Linear Equation in Statistics
y = β0 + β1x
where:
β0 = Intercept/constant value
β1 = slope of x
6. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Note: We call it “linear equation” because the equation represents a
straight line in a bi-dimensional plot
Change in intercept
Change in slope
Slope-intercept form
• y = a+bx
• y = mx+b
7. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Simple Linear Regression Error Term
Linear Regression Model Linear Regression Equation
y = β0 + β1x + ε
where:
y = Dependent Variable
x = Independent Variable
β0 = Intercept/constant value
β1 = Coefficient/slope of x
ε = error term
y = β0 + β1x
where:
y = Dependent Variable
x = Independent Variable
β0 = Intercept/constant value
β1 = Coefficient/slope of x
Note: There is no error term since the error is assumed to be zero
8. PRESENTED BY |
PART 2 – How to Perform Linear Regression
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
BRYLL EDISON C. PAR
9. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Step 1. Compile the observations/true value on a table in the Microsoft Excel program and
save it as a CSV. File.
10. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
11. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Step 2. Review the linear regression model and identify the independent as well as the
dependent variable.
12. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Linear Regression Model Explanation
y = β0 + β1x + ε
where:
y = Dependent Variable
x = Independent Variable
β0 = Intercept/constant value
β1 = Coefficient/slope of x
ε = error term
CONSUMPTION = β0 + β1 INCOME + ε
where:
y = Consumption
x = Income
β0 = Intercept/constant value
β1 = Coefficient/slope of x
ε = error term
Assumption: Income explains consumption
13. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Step 3. Find the coefficients of the constant and the independent variable
In this case an open-source statistical package will be used (Gretl software)
The software may be downloaded on this link: http://gretl.sourceforge.net/win32/
14. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Open Gretl software Click the file tab and hover to open data and user file
15. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
A window will pop up. Locate your csv file from your pc and
then click open
After you click open, you will be redirected to this window
16. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Select both the dependent and independent variable (income
and consumption) and then click the Beta icon below
Choose consumption by clicking the blue arrow pointing right
17. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Choose income by clicking the green arrow pointing right
below the blue arrow then click OK
A window will pop up showing the summary of the data you
need
18. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Gretl is an open-source statistical
package, mainly for econometrics.
Econometrics is the science or field of
knowledge that analyses data with
statistical models to test hypothesis and
reach conclusions.
y = 48.77 + 0.85 x + ε
Consumption = 48.77 + 0.85 income + ε
19. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Step 4. Forecast using gretl. Please follow the instructions
20. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Hover on the analysis tab and click forecast and wait for
another window to pop up
Check the value of the forecast range and click OK
21. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
A summary of the predictions, standards error and the intervals will open as well as the “Forecast evaluation statistics using 40
observations” not shown in this figure. The graph will also pop up as shown on the next slide
22. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Estimated vs. Actual Values
23. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com; Gretl Tutorial 1: Simple Linear Regression by dataminingincae Found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
Step 5. Finalize result and proceed with conclusion
24. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Example: Family’s consumption of a given product (Relationship between the family’s income and
the consumption)
y = 48.77 + 0.85 x + ε
Consumption = 48.77 + 0.85 income + ε
Estimated model of consumption
48.77 = Interpreted consumption of a family with 0 income
0.85 = Marginal effect of one unit increase of income on consumption
x = It doesn’t have an intuitive interpretation meaning that in most cases we will actually be ignoring it.
Conclusion:
Income will grow 0.85 for every unit increase in income. Ex: A family's income is 50 dollars more.
0.85x
(0.85)(50 dollars) = 42.5 dollars
It means that for every 50 dollars of income a family earns more per week, the consumption will grow on average
an expected of 42.5 dollars.
25. PRESENTED BY |
PART 3 – Multiple Regression
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
BRYLL EDISON C. PAR
26. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: https://www.scribbr.com/statistics/multiple-linear-regression/
Multiple linear regression is used to estimate the relationship between two or more independent
variables and one dependent variable.
IMAGE REFERENCES: From left – Jacome (2016); scribbr
27. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Objectives of Multiple Linear Regression
You can use multiple linear regression when you want to
know:
• How strong the relationship is between two or more
independent variables and one dependent variable (e.g.
how rainfall, temperature, and amount of fertilizer added
affect crop growth)
You can use multiple linear regression when you want to
know:
• The value of the dependent variable at a certain value
of the independent variables (e.g. the expected yield of
a crop at certain levels of rainfall, temperature, and
fertilizer addition).
Variable Roles The Magic
• Dependent Variable – Denoted by “y”
• Independent Variable – Denoted by “x”
Slope-intercept form
• y = a+bx
• y = mx+b
Multiple Linear Regression Model
y = β0 + β1X1 + β2X2 + … βpXp + ε
where:
β0 = Intercept/constant value
β1 = slope of x
28. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Multiple Regression Key Concept
Simple linear regression
(One to one relationship)
DV
IV
Multiple regression
(Many to one relationship)
DV
IV IV
IV IV
… or more
Note: Adding more independent variables to a multiple regression procedure does not mean the regression will be
“better” or offer better predictions; in fact, it can make things worse. This is called “Overfitting”
The addition of more independent variables creates more relationships among them. So not only the independent
variable, but they are also potentially related to each other. When this happen, it is called “Multicollinearity”
29. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Multicollinearity – the independent variables are correlated with each other.
The ideal is for all the independent variables to be correlated with the dependent variable but not with each other.
DV
IV
IV
IV
IV
Check for the relationship between each
independent variable and the dependent variable.
Consider all the relationships between each
independent variables.
Multiple regression
(Many to one relationship)
In tis example. 10 relationships should be considered.
4 relationships between IV and DV and another and 6
relationships between IV and IV
Note: The more Independent variable added the
relationships become numerous. Some independent
variables, or set of independent variables, are better at
predicting the dependent variable than others. Some
contribute nothing.
The ideal is for all the independent variables to be
correlated with the dependent variable but not with each
other.
30. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Facts
• Multiple regression is an extension of simple linear
regression.
• Two or more independent variable is used to
predict/explain the variance in one dependent variable.
• Two problems may arise:
• 1. Overfitting: is caused by adding too many
independent variables; they account for more variance
but add nothing to the model
• 2. Multicollinearity: happens when some/all the
independent variables are correlated with eachother.
• In multiple regression, each coefficient is interpreted as
the estimated change in y corresponding to a one unit
change in a variable, when all other variables are held
constant.
y = β0 + β1X1 + β2X2 + … βpXp + ε
Multiple Linear Regression Model
Sum of Linear Parameters Error Term
Multiple Linear Regression Equation
y = β0 + β1X1 + β2X2 + … βpXp
Error Term is assumed to be zero
Estimated Multiple Linear
Regression Equation
ŷ = b0 + b1X1 + b2X2 + … bpXp
b1, b2,…bp are the estimates of β1, β2,…βp
31. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
EXAMPLE 1: ESTIMATED MULTIPLE REGRESSION
EQUATION
Ŷ = 5.344 + 0.025 X1 + 0.234 X2 – 0.529 X3
(Standard form of a multiple regression equation)
ŷ = b0 + b1X1 + b2X2 + … bpXp
(Estimated multiple regression equation)
Estimates of a multiple regression model
Variables: X1, X2, and X3
Coefficients: 0.025, 0.234, and -0.529
Intercept: 5.344
EXAMPLE 2: INTERPRETTING COEFFICIENTS
Ŷ = 27 + 9 X1 + 12 X2
(Standard form of a multiple regression equation)
X1 = Capital Investments (1000 usd)
X2 = Marketing Expenditures (1000 usd)
Ŷ = Predicted Exam Score (1000 usd)
Note: In multiple regression, each coefficient is
interpreted as the estimated change in y
corresponding to a one unit change in a variable, when
all other variables are held constant.
In this example, 9000 usd is an estimate of the
expected increase in sales y, corresponding to a 1000
usd increase in capital investment (X1) when
marketing expenditure (X2) are held constant.
32. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Multiple Regression Pre-work/Data Preparation
1. Generate a list of potential variables; independent(s) and dependent.
2. Collect data on the variables.
3. Check the relationships between each independent variable and the dependent variable using scatterplots and
correlations.
4. Check the relationships between independent variables using scatterplots and correlations.
5. (Optional) Conduct simple linear regression for each independent and dependent pair.
6. Use the non-redundant independent variables in the analysis to find the best fitting model.
7. Use the best fitting model to make predictions about the dependent variable.
33. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service
RDS Data and Variable Naming
To conduct your analysis, you take a random sample of 10 past trips and record four pieces of information for each
trip: 10 Total miles traveled, 2) number of deliveries, 3) the daily gas price, and 4) total travel time in hours.
Miles Traveled, (X1) Number of Deliveries, (X2) Gas Price, (X3) Travel Time (Hours), (y)
89 4 3.84 7
66 1 3.19 5.4
78 3 3.78 6.6
111 6 3.89 7.4
44 1 3.57 4.8
77 3 3.57 6.4
80 3 3.03 7
66 2 3.51 5.6
109 5 3.54 7.3
76 3 3.25 6.4
34. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service (Sketching out relationships)
Independent Variables
Miles
Traveled,
(X1)
Number of
Deliveries,
(X2)
Gas Price,
(X3)
Travel
Time, (y)
Dependent Variable
6 Relationships Should be Analyzed
35. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service – Dependent Variable vs Independent Variable Scatterplots (Using
Gretl application)
X1 vs. y X2 vs. y X3 vs. y
R squared = 0.862
P Value (F) = 0.000
R squared = 0.840
P Value (F) = 0.000
R squared = 0.071
P Value (F) = 0.455
36. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service – Multicollinearity Scatterplots (Using Gretl application)
X1 vs. y X2 vs. y X3 vs. y
R squared = 0.914
P Value (F) = 0.000
R squared = 0.127
P Value (F) = 0.313
R squared = 0.248
P Value (F) = 0.143
37. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service – Correlation Summary
Correlation analysis confirms the conclusion reached by visual examination of the scatterplots
Redundant multicollinear variables
• Miles Travelled and Number of Deliveries are both highly correlated with each other and therefore are redundant;
only one should be used in the multiple regression analysis.
Non-contributing variables
• Gas price is not correlated with the dependent variable and should be excluded.
Note: For education purposes, all three relationships will be retained.
38. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service – Individual Summary Output
Travel Time (y) vs. Miles Travelled (X1)
Ŷ = 3.186 + 0.0403 (Miles Travelled)
Ŷ = 3.186 + 0.0403 X1
An increase in 1 mile will increase delivery
time by 0.0403 hours
39. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service – Individual Summary Output
Travel Time (y) vs. Number of Deliveries (X2)
Ŷ = 4.845 + 0.498 (Number of Deliveries)
Ŷ = 4.845 + 0.498 X2
An increase in 1 delivery will increase
delivery time by 0.498 hours
40. Advanced Quantitative Research in the Designed and Built Environment
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
INTRODUCTION TO SIMPLE LINEAR REGRESSION | HOW TO PERFORM LINEAR REGRESSION | MULTIPLE REGRESSION
SOURCE: Introduction to Simple Linear Regression by dataminingincae found at YouTube.com
Sample Problem: Regional Delivery Service – Individual Summary Output
Travel Time (y) vs. Gas Price (X3)
Ŷ = 3.536 + 0.811 (Gas Price)
Ŷ = 3.536 + 0.811 X3
Gas Price is not a variable that contributes
to travel time. No need to explore this value.
41. PRESENTED BY |
Advanced Quantitative Research in the Designed and Built Environment
Simple Linear and Multiple Linear Regression
UNIVERSITY OF THE PHILIPPINES – DILIMAN | INTEGRATED GRADUATE PROGRAM (IGP) | URBAN DESIGN STUDIO LAB
BRYLL EDISON C. PAR
A. INTRODUCTION TO SIMPLE LINEAR REGRESSION
B. HOW TO PERFORM LINEAR REGRESSION
C. MULTIPLE REGRESSION