This document discusses the key assumptions of linear regression models:
(1) linearity between dependent and independent variables,
(2) independent and normally distributed errors,
(3) homoscedasticity or constant variance of errors.
It describes how to check these assumptions by examining residual plots and statistical tests, and provides examples of what abnormal patterns may indicate violations of assumptions. Remedies discussed include transforming variables to achieve linearity and normality or working with smaller intervals of data to address heteroscedasticity.
Guided notes covering background material on Statistics for IB Biology. This content was formerly Topic 1 but it is no longer a formal topic for the new 2016 syllabus.
Guided notes covering background material on Statistics for IB Biology. This content was formerly Topic 1 but it is no longer a formal topic for the new 2016 syllabus.
Data Science - Part IV - Regression Analysis & ANOVADerek Kane
This lecture provides an overview of linear regression analysis, interaction terms, ANOVA, optimization, log-level, and log-log transformations. The first practical example centers around the Boston housing market where the second example dives into business applications of regression analysis in a supermarket retailer.
Cancer cell metabolism: special Reference to Lactate PathwayAADYARAJPANDEY1
Normal Cell Metabolism:
Cellular respiration describes the series of steps that cells use to break down sugar and other chemicals to get the energy we need to function.
Energy is stored in the bonds of glucose and when glucose is broken down, much of that energy is released.
Cell utilize energy in the form of ATP.
The first step of respiration is called glycolysis. In a series of steps, glycolysis breaks glucose into two smaller molecules - a chemical called pyruvate. A small amount of ATP is formed during this process.
Most healthy cells continue the breakdown in a second process, called the Kreb's cycle. The Kreb's cycle allows cells to “burn” the pyruvates made in glycolysis to get more ATP.
The last step in the breakdown of glucose is called oxidative phosphorylation (Ox-Phos).
It takes place in specialized cell structures called mitochondria. This process produces a large amount of ATP. Importantly, cells need oxygen to complete oxidative phosphorylation.
If a cell completes only glycolysis, only 2 molecules of ATP are made per glucose. However, if the cell completes the entire respiration process (glycolysis - Kreb's - oxidative phosphorylation), about 36 molecules of ATP are created, giving it much more energy to use.
IN CANCER CELL:
Unlike healthy cells that "burn" the entire molecule of sugar to capture a large amount of energy as ATP, cancer cells are wasteful.
Cancer cells only partially break down sugar molecules. They overuse the first step of respiration, glycolysis. They frequently do not complete the second step, oxidative phosphorylation.
This results in only 2 molecules of ATP per each glucose molecule instead of the 36 or so ATPs healthy cells gain. As a result, cancer cells need to use a lot more sugar molecules to get enough energy to survive.
Unlike healthy cells that "burn" the entire molecule of sugar to capture a large amount of energy as ATP, cancer cells are wasteful.
Cancer cells only partially break down sugar molecules. They overuse the first step of respiration, glycolysis. They frequently do not complete the second step, oxidative phosphorylation.
This results in only 2 molecules of ATP per each glucose molecule instead of the 36 or so ATPs healthy cells gain. As a result, cancer cells need to use a lot more sugar molecules to get enough energy to survive.
introduction to WARBERG PHENOMENA:
WARBURG EFFECT Usually, cancer cells are highly glycolytic (glucose addiction) and take up more glucose than do normal cells from outside.
Otto Heinrich Warburg (; 8 October 1883 – 1 August 1970) In 1931 was awarded the Nobel Prize in Physiology for his "discovery of the nature and mode of action of the respiratory enzyme.
WARNBURG EFFECT : cancer cells under aerobic (well-oxygenated) conditions to metabolize glucose to lactate (aerobic glycolysis) is known as the Warburg effect. Warburg made the observation that tumor slices consume glucose and secrete lactate at a higher rate than normal tissues.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
1. SLR Model Assumptions
SPSS-Model Check
Introduced by
Dr. Nermin Osman
Assistant Lecturer of Medical Statistics,
MRI, Alexandria University
United Nations System Staff College
Intern-UNITAR
6. Four principal assumptions which justify the
use of linear regression models for purposes
of prediction
(1) linearity of the relationship between dependent and
independent variables
(2) Independence of the errors (no serial correlation)
(3) homoscedasticity (constant variance) of the errors
(a) versus time
(b) versus the predictions (or versus any independent variable)
(4) normality of the error distribution.
7. Other Potential assumption violations
Multicollinearity: X variables that are nearly linear
combinations of other X variables in the equation
Outliers/ Influentials: apparent non-normality by a few data
points
9. N.B. Errors and residuals
The error of an observed value is the deviation of
the observed value from the true value (for
example, a population mean)
The residual of an observed value is the difference
between the observed value and
the estimated value of the quantity of interest (for
example, a sample mean).
11. Violations of linearity: How to detect?
Nonlinearity is usually most evident in a plot of:
◦ Residuals versus predicted values (Ŷ) (The points should be
symmetrically distributed around a horizontal line )
12.
13.
14.
15.
16.
17. Abnormal Patterns in Residual Plots
Figures a). and b). suggest
non-linear relationship
between X and Y.
Fig. c). Suggests
autocorrelation.
Fig. d). Suggests variance is
not the same since the
spread of Y values is far
greater for larger values of
X.
31. Violation of normality:
How to fix?
violations of normality often arise either because :
◦ (a) the distributions of the dependent and/or independent variables are
themselves significantly non-normal,
◦ (b) the linearity assumption is violated.
In both cases, a nonlinear transformation of variables might cure both
problems.
43. What to do about Non-Normality
Transform data
◦ positive skew/ high kurotosis log transform
◦ negative skew/ low kurotosis - - power X square
43
47. Heteroscedasticity
This assumption is a about heteroscedasticity of
the residuals
◦ Hetero=different
◦ Scedastic = scattered
◦ we want our data to be homoscedastic
◦ (Funnelling in / Funnelling Out)
47
48. Effects of Violations of homoscedasticity
make it difficult to depict the true standard deviation of the
forecast errors, usually resulting in confidence intervals that are
too wide or too narrow.
E.g. if the variance of the errors is increasing over time,
confidence intervals for out-of-sample predictions will tend to be
unrealistically narrow.
Heteroscedasticity may also have the effect of giving too much
weight to small subset of the data (namely the subset where the
error variance was largest) when estimating coefficients.
49. Why is heteroscedasticity a problem?
Heteroscedasticity does not give us biased estimates of
the coefficients--however, it does make the standard
errors of the estimates unreliable.
Due to the aforementioned problem, t-tests cannot be
trusted. We run the risk of rejecting a null hypothesis that
should not be rejected.
50. Heteroscedasticity:
How to detect?
look at plots of residuals versus time and residuals versus predicted
value,
You might also want to plot residuals versus some of the independent
variables.)
51.
52.
53.
54.
55.
56. Heteroscedasticity:
How to fix?
a simple fix would be to work with shorter intervals of data
in which variance is more nearly constant.
Heteroscedasticity can also be a byproduct of a significant
violation of the linearity and/or independence assumptions,
in which case it may also be fixed as a byproduct of fixing
those problems.
57. What to do about Non-Normality
Transform data
◦ positive skew – log transform
◦ negative skew - - power X square
57