This PhD thesis examines using conformal predictors to estimate air pollution concentrations. Ordinary kriging and ridge regression confidence machine (RRCM) models are used to model nitrogen dioxide and particulate matter levels in Barcelona. RRCM provides prediction intervals rather than point estimates. Different kernels, including Gaussian and polynomial, are tested in the RRCM approach. Results show that kernel methods can improve upon the default linear model, with Gaussian generally performing best. Conformal predictors provide valid confidence levels for air pollution estimates.
Recent Trend in Pharmaceutical Chemistry Sagar ghotekarSagar Ghotekar
Recent Trend in Pharmaceutical Chemistry which are turning it into Green Chemistry
Organic chemistry is the base of pharmacy, without chemistry a pharmacist can not understand the chemical formulae of the drugs and the design of new drugs. In this way a pharmacist can not practice the field of pharmacy without a plentiful knowledge about chemistry.
“ Chemistry has an important role to play in achieving a sustainable civilization on earth.”
Working in the pharmaceutical industry is challenging. The graduate needs to have both interpersonal and interpersonal skills to work with other colleagues. In addition to a bachelor's degree, graduates have to be trained in Good Laboratory Practices (GLP) and Good Manufacturing Practices (GMP).
Introductory PPT on Metal Carbonyls having its' classification,structure and applications.This is a basic level PPT specially prepared for UG/PG Chemistry students.
Crack CSIR UGC NET chemical science - Study Planshekhar suman
Here are few tips on How you can Crack CSIR UGC NET chemical science in first attempt. Study plan for CSIR UGC NET chemical science has been discussed.
The fabrication methodology of a composite part depends mainly on three factors:
(i) the characteristics of matrices and reinforcements,
(ii) the shapes, sizes and engineering details of products, and
(iii) end uses.
The composite products are too many and cover a very wide domain of applications ranging from an engine valve to an aircraft wing.
The fabrication technique varies from one product to the other.
Recent Trend in Pharmaceutical Chemistry Sagar ghotekarSagar Ghotekar
Recent Trend in Pharmaceutical Chemistry which are turning it into Green Chemistry
Organic chemistry is the base of pharmacy, without chemistry a pharmacist can not understand the chemical formulae of the drugs and the design of new drugs. In this way a pharmacist can not practice the field of pharmacy without a plentiful knowledge about chemistry.
“ Chemistry has an important role to play in achieving a sustainable civilization on earth.”
Working in the pharmaceutical industry is challenging. The graduate needs to have both interpersonal and interpersonal skills to work with other colleagues. In addition to a bachelor's degree, graduates have to be trained in Good Laboratory Practices (GLP) and Good Manufacturing Practices (GMP).
Introductory PPT on Metal Carbonyls having its' classification,structure and applications.This is a basic level PPT specially prepared for UG/PG Chemistry students.
Crack CSIR UGC NET chemical science - Study Planshekhar suman
Here are few tips on How you can Crack CSIR UGC NET chemical science in first attempt. Study plan for CSIR UGC NET chemical science has been discussed.
The fabrication methodology of a composite part depends mainly on three factors:
(i) the characteristics of matrices and reinforcements,
(ii) the shapes, sizes and engineering details of products, and
(iii) end uses.
The composite products are too many and cover a very wide domain of applications ranging from an engine valve to an aircraft wing.
The fabrication technique varies from one product to the other.
Editor: Eng. Mohamadreza Govahi
Mentor: Dr. Ehsan Borhani
Date of Presentation: Apr 2016, Semnan PN Univeristy
*Contents
~Introduction to MMCs
~Introduction to Aluminum MMCs (AMMCs)
~Ceramic Reinforcements in AMMCs
~Types and Morphology of Reinforcements
~Aluminum Nano-composites
~Producing Methods
~Comparison in Different Procedures
~Reviews of some Experiments And Researches
If we mix a group of oils with insect repellent nature with some of the pungent smell of plants and the formation of a homogeneous mixture of them will be able to repel insects with the preservation of the environment and the security of the human.
Seminar on tribological behaviour of alumina reinfoeced composite material na...Sidharth Adhikari
THIS SEMINAR IS ON TRIBOLOGY BEHAVIOR OF ALUMINA REINFOCED COMPOSITE MATERIAL AND BRAKE DISK MATERIAL
MTECH SECOND SEMESTER SEMINAR ,CENTRE FOR ADVANCE POST-GRADUATE STUDIES,BPUT,ROURKELA
Moving Target Detection Using CA, SO and GO-CFAR detectors in Nonhomogeneous ...mathsjournal
systems in complex situations. A fundamental problem in radar systems is to automatically detect targets while maintaining a
desired constant false alarm probability. This work studies two detection approaches, the first with a fixed threshold and the
other with an adaptive one. In the latter, we have learned the three types of detectors CA, SO, and GO-CFAR. This research
aims to apply intelligent techniques to improve detection performance in a nonhomogeneous environment using standard
CFAR detectors. The objective is to maintain the false alarm probability and enhance target detection by combining
intelligent techniques. With these objectives in mind, implementing standard CFAR detectors is applied to nonhomogeneous
environment data. The primary focus is understanding the reason for the false detection when applying standard CFAR
detectors in a nonhomogeneous environment and how to avoid it using intelligent approaches.
Moving Target Detection Using CA, SO and GO-CFAR detectors in Nonhomogeneous ...mathsjournal
Modernization of radar technology and improved signal processing techniques are necessary to improve detection systems in complex situations. A fundamental problem in radar systems is to automatically detect targets while maintaining a
desired constant false alarm probability. This work studies two detection approaches, the first with a fixed threshold and the
other with an adaptive one. In the latter, we have learned the three types of detectors CA, SO, and GO-CFAR. This research
aims to apply intelligent techniques to improve detection performance in a nonhomogeneous environment using standard
CFAR detectors. The objective is to maintain the false alarm probability and enhance target detection by combining
intelligent techniques. With these objectives in mind, implementing standard CFAR detectors is applied to nonhomogeneous
environment data. The primary focus is understanding the reason for the false detection when applying standard CFAR
detectors in a nonhomogeneous environment and how to avoid it using intelligent approaches
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...sipij
Recently anomaly detection (AD) has become an important application for target detection in hyperspectral remotely sensed images. In many applications, in addition to high accuracy of detection we need a fast and reliable algorithm as well. This paper presents a novel method to improve the performance of current AD algorithms. The proposed method first calculates Discrete Wavelet Transform (DWT) of every pixel vector of image using Daubechies4 wavelet. Then, AD algorithm performs on four bands of “Wavelet transform” matrix which are the approximation of main image. In this research some benchmark AD algorithms including Local RX, DWRX and DWEST have been implemented on Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) hyperspectral datasets. Experimental results demonstrate significant improvement of runtime in proposed method. In addition, this method improves the accuracy of AD algorithms because of DWT’s power in extracting approximation coefficients of signal, which contain the main behaviour of signal, and abandon the redundant information in hyperspectral image data.
The remote sensing working group has investigated methodology for atmospheric remotesensing retrievals, which are mathematical and computational procedures for inferring the state of the atmosphere from remote sensing observations. Satellite data with fine spatial and temporal
resolution present opportunities to combine information across satellite pixels using spatiotemporal statistical modeling. We present examples of this approach at the process level of a hierarchical model, with a nonlinear radiative transfer model incorporated into the likelihood. In
this framework, we assess the impact of various statistical properties on the relative performance of a multi-pixel retrieval strategy versus an operational one-at-a-time approach. The prospect of adopting the approach is illustrated in the context of estimating atmospheric carbon dioxide concentration with data from NASA's Orbiting Carbon Observatory-2 (OCO-2).
Boosting CED Using Robust Orientation Estimationijma
n this paper, Coherence Enhancement Diffusion (CED) is boosted feeding external orientation using new
robust orientation estimation. In CED, proper scale selection is very important as the gradient vector at
that scale reflects the orientation of local ridge. For this purpose a new scheme is proposed in which pre
calculated orientation, by using local and integration scales. From the experiments it is found the proposed
scheme is working much better in noisy environment as compared to the traditional Coherence
Enhancement Diffusion
Path Loss Prediction by Robust Regression Methodsijceronline
International Journal of Computational Engineering Research (IJCER) is dedicated to protecting personal information and will make every reasonable effort to handle collected information appropriately. All information collected, as well as related requests, will be handled as carefully and efficiently as possible in accordance with IJCER standards for integrity and objectivity.
A statistical approach to spectrum sensing using bayes factor and p-ValuesIJECEIAES
The sensing methods with multiple receive antennas in the Cognitive Radio (CR) device, provide a promising solution for reducing the error rates in the detection of the Primary User (PU) signal. The received Signal to Noise Ratio at the CR receiver is enhanced using the diversity combiners. This paper proposes a statistical approach based on minimum Bayes factors and p-Values as diversity combiners in the spectrum sensing scenario. The effect of these statistical measures in sensing the spectrum in a CR environment is investigated. Through extensive Monte Carlo simulations it is shown that this novel statistical approach based on Bayes factors provides a promising solution to combine the test statistics from multiple receiver antennas and can be used as an alternative to the conventional hypothesis testing methods for spectrum sensing. The Bayesian results provide more accurate results when measuring the strength of the evidence against the hypothesis.
Calculation of solar radiation by using regression methodsmehmet şahin
Abstract. In this study, solar radiation was estimated at 53 location over Turkey with
varying climatic conditions using the Linear, Ridge, Lasso, Smoother, Partial least, KNN
and Gaussian process regression methods. The data of 2002 and 2003 years were used to
obtain regression coefficients of relevant methods. The coefficients were obtained based on
the input parameters. Input parameters were month, altitude, latitude, longitude and landsurface
temperature (LST).The values for LST were obtained from the data of the National
Oceanic and Atmospheric Administration Advanced Very High Resolution Radiometer
(NOAA-AVHRR) satellite. Solar radiation was calculated using obtained coefficients in
regression methods for 2004 year. The results were compared statistically. The most
successful method was Gaussian process regression method. The most unsuccessful method
was lasso regression method. While means bias error (MBE) value of Gaussian process
regression method was 0,274 MJ/m2, root mean square error (RMSE) value of method was
calculated as 2,260 MJ/m2. The correlation coefficient of related method was calculated as
0,941. Statistical results are consistent with the literature. Used the Gaussian process
regression method is recommended for other studies.
Quite often in experimental work, many situations arise where some observations are lost or become
unavailable due to some accidents or cost constraints. When there are missing observations, some
desirable design properties like orthogonality,rotatability and optimality can be adversely affected. Some
attention has been given, in literature, to investigating the prediction capability of response surface
designs; however, little or no effort has been devoted to investigating same for such designs when some
observations are missing. This work therefore investigates the impact of a single missing observation of the
various design points: factorial, axial and center points, on the estimation and predictive capability of
Central Composite Designs (CCDs). It was observed that for each of the designs considered, precision of
model parameter estimates and the design prediction properties were adversely affected by the missing
observations and that the largest loss in precision of parameters corresponds to a missing factorial point.
Boosting ced using robust orientation estimationijma
In this paper, Coherence Enhancement Diffusion (CED) is boosted feeding external orientation using new
robust orientation estimation. In CED, proper scale selection is very important as the gradient vector at
that scale reflects the orientation of local ridge. For this purpose a new scheme is proposed in which pre
calculated orientation, by using local and integration scales. From the experiments it is found the proposed
scheme is working much better in noisy environment as compared to the traditional Coherence
Enhancement Diffusion
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...IJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
International Journal of Modern Engineering Research (IJMER) covers all the fields of engineering and science: Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Computer Engineering, Agricultural Engineering, Aerospace Engineering, Thermodynamics, Structural Engineering, Control Engineering, Robotics, Mechatronics, Fluid Mechanics, Nanotechnology, Simulators, Web-based Learning, Remote Laboratories, Engineering Design Methods, Education Research, Students' Satisfaction and Motivation, Global Projects, and Assessment…. And many more.
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...gerogepatton
Deep learning provide successful applications in many fields. Recently, machines learning are involved for oceans remote sensing applications. In this study, we use and compare about eight (8) deep learning estimators
for retrieval of a mainly pigment of phytoplankton. Depending on the water case and the multiple instruments simultaneously observing the earth on a variety of platforms, several algorithm are used to estimate the chlolophyll-a from marine eflectance.By using a long-term multi-sensor time-series of satellite ocean-colour data, as MODIS, SeaWifs, VIIRS, MERIS, etc…, we make a unique deep network model able to establish a relationship between sea surface reflectance and chlorophyll-a from any measurement satellite sensor over West
Africa. These data fusion take into account the bias between case water and instruments. We construct several chlorophyll-a concentration prediction deep learning based models, compare them and therefore use the best for our study. Results obtained for accuracy training and test are quite good. The mean absolute error are very low and vary between 0,07 to 0,13 mg/m3.
OPTIMIZATION OF MANUFACTURE OF FIELDEFFECT HETEROTRANSISTORS WITHOUT P-NJUNCT...ijrap
It has been recently shown, that manufacturing p-n-junctions, field-effect and bipolar transistors, thyristors
in a multilayer structure by diffusion or ion implantation under condition of optimization of dopant and/or
radiation defects leads to increasing of sharpness of p-n-junctions (both single p-n-junctions and p-njunctions,
which include into their system). In this situation one can also obtain increasing of homogeneity
of dopant in doped area. In this paper we consider manufacturing a field-effect heterotransistor without pn-
junction. Optimization of technological process with using inhomogeneity of heterostructure give us
possibility to manufacture the transistors as more compact.
Cell hole identification in carcinogenic segment using Geodesic Methodology: ...Soumen Santra
Indian Economic Association organized the 106th Annual conference at University of Delhi.
This ppt awarded as Best research paper in the theme of Research (including education) Data and Artificial Intelligence for development.
Similar to Olga Ivina PhD thesis presentation short (20)
Cell hole identification in carcinogenic segment using Geodesic Methodology: ...
Olga Ivina PhD thesis presentation short
1. Conformal prediction of air pollution concentrations for
the Barcelona Metropolitan Region
PhD Thesis summary
Olga Ivina
University of Girona
GRECS research group
CIBER de Epidemiolog´ y la Salud P´blica
ıa u
November 22, 2012
1 / 42
2. Outline
Introduction
Air pollution and its effects
Air pollution exposure assessment
Conformal predictors for air pollution problem
Objectives
Methods and data
Kriging
Conformal predictors
Computing
Data
Results
Ordinary kriging and RRCM models in default setting
Kernelisation: a Gaussian kernel
Kernelisation: other kernels
Comparison of models
Discussion
Conclusion
Conformal predictors and geostatistics
Future research
2 / 42
3. Air pollution and its effects
Introduction
Air pollutant is a problem of growing concern all over the world.
There exists great body of scientific evidence of hazardous effect of air
pollution on people’s health and well-being, as well as on general
ecological condition of our planet.
In people: association with adverse health outcomes - both in adults and
in children. Children are specially susceptible to pollution. They get
affected from the very first stages of their lives and on. Linked outcomes
(to name a few):
- preterm birth and low birth weight
- asthma aggravation, cough and bronchitis
- allergies: hay fever, rhinitis, ...
- excess risk of mortality
3 / 42
4. Air pollution and its effects - 2
Introduction
Adults are influenced by pollution as well. In them, pollution is linked to
both long-term and short-term health effects (to name a few):
- respiratory: COPD, asthma, chronic bronchitis
- lung cancer
- cardiovascular morbidity
- mortality: cancer, all-cause, cardiopulmonary, non-accidental,...
Special factors of impact: SES and geographical location of a person.
4 / 42
5. Air pollution and its effects - 3
Introduction
Global air pollution map produced by Envisat’s SCIAMACHY.
Authors: S. Beirle, U. Platt and T. Wagner, University of Heidelberg’s Institute for Environmental Physics.
5 / 42
6. Air pollution and its effects - 4
Introduction
The main contributor to air pollution in urban areas is traffic. Two -
”criteria” - traffic-related air pollutants are taken up in this study:
- nitrogen dioxide (NO2)
- particulate matter PM10
NO2 effects:
short-term: respiratory effects and asthma aggravation
long-term: risk of coronary heart disease and fatal events
PM10 effects:
short-term: aggravation of respiratory and cardiovascular diseases,
premature death, ...
long-term: development of heart and lung diseases, premature
death,...
6 / 42
7. Air pollution exposure assessment
Introduction
Problem: direct measurements of pollution not always available.
There exists a large number of models aimed t predict pollution at a given
spot. The main classes are:
- proximity models
- geostatistical models
- land use regression (LUR) models
- dispersion models
- integrated meteorological emission (IME) models
- hybrid models
7 / 42
8. Conformal predictors for air pollution problem
Introduction
Problem: nowadays existing methods for air pollution exposure
assessment may lack confidence in predictions.
In order to tackle this problem, this research suggests making use of a
newly developed approach that is conformal predictors. A conformal
predictor is a “confidence predictor”, where the level of confidence for
prediction is introduced ad hoc. This prediction is always valid - provided
by definition of conformal predictor.
8 / 42
9. Conformal predictors for air pollution problem - 2
Introduction
A conformal predictor is defined by some nonconformity measure, and it
has two major desiderata:
- validity of predictions
- efficiency of preditions
Conformal predictors are flexible: they can be based upon almost any
underlying statistical algorithm.
In air pollution modeling, if a regression-based algorithm is taken up, such
as LUR or kriging, regression residuals serve as a nonconformity measure.
9 / 42
10. Objectives
This dissertation has two major objectives:
1 To demonstrate the capacity of conformal predictors as a method for
spatial environmental modeling.
2 To provide valid estimates of nitrogen dioxide and fine particulate
matter for Barcelona Metropolitan Region.
10 / 42
11. Kriging
Methods and data
Kriging is a spatial interpolation method. Provides a prediction of a factor
of interest in an unobserved point on the basis of a set of observed points.
Also provides an estimate of error variance (called “kriging variance”).
First introduced in 1951 by a South African engineer D.H. Krige in his
master work devoted to estimation of a mineral ore body. The method has
been further developed: nowadays the notion “kriging” stands for asset of
methods such as ordinary kriging, simple kriging, co-kriging, Bayesian
kriging etc.
In its simples form, a kriging estimate of the data at an unobserved
location is a linear combination of the observed data. The coefficients of
the equation depend on spatial structure of the data and on the spatial
covariance.
11 / 42
12. Kriging - 2
Methods and data
The most common kriging is ordinary kriging. It is used when the mean
of the second order stationary process is unknown. It is based on a
geostatistical concept of variogram, and its approach - covariance function.
Let there be n neighboring observed locations, x1 , . . . , xn , and an
unobserved location x0 , on a spatial domain D. Let Z (x) : x ∈ D denote
the process, and let it have a variogram γ(h). Then the ordinary kriging
∗
estimate ZOK (x0 ) at the unobserved point x0 will take the following
analytical form:
n
∗
ZOK (x0 ) = ωα Z (xα ), (1)
α=1
where ωα are the kriging weights. Ordinary kriging provides BLUE
estimates of a random field, together with an error variance estimate
(kriging variance.)
12 / 42
13. New methods. Conformal predictors
Methods and data
How it works? Provided: pairs of observations of (xi , yi ) where xi is an
object and yi is a label. Then
Z := X × Y (2)
denotes the example space. Z is a measurable space. Given an incomplete
data sequence (x1 , y1 ), (x2 , y2 ), . . . , (xn−1 , yn−1 ) ∈ Z∗ , the aim is to predict
a label yn for an object xn . An operator:
D : Z∗ × X → Y (3)
denotes then a simple predictor. (e.g., an ordinary kriging predictor).
13 / 42
14. New methods. Conformal predictors - 2
Methods and data
The prediction can be described as:
yn = D(x1 , y1 , x2 , y2 , . . . ; xn−1 ), Yn ∈ Y. (4)
Let us allow the predictor to output the prediction sets Yn large enough to
provide the confidence in prediction. This means, that the real value of yn
will fall in Yn with a given level of confidence, which is chosen and
provided to a predictor ad hoc.
A conformal predictor is a confidence predictor defined by some
nonconformity measure. Given the measure, a conformal predictor outputs
the prediction set assuming that the new example conforms with the
observed ones.
14 / 42
15. New methods. Conformal predictors - 3
Methods and data
Ridge regression confidence machine (RRCM) is a regression-based
conformal predictor. It makes use of the ridge regression procedure (A. E.
Hoerl, 1971) as an underlying algorithm.
Suppose Xn is the n × p matrix of objects (independent variables), and Yn
is the vector of labels (dependent variables). Then, a RRCM estimate of
parameters ω takes form:
ω = (Xn Xn + aIp )−1 Xn Yn , (5)
where a is a ridge factor. a = 0 yields a standard least squares estimate.
The nonconformity scores for this predictor are the regression residuals:
|ei | := |yi − yi |.
ˆ
15 / 42
16. New methods. Conformal predictors - 4
Methods and data
Based on a significance level for prediction introduced (roughly, a
probability of error not to exceed), a RRCM predictor outputs a set of
labels y for yn :
Si := {y : αi (y ) ≥ αn (y )} = {y : |ai + bi y | ≥ |an + bn y |}, (6)
where ai and bi are the components of the vectors A and B.
RRCM outputs prediction sets instead of point predictions (what kriging
does). These sets can be in form of a point, an interval, a ray, a union of
two rays, the whole real line, or empty. Usually, it is an interval.
16 / 42
17. New methods. Conformal predictors - 5
Methods and data
When the number of parameters p is large, computation is hard. “Kernel
trick” is a method that helps deal with hight-dimensional data. It allows to
consider nonlinearity in RRCM.
A kernel is a similarity measure that operates in a feature space. Provided
an input space X with a dot product, and an operator Φ that maps X to a
feature space H:
Φ:X →H
x → x := Φ(x)
a kernel will be defined as follows. For xα , xβ ∈ X :
k(xα , xβ ) = Φ(xα ), Φ(xβ ) (7)
17 / 42
18. New methods. Conformal predictors - 6
Methods and data
Any conventional covariance function for kriging can be taken up as
a kernel for RRCM. This research uses three (positive definite) kernels:
a dot product kernel (default)
a radial basis Gaussian kernel
an inhomogeneous polynomial kernel of a second degree
18 / 42
19. Computing
Methods and data
All computational work made with R.
- Kriging: geoR package. Function krige.conv
- RRCM: PredictiveRegression package. Function iidpred.
- “Kernel trick” self-developed (on the basis of the PredictiveRegression
:
package) functions for RRCM in “dual form” and for implementing the
kernels.
19 / 42
20. Data
Methods and data
The data for this study has been kindly provided by XVPCA (Network for
Monitoring and Forecasting of Air Pollution) of the Generalitat de
Catalunya.
Mean annual concentrations of two criteria pollutants, NO2 and PM10, are
provided for the Barcelona Metropolitan Region, together with the
geographical coordinates of the monitoring stations(Mercator, UTM 31).
Time frames:
- NO2: 1998 - 2009, ex. 2003
- PM10: 2001 - 2009, ex.2003
20 / 42
21. Data - 2
Methods and data
49 monitoring stations over the area in total.
Barcelona Metropolitan Region has a territory of about 3200 km2 and
accommodates over 5 million inhabitants.
In BMR, there happen about 107 million displacements weekly, 54.1% of
them - by means of motorized transport.
21 / 42
22. Data - 3
Methods and data
Table: 1. Data on mean annual nitrogen dioxide concentrations
Available observations for each year
1998 1999 2000 2001 2002 2004 2005 2006 2007 2008 2009
24 25 25 25 25 24 22 24 25 25 24
Table: 2. Data on mean annual particulate matter concentrations
Available observations for each year
2001 2002 2004 2005 2006 2007 2008 2009
22 24 28 28 29 30 33 36
22 / 42
23. Data - 4
Methods and data
Two major drawbacks, or limiting factors, of the data set:
Size: there was a small number of observations for each year and
pollutant,
Distribution: the measurement spots are situated quite far apart
from one another, and they are distributed, or placed, unevenly over
the geographic region.
Also, the data is the mean averages, and more frequent observations were
unavailable for this study.
23 / 42
36. Efficiency of predictions
Discussion
Kriging predictions are smooth and vary little, also made for mean annual
data. Error estimates, however, are huge in case of nitrogen dioxide, and
small in case of airborne particles - subject to properties of the substances:
NO2 is known to have a generally larger variability than PM10.
Kriging intervals can be derived, assuming the Gaussianity of data
distribution. This assumption is common, but not always correct. RRCM
makes no assumption on data distribution, apart from being iid.
Two factors help boost the efficiency of RRCM prediction: kernels and
ridge factor. The least is chosen by the brute force method (or the method
of consecutive approximations).
36 / 42
37. Conformal predictors and geostatistics
Conclusion
Table: Comparison of OK and RRCM
OK RRCM
point predictions prediction sets (usually intervals)
regression algorithm regression algorithm
Gaussianity assumption iid assumption
estimates error variance -
uses variogram and uses any appropriate
covariance function kernel
to approach it
- ridge factor
may lack confidence confidence level is
chosen and guaranteed
37 / 42
38. Future research
Conclusion
Extend the existing data set for BMR
Provide additional validation for the methods
Test these models on the data for other cities
Develop conformal predictors on the basis of other popular air
pollution exposure modeling algorithms (land use regression,
dispersion models etc.)
38 / 42
39. Selected references
V.Vovk, A.Gammerman, G.Shafer, Algorithmic learning in a random
world, Springer (2005).
V.Vovk, I.Nouretdinov, A. Gammerman, On-line predictive linear
regression, The Annals of Statistics (2009).
H. Wackernagel, Multivariate geostatistics: an introduction with
applications, Springer (2003).
B. Sch¨lkopf, J. Smola, Learning with kernels: support vector
o
machines, regularization, optimization, and beyond, MIT Press
(2002).
A. Lertxundi-Manterola, M. Saez, Modelling of nitrogen dioxide (NO2)
and fine particulate matter (PM10) air pollution in the metropolitan
areas of Barcelona and Bilbao, Spain, Environmetrics (2009).
39 / 42
40. Selected references - 2
A. Hoerl, R. Kennard, Ridge regression: Biased estimation for
nonorthogonal problems, Technometrics 12.1 (1970).
P. Diggle, P. Ribeiro Jr., Model-Based Geostatistics, Springer (2007).
P. Ribeiro Jr., P. Diggle, geoR: a package for geostatistical analysis,
R-NEWS 1.2 (2001).
N. Cressie, Statistics for spatial data, Wiley (1993).
M. Jerrett et al., A review and evaluation of intraurban air pollution
exposure models, Journal of exposure analysis and environmental
epidemiology (2005).
40 / 42