SlideShare a Scribd company logo
1 of 18
Download to read offline
Citation: Sibyan, H.; Svajlenka, J.;
Hermawan, H.; Faqih, N.; Arrizqi,
A.N. Thermal Comfort Prediction
Accuracy with Machine Learning
between Regression Analysis and
Naïve Bayes Classifier. Sustainability
2022, 14, 15663. https://doi.org/
10.3390/su142315663
Academic Editor: Hamid Khayyam
Received: 20 September 2022
Accepted: 21 November 2022
Published: 24 November 2022
Publisher’s Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional affil-
iations.
Copyright: © 2022 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
sustainability
Article
Thermal Comfort Prediction Accuracy with Machine Learning
between Regression Analysis and Naïve Bayes Classifier
Hidayatus Sibyan 1,* , Jozef Svajlenka 2 , Hermawan Hermawan 3 , Nasyiin Faqih 4
and Annisa Nabila Arrizqi 5
1 Department of Informatics Engineering, Qur’anic Science University (Universitas Sains Al-Qur’an),
Jl. Hasyim Asy’ari Km. 03, Wonosobo 56351, Indonesia
2 Department of Construction Technology and Management, Technical University of Košice,
042 00 Košice, Slovakia
3 Department of Architecture, Qur’anic Science University (Universitas Sains Al-Qur’an),
Jl. Hasyim Asy’ari Km. 03, Wonosobo 56351, Indonesia
4 Department of Civil Engineering, Qur’anic Science University (Universitas Sains Al-Qur’an),
Jl. Hasyim Asy’ari Km. 03, Wonosobo 56351, Indonesia
5 Department of Civil Engineering, Universitas Islam Indonesia, Jl. Kaliurang Km.14,5,
Yogyakarta 55584, Indonesia
* Correspondence: hsibyan@unsiq.ac.id; Tel.: +62-852-9104-1812
Abstract: Various data analysis methods can make thermal comfort prediction models. One method
that is often used is multiple linear regression statistical analysis. Regression analysis needs to be
checked for accuracy with other analytical methods. This study compares the making of a thermal
comfort prediction model with regression analysis and naïve Bayes analysis. The research method
used quantitative methods for data collection regarding thermal comfort. The thermal comfort
variable, consisting of eight independent variables and one dependent variable, was measured at
Wonosobo High School, Indonesia. The analysis to make the prediction model was carried out with
two different analyses: multiple linear regression analysis and naïve Bayes analysis. The results show
that naïve Bayes is more accurate than multiple linear regression analysis.
Keywords: machine learning; thermal comfort; regression; naïve Bayes; prediction
1. Introduction
The problem of energy and thermal comfort has become a global issue. Many studies
have tried to solve energy problems by predicting thermal comfort with variations in the
type and shape of the building. Thermal comfort predictions can reduce energy wastage in
buildings [1]. Thermal comfort cannot be separated from the scale of thermal sensation,
model, climate, and personal variables. Several studies have used a variety of thermal
sensation scales. The model is the result of an analysis of the thermal comfort variable,
which is used to predict human thermal comfort [2]. Climatic variables associated with
the thermal sensation of residents in naturally ventilated nursing homes can influence the
mathematical model. Regions with different climates will also produce different responses,
so research areas must pay attention to hot or cold areas [3]. A user’s thermal perception is
one of the main variables in thermal comfort research. Field tests can be used in thermal
comfort research Questionnaire based on the thermal perception standard from ASHRAE
55. The analysis uses regression analysis, producing a mathematical model or equation of
thermal comfort [4]. Airflow is one of the factors of thermal comfort. Human perception
of airflow is also essential to know in achieving thermal comfort. Using the laboratory as
an experimental room in research can obtain results that can be accounted for. The use of
regression analysis can be applied to airflow research using an experimental room [5].
Sustainability 2022, 14, 15663. https://doi.org/10.3390/su142315663 https://www.mdpi.com/journal/sustainability
Sustainability 2022, 14, 15663 2 of 18
The thermal comfort research model aims to predict building occupants’ thermal
comfort. The prediction model generated from research is used as a standard for mak-
ing building designs. The current prediction model uses an adaptive thermal comfort
approach. Research verifying the adaptive thermal comfort model has been conducted in
four Brazilian cities. The study used the Preferred Reporting Items for Systematic Review
and Meta-Analysis (PRISMA) method and found that the variation in the verified adaptive
thermal comfort model was more than 90% for the four cities in Brazil [6]. Predictively
modeling personal thermal comfort has become a trending topic in the improvement of
human comfort in rooms. Thermal comfort is closely related to the design and performance
of building systems, especially in sustainable and intelligent buildings [7]. Thermal comfort
modeling is manifold. A basis for modeling that is currently developing is the use of
stochastic algorithms and variables [8]
The model’s accuracy in predicting the thermal comfort of occupants is an important
aspect that must be continuously developed. Accurate models that predict the right results
are convincing [9]. Linear regression is also used in outdoor thermal comfort studies.
Thermal perception and other variables such as air temperature, wind, and sun exposure
have been analyzed using linear regression. The results showed a predictive model of
thermal comfort. In addition, equations about air temperature have also been found based
on wind and sun exposure [10]. In outdoor thermal comfort research, sun exposure is a
factor that is more considered than in indoor thermal comfort research. The temperature of
solar radiation in the room is not too significant compared with the temperature of solar
radiation outside. The mathematical model of thermal comfort in the interior still includes
the average solar radiation temperature according to the thermal comfort factor that has
been formulated [11].
Methods in thermal comfort research include simulation with software, modeling
in the lab, and field testing. Field tests are the most widely used in thermal comfort
research. The measurement of the thermal comfort variable also sometimes coincides with
the measurement of the acoustic and visual comfort variables. A study’s results will show
the relationship between the variables of thermal, acoustic, and visual comfort [12]. The
experimental method is also one of the methods used in thermal comfort research. Some
studies make the experimental space a tool to test a model. The developing technology
makes the experimental space more varied [13]. Simulation methods are often used to
validate a model found in research. Simulation using software is widely used in model
validation. One of the programs used for simulation is ENVI-NET.
Simulations are often combined with field measurements to obtain more valid results,
with both methods carried out in a study [14]. Measurements in the field coupled with
simulations can use building information modeling (BIM). Computational fluid dynamic
(CFD) analysis is often one of the analytical methods in BIM [15]. Several studies have
combined the two methods, but research comparing the two methods has not been widely
carried out.
The development of the database field has led to the emergence of increasingly sophis-
ticated data analysis tools. Machine learning algorithms are a new approach to analyzing
thermal or visual comfort data. Machine learning algorithms as analytical tools are widely
used for research on human comfort in buildings [16]. Machine learning algorithm data
analysis methods continue to be developed and used in various types of buildings. Numer-
ical computing is one of the strengths of machine learning algorithms [17]. The algorithm
can also be used in urban heat island research. Research can produce effective strategies to
reduce the urban heat island effect by avoiding overcrowding in infrastructure develop-
ment; increasing plantations, waterbodies, and roof gardens; and using a white roof color
in construction. Research findings will enable urban planners, policymakers, and local
governments to achieve environmentally friendly outcomes [18]. Machine learning (ML)-
based building models have gained popularity in building predictive control (MPC) models
for building energy management applications. However, ML-based building models are
usually nonlinear in capturing building dynamics, which causes a high computational load
Sustainability 2022, 14, 15663 3 of 18
for MPC models, prohibiting their application in real-time building control [19]. Analysis
of the hot water usage model for the thermal comfort of occupants can also be performed
with machine learning. The created model provides control performance for occupant
adaptation [20].
Developing convenience models using machine learning is inevitable with database
development and automated calculations. Determining the analytical method is essential
to finding an accurate thermal comfort model. The purpose of this study was to compare
the method of multiple linear regression analysis and naïve Bayes in making an accurate
thermal comfort model.
Measuring thermal comfort can be performed with objective and subjective measure-
ments. Objective measurements use a thermal measuring instrument that measures air
temperature, average solar radiation temperature, wind speed, and tire humidity. Subjec-
tive measurements include filling out a thermal sensation questionnaire from ASHRAE by
having the respondent sit for 15 min within the research object [21]. Gender is an important
factor in thermal comfort. The selection of respondents needs to consider gender. Different
sexes will produce different thermal sensations [22]. Thermal comfort measurements have
PMV (predicted mean vote) and PPD (predicted percent dissatisfied) indicators. These two
aspects are the keywords in the analysis of thermal comfort [23].
Thermal comfort prediction with machine learning is still being developed. Prediction
can be used to develop the science of building design. Not many studies predict thermal
comfort using machine learning [24]. Machine learning methods have various forms, often
using mathematical calculations and computational fluid dynamics (CFD). The use of
various machine learning methods means that research results vary [17].
Thermal comfort prediction using a neural network has also been carried out. The
thermoelectric airduct (TE-AD) cooling system is used to predict air temperature, PMV, and
PPD. The prediction model is accurate in predicting thermal comfort [25]. The development
of artificial neural networks for machine learning is still being carried out to find the right
predictions [26]. Machine learning in predicting indoor air quality is also still being
developed [27]. Thermal comfort prediction can also combine methods with thermal
comfort variables [28].
Regression analysis is widely used in thermal comfort research to find a predictive
model of thermal comfort. Currently, there are many models of thermal comfort that use
regression analysis. Many comparisons of machine learning methods to find predictive
models of thermal comfort have been carried out. One of the machine learning methods is
naïve Bayes. A comparison between naïve Bayes and other methods has also been carried
out [29]. Naïve Bayes can be an alternative to machine learning that does not require
complicated calculations. Research that compares the regression method with naïve Bayes
to form a predictive model of thermal comfort needs to be performed so that it is known
that the method is not complicated and that it produces a better predictive model.
Clothing is one of the variables that affects thermal comfort. The majority of students
in Wonosobo, Indonesia, wear closed clothes in carrying out learning activities at school.
Currently, there are few predictive models with respondents who wear closed clothes and
have a religious culture. Thus, this study has the novelty of finding a predictive thermal
comfort model using closed clothing variables in cold areas.
This research can contribute to computer science to find predictive models that are
simple and accurate. Contributions to architecture can be used as a basis for architectural
design by predicting thermal comfort in naturally ventilated buildings.
Some of the related works are shown in Table 1, below.
Sustainability 2022, 14, 15663 4 of 18
Table 1. Related work.
Author Related Work
J. J. Aguilera, J. Toftum,
and O. Berk Kazanci (2019)
The study “Predicting personal thermal preferences based on
data-driven methods” predicts thermal comfort by comparing
artificial neural network (ANN), naïve Bayes (NB), and fuzzy
logic (FL) machine learning algorithms. The results showed that
all methods performed well, with a 70% probability of guessing
correctly [30].
P. I. Benito, M. A. Sebastián,
and C. González-Gaya
(2021)
“Study and Application of Industrial Thermal Comfort
Parameters Using Bayesian Inference Techniques” focuses on
using Bayesian analysis for thermal comfort in the industry. It
compared the results with the linear regression method used for
thermal comfort. Bayesian analysis has a better ability to develop
intelligent and thermally comfortable systems [29].
B. Yang et al. (2022)
The study “Comparison of models for predicting individual
winter thermal comfort based on machine learning algorithms”
aims to develop a comparison of thermal comfort models based
on skin temperature and environmental factors. The comparative
study includes four models: support vector machine, decision
tree, ensemble algorithms, and K-nearest neighbor. The thermal
comfort prediction accuracy rate reaches 95.8% [31].
R. Zhang, D. Liu,
and L. Shi (2022)
The study “Thermal-comfort optimization design method for
semi-outdoor stadium using machine learning” aims to reveal the
relationship between stadium shape and thermal performance
using an artificial neural network approach and genetic
algorithms. The simulation results show that the simulation is
close to the actual measurement and can be used for stadium
optimization, which can be increased by 8.96% [17].
2. Materials and Methods
This research compares two data analysis methods, regression analysis and naïve
Bayes. The data used results from the measurement of thermal comfort in the field. Respon-
dents were students at two private high schools in Kejajar District, Wonosobo Regency. The
variables used are gender, age, height, weight, temperature, globe temperature, humidity,
velocity, and thermal sensation vote (TSV). The survey was conducted on two measurement
days in the morning, afternoon, and evening. Temperature and humidity were measured
using a measuring tool with the Extect brand. We measured the globe temperature using a
black copper ball with a diameter of 15 cm.
Respondents were asked to wait for 15 min when the initial measurements were taken.
Respondents were high school students, so respondents had gone through adaptation to
the room. The research subjects involved were male and female respondents. In thermal
comfort research, it is possible that there are differences in the results of thermal sensation
between men and women, although there are studies that say there are not many differ-
ences between men and women [32]. Research subjects were not selected using sampling.
Responses were taken from all students who became the object of the research. The number
of high school students was 252, and all of them were used as research subjects.
Data analysis used regression analysis with SPSS Statistics 25 and weka 3.8.6. The
results of data analysis were compared for accuracy so that more accurate analytical tools
could be found. Regression analysis was performed by making a mathematical equation
model as follows:
Y = α +
8
∑
i=1
βiXi (1)
where X1: gender, X2: age, X3: height, X4: weight, X5: temperature, X6: globe_temperature,
X7: relative_humidity, and X8: velocity.
Sustainability 2022, 14, 15663 5 of 18
Data analysis using naïve Bayes requires a reasonably long process starting with
determining the training data that will be the test data. The calculation is performed by
calculating the class probability P(Y), the probability of each P(X) criterion, and the final
probability. Naïve Bayes analysis will produce an output from Y or TSV predictions. The
resulting output can predict the TSV generated by building occupants with the value of the
independent variable set (Figure 1).
Y 𝛼 𝛽 X (1)
where X1: gender, X2: age, X3: height, X4: weight, X5: temperature, X6: globe_temperature,
X7: relative_humidity, and X8: velocity.
Data analysis using naïve Bayes requires a reasonably long process starting with de-
termining the training data that will be the test data. The calculation is performed by cal-
culating the class probability P(Y), the probability of each P(X) criterion, and the final
probability. Naïve Bayes analysis will produce an output from Y or TSV predictions. The
resulting output can predict the TSV generated by building occupants with the value of
the independent variable set (Figure 1).
Figure 1. Naïve Bayes flowchart.
3. Results
The data obtained amounted to 252 datasets. Respondents were 252 high school stu-
dents. Female respondents wear hijab school uniforms, while men do not wear head cov-
erings. Female students wear long and long-sleeved skirts, and male students wear shirts
and trousers. All students, both girls and boys, wear shoes and socks. The activities they
perform are sitting writing and sitting listening for 7–16 h. The total amount of data from
eight independent variables and one dependent variable is 2268. Respondents consisted
Start
training data
test data
Calculate Class Probability
Calculate Class P(Y) Probability
Calculate the Probability of
Each Criteria P(X)
Calculate Final Probability
P(X|Y) = P(X|Y).P(Y)
Prediction Result
Output Y(TSV)
End
Figure 1. Naïve Bayes flowchart.
3. Results
The data obtained amounted to 252 datasets. Respondents were 252 high school
students. Female respondents wear hijab school uniforms, while men do not wear head
coverings. Female students wear long and long-sleeved skirts, and male students wear
shirts and trousers. All students, both girls and boys, wear shoes and socks. The activities
they perform are sitting writing and sitting listening for 7–16 h. The total amount of
data from eight independent variables and one dependent variable is 2268. Respondents
consisted of 44% men and 56% women. Respondent ages ranged from 14 to 19 years, with
an average of 16.3 years. The respondents’ heights were between 140 and 177 cm, with an
average of 156 cm. The respondents’ body weights were between 30 and 82 kg, averaging
47.9 kg. The temperature in the class was between 22 and 25.5 ◦C, with an average of
23.75 ◦C. Globe temperature in the class was between 23 and 26.5 ◦C, with an average of
Sustainability 2022, 14, 15663 6 of 18
24.6 ◦C. Humidity was between 60 and 80%, with an average of 68.63%. Velocity did not
involve too much movement, so it shows more zeros. The most significant velocity was
1 m/s. The thermal sensation votes obtained ranged from −3 (very cold) to +3 (very hot),
with an average of −0.89 (near cool). Data measurement was performed by bringing the
measuring instrument closer to the respondent by placing the measuring instrument on
the classroom table (Figure 2).
of 44% men and 56% women. Respondent ages ranged from 14 to 19 years, with an aver-
age of 16.3 years. The respondents’ heights were between 140 and 177 cm, with an average
of 156 cm. The respondents’ body weights were between 30 and 82 kg, averaging 47.9 kg.
The temperature in the class was between 22 and 25.5 °C, with an average of 23.75 °C.
Globe temperature in the class was between 23 and 26.5 °C, with an average of 24.6 °C.
Humidity was between 60 and 80%, with an average of 68.63%. Velocity did not involve
too much movement, so it shows more zeros. The most significant velocity was 1 m/s. The
thermal sensation votes obtained ranged from −3 (very cold) to +3 (very hot), with an av-
erage of −0.89 (near cool). Data measurement was performed by bringing the measuring
instrument closer to the respondent by placing the measuring instrument on the class-
room table (Figure 2).
Figure 2. Data measuring.
Data analysis using multiple linear regression has several data test requirements,
namely, validity and reliability. In addition, the classical assumption test also needs to be
carried out to obtain data that can be used for multiple linear regression analysis. Analysis
using SPSS software resulted in a large amount of valid test data. The normality test was
part of the regression analysis and obtained a model that meets the assumption of nor-
mality (Figure 3).
Figure 3. Classic assumption test.
Multiple linear regression data analysis using SPSS produced a value of unstandard-
ized coefficients that can be used as the coefficient of the prediction model. Some of the
resulting values looked insignificant. This value indicates that the influence of the inde-
pendent variable on the dependent is less potent (Table 2). This value can still be used in
predicting thermal comfort because several models from other studies also obtained the
same results.
Figure 2. Data measuring.
Data analysis using multiple linear regression has several data test requirements,
namely, validity and reliability. In addition, the classical assumption test also needs to be
carried out to obtain data that can be used for multiple linear regression analysis. Analysis
using SPSS software resulted in a large amount of valid test data. The normality test was
part of the regression analysis and obtained a model that meets the assumption of normality
(Figure 3).
Figure 3. Classic assumption test.
Multiple linear regression data analysis using SPSS produced a value of unstandard-
ized coefficients that can be used as the coefficient of the prediction model. Some of the
resulting values looked insignificant. This value indicates that the influence of the inde-
pendent variable on the dependent is less potent (Table 2). This value can still be used in
predicting thermal comfort because several models from other studies also obtained the
same results.
Sustainability 2022, 14, 15663 7 of 18
Table 2. Regression Value.
Coefficients
Model
Unstandardized
Coefficients
Standardized
Coefficients
t Sig.
Correlations
Collinearity
Statistics
B
Std.
Error
Beta
Zero-
Order
Partial Part Tolerance VIF
1
(Constant) −8.796 3.231 −2.723 0.007
Gender −0.308 0.182 −0.138 −1.696 0.091 −0.179 −0.108 −0.097 0.493 2.030
Age −0.093 0.074 −0.078 −1.266 0.207 −0.226 −0.081 −0.072 0.854 1.171
Height −0.015 0.013 −0.106 −1.235 0.218 0.083 −0.079 −0.070 0.437 2.289
Weight 0.005 0.012 0.030 0.439 0.661 0.055 0.028 0.025 0.705 1.419
Temperature 0.265 0.124 0.238 2.139 0.033 0.359 0.136 0.122 0.263 3.808
Globe_temperature 0.183 0.119 0.182 1.534 0.126 0.334 0.098 0.087 0.231 4.332
Humidity 0.018 0.011 0.102 1.588 0.114 0.014 0.101 0.090 0.784 1.275
Velocity 1.420 0.433 0.198 3.282 0.001 0.144 0.206 0.187 0.891 1.123
Dependent variable: TSV.
Based on the value of the constants and regression coefficients obtained from the
regression analysis, it is known that the multiple linear regression equation based equation
(Equation (1)) is as follows:
Y = (−8.796) + (−0.308) × X1 + (−0.093) × X2 + (−0.015) × X3 + 0.005 × X4 + 0.265 ×
X5 + 0.183 × X6 + 0.018 × X7 + 1.420 × X8
Several variables show a high significance value. This indicates that several indepen-
dent variables did not strongly influence the dependent variable. This is possible because
of the type of clothing worn by the respondents.
The comfortable air temperature was calculated when Y = 0 (comfortable thermal
sensation condition), and a comfortable air temperature of 33.19 ◦C was obtained. The
comfortable air temperature produced was higher than the average comfortable air tem-
perature in the tropics, which is 27 ◦C. This is possible because the closed clothes worn by
respondents are hijab for women and trousers for men.
Based on Equation (1) that was generated, a prediction of thermal comfort can be
made for the sample data based on Table 3 as follows:
Y = (−8.796) + (−0.308) × 1 + (−0.093) × 18 + (−0.015) × 160 + 0.005 × 46 + 0.265 ×
22 + 0.183 × 24 + 0.018 × 62 + 1.420 × 0
Y = −1.61 ≈ −2
Table 3. Sample of testing data for prediction.
Gender Age Height Weight Temperature Globe_Temperature Relative_Humidity Velocity TSV
1 18 160 46 22 24 62 0 ?
Naïve Bayes analysis begins with determining the training data in as many as
252 datasets according to the data from the measurement results. The variables used
in predicting thermal comfort (TSV) are the same as the regression analysis, namely, gender,
age, height, weight, temperature, globe temperature, relative humidity, and velocity. The
training data is used as test data, which is the basis of our calculations.
Based on the training data used, there are 7 TSV classes, namely, class −3 with 3 data,
class −2 with 87 data, class 1 with 81 data, class 0 with 50 data, class 1 with 24 data, class 2
with 6 data, and class 3 with as much as 1 data. TSV class classification data in the Weka
software is shown in Figure 4.
Sustainability 2022, 14, 15663 8 of 18
Sustainability 2022, 14, x FOR PEER REVIEW 8 of 18
Figure 4. TSV class.
The probability value for each criterion is shown in Appendix A.
In this test data, a thermal comfort (TSV) prediction can be made if the data used are
as follows: gender: 1, age: 18, height: 160, weight: 46, temperature: 22, globe temperature:
24, relative humidity: 62, and velocity: 0.
Based on the test data in Table 3, a prediction calculation can be made for the data
above.
Based on the naïve Bayes algorithm calculations based on Table 4, the highest value
is 0.000026 for class −2. Thus, the prediction of the test data is class −2.
Table 4. Prediction result.
Variable
Probability
−3 −2 −1 0 1 2 3
3 87 81 50 24 6 1
Gender 0.00 0.2759 0.59 0.52 0.33 0.83 1.00
Age 0.33 0.1494 0.09 0.16 0.00 0.00 0.00
Height 0.00 0.0575 0.10 0.04 0.04 0.00 0.00
Weight 0.00 0.0460 0.00 0.04 0.04 0.00 0.00
Temperature 0.00 0.1149 0.04 0.00 0.00 0.00 0.00
Globe_temperature 0.00 0.2874 0.12 0.02 0.08 0.00 0.00
Relative_humidity 0.00 0.0805 0.12 0.06 0.00 0.00 0.00
Velocity 1.00 1.0000 0.94 0.98 0.88 1.00 1.00
Sum 0 0.000026 0 0 0 0 0
Prediction Result −2
The accuracy of the prediction results can be calculated based on the confusion ma-
trix. From a total of 252 available data, 0 data in class A were correctly predicted as class
A (−3), and 3 data were correctly predicted as class one. In total, 72 data in class B were
correctly predicted as class B (−2), and 15 data were correctly predicted as class B. In total,
49 data in class C were correctly predicted as class C (−1), and 32 data were predicted
incorrectly as class C. In total, 29 data in class D were correctly predicted as class D (0),
and 21 data were predicted incorrectly as class D. In total, 16 data in class E were correctly
predicted as class E (1), and 8 data were predicted incorrectly as class E. In total, three data
in class F were correctly predicted as class F (2), and three data were incorrectly predicted
as class F. Zero data in class G were correctly predicted as class G (3), and one datum was
predicted incorrectly as class G.
3
87
81
50
24
6
1
-3 -2 -1 0 1 2 3
Figure 4. TSV class.
The probability value for each criterion is shown in Appendix A.
In this test data, a thermal comfort (TSV) prediction can be made if the data used are
as follows: gender: 1, age: 18, height: 160, weight: 46, temperature: 22, globe temperature:
24, relative humidity: 62, and velocity: 0.
Based on the test data in Table 3, a prediction calculation can be made for the
data above.
Based on the naïve Bayes algorithm calculations based on Table 4, the highest value is
0.000026 for class −2. Thus, the prediction of the test data is class −2.
Table 4. Prediction result.
Variable
Probability
−3 −2 −1 0 1 2 3
3 87 81 50 24 6 1
Gender 0.00 0.2759 0.59 0.52 0.33 0.83 1.00
Age 0.33 0.1494 0.09 0.16 0.00 0.00 0.00
Height 0.00 0.0575 0.10 0.04 0.04 0.00 0.00
Weight 0.00 0.0460 0.00 0.04 0.04 0.00 0.00
Temperature 0.00 0.1149 0.04 0.00 0.00 0.00 0.00
Globe_temperature 0.00 0.2874 0.12 0.02 0.08 0.00 0.00
Relative_humidity 0.00 0.0805 0.12 0.06 0.00 0.00 0.00
Velocity 1.00 1.0000 0.94 0.98 0.88 1.00 1.00
Sum 0 0.000026 0 0 0 0 0
Prediction Result −2
The accuracy of the prediction results can be calculated based on the confusion matrix.
From a total of 252 available data, 0 data in class A were correctly predicted as class A (−3),
and 3 data were correctly predicted as class one. In total, 72 data in class B were correctly
predicted as class B (−2), and 15 data were correctly predicted as class B. In total, 49 data
in class C were correctly predicted as class C (−1), and 32 data were predicted incorrectly
as class C. In total, 29 data in class D were correctly predicted as class D (0), and 21 data
were predicted incorrectly as class D. In total, 16 data in class E were correctly predicted as
class E (1), and 8 data were predicted incorrectly as class E. In total, three data in class F
were correctly predicted as class F (2), and three data were incorrectly predicted as class F.
Sustainability 2022, 14, 15663 9 of 18
Zero data in class G were correctly predicted as class G (3), and one datum was predicted
incorrectly as class G.
The results show that the number of correctly predicted data (correctly classified
instances) was 169 data, or 67.06%, while the incorrectly predicted results (incorrectly
classified instances) amounted to 83 data, or 32.94%.
The comparison between regression analysis and naïve Bayes seen from the number of
TSVs shows that the regression analysis found the highest TSV predictions in the cool range
(−1), with as many as 147 data. In the naïve Bayes prediction, the highest TSV was found
in the cold range (−2), as much as 104. The highest TSV difference showed a difference in
results between the regression analysis and naïve Bayes (Figure 5).
Sustainability 2022, 14, x FOR PEER REVIEW 9 of 18
The results show that the number of correctly predicted data (correctly classified in-
stances) was 169 data, or 67.06%, while the incorrectly predicted results (incorrectly clas-
sified instances) amounted to 83 data, or 32.94%.
The comparison between regression analysis and naïve Bayes seen from the number
of TSVs shows that the regression analysis found the highest TSV predictions in the cool
range (-1), with as many as 147 data. In the naïve Bayes prediction, the highest TSV was
found in the cold range (-2), as much as 104. The highest TSV difference showed a differ-
ence in results between the regression analysis and naïve Bayes (Figure 5).
Figure 5. Differences in prediction results.
Higher data variation was found in the results of naïve Bayes analysis, which could
predict all TSV categories, while the data generated using linear regression look clustered
at a value of 0 to −2. If the prediction results using linear regression and the naïve Bayes
method are compared with the actual data in the field, then the naïve Bayes method has
a better level of accuracy because the results obtained from naïve Bayes can approach the
actual data in the field.
The value generated from naïve Bayes looks close to the actual data generated in field
testing. The value using regression analysis shows the greater the value, the higher the
value (Figure 6).
0
20
40
60
80
100
120
140
160
─3 ─2 ─1 0 1 2 3
Prediction Result
Linear Regression Naïve Bayes
Figure 5. Differences in prediction results.
Higher data variation was found in the results of naïve Bayes analysis, which could
predict all TSV categories, while the data generated using linear regression look clustered
at a value of 0 to −2. If the prediction results using linear regression and the naïve Bayes
method are compared with the actual data in the field, then the naïve Bayes method has a
better level of accuracy because the results obtained from naïve Bayes can approach the
actual data in the field.
The value generated from naïve Bayes looks close to the actual data generated in field
testing. The value using regression analysis shows the greater the value, the higher the
value (Figure 6).
Sustainability 2022, 14, x FOR PEER REVIEW 10 of 18
Figure 6. Prediction result comparison between actual TSV, linear regression, and naïve Bayes.
By comparing the prediction results with the initial data, the level of accuracy of both
methods can be found. The linear regression analysis has data accuracy in as many as 84
of 252 datasets, and naïve Bayes analysis makes correct predictions in 181 out of 252 da-
tasets. The accuracy of linear regression analysis is 33%, while the naïve Bayes analysis is
67%. Prediction results show that the accuracy of naïve Bayes is higher than the multiple
linear regression analysis (Table 5).
-20
0
20
40
60
80
100
120
140
160
-3 -2 -1 0 1 2 3
Prediction Result Comparison
Actual TSV Linear Regression Naïve Bayes
Figure 6. Prediction result comparison between actual TSV, linear regression, and naïve Bayes.
Sustainability 2022, 14, 15663 10 of 18
By comparing the prediction results with the initial data, the level of accuracy of both
methods can be found. The linear regression analysis has data accuracy in as many as 84 of
252 datasets, and naïve Bayes analysis makes correct predictions in 181 out of 252 datasets.
The accuracy of linear regression analysis is 33%, while the naïve Bayes analysis is 67%.
Prediction results show that the accuracy of naïve Bayes is higher than the multiple linear
regression analysis (Table 5).
Table 5. Comparison of data analysis accuracies.
Method Correct Prediction Count of Data Accuracy
Linear Regression 84 252 33%
Naïve Bayes 169 252 67%
4. Discussion
Thermal comfort data obtained based on age are still relevant as a basis for formulating
a thermal comfort model. Age is essential in thermal comfort, and other studies have
analyzed elderly respondents. Thermal comfort models built with different age data
will produce different findings. Research in Tibet in winter and summer with elderly
respondents found differences in the research results regarding the acceptance of thermal
comfort [33]. Individual thermal comfort response is inseparable from the microclimate
of each region. Solar radiation influences the individual’s thermal comfort response with
an influence on the average solar radiation temperature. The air content also influences
the thermal comfort response of each individual in an area [34]. Modeling with thermal
comfort data more often uses regression analysis. The use of machine learning has now
grown so that the methods applied are more varied [35].
The use of regression analysis is still carried out in modeling thermal comfort in
outdoor spaces. Research with regression analysis is still accurate in evaluating outdoor
thermal comfort by including physiological parameters. The model found that it can
provide a design basis for creating thermally comfortable open spaces in urban parks [36].
A comparison of methods in modeling thermal comfort using thermal sensation vote (TSV)
was carried out, but it still needs to be used to find the most accurate method. TSV is one of
the appropriate variables in accurately predicting thermal comfort at a rate of 95.8%. The
Bayesian optimization technique is considered an accurate method for making prediction
models. Algorithms in Bayesian optimization techniques can predict individual thermal
comfort [31]. The results of other studies show that linear discriminant analysis (LDA) is
better than linear regression (LA). Several algorithms show different results in different
cases. These findings can contribute to studying subjective and objective feelings of indoor
thermal comfort in public buildings, thereby guiding architectural design, the intelligent
control of ventilation systems, and realizing human–building interaction interfaces [37].
Naïve Bayes is better than regression analysis. The results showed that naïve Bayes
has a calculation accuracy of 67%. Another study compared naïve Bayes with artificial
neural network (ANN), fuzzy logic (FL), and PMV-based algorithms. Other results show
that the naïve Bayes calculation provides a prediction accuracy of 73% [30]. The difference
compared with the research conducted in other studies is 1%. Another study comparing
several machine learning methods in finding predictions of city thermal comfort found
that naïve Bayes resulted in a data accuracy of 40.43% [38]. The results of other studies
are quite different from the research that has been undertaken. Thermal comfort data in
urban areas may differ from indoor data. Research on energy consumption savings that
compares naïve Bayes and regression has also found results that are not different from the
research on thermal comfort that has been carried out. The results of the study of energy
consumption savings with regression resulted in a data accuracy of 41.43% and a naïve
Bayes accuracy of 73% [39]. The results of other research regressions compared with the
research that has been performed have a difference of 41.43 minus 33%, which is 8.43%.
The difference compared with naïve Bayes accuracy is 1%.
Sustainability 2022, 14, 15663 11 of 18
The prediction results obtained from linear regression and naïve Bayes are not precise
but instead are based on the closest value [40]. In linear regression, the results are obtained
by rounding the final grade to the nearest side of the class, while the results from naïve
Bayes are obtained from the class that has the largest final score.
PMV (predicted mean vote) and PPD (predicted percentage of dissatisfied) values
were obtained using the CBE Thermal Comfort Tool software from https://comfort.cbe.
berkeley.edu/ (accessed on 3 October 2022). Thermal variable data in the form of air tem-
perature, average solar radiation temperature (globe temperature), wind speed, humidity,
metabolism, and respondent activity were entered into the software, and PMV and PPD
values were obtained. A total of 252 respondents calculated their PMV and PPD. The
distribution of the PMV values was mostly in the range of −0.5 to −1, a value that indicates
that a respondent is almost cold (score: −1). In another value, respondents seem to obtain a
PMV value of 0.5, which indicates that some respondents feel close to warm (score: 1). The
overall PMV results show on Figure 7 that the respondents are still not too cold or too hot.
that the naïve Bayes calculation provides a prediction accuracy of 73% [30]. The difference
compared with the research conducted in other studies is 1%. Another study comparing
several machine learning methods in finding predictions of city thermal comfort found
that naïve Bayes resulted in a data accuracy of 40.43% [38]. The results of other studies are
quite different from the research that has been undertaken. Thermal comfort data in urban
areas may differ from indoor data. Research on energy consumption savings that com-
pares naïve Bayes and regression has also found results that are not different from the
research on thermal comfort that has been carried out. The results of the study of energy
consumption savings with regression resulted in a data accuracy of 41.43% and a naïve
Bayes accuracy of 73% [39]. The results of other research regressions compared with the
research that has been performed have a difference of 41.43 minus 33%, which is 8.43%.
The difference compared with naïve Bayes accuracy is 1%.
The prediction results obtained from linear regression and naïve Bayes are not pre-
cise but instead are based on the closest value [40]. In linear regression, the results are
obtained by rounding the final grade to the nearest side of the class, while the results from
naïve Bayes are obtained from the class that has the largest final score.
PMV (predicted mean vote) and PPD (predicted percentage of dissatisfied) values
were obtained using the CBE Thermal Comfort Tool software from https://com-
fort.cbe.berkeley.edu/ (accessed on 3 October 2022). Thermal variable data in the form of
air temperature, average solar radiation temperature (globe temperature), wind speed,
humidity, metabolism, and respondent activity were entered into the software, and PMV
and PPD values were obtained. A total of 252 respondents calculated their PMV and PPD.
The distribution of the PMV values was mostly in the range of −0.5 to −1, a value that
indicates that a respondent is almost cold (score: −1). In another value, respondents seem
to obtain a PMV value of 0.5, which indicates that some respondents feel close to warm
(score: 1). The overall PMV results show on Figure 7 that the respondents are still not too
cold or too hot.
Figure 7. PMV score.
The highest PPD value produced by the respondents reached 19%. The minimum
value is 5%, and the average PPD produced is 9%. Not many respondents reached the
19%value. The PPD value generated using the software from https://comfort.cbe.berke-
ley.edu/software (accessed on 3 October 2022) shows that respondents are predicted to
still be able to accept the existing thermal conditions. The PPD value is still below 25%,
which means that respondents are still relatively comfortable with the existing thermal
conditions (Figure 8).
-1
-0.5
0
0.5
1
1
11
21
31
41
51
61
71
81
91
101
111
121
131
141
151
161
171
181
191
201
211
221
231
241
251
PMV
Respondent
Figure 7. PMV score.
The highest PPD value produced by the respondents reached 19%. The minimum
value is 5%, and the average PPD produced is 9%. Not many respondents reached the
19%value. The PPD value generated using the software from https://comfort.cbe.berkeley.
edu/software (accessed on 3 October 2022) shows that respondents are predicted to still
be able to accept the existing thermal conditions. The PPD value is still below 25%, which
means that respondents are still relatively comfortable with the existing thermal conditions
(Figure 8).
Sustainability 2022, 14, x FOR PEER REVIEW 12 of 18
Figure 8. PPD score.
5. Conclusions
We studied the use of multiple linear regression analysis as a model for predicting
thermal comfort. Predicting thermal comfort using a regression model in a study at Won-
osobo High School showed results with a better range of coolness. Naïve Bayes analysis
is one of the analytical alternatives classified as machine learning. The use of machine
learning in thermal comfort is important because the analysis is expected to have accurate
predictions. Using naïve Bayes analysis in thermal comfort research at Wonosobo Senior
High School, Indonesia, we found differences compared with research results using mul-
tiple linear regression analysis. The difference in prediction results shows that naïve Bayes
analysis has a cooler TSV result than multiple linear regression analysis. The level of ac-
curacy in predictions using naïve Bayes is higher than the multiple linear regression anal-
0%
2%
4%
6%
8%
10%
12%
14%
16%
18%
20%
0 50 100 150 200 250 300
PPD
Respondent
Figure 8. PPD score.
5. Conclusions
We studied the use of multiple linear regression analysis as a model for predicting
thermal comfort. Predicting thermal comfort using a regression model in a study at
Wonosobo High School showed results with a better range of coolness. Naïve Bayes
Sustainability 2022, 14, 15663 12 of 18
analysis is one of the analytical alternatives classified as machine learning. The use of
machine learning in thermal comfort is important because the analysis is expected to have
accurate predictions. Using naïve Bayes analysis in thermal comfort research at Wonosobo
Senior High School, Indonesia, we found differences compared with research results using
multiple linear regression analysis. The difference in prediction results shows that naïve
Bayes analysis has a cooler TSV result than multiple linear regression analysis. The level
of accuracy in predictions using naïve Bayes is higher than the multiple linear regression
analysis method. A comparison between the predictions of he two analytical methods is
not too different, so it is still possible to use both methods in predicting the thermal comfort
of building occupants.
Author Contributions: Conceptualization, H.H.; Methodology, H.H.; Validation, H.S.; Data curation,
H.S.; Writing—original draft, N.F.; Writing—review & editing, J.S.; Visualization, A.N.A. All authors
have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Conflicts of Interest: The authors declare no conflict of interest.
Appendix A
Table A1. Gender criteria probability.
Gender
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
1 0 24 48 26 8 5 1 0 0.28 0.59 0.52 0.33 0.83 1
2 3 63 33 24 16 1 0 1 0.72 0.41 0.48 0.67 0.17 0
3 87 81 50 24 6 1
Table A2. Age criteria probability.
Age
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
14 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0
15 0 9 16 10 7 3 0 0 0.1 0.2 0.2 0.29 0.5 0
16 0 29 36 18 12 2 0 0 0.33 0.44 0.36 0.5 0.33 0
17 2 35 21 14 5 1 1 0.67 0.4 0.26 0.28 0.21 0.17 1
18 1 13 7 8 0 0 0 0.33 0.15 0.09 0.16 0 0 0
19 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0
3 87 81 50 24 6 1
Sustainability 2022, 14, 15663 13 of 18
Table A3. Height criteria probability.
Height
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
140 0 0 1 1 0 0 0 0 0 0.01 0.02 0 0 0
142 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0
143 0 5 2 1 0 0 0 0 0.06 0.02 0.02 0 0 0
144 0 2 1 0 0 0 0 0 0.02 0.01 0 0 0 0
145 0 3 0 0 1 0 0 0 0.03 0 0 0.04 0 0
146 0 4 2 0 0 0 0 0 0.05 0.02 0 0 0 0
147 0 4 2 1 1 0 0 0 0.05 0.02 0.02 0.04 0 0
148 0 3 1 0 1 0 0 0 0.03 0.01 0 0.04 0 0
149 0 1 2 1 1 0 0 0 0.01 0.02 0.02 0.04 0 0
150 2 10 6 6 2 0 1 0.67 0.11 0.07 0.12 0.08 0 1
151 0 5 2 2 2 1 0 0 0.06 0.02 0.04 0.08 0.17 0
152 0 3 2 1 2 0 0 0 0.03 0.02 0.02 0.08 0 0
153 0 6 3 3 2 0 0 0 0.07 0.04 0.06 0.08 0 0
154 1 1 6 3 1 0 0 0.33 0.01 0.07 0.06 0.04 0 0
155 0 5 5 3 2 2 0 0 0.06 0.06 0.06 0.08 0.33 0
156 0 5 2 3 1 1 0 0 0.06 0.02 0.06 0.04 0.17 0
157 0 3 1 2 1 1 0 0 0.03 0.01 0.04 0.04 0.17 0
158 0 3 7 3 1 0 0 0 0.03 0.09 0.06 0.04 0 0
159 0 0 4 2 0 0 0 0 0 0.05 0.04 0 0 0
160 0 5 8 2 1 0 0 0 0.06 0.1 0.04 0.04 0 0
161 0 0 1 2 0 0 0 0 0 0.01 0.04 0 0 0
162 0 3 2 0 0 1 0 0 0.03 0.02 0 0 0.17 0
163 0 1 1 1 1 0 0 0 0.01 0.01 0.02 0.04 0 0
164 0 3 2 2 1 0 0 0 0.03 0.02 0.04 0.04 0 0
165 0 3 4 3 0 0 0 0 0.03 0.05 0.06 0 0 0
166 0 2 1 1 1 0 0 0 0.02 0.01 0.02 0.04 0 0
167 0 4 2 1 1 0 0 0 0.05 0.02 0.02 0.04 0 0
168 0 0 2 2 0 0 0 0 0 0.02 0.04 0 0 0
169 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0
170 0 0 5 1 1 0 0 0 0 0.06 0.02 0.04 0 0
171 0 1 0 1 0 0 0 0 0.01 0 0.02 0 0 0
172 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0
174 0 0 0 1 0 0 0 0 0 0 0.02 0 0 0
176 0 1 1 1 0 0 0 0 0.01 0.01 0.02 0 0 0
177 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0
3 87 81 50 24 6 1
Sustainability 2022, 14, 15663 14 of 18
Table A4. Weight criteria probability.
Weight
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
30 0 0 0 1 0 1 0 0 0 0 0.02 0 0.17 0
36 0 0 0 0 0 1 0 0 0 0 0 0 0.17 0
37 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0
38 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0
39 0 3 1 2 0 0 0 0 0.03 0.01 0.04 0 0 0
40 0 14 9 2 1 0 1 0 0.16 0.11 0.04 0.04 0 1
41 0 1 1 0 1 0 0 0 0.01 0.01 0 0.04 0 0
42 0 7 3 1 1 1 0 0 0.08 0.04 0.02 0.04 0.17 0
43 2 3 5 1 0 0 0 0.67 0.03 0.06 0.02 0 0 0
44 0 4 2 0 0 0 0 0 0.05 0.02 0 0 0 0
45 0 13 9 9 5 0 0 0 0.15 0.11 0.18 0.21 0 0
46 0 4 0 2 1 0 0 0 0.05 0 0.04 0.04 0 0
47 0 1 2 2 0 0 0 0 0.01 0.02 0.04 0 0 0
48 1 5 6 0 1 1 0 0.33 0.06 0.07 0 0.04 0.17 0
49 0 1 8 1 2 1 0 0 0.01 0.1 0.02 0.08 0.17 0
50 0 9 7 13 4 0 0 0 0.1 0.09 0.26 0.17 0 0
51 0 2 3 1 1 0 0 0 0.02 0.04 0.02 0.04 0 0
52 0 5 1 3 4 0 0 0 0.06 0.01 0.06 0.17 0 0
53 0 4 4 1 0 0 0 0 0.05 0.05 0.02 0 0 0
54 0 4 3 2 2 0 0 0 0.05 0.04 0.04 0.08 0 0
55 0 1 5 3 1 0 0 0 0.01 0.06 0.06 0.04 0 0
56 0 1 1 1 0 0 0 0 0.01 0.01 0.02 0 0 0
57 0 0 2 1 0 1 0 0 0 0.02 0.02 0 0.17 0
58 0 1 2 0 0 0 0 0 0.01 0.02 0 0 0 0
59 0 0 1 2 0 0 0 0 0 0.01 0.04 0 0 0
60 0 0 3 1 0 0 0 0 0 0.04 0.02 0 0 0
61 0 0 1 1 0 0 0 0 0 0.01 0.02 0 0 0
65 0 0 2 0 0 0 0 0 0 0.02 0 0 0 0
69 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0
82 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0
3 87 81 50 24 6 1
Sustainability 2022, 14, 15663 15 of 18
Table A5. Temperature criteria probability.
Temperature
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
22 0 10 3 0 0 0 0 0 0.11 0.04 0 0 0 0
23 1 12 11 3 1 0 0 0.33 0.14 0.14 0.06 0.04 0 0
24 1 12 21 7 4 1 0 0.33 0.14 0.26 0.14 0.17 0.17 0
25 0 9 11 9 4 2 0 0 0.1 0.14 0.18 0.17 0.33 0
22.5 1 19 13 8 4 0 0 0.33 0.22 0.16 0.16 0.17 0 0
22.8 0 3 0 0 0 0 0 0 0.03 0 0 0 0 0
23.5 0 12 7 7 0 0 0 0 0.14 0.09 0.14 0 0 0
24.5 0 9 12 15 4 2 1 0 0.1 0.15 0.3 0.17 0.33 1
25.5 0 1 3 1 7 1 0 0 0.01 0.04 0.02 0.29 0.17 0
3 87 81 50 24 6 1
Table A6. Globe temperature criteria probability.
Globe_Temperature
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
23 1 20 9 1 4 0 0 0.33 0.23 0.11 0.02 0.17 0 0
24 0 25 10 1 2 0 0 0 0.29 0.12 0.02 0.08 0 0
25 1 11 9 10 1 0 0 0.33 0.13 0.11 0.2 0.04 0 0
26 0 8 28 10 9 3 1 0 0.09 0.35 0.2 0.38 0.5 1
23.5 1 12 15 11 0 0 0 0.33 0.14 0.19 0.22 0 0 0
24.5 0 4 4 11 4 0 0 0 0.05 0.05 0.22 0.17 0 0
25.5 0 6 6 5 1 2 0 0 0.07 0.07 0.1 0.04 0.33 0
26.5 0 1 0 1 3 1 0 0 0.01 0 0.02 0.13 0.17 0
3 87 81 50 24 6 1
Table A7. Relative humidity criteria probability.
Relative_Humidity
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
60 1 1 2 0 0 0 0 0.33 0.01 0.02 0 0 0 0
61 0 3 7 6 0 1 1 0 0.03 0.09 0.12 0 0.17 1
62 0 7 10 3 0 0 0 0 0.08 0.12 0.06 0 0 0
63 0 11 11 0 4 1 0 0 0.13 0.14 0 0.17 0.17 0
64 0 1 0 8 5 0 0 0 0.01 0 0.16 0.21 0 0
65 1 11 2 3 1 0 0 0.33 0.13 0.02 0.06 0.04 0 0
66 1 7 4 5 2 0 0 0.33 0.08 0.05 0.1 0.08 0 0
67 0 7 7 7 1 0 0 0 0.08 0.09 0.14 0.04 0 0
Sustainability 2022, 14, 15663 16 of 18
Table A7. Cont.
Relative_Humidity
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
68 0 0 1 3 0 0 0 0 0 0.01 0.06 0 0 0
69 0 4 0 0 0 0 0 0 0.05 0 0 0 0 0
70 0 7 6 3 0 0 0 0 0.08 0.07 0.06 0 0 0
71 0 3 8 1 3 0 0 0 0.03 0.1 0.02 0.13 0 0
72 0 2 2 0 0 2 0 0 0.02 0.02 0 0 0.33 0
73 0 2 2 0 0 0 0 0 0.02 0.02 0 0 0 0
74 0 1 5 1 1 1 0 0 0.01 0.06 0.02 0.04 0.17 0
75 0 2 1 0 0 0 0 0 0.02 0.01 0 0 0 0
77 0 1 0 1 3 1 0 0 0.01 0 0.02 0.13 0.17 0
79 0 12 10 6 0 0 0 0 0.14 0.12 0.12 0 0 0
80 0 5 3 3 4 0 0 0 0.06 0.04 0.06 0.17 0 0
3 87 81 50 24 6 1
Table A8. Velocity criteria probability.
Velocity
Count Probability
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
0 3 87 76 49 21 6 1 1 1 0.94 0.98 0.88 1 1
0.3 0 0 3 0 0 0 0 0 0 0.04 0 0 0 0
1 0 0 2 1 3 0 0 0 0 0.02 0.02 0.13 0 0
3 87 81 50 24 6 1
Sustainability 2022, 14, x FOR PEER REVIEW 16 of 18
Figure A1. Prediction results using Weka software.
Figure A2. Confusion matrix using Weka software.
Figure A1. Prediction results using Weka software.
Sustainability 2022, 14, x FOR PEER REVIEW 16 of 18
Figure A1. Prediction results using Weka software.
Figure A2. Confusion matrix using Weka software.
Figure A2. Confusion matrix using Weka software.
Sustainability 2022, 14, 15663 17 of 18
Figure A2. Confusion matrix using Weka software.
Figure A3. Naïve Bayes prediction accuracy using Weka software.
References
1. Gong, P.; Cai, Y.; Zhou, Z.; Zhang, C.; Chen, B.; Sharples, S. Investigating spatial impact on indoor personal thermal comfort. J.
Build. Eng. 2021, 45, 103536. https://doi.org/10.1016/j.jobe.2021.103536.
2. Dzyuban, Y.; Ching, G.N.; Yik, S.K.; Tan, A.J.; Banerjee, S.; Crank, P.J.; Chow, W.T. Outdoor thermal comfort research in tran-
sient conditions: A narrative literature review. Landsc. Urban Plan. 2022, 226, 104496. https://doi.org/10.1016/j.landur-
bplan.2022.104496.
3. Larriva, M.T.B.; Mendes, A.S.; Forcada, N. The effect of climatic conditions on occupants’ thermal comfort in naturally venti-
lated nursing homes. Build. Environ. 2022, 214, 108930. https://doi.org/10.1016/j.buildenv.2022.108930.
4. Zang, X.; Liu, K.; Qian, Y.; Qu, G.; Yuan, Y.; Ren, L.; Liu, G. The influence of different functional areas on customers’ thermal
comfort—A Field study in shopping complexes of North China. Energy Built Environ. 2022, in press.
https://doi.org/10.1016/j.enbenv.2022.01.004.
5. Jia, X.; Wang, J.; Zhu, Y.; Ji, W.; Cao, B. Climate chamber study on thermal comfort of walking passengers with elevated ambient
air velocity. Build. Environ. 2022, 218, 109100. https://doi.org/10.1016/j.buildenv.2022.109100.
6. Niza, I.L.; Broday, E.E. Thermal comfort conditions in Brazil: A discriminant analysis through the ASHRAE Global Thermal
Comfort Database II. Build. Environ. 2022, 221, 109310. https://doi.org/10.1016/j.buildenv.2022.109310.
7. Feng, Y.; Liu, S.; Wang, J.; Yang, J.; Jao, Y.-L.; Wang, N. Data-driven personal thermal comfort prediction: A literature review.
Renew. Sustain. Energy Rev. 2022, 161, 112357. https://doi.org/10.1016/j.rser.2022.112357.
Figure A3. Naïve Bayes prediction accuracy using Weka software.
References
1. Gong, P.; Cai, Y.; Zhou, Z.; Zhang, C.; Chen, B.; Sharples, S. Investigating spatial impact on indoor personal thermal comfort. J.
Build. Eng. 2021, 45, 103536. [CrossRef]
2. Dzyuban, Y.; Ching, G.N.; Yik, S.K.; Tan, A.J.; Banerjee, S.; Crank, P.J.; Chow, W.T. Outdoor thermal comfort research in transient
conditions: A narrative literature review. Landsc. Urban Plan. 2022, 226, 104496. [CrossRef]
3. Larriva, M.T.B.; Mendes, A.S.; Forcada, N. The effect of climatic conditions on occupants’ thermal comfort in naturally ventilated
nursing homes. Build. Environ. 2022, 214, 108930. [CrossRef]
4. Zang, X.; Liu, K.; Qian, Y.; Qu, G.; Yuan, Y.; Ren, L.; Liu, G. The influence of different functional areas on customers’ thermal
comfort—A Field study in shopping complexes of North China. Energy Built Environ. 2022, in press. [CrossRef]
5. Jia, X.; Wang, J.; Zhu, Y.; Ji, W.; Cao, B. Climate chamber study on thermal comfort of walking passengers with elevated ambient
air velocity. Build. Environ. 2022, 218, 109100. [CrossRef]
6. Niza, I.L.; Broday, E.E. Thermal comfort conditions in Brazil: A discriminant analysis through the ASHRAE Global Thermal
Comfort Database II. Build. Environ. 2022, 221, 109310. [CrossRef]
7. Feng, Y.; Liu, S.; Wang, J.; Yang, J.; Jao, Y.-L.; Wang, N. Data-driven personal thermal comfort prediction: A literature review.
Renew. Sustain. Energy Rev. 2022, 161, 112357. [CrossRef]
8. Heidari, A.; Maréchal, F.; Khovalyg, D. Reinforcement Learning for proactive operation of residential energy systems by learning
stochastic occupant behavior and fluctuating solar energy: Balancing comfort, hygiene and energy use. Appl. Energy 2022,
318, 119206. [CrossRef]
9. Jia, M.; Choi, J.-H.; Liu, H.; Susman, G. Development of facial-skin temperature driven thermal comfort and sensation modeling
for a futuristic application. Build. Environ. 2022, 207, 108479. [CrossRef]
10. Liu, K.; Lian, Z.; Dai, X.; Lai, D. Comparing the effects of sun and wind on outdoor thermal comfort: A case study based on
longitudinal subject tests in cold climate region. Sci. Total Environ. 2022, 825, 154009. [CrossRef]
11. Ji, Y.; Song, J.; Shen, P. A review of studies and modelling of solar radiation on human thermal comfort in outdoor environment.
Build. Environ. 2022, 214, 108891. [CrossRef]
12. Geng, Y.; Hong, B.; Du, M.; Yuan, T.; Wang, Y. Combined effects of visual-acoustic-thermal comfort in campus open spaces: A
pilot study in China’s cold region. Build. Environ. 2022, 209, 108658. [CrossRef]
13. Dharmasastha, K.; Samuel, D.L.; Nagendra, S.S.; Maiya, M. Thermal comfort of a radiant cooling system in glass fiber reinforced
gypsum roof—An experimental study. Appl. Therm. Eng. 2022, 214, 118842. [CrossRef]
14. Ma, X.; Leung, T.; Chau, C.; Yung, E.H. Analyzing the influence of urban morphological features on pedestrian thermal comfort.
Urban Clim. 2022, 44, 101192. [CrossRef]
15. Cheng, J.C.; Kwok, H.H.; Li, A.T.; Tong, J.C.; Lau, A.K. BIM-supported sensor placement optimization based on genetic algorithm
for multi-zone thermal comfort and IAQ monitoring. Build. Environ. 2022, 216, 108997. [CrossRef]
16. Luo, Z.; Sun, C.; Dong, Q.; Qi, X. Key control variables affecting interior visual comfort for automated louver control in open-plan
office—A study using machine learning. Build. Environ. 2022, 207, 108565. [CrossRef]
17. Zhang, R.; Liu, D.; Shi, L. Thermal-comfort optimization design method for semi-outdoor stadium using machine learning. Build.
Environ. 2022, 215, 108890. [CrossRef]
18. Kafy, A.-A.; Saha, M.; Faisal, A.-A.; Rahaman, Z.A.; Rahman, M.T.; Liu, D.; Fattah, A.; Al Rakib, A.; AlDousari, A.E.; Rahaman,
S.N.; et al. Predicting the impacts of land use/land cover changes on seasonal urban thermal characteristics using machine
learning algorithms. Build. Environ. 2022, 217, 109066. [CrossRef]
19. Yang, S.; Wan, M.P. Machine-learning-based model predictive control with instantaneous linearization—A case study on an
air-conditioning and mechanical ventilation system. Appl. Energy 2022, 306, 118041. [CrossRef]
20. Heidari, A.; Maréchal, F.; Khovalyg, D. An occupant-centric control framework for balancing comfort, energy use and hygiene in
hot water systems: A model-free reinforcement learning approach. Appl. Energy 2022, 312, 118833. [CrossRef]
Sustainability 2022, 14, 15663 18 of 18
21. Irshad, K.; Habib, K.; Kareem, M.; Basrawi, F.; Saha, B.B. Evaluation of thermal comfort in a test room equipped with a
photovoltaic assisted thermo-electric air duct cooling system. Int. J. Hydrogen Energy 2017, 42, 26956–26972. [CrossRef]
22. Irshad, K.; Algarni, S.; Jamil, B.; Ahmad, M.T.; Khan, M.A. Effect of gender difference on sleeping comfort and building energy
utilization: Field study on test chamber with thermoelectric air-cooling system. Build. Environ. 2019, 152, 214–227. [CrossRef]
23. Yang, Z.; Du, C.; Xiao, H.; Li, B.; Shi, W.; Wang, B. A novel integrated index for simultaneous evaluation of the thermal comfort
and energy efficiency of air-conditioning systems. J. Build. Eng. 2022, 57, 104885. [CrossRef]
24. Zhang, W.; Wu, Y.; Calautit, J.K. A review on occupancy prediction through machine learning for enhancing energy efficiency, air
quality and thermal comfort in the built environment. Renew. Sustain. Energy Rev. 2022, 167, 112704. [CrossRef]
25. Irshad, K.; Khan, A.I.; Irfan, S.A.; Alam, M.; Almalawi, A.; Zahir, H. Utilizing Artificial Neural Network for Prediction of
Occupants Thermal Comfort: A Case Study of a Test Room Fitted With a Thermoelectric Air-Conditioning System. IEEE Access
2020, 8, 99709–99728. [CrossRef]
26. Elnour, M.; Himeur, Y.; Fadli, F.; Mohammedsherif, H.; Meskin, N.; Ahmad, A.M.; Petri, I.; Rezgui, Y.; Hodorog, A. Neural
network-based model predictive control system for optimizing building automation and management systems of sports facilities.
Appl. Energy 2022, 318, 119153. [CrossRef]
27. Esrafilian-Najafabadi, M.; Haghighat, F. Impact of predictor variables on the performance of future occupancy prediction: Feature
selection using genetic algorithms and machine learning. Build. Environ. 2022, 219, 109152. [CrossRef]
28. Rana, R.; Kusy, B.; Jurdak, R.; Wall, J.; Hu, W. Feasibility analysis of using humidex as an indoor thermal comfort predictor. Energy
Build. 2013, 64, 17–25. [CrossRef]
29. Benito, P.I.; Sebastián, M.A.; González-Gaya, C. Study and Application of Industrial Thermal Comfort Parameters by Using
Bayesian Inference Techniques. Appl. Sci. 2021, 11, 11979. [CrossRef]
30. Aguilera, J.J.; Toftum, J.; Kazanci, O.B. Predicting personal thermal preferences based on data-driven methods. E3S Web Conf.
2019, 111, 05015. [CrossRef]
31. Yang, B.; Li, X.; Liu, Y.; Chen, L.; Guo, R.; Wang, F.; Yan, K. Comparison of models for predicting winter individual thermal
comfort based on machine learning algorithms. Build. Environ. 2022, 215, 108970. [CrossRef]
32. Asif, A.; Zeeshan, M.; Khan, S.R.; Sohail, N.F. Investigating the gender differences in indoor thermal comfort perception for
summer and winter seasons and comparison of comfort temperature prediction methods. J. Therm. Biol. 2022, 110, 103357.
[CrossRef]
33. Yao, F.; Fang, H.; Han, J.; Zhang, Y. Study on the outdoor thermal comfort evaluation of the elderly in the Tibetan plateau. Sustain.
Cities Soc. 2021, 77, 103582. [CrossRef]
34. Wei, D.; Yang, L.; Bao, Z.; Lu, Y.; Yang, H. Variations in outdoor thermal comfort in an urban park in the hot-summer and
cold-winter region of China. Sustain. Cities Soc. 2022, 77, 103535. [CrossRef]
35. Qin, H.; Wang, X. A multi-discipline predictive intelligent control method for maintaining the thermal comfort on indoor
environment. Appl. Soft Comput. 2022, 116, 108299. [CrossRef]
36. Zhu, R.; Zhang, X.; Yang, L.; Liu, Y.; Cong, Y.; Gao, W. Correlation analysis of thermal comfort and physiological responses under
different microclimates of urban park. Case Stud. Therm. Eng. 2022, 34, 102044. [CrossRef]
37. Song, G.; Ai, Z.; Zhang, G.; Peng, Y.; Wang, W.; Yan, Y. Using machine learning algorithms to multidimensional analysis of
subjective thermal comfort in a library. Build. Environ. 2022, 212, 108790. [CrossRef]
38. Gao, N.; Shao, W.; Rahaman, M.S.; Zhai, J.; David, K.; Salim, F.D. Transfer learning for thermal comfort prediction in multiple
cities. Build. Environ. 2021, 195, 107725. [CrossRef]
39. Lin, C.-M.; Lin, S.-F.; Liu, H.-Y.; Tseng, K.-Y. Applying the naïve Bayes classifier to HVAC energy prediction using hourly data.
Microsyst. Technol. 2022, 28, 121–135. [CrossRef]
40. Pan, W.; Ming, H.; Yang, Z.; Wang, T. Comments on "Using k-core Decomposition on Class Dependency Networks to Improve
Bug Prediction Model’s Practical Performance". IEEE Trans. Softw. Eng. 2022. Early Access. [CrossRef]

More Related Content

Similar to Thermal_Comfort_Prediction_Accuracy_with_Machine_L.pdf

Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...
Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...
Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...Basrah University for Oil and Gas
 
A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...
A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...
A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...Basrah University for Oil and Gas
 
Constrained discrete model predictive control of a greenhouse system temperature
Constrained discrete model predictive control of a greenhouse system temperatureConstrained discrete model predictive control of a greenhouse system temperature
Constrained discrete model predictive control of a greenhouse system temperatureIJECEIAES
 
Review on the hvac system modeling types and the shortcomings of their applic...
Review on the hvac system modeling types and the shortcomings of their applic...Review on the hvac system modeling types and the shortcomings of their applic...
Review on the hvac system modeling types and the shortcomings of their applic...Basrah University for Oil and Gas
 
Green Building Awareness HVAC
Green Building Awareness HVAC Green Building Awareness HVAC
Green Building Awareness HVAC AKSHAY SHARMA
 
Environmental building design performance modelling and simulation
Environmental building design performance modelling and simulationEnvironmental building design performance modelling and simulation
Environmental building design performance modelling and simulationnagham ali hasan
 
Dynamic indoor thermal comfort model identification based on neural computing...
Dynamic indoor thermal comfort model identification based on neural computing...Dynamic indoor thermal comfort model identification based on neural computing...
Dynamic indoor thermal comfort model identification based on neural computing...Basrah University for Oil and Gas
 
Assignment 1 - Application of Numerical Analysis.pdf
Assignment 1 - Application of Numerical Analysis.pdfAssignment 1 - Application of Numerical Analysis.pdf
Assignment 1 - Application of Numerical Analysis.pdfSafeen Yaseen Ja'far
 
Green bim software - dr. Nagham Ali Hasan
Green bim software - dr. Nagham Ali HasanGreen bim software - dr. Nagham Ali Hasan
Green bim software - dr. Nagham Ali Hasannagham ali hasan
 
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...Gurdal Ertek
 
climate-08-00072.pdf
climate-08-00072.pdfclimate-08-00072.pdf
climate-08-00072.pdfLilithlit
 
ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...
ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...
ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...sravan66
 
Modeling of heat and moisture transfer in building using rlf method
Modeling of heat and moisture transfer in building using rlf methodModeling of heat and moisture transfer in building using rlf method
Modeling of heat and moisture transfer in building using rlf methodBasrah University for Oil and Gas
 
The Effect of Building Direction and Natural Airing on Yearly Power Utilization
The Effect of Building Direction and Natural Airing on Yearly Power UtilizationThe Effect of Building Direction and Natural Airing on Yearly Power Utilization
The Effect of Building Direction and Natural Airing on Yearly Power UtilizationYogeshIJTSRD
 
Functions of fuzzy logic based controllers used in smart building
Functions of fuzzy logic based controllers used in smart  buildingFunctions of fuzzy logic based controllers used in smart  building
Functions of fuzzy logic based controllers used in smart buildingIJECEIAES
 
Ensuring Thermal Comfort at Indoor Living Space in Building for Monsoon Climate
Ensuring Thermal Comfort at Indoor Living Space in Building for Monsoon ClimateEnsuring Thermal Comfort at Indoor Living Space in Building for Monsoon Climate
Ensuring Thermal Comfort at Indoor Living Space in Building for Monsoon ClimateEngr. Ananya Abduhu
 
Applications of Numerical Method
Applications of Numerical Method Applications of Numerical Method
Applications of Numerical Method MdOsmanAzizMinaj
 
Climatology Applied To Architecture: An Experimental Investigation about Inte...
Climatology Applied To Architecture: An Experimental Investigation about Inte...Climatology Applied To Architecture: An Experimental Investigation about Inte...
Climatology Applied To Architecture: An Experimental Investigation about Inte...IJERA Editor
 

Similar to Thermal_Comfort_Prediction_Accuracy_with_Machine_L.pdf (20)

Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...
Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...
Measuring Device for Human Comfort Sensation by Converting Fanger Formula Usi...
 
A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...
A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...
A novel hybrid modelling structure fabricated by using takagi sugeno fuzzy to...
 
V01 i030602
V01 i030602V01 i030602
V01 i030602
 
Constrained discrete model predictive control of a greenhouse system temperature
Constrained discrete model predictive control of a greenhouse system temperatureConstrained discrete model predictive control of a greenhouse system temperature
Constrained discrete model predictive control of a greenhouse system temperature
 
Review on the hvac system modeling types and the shortcomings of their applic...
Review on the hvac system modeling types and the shortcomings of their applic...Review on the hvac system modeling types and the shortcomings of their applic...
Review on the hvac system modeling types and the shortcomings of their applic...
 
Green Building Awareness HVAC
Green Building Awareness HVAC Green Building Awareness HVAC
Green Building Awareness HVAC
 
Environmental building design performance modelling and simulation
Environmental building design performance modelling and simulationEnvironmental building design performance modelling and simulation
Environmental building design performance modelling and simulation
 
Dynamic indoor thermal comfort model identification based on neural computing...
Dynamic indoor thermal comfort model identification based on neural computing...Dynamic indoor thermal comfort model identification based on neural computing...
Dynamic indoor thermal comfort model identification based on neural computing...
 
Assignment 1 - Application of Numerical Analysis.pdf
Assignment 1 - Application of Numerical Analysis.pdfAssignment 1 - Application of Numerical Analysis.pdf
Assignment 1 - Application of Numerical Analysis.pdf
 
Green bim software - dr. Nagham Ali Hasan
Green bim software - dr. Nagham Ali HasanGreen bim software - dr. Nagham Ali Hasan
Green bim software - dr. Nagham Ali Hasan
 
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
 
climate-08-00072.pdf
climate-08-00072.pdfclimate-08-00072.pdf
climate-08-00072.pdf
 
Anand2020
Anand2020Anand2020
Anand2020
 
ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...
ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...
ArtificialCorona-InspiredOptimizationAlgorithmTheoreticalFoundationsAnalysisa...
 
Modeling of heat and moisture transfer in building using rlf method
Modeling of heat and moisture transfer in building using rlf methodModeling of heat and moisture transfer in building using rlf method
Modeling of heat and moisture transfer in building using rlf method
 
The Effect of Building Direction and Natural Airing on Yearly Power Utilization
The Effect of Building Direction and Natural Airing on Yearly Power UtilizationThe Effect of Building Direction and Natural Airing on Yearly Power Utilization
The Effect of Building Direction and Natural Airing on Yearly Power Utilization
 
Functions of fuzzy logic based controllers used in smart building
Functions of fuzzy logic based controllers used in smart  buildingFunctions of fuzzy logic based controllers used in smart  building
Functions of fuzzy logic based controllers used in smart building
 
Ensuring Thermal Comfort at Indoor Living Space in Building for Monsoon Climate
Ensuring Thermal Comfort at Indoor Living Space in Building for Monsoon ClimateEnsuring Thermal Comfort at Indoor Living Space in Building for Monsoon Climate
Ensuring Thermal Comfort at Indoor Living Space in Building for Monsoon Climate
 
Applications of Numerical Method
Applications of Numerical Method Applications of Numerical Method
Applications of Numerical Method
 
Climatology Applied To Architecture: An Experimental Investigation about Inte...
Climatology Applied To Architecture: An Experimental Investigation about Inte...Climatology Applied To Architecture: An Experimental Investigation about Inte...
Climatology Applied To Architecture: An Experimental Investigation about Inte...
 

Recently uploaded

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfTukamushabaBismark
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai YoungDubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Youngkajalvid75
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 

Recently uploaded (20)

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai YoungDubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 

Thermal_Comfort_Prediction_Accuracy_with_Machine_L.pdf

  • 1. Citation: Sibyan, H.; Svajlenka, J.; Hermawan, H.; Faqih, N.; Arrizqi, A.N. Thermal Comfort Prediction Accuracy with Machine Learning between Regression Analysis and Naïve Bayes Classifier. Sustainability 2022, 14, 15663. https://doi.org/ 10.3390/su142315663 Academic Editor: Hamid Khayyam Received: 20 September 2022 Accepted: 21 November 2022 Published: 24 November 2022 Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affil- iations. Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). sustainability Article Thermal Comfort Prediction Accuracy with Machine Learning between Regression Analysis and Naïve Bayes Classifier Hidayatus Sibyan 1,* , Jozef Svajlenka 2 , Hermawan Hermawan 3 , Nasyiin Faqih 4 and Annisa Nabila Arrizqi 5 1 Department of Informatics Engineering, Qur’anic Science University (Universitas Sains Al-Qur’an), Jl. Hasyim Asy’ari Km. 03, Wonosobo 56351, Indonesia 2 Department of Construction Technology and Management, Technical University of Košice, 042 00 Košice, Slovakia 3 Department of Architecture, Qur’anic Science University (Universitas Sains Al-Qur’an), Jl. Hasyim Asy’ari Km. 03, Wonosobo 56351, Indonesia 4 Department of Civil Engineering, Qur’anic Science University (Universitas Sains Al-Qur’an), Jl. Hasyim Asy’ari Km. 03, Wonosobo 56351, Indonesia 5 Department of Civil Engineering, Universitas Islam Indonesia, Jl. Kaliurang Km.14,5, Yogyakarta 55584, Indonesia * Correspondence: hsibyan@unsiq.ac.id; Tel.: +62-852-9104-1812 Abstract: Various data analysis methods can make thermal comfort prediction models. One method that is often used is multiple linear regression statistical analysis. Regression analysis needs to be checked for accuracy with other analytical methods. This study compares the making of a thermal comfort prediction model with regression analysis and naïve Bayes analysis. The research method used quantitative methods for data collection regarding thermal comfort. The thermal comfort variable, consisting of eight independent variables and one dependent variable, was measured at Wonosobo High School, Indonesia. The analysis to make the prediction model was carried out with two different analyses: multiple linear regression analysis and naïve Bayes analysis. The results show that naïve Bayes is more accurate than multiple linear regression analysis. Keywords: machine learning; thermal comfort; regression; naïve Bayes; prediction 1. Introduction The problem of energy and thermal comfort has become a global issue. Many studies have tried to solve energy problems by predicting thermal comfort with variations in the type and shape of the building. Thermal comfort predictions can reduce energy wastage in buildings [1]. Thermal comfort cannot be separated from the scale of thermal sensation, model, climate, and personal variables. Several studies have used a variety of thermal sensation scales. The model is the result of an analysis of the thermal comfort variable, which is used to predict human thermal comfort [2]. Climatic variables associated with the thermal sensation of residents in naturally ventilated nursing homes can influence the mathematical model. Regions with different climates will also produce different responses, so research areas must pay attention to hot or cold areas [3]. A user’s thermal perception is one of the main variables in thermal comfort research. Field tests can be used in thermal comfort research Questionnaire based on the thermal perception standard from ASHRAE 55. The analysis uses regression analysis, producing a mathematical model or equation of thermal comfort [4]. Airflow is one of the factors of thermal comfort. Human perception of airflow is also essential to know in achieving thermal comfort. Using the laboratory as an experimental room in research can obtain results that can be accounted for. The use of regression analysis can be applied to airflow research using an experimental room [5]. Sustainability 2022, 14, 15663. https://doi.org/10.3390/su142315663 https://www.mdpi.com/journal/sustainability
  • 2. Sustainability 2022, 14, 15663 2 of 18 The thermal comfort research model aims to predict building occupants’ thermal comfort. The prediction model generated from research is used as a standard for mak- ing building designs. The current prediction model uses an adaptive thermal comfort approach. Research verifying the adaptive thermal comfort model has been conducted in four Brazilian cities. The study used the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) method and found that the variation in the verified adaptive thermal comfort model was more than 90% for the four cities in Brazil [6]. Predictively modeling personal thermal comfort has become a trending topic in the improvement of human comfort in rooms. Thermal comfort is closely related to the design and performance of building systems, especially in sustainable and intelligent buildings [7]. Thermal comfort modeling is manifold. A basis for modeling that is currently developing is the use of stochastic algorithms and variables [8] The model’s accuracy in predicting the thermal comfort of occupants is an important aspect that must be continuously developed. Accurate models that predict the right results are convincing [9]. Linear regression is also used in outdoor thermal comfort studies. Thermal perception and other variables such as air temperature, wind, and sun exposure have been analyzed using linear regression. The results showed a predictive model of thermal comfort. In addition, equations about air temperature have also been found based on wind and sun exposure [10]. In outdoor thermal comfort research, sun exposure is a factor that is more considered than in indoor thermal comfort research. The temperature of solar radiation in the room is not too significant compared with the temperature of solar radiation outside. The mathematical model of thermal comfort in the interior still includes the average solar radiation temperature according to the thermal comfort factor that has been formulated [11]. Methods in thermal comfort research include simulation with software, modeling in the lab, and field testing. Field tests are the most widely used in thermal comfort research. The measurement of the thermal comfort variable also sometimes coincides with the measurement of the acoustic and visual comfort variables. A study’s results will show the relationship between the variables of thermal, acoustic, and visual comfort [12]. The experimental method is also one of the methods used in thermal comfort research. Some studies make the experimental space a tool to test a model. The developing technology makes the experimental space more varied [13]. Simulation methods are often used to validate a model found in research. Simulation using software is widely used in model validation. One of the programs used for simulation is ENVI-NET. Simulations are often combined with field measurements to obtain more valid results, with both methods carried out in a study [14]. Measurements in the field coupled with simulations can use building information modeling (BIM). Computational fluid dynamic (CFD) analysis is often one of the analytical methods in BIM [15]. Several studies have combined the two methods, but research comparing the two methods has not been widely carried out. The development of the database field has led to the emergence of increasingly sophis- ticated data analysis tools. Machine learning algorithms are a new approach to analyzing thermal or visual comfort data. Machine learning algorithms as analytical tools are widely used for research on human comfort in buildings [16]. Machine learning algorithm data analysis methods continue to be developed and used in various types of buildings. Numer- ical computing is one of the strengths of machine learning algorithms [17]. The algorithm can also be used in urban heat island research. Research can produce effective strategies to reduce the urban heat island effect by avoiding overcrowding in infrastructure develop- ment; increasing plantations, waterbodies, and roof gardens; and using a white roof color in construction. Research findings will enable urban planners, policymakers, and local governments to achieve environmentally friendly outcomes [18]. Machine learning (ML)- based building models have gained popularity in building predictive control (MPC) models for building energy management applications. However, ML-based building models are usually nonlinear in capturing building dynamics, which causes a high computational load
  • 3. Sustainability 2022, 14, 15663 3 of 18 for MPC models, prohibiting their application in real-time building control [19]. Analysis of the hot water usage model for the thermal comfort of occupants can also be performed with machine learning. The created model provides control performance for occupant adaptation [20]. Developing convenience models using machine learning is inevitable with database development and automated calculations. Determining the analytical method is essential to finding an accurate thermal comfort model. The purpose of this study was to compare the method of multiple linear regression analysis and naïve Bayes in making an accurate thermal comfort model. Measuring thermal comfort can be performed with objective and subjective measure- ments. Objective measurements use a thermal measuring instrument that measures air temperature, average solar radiation temperature, wind speed, and tire humidity. Subjec- tive measurements include filling out a thermal sensation questionnaire from ASHRAE by having the respondent sit for 15 min within the research object [21]. Gender is an important factor in thermal comfort. The selection of respondents needs to consider gender. Different sexes will produce different thermal sensations [22]. Thermal comfort measurements have PMV (predicted mean vote) and PPD (predicted percent dissatisfied) indicators. These two aspects are the keywords in the analysis of thermal comfort [23]. Thermal comfort prediction with machine learning is still being developed. Prediction can be used to develop the science of building design. Not many studies predict thermal comfort using machine learning [24]. Machine learning methods have various forms, often using mathematical calculations and computational fluid dynamics (CFD). The use of various machine learning methods means that research results vary [17]. Thermal comfort prediction using a neural network has also been carried out. The thermoelectric airduct (TE-AD) cooling system is used to predict air temperature, PMV, and PPD. The prediction model is accurate in predicting thermal comfort [25]. The development of artificial neural networks for machine learning is still being carried out to find the right predictions [26]. Machine learning in predicting indoor air quality is also still being developed [27]. Thermal comfort prediction can also combine methods with thermal comfort variables [28]. Regression analysis is widely used in thermal comfort research to find a predictive model of thermal comfort. Currently, there are many models of thermal comfort that use regression analysis. Many comparisons of machine learning methods to find predictive models of thermal comfort have been carried out. One of the machine learning methods is naïve Bayes. A comparison between naïve Bayes and other methods has also been carried out [29]. Naïve Bayes can be an alternative to machine learning that does not require complicated calculations. Research that compares the regression method with naïve Bayes to form a predictive model of thermal comfort needs to be performed so that it is known that the method is not complicated and that it produces a better predictive model. Clothing is one of the variables that affects thermal comfort. The majority of students in Wonosobo, Indonesia, wear closed clothes in carrying out learning activities at school. Currently, there are few predictive models with respondents who wear closed clothes and have a religious culture. Thus, this study has the novelty of finding a predictive thermal comfort model using closed clothing variables in cold areas. This research can contribute to computer science to find predictive models that are simple and accurate. Contributions to architecture can be used as a basis for architectural design by predicting thermal comfort in naturally ventilated buildings. Some of the related works are shown in Table 1, below.
  • 4. Sustainability 2022, 14, 15663 4 of 18 Table 1. Related work. Author Related Work J. J. Aguilera, J. Toftum, and O. Berk Kazanci (2019) The study “Predicting personal thermal preferences based on data-driven methods” predicts thermal comfort by comparing artificial neural network (ANN), naïve Bayes (NB), and fuzzy logic (FL) machine learning algorithms. The results showed that all methods performed well, with a 70% probability of guessing correctly [30]. P. I. Benito, M. A. Sebastián, and C. González-Gaya (2021) “Study and Application of Industrial Thermal Comfort Parameters Using Bayesian Inference Techniques” focuses on using Bayesian analysis for thermal comfort in the industry. It compared the results with the linear regression method used for thermal comfort. Bayesian analysis has a better ability to develop intelligent and thermally comfortable systems [29]. B. Yang et al. (2022) The study “Comparison of models for predicting individual winter thermal comfort based on machine learning algorithms” aims to develop a comparison of thermal comfort models based on skin temperature and environmental factors. The comparative study includes four models: support vector machine, decision tree, ensemble algorithms, and K-nearest neighbor. The thermal comfort prediction accuracy rate reaches 95.8% [31]. R. Zhang, D. Liu, and L. Shi (2022) The study “Thermal-comfort optimization design method for semi-outdoor stadium using machine learning” aims to reveal the relationship between stadium shape and thermal performance using an artificial neural network approach and genetic algorithms. The simulation results show that the simulation is close to the actual measurement and can be used for stadium optimization, which can be increased by 8.96% [17]. 2. Materials and Methods This research compares two data analysis methods, regression analysis and naïve Bayes. The data used results from the measurement of thermal comfort in the field. Respon- dents were students at two private high schools in Kejajar District, Wonosobo Regency. The variables used are gender, age, height, weight, temperature, globe temperature, humidity, velocity, and thermal sensation vote (TSV). The survey was conducted on two measurement days in the morning, afternoon, and evening. Temperature and humidity were measured using a measuring tool with the Extect brand. We measured the globe temperature using a black copper ball with a diameter of 15 cm. Respondents were asked to wait for 15 min when the initial measurements were taken. Respondents were high school students, so respondents had gone through adaptation to the room. The research subjects involved were male and female respondents. In thermal comfort research, it is possible that there are differences in the results of thermal sensation between men and women, although there are studies that say there are not many differ- ences between men and women [32]. Research subjects were not selected using sampling. Responses were taken from all students who became the object of the research. The number of high school students was 252, and all of them were used as research subjects. Data analysis used regression analysis with SPSS Statistics 25 and weka 3.8.6. The results of data analysis were compared for accuracy so that more accurate analytical tools could be found. Regression analysis was performed by making a mathematical equation model as follows: Y = α + 8 ∑ i=1 βiXi (1) where X1: gender, X2: age, X3: height, X4: weight, X5: temperature, X6: globe_temperature, X7: relative_humidity, and X8: velocity.
  • 5. Sustainability 2022, 14, 15663 5 of 18 Data analysis using naïve Bayes requires a reasonably long process starting with determining the training data that will be the test data. The calculation is performed by calculating the class probability P(Y), the probability of each P(X) criterion, and the final probability. Naïve Bayes analysis will produce an output from Y or TSV predictions. The resulting output can predict the TSV generated by building occupants with the value of the independent variable set (Figure 1). Y 𝛼 𝛽 X (1) where X1: gender, X2: age, X3: height, X4: weight, X5: temperature, X6: globe_temperature, X7: relative_humidity, and X8: velocity. Data analysis using naïve Bayes requires a reasonably long process starting with de- termining the training data that will be the test data. The calculation is performed by cal- culating the class probability P(Y), the probability of each P(X) criterion, and the final probability. Naïve Bayes analysis will produce an output from Y or TSV predictions. The resulting output can predict the TSV generated by building occupants with the value of the independent variable set (Figure 1). Figure 1. Naïve Bayes flowchart. 3. Results The data obtained amounted to 252 datasets. Respondents were 252 high school stu- dents. Female respondents wear hijab school uniforms, while men do not wear head cov- erings. Female students wear long and long-sleeved skirts, and male students wear shirts and trousers. All students, both girls and boys, wear shoes and socks. The activities they perform are sitting writing and sitting listening for 7–16 h. The total amount of data from eight independent variables and one dependent variable is 2268. Respondents consisted Start training data test data Calculate Class Probability Calculate Class P(Y) Probability Calculate the Probability of Each Criteria P(X) Calculate Final Probability P(X|Y) = P(X|Y).P(Y) Prediction Result Output Y(TSV) End Figure 1. Naïve Bayes flowchart. 3. Results The data obtained amounted to 252 datasets. Respondents were 252 high school students. Female respondents wear hijab school uniforms, while men do not wear head coverings. Female students wear long and long-sleeved skirts, and male students wear shirts and trousers. All students, both girls and boys, wear shoes and socks. The activities they perform are sitting writing and sitting listening for 7–16 h. The total amount of data from eight independent variables and one dependent variable is 2268. Respondents consisted of 44% men and 56% women. Respondent ages ranged from 14 to 19 years, with an average of 16.3 years. The respondents’ heights were between 140 and 177 cm, with an average of 156 cm. The respondents’ body weights were between 30 and 82 kg, averaging 47.9 kg. The temperature in the class was between 22 and 25.5 ◦C, with an average of 23.75 ◦C. Globe temperature in the class was between 23 and 26.5 ◦C, with an average of
  • 6. Sustainability 2022, 14, 15663 6 of 18 24.6 ◦C. Humidity was between 60 and 80%, with an average of 68.63%. Velocity did not involve too much movement, so it shows more zeros. The most significant velocity was 1 m/s. The thermal sensation votes obtained ranged from −3 (very cold) to +3 (very hot), with an average of −0.89 (near cool). Data measurement was performed by bringing the measuring instrument closer to the respondent by placing the measuring instrument on the classroom table (Figure 2). of 44% men and 56% women. Respondent ages ranged from 14 to 19 years, with an aver- age of 16.3 years. The respondents’ heights were between 140 and 177 cm, with an average of 156 cm. The respondents’ body weights were between 30 and 82 kg, averaging 47.9 kg. The temperature in the class was between 22 and 25.5 °C, with an average of 23.75 °C. Globe temperature in the class was between 23 and 26.5 °C, with an average of 24.6 °C. Humidity was between 60 and 80%, with an average of 68.63%. Velocity did not involve too much movement, so it shows more zeros. The most significant velocity was 1 m/s. The thermal sensation votes obtained ranged from −3 (very cold) to +3 (very hot), with an av- erage of −0.89 (near cool). Data measurement was performed by bringing the measuring instrument closer to the respondent by placing the measuring instrument on the class- room table (Figure 2). Figure 2. Data measuring. Data analysis using multiple linear regression has several data test requirements, namely, validity and reliability. In addition, the classical assumption test also needs to be carried out to obtain data that can be used for multiple linear regression analysis. Analysis using SPSS software resulted in a large amount of valid test data. The normality test was part of the regression analysis and obtained a model that meets the assumption of nor- mality (Figure 3). Figure 3. Classic assumption test. Multiple linear regression data analysis using SPSS produced a value of unstandard- ized coefficients that can be used as the coefficient of the prediction model. Some of the resulting values looked insignificant. This value indicates that the influence of the inde- pendent variable on the dependent is less potent (Table 2). This value can still be used in predicting thermal comfort because several models from other studies also obtained the same results. Figure 2. Data measuring. Data analysis using multiple linear regression has several data test requirements, namely, validity and reliability. In addition, the classical assumption test also needs to be carried out to obtain data that can be used for multiple linear regression analysis. Analysis using SPSS software resulted in a large amount of valid test data. The normality test was part of the regression analysis and obtained a model that meets the assumption of normality (Figure 3). Figure 3. Classic assumption test. Multiple linear regression data analysis using SPSS produced a value of unstandard- ized coefficients that can be used as the coefficient of the prediction model. Some of the resulting values looked insignificant. This value indicates that the influence of the inde- pendent variable on the dependent is less potent (Table 2). This value can still be used in predicting thermal comfort because several models from other studies also obtained the same results.
  • 7. Sustainability 2022, 14, 15663 7 of 18 Table 2. Regression Value. Coefficients Model Unstandardized Coefficients Standardized Coefficients t Sig. Correlations Collinearity Statistics B Std. Error Beta Zero- Order Partial Part Tolerance VIF 1 (Constant) −8.796 3.231 −2.723 0.007 Gender −0.308 0.182 −0.138 −1.696 0.091 −0.179 −0.108 −0.097 0.493 2.030 Age −0.093 0.074 −0.078 −1.266 0.207 −0.226 −0.081 −0.072 0.854 1.171 Height −0.015 0.013 −0.106 −1.235 0.218 0.083 −0.079 −0.070 0.437 2.289 Weight 0.005 0.012 0.030 0.439 0.661 0.055 0.028 0.025 0.705 1.419 Temperature 0.265 0.124 0.238 2.139 0.033 0.359 0.136 0.122 0.263 3.808 Globe_temperature 0.183 0.119 0.182 1.534 0.126 0.334 0.098 0.087 0.231 4.332 Humidity 0.018 0.011 0.102 1.588 0.114 0.014 0.101 0.090 0.784 1.275 Velocity 1.420 0.433 0.198 3.282 0.001 0.144 0.206 0.187 0.891 1.123 Dependent variable: TSV. Based on the value of the constants and regression coefficients obtained from the regression analysis, it is known that the multiple linear regression equation based equation (Equation (1)) is as follows: Y = (−8.796) + (−0.308) × X1 + (−0.093) × X2 + (−0.015) × X3 + 0.005 × X4 + 0.265 × X5 + 0.183 × X6 + 0.018 × X7 + 1.420 × X8 Several variables show a high significance value. This indicates that several indepen- dent variables did not strongly influence the dependent variable. This is possible because of the type of clothing worn by the respondents. The comfortable air temperature was calculated when Y = 0 (comfortable thermal sensation condition), and a comfortable air temperature of 33.19 ◦C was obtained. The comfortable air temperature produced was higher than the average comfortable air tem- perature in the tropics, which is 27 ◦C. This is possible because the closed clothes worn by respondents are hijab for women and trousers for men. Based on Equation (1) that was generated, a prediction of thermal comfort can be made for the sample data based on Table 3 as follows: Y = (−8.796) + (−0.308) × 1 + (−0.093) × 18 + (−0.015) × 160 + 0.005 × 46 + 0.265 × 22 + 0.183 × 24 + 0.018 × 62 + 1.420 × 0 Y = −1.61 ≈ −2 Table 3. Sample of testing data for prediction. Gender Age Height Weight Temperature Globe_Temperature Relative_Humidity Velocity TSV 1 18 160 46 22 24 62 0 ? Naïve Bayes analysis begins with determining the training data in as many as 252 datasets according to the data from the measurement results. The variables used in predicting thermal comfort (TSV) are the same as the regression analysis, namely, gender, age, height, weight, temperature, globe temperature, relative humidity, and velocity. The training data is used as test data, which is the basis of our calculations. Based on the training data used, there are 7 TSV classes, namely, class −3 with 3 data, class −2 with 87 data, class 1 with 81 data, class 0 with 50 data, class 1 with 24 data, class 2 with 6 data, and class 3 with as much as 1 data. TSV class classification data in the Weka software is shown in Figure 4.
  • 8. Sustainability 2022, 14, 15663 8 of 18 Sustainability 2022, 14, x FOR PEER REVIEW 8 of 18 Figure 4. TSV class. The probability value for each criterion is shown in Appendix A. In this test data, a thermal comfort (TSV) prediction can be made if the data used are as follows: gender: 1, age: 18, height: 160, weight: 46, temperature: 22, globe temperature: 24, relative humidity: 62, and velocity: 0. Based on the test data in Table 3, a prediction calculation can be made for the data above. Based on the naïve Bayes algorithm calculations based on Table 4, the highest value is 0.000026 for class −2. Thus, the prediction of the test data is class −2. Table 4. Prediction result. Variable Probability −3 −2 −1 0 1 2 3 3 87 81 50 24 6 1 Gender 0.00 0.2759 0.59 0.52 0.33 0.83 1.00 Age 0.33 0.1494 0.09 0.16 0.00 0.00 0.00 Height 0.00 0.0575 0.10 0.04 0.04 0.00 0.00 Weight 0.00 0.0460 0.00 0.04 0.04 0.00 0.00 Temperature 0.00 0.1149 0.04 0.00 0.00 0.00 0.00 Globe_temperature 0.00 0.2874 0.12 0.02 0.08 0.00 0.00 Relative_humidity 0.00 0.0805 0.12 0.06 0.00 0.00 0.00 Velocity 1.00 1.0000 0.94 0.98 0.88 1.00 1.00 Sum 0 0.000026 0 0 0 0 0 Prediction Result −2 The accuracy of the prediction results can be calculated based on the confusion ma- trix. From a total of 252 available data, 0 data in class A were correctly predicted as class A (−3), and 3 data were correctly predicted as class one. In total, 72 data in class B were correctly predicted as class B (−2), and 15 data were correctly predicted as class B. In total, 49 data in class C were correctly predicted as class C (−1), and 32 data were predicted incorrectly as class C. In total, 29 data in class D were correctly predicted as class D (0), and 21 data were predicted incorrectly as class D. In total, 16 data in class E were correctly predicted as class E (1), and 8 data were predicted incorrectly as class E. In total, three data in class F were correctly predicted as class F (2), and three data were incorrectly predicted as class F. Zero data in class G were correctly predicted as class G (3), and one datum was predicted incorrectly as class G. 3 87 81 50 24 6 1 -3 -2 -1 0 1 2 3 Figure 4. TSV class. The probability value for each criterion is shown in Appendix A. In this test data, a thermal comfort (TSV) prediction can be made if the data used are as follows: gender: 1, age: 18, height: 160, weight: 46, temperature: 22, globe temperature: 24, relative humidity: 62, and velocity: 0. Based on the test data in Table 3, a prediction calculation can be made for the data above. Based on the naïve Bayes algorithm calculations based on Table 4, the highest value is 0.000026 for class −2. Thus, the prediction of the test data is class −2. Table 4. Prediction result. Variable Probability −3 −2 −1 0 1 2 3 3 87 81 50 24 6 1 Gender 0.00 0.2759 0.59 0.52 0.33 0.83 1.00 Age 0.33 0.1494 0.09 0.16 0.00 0.00 0.00 Height 0.00 0.0575 0.10 0.04 0.04 0.00 0.00 Weight 0.00 0.0460 0.00 0.04 0.04 0.00 0.00 Temperature 0.00 0.1149 0.04 0.00 0.00 0.00 0.00 Globe_temperature 0.00 0.2874 0.12 0.02 0.08 0.00 0.00 Relative_humidity 0.00 0.0805 0.12 0.06 0.00 0.00 0.00 Velocity 1.00 1.0000 0.94 0.98 0.88 1.00 1.00 Sum 0 0.000026 0 0 0 0 0 Prediction Result −2 The accuracy of the prediction results can be calculated based on the confusion matrix. From a total of 252 available data, 0 data in class A were correctly predicted as class A (−3), and 3 data were correctly predicted as class one. In total, 72 data in class B were correctly predicted as class B (−2), and 15 data were correctly predicted as class B. In total, 49 data in class C were correctly predicted as class C (−1), and 32 data were predicted incorrectly as class C. In total, 29 data in class D were correctly predicted as class D (0), and 21 data were predicted incorrectly as class D. In total, 16 data in class E were correctly predicted as class E (1), and 8 data were predicted incorrectly as class E. In total, three data in class F were correctly predicted as class F (2), and three data were incorrectly predicted as class F.
  • 9. Sustainability 2022, 14, 15663 9 of 18 Zero data in class G were correctly predicted as class G (3), and one datum was predicted incorrectly as class G. The results show that the number of correctly predicted data (correctly classified instances) was 169 data, or 67.06%, while the incorrectly predicted results (incorrectly classified instances) amounted to 83 data, or 32.94%. The comparison between regression analysis and naïve Bayes seen from the number of TSVs shows that the regression analysis found the highest TSV predictions in the cool range (−1), with as many as 147 data. In the naïve Bayes prediction, the highest TSV was found in the cold range (−2), as much as 104. The highest TSV difference showed a difference in results between the regression analysis and naïve Bayes (Figure 5). Sustainability 2022, 14, x FOR PEER REVIEW 9 of 18 The results show that the number of correctly predicted data (correctly classified in- stances) was 169 data, or 67.06%, while the incorrectly predicted results (incorrectly clas- sified instances) amounted to 83 data, or 32.94%. The comparison between regression analysis and naïve Bayes seen from the number of TSVs shows that the regression analysis found the highest TSV predictions in the cool range (-1), with as many as 147 data. In the naïve Bayes prediction, the highest TSV was found in the cold range (-2), as much as 104. The highest TSV difference showed a differ- ence in results between the regression analysis and naïve Bayes (Figure 5). Figure 5. Differences in prediction results. Higher data variation was found in the results of naïve Bayes analysis, which could predict all TSV categories, while the data generated using linear regression look clustered at a value of 0 to −2. If the prediction results using linear regression and the naïve Bayes method are compared with the actual data in the field, then the naïve Bayes method has a better level of accuracy because the results obtained from naïve Bayes can approach the actual data in the field. The value generated from naïve Bayes looks close to the actual data generated in field testing. The value using regression analysis shows the greater the value, the higher the value (Figure 6). 0 20 40 60 80 100 120 140 160 ─3 ─2 ─1 0 1 2 3 Prediction Result Linear Regression Naïve Bayes Figure 5. Differences in prediction results. Higher data variation was found in the results of naïve Bayes analysis, which could predict all TSV categories, while the data generated using linear regression look clustered at a value of 0 to −2. If the prediction results using linear regression and the naïve Bayes method are compared with the actual data in the field, then the naïve Bayes method has a better level of accuracy because the results obtained from naïve Bayes can approach the actual data in the field. The value generated from naïve Bayes looks close to the actual data generated in field testing. The value using regression analysis shows the greater the value, the higher the value (Figure 6). Sustainability 2022, 14, x FOR PEER REVIEW 10 of 18 Figure 6. Prediction result comparison between actual TSV, linear regression, and naïve Bayes. By comparing the prediction results with the initial data, the level of accuracy of both methods can be found. The linear regression analysis has data accuracy in as many as 84 of 252 datasets, and naïve Bayes analysis makes correct predictions in 181 out of 252 da- tasets. The accuracy of linear regression analysis is 33%, while the naïve Bayes analysis is 67%. Prediction results show that the accuracy of naïve Bayes is higher than the multiple linear regression analysis (Table 5). -20 0 20 40 60 80 100 120 140 160 -3 -2 -1 0 1 2 3 Prediction Result Comparison Actual TSV Linear Regression Naïve Bayes Figure 6. Prediction result comparison between actual TSV, linear regression, and naïve Bayes.
  • 10. Sustainability 2022, 14, 15663 10 of 18 By comparing the prediction results with the initial data, the level of accuracy of both methods can be found. The linear regression analysis has data accuracy in as many as 84 of 252 datasets, and naïve Bayes analysis makes correct predictions in 181 out of 252 datasets. The accuracy of linear regression analysis is 33%, while the naïve Bayes analysis is 67%. Prediction results show that the accuracy of naïve Bayes is higher than the multiple linear regression analysis (Table 5). Table 5. Comparison of data analysis accuracies. Method Correct Prediction Count of Data Accuracy Linear Regression 84 252 33% Naïve Bayes 169 252 67% 4. Discussion Thermal comfort data obtained based on age are still relevant as a basis for formulating a thermal comfort model. Age is essential in thermal comfort, and other studies have analyzed elderly respondents. Thermal comfort models built with different age data will produce different findings. Research in Tibet in winter and summer with elderly respondents found differences in the research results regarding the acceptance of thermal comfort [33]. Individual thermal comfort response is inseparable from the microclimate of each region. Solar radiation influences the individual’s thermal comfort response with an influence on the average solar radiation temperature. The air content also influences the thermal comfort response of each individual in an area [34]. Modeling with thermal comfort data more often uses regression analysis. The use of machine learning has now grown so that the methods applied are more varied [35]. The use of regression analysis is still carried out in modeling thermal comfort in outdoor spaces. Research with regression analysis is still accurate in evaluating outdoor thermal comfort by including physiological parameters. The model found that it can provide a design basis for creating thermally comfortable open spaces in urban parks [36]. A comparison of methods in modeling thermal comfort using thermal sensation vote (TSV) was carried out, but it still needs to be used to find the most accurate method. TSV is one of the appropriate variables in accurately predicting thermal comfort at a rate of 95.8%. The Bayesian optimization technique is considered an accurate method for making prediction models. Algorithms in Bayesian optimization techniques can predict individual thermal comfort [31]. The results of other studies show that linear discriminant analysis (LDA) is better than linear regression (LA). Several algorithms show different results in different cases. These findings can contribute to studying subjective and objective feelings of indoor thermal comfort in public buildings, thereby guiding architectural design, the intelligent control of ventilation systems, and realizing human–building interaction interfaces [37]. Naïve Bayes is better than regression analysis. The results showed that naïve Bayes has a calculation accuracy of 67%. Another study compared naïve Bayes with artificial neural network (ANN), fuzzy logic (FL), and PMV-based algorithms. Other results show that the naïve Bayes calculation provides a prediction accuracy of 73% [30]. The difference compared with the research conducted in other studies is 1%. Another study comparing several machine learning methods in finding predictions of city thermal comfort found that naïve Bayes resulted in a data accuracy of 40.43% [38]. The results of other studies are quite different from the research that has been undertaken. Thermal comfort data in urban areas may differ from indoor data. Research on energy consumption savings that compares naïve Bayes and regression has also found results that are not different from the research on thermal comfort that has been carried out. The results of the study of energy consumption savings with regression resulted in a data accuracy of 41.43% and a naïve Bayes accuracy of 73% [39]. The results of other research regressions compared with the research that has been performed have a difference of 41.43 minus 33%, which is 8.43%. The difference compared with naïve Bayes accuracy is 1%.
  • 11. Sustainability 2022, 14, 15663 11 of 18 The prediction results obtained from linear regression and naïve Bayes are not precise but instead are based on the closest value [40]. In linear regression, the results are obtained by rounding the final grade to the nearest side of the class, while the results from naïve Bayes are obtained from the class that has the largest final score. PMV (predicted mean vote) and PPD (predicted percentage of dissatisfied) values were obtained using the CBE Thermal Comfort Tool software from https://comfort.cbe. berkeley.edu/ (accessed on 3 October 2022). Thermal variable data in the form of air tem- perature, average solar radiation temperature (globe temperature), wind speed, humidity, metabolism, and respondent activity were entered into the software, and PMV and PPD values were obtained. A total of 252 respondents calculated their PMV and PPD. The distribution of the PMV values was mostly in the range of −0.5 to −1, a value that indicates that a respondent is almost cold (score: −1). In another value, respondents seem to obtain a PMV value of 0.5, which indicates that some respondents feel close to warm (score: 1). The overall PMV results show on Figure 7 that the respondents are still not too cold or too hot. that the naïve Bayes calculation provides a prediction accuracy of 73% [30]. The difference compared with the research conducted in other studies is 1%. Another study comparing several machine learning methods in finding predictions of city thermal comfort found that naïve Bayes resulted in a data accuracy of 40.43% [38]. The results of other studies are quite different from the research that has been undertaken. Thermal comfort data in urban areas may differ from indoor data. Research on energy consumption savings that com- pares naïve Bayes and regression has also found results that are not different from the research on thermal comfort that has been carried out. The results of the study of energy consumption savings with regression resulted in a data accuracy of 41.43% and a naïve Bayes accuracy of 73% [39]. The results of other research regressions compared with the research that has been performed have a difference of 41.43 minus 33%, which is 8.43%. The difference compared with naïve Bayes accuracy is 1%. The prediction results obtained from linear regression and naïve Bayes are not pre- cise but instead are based on the closest value [40]. In linear regression, the results are obtained by rounding the final grade to the nearest side of the class, while the results from naïve Bayes are obtained from the class that has the largest final score. PMV (predicted mean vote) and PPD (predicted percentage of dissatisfied) values were obtained using the CBE Thermal Comfort Tool software from https://com- fort.cbe.berkeley.edu/ (accessed on 3 October 2022). Thermal variable data in the form of air temperature, average solar radiation temperature (globe temperature), wind speed, humidity, metabolism, and respondent activity were entered into the software, and PMV and PPD values were obtained. A total of 252 respondents calculated their PMV and PPD. The distribution of the PMV values was mostly in the range of −0.5 to −1, a value that indicates that a respondent is almost cold (score: −1). In another value, respondents seem to obtain a PMV value of 0.5, which indicates that some respondents feel close to warm (score: 1). The overall PMV results show on Figure 7 that the respondents are still not too cold or too hot. Figure 7. PMV score. The highest PPD value produced by the respondents reached 19%. The minimum value is 5%, and the average PPD produced is 9%. Not many respondents reached the 19%value. The PPD value generated using the software from https://comfort.cbe.berke- ley.edu/software (accessed on 3 October 2022) shows that respondents are predicted to still be able to accept the existing thermal conditions. The PPD value is still below 25%, which means that respondents are still relatively comfortable with the existing thermal conditions (Figure 8). -1 -0.5 0 0.5 1 1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 PMV Respondent Figure 7. PMV score. The highest PPD value produced by the respondents reached 19%. The minimum value is 5%, and the average PPD produced is 9%. Not many respondents reached the 19%value. The PPD value generated using the software from https://comfort.cbe.berkeley. edu/software (accessed on 3 October 2022) shows that respondents are predicted to still be able to accept the existing thermal conditions. The PPD value is still below 25%, which means that respondents are still relatively comfortable with the existing thermal conditions (Figure 8). Sustainability 2022, 14, x FOR PEER REVIEW 12 of 18 Figure 8. PPD score. 5. Conclusions We studied the use of multiple linear regression analysis as a model for predicting thermal comfort. Predicting thermal comfort using a regression model in a study at Won- osobo High School showed results with a better range of coolness. Naïve Bayes analysis is one of the analytical alternatives classified as machine learning. The use of machine learning in thermal comfort is important because the analysis is expected to have accurate predictions. Using naïve Bayes analysis in thermal comfort research at Wonosobo Senior High School, Indonesia, we found differences compared with research results using mul- tiple linear regression analysis. The difference in prediction results shows that naïve Bayes analysis has a cooler TSV result than multiple linear regression analysis. The level of ac- curacy in predictions using naïve Bayes is higher than the multiple linear regression anal- 0% 2% 4% 6% 8% 10% 12% 14% 16% 18% 20% 0 50 100 150 200 250 300 PPD Respondent Figure 8. PPD score. 5. Conclusions We studied the use of multiple linear regression analysis as a model for predicting thermal comfort. Predicting thermal comfort using a regression model in a study at Wonosobo High School showed results with a better range of coolness. Naïve Bayes
  • 12. Sustainability 2022, 14, 15663 12 of 18 analysis is one of the analytical alternatives classified as machine learning. The use of machine learning in thermal comfort is important because the analysis is expected to have accurate predictions. Using naïve Bayes analysis in thermal comfort research at Wonosobo Senior High School, Indonesia, we found differences compared with research results using multiple linear regression analysis. The difference in prediction results shows that naïve Bayes analysis has a cooler TSV result than multiple linear regression analysis. The level of accuracy in predictions using naïve Bayes is higher than the multiple linear regression analysis method. A comparison between the predictions of he two analytical methods is not too different, so it is still possible to use both methods in predicting the thermal comfort of building occupants. Author Contributions: Conceptualization, H.H.; Methodology, H.H.; Validation, H.S.; Data curation, H.S.; Writing—original draft, N.F.; Writing—review & editing, J.S.; Visualization, A.N.A. All authors have read and agreed to the published version of the manuscript. Funding: This research received no external funding. Conflicts of Interest: The authors declare no conflict of interest. Appendix A Table A1. Gender criteria probability. Gender Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 1 0 24 48 26 8 5 1 0 0.28 0.59 0.52 0.33 0.83 1 2 3 63 33 24 16 1 0 1 0.72 0.41 0.48 0.67 0.17 0 3 87 81 50 24 6 1 Table A2. Age criteria probability. Age Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 14 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0 15 0 9 16 10 7 3 0 0 0.1 0.2 0.2 0.29 0.5 0 16 0 29 36 18 12 2 0 0 0.33 0.44 0.36 0.5 0.33 0 17 2 35 21 14 5 1 1 0.67 0.4 0.26 0.28 0.21 0.17 1 18 1 13 7 8 0 0 0 0.33 0.15 0.09 0.16 0 0 0 19 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0 3 87 81 50 24 6 1
  • 13. Sustainability 2022, 14, 15663 13 of 18 Table A3. Height criteria probability. Height Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 140 0 0 1 1 0 0 0 0 0 0.01 0.02 0 0 0 142 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0 143 0 5 2 1 0 0 0 0 0.06 0.02 0.02 0 0 0 144 0 2 1 0 0 0 0 0 0.02 0.01 0 0 0 0 145 0 3 0 0 1 0 0 0 0.03 0 0 0.04 0 0 146 0 4 2 0 0 0 0 0 0.05 0.02 0 0 0 0 147 0 4 2 1 1 0 0 0 0.05 0.02 0.02 0.04 0 0 148 0 3 1 0 1 0 0 0 0.03 0.01 0 0.04 0 0 149 0 1 2 1 1 0 0 0 0.01 0.02 0.02 0.04 0 0 150 2 10 6 6 2 0 1 0.67 0.11 0.07 0.12 0.08 0 1 151 0 5 2 2 2 1 0 0 0.06 0.02 0.04 0.08 0.17 0 152 0 3 2 1 2 0 0 0 0.03 0.02 0.02 0.08 0 0 153 0 6 3 3 2 0 0 0 0.07 0.04 0.06 0.08 0 0 154 1 1 6 3 1 0 0 0.33 0.01 0.07 0.06 0.04 0 0 155 0 5 5 3 2 2 0 0 0.06 0.06 0.06 0.08 0.33 0 156 0 5 2 3 1 1 0 0 0.06 0.02 0.06 0.04 0.17 0 157 0 3 1 2 1 1 0 0 0.03 0.01 0.04 0.04 0.17 0 158 0 3 7 3 1 0 0 0 0.03 0.09 0.06 0.04 0 0 159 0 0 4 2 0 0 0 0 0 0.05 0.04 0 0 0 160 0 5 8 2 1 0 0 0 0.06 0.1 0.04 0.04 0 0 161 0 0 1 2 0 0 0 0 0 0.01 0.04 0 0 0 162 0 3 2 0 0 1 0 0 0.03 0.02 0 0 0.17 0 163 0 1 1 1 1 0 0 0 0.01 0.01 0.02 0.04 0 0 164 0 3 2 2 1 0 0 0 0.03 0.02 0.04 0.04 0 0 165 0 3 4 3 0 0 0 0 0.03 0.05 0.06 0 0 0 166 0 2 1 1 1 0 0 0 0.02 0.01 0.02 0.04 0 0 167 0 4 2 1 1 0 0 0 0.05 0.02 0.02 0.04 0 0 168 0 0 2 2 0 0 0 0 0 0.02 0.04 0 0 0 169 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0 170 0 0 5 1 1 0 0 0 0 0.06 0.02 0.04 0 0 171 0 1 0 1 0 0 0 0 0.01 0 0.02 0 0 0 172 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0 174 0 0 0 1 0 0 0 0 0 0 0.02 0 0 0 176 0 1 1 1 0 0 0 0 0.01 0.01 0.02 0 0 0 177 0 0 1 0 0 0 0 0 0 0.01 0 0 0 0 3 87 81 50 24 6 1
  • 14. Sustainability 2022, 14, 15663 14 of 18 Table A4. Weight criteria probability. Weight Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 30 0 0 0 1 0 1 0 0 0 0 0.02 0 0.17 0 36 0 0 0 0 0 1 0 0 0 0 0 0 0.17 0 37 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0 38 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0 39 0 3 1 2 0 0 0 0 0.03 0.01 0.04 0 0 0 40 0 14 9 2 1 0 1 0 0.16 0.11 0.04 0.04 0 1 41 0 1 1 0 1 0 0 0 0.01 0.01 0 0.04 0 0 42 0 7 3 1 1 1 0 0 0.08 0.04 0.02 0.04 0.17 0 43 2 3 5 1 0 0 0 0.67 0.03 0.06 0.02 0 0 0 44 0 4 2 0 0 0 0 0 0.05 0.02 0 0 0 0 45 0 13 9 9 5 0 0 0 0.15 0.11 0.18 0.21 0 0 46 0 4 0 2 1 0 0 0 0.05 0 0.04 0.04 0 0 47 0 1 2 2 0 0 0 0 0.01 0.02 0.04 0 0 0 48 1 5 6 0 1 1 0 0.33 0.06 0.07 0 0.04 0.17 0 49 0 1 8 1 2 1 0 0 0.01 0.1 0.02 0.08 0.17 0 50 0 9 7 13 4 0 0 0 0.1 0.09 0.26 0.17 0 0 51 0 2 3 1 1 0 0 0 0.02 0.04 0.02 0.04 0 0 52 0 5 1 3 4 0 0 0 0.06 0.01 0.06 0.17 0 0 53 0 4 4 1 0 0 0 0 0.05 0.05 0.02 0 0 0 54 0 4 3 2 2 0 0 0 0.05 0.04 0.04 0.08 0 0 55 0 1 5 3 1 0 0 0 0.01 0.06 0.06 0.04 0 0 56 0 1 1 1 0 0 0 0 0.01 0.01 0.02 0 0 0 57 0 0 2 1 0 1 0 0 0 0.02 0.02 0 0.17 0 58 0 1 2 0 0 0 0 0 0.01 0.02 0 0 0 0 59 0 0 1 2 0 0 0 0 0 0.01 0.04 0 0 0 60 0 0 3 1 0 0 0 0 0 0.04 0.02 0 0 0 61 0 0 1 1 0 0 0 0 0 0.01 0.02 0 0 0 65 0 0 2 0 0 0 0 0 0 0.02 0 0 0 0 69 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0 82 0 1 0 0 0 0 0 0 0.01 0 0 0 0 0 3 87 81 50 24 6 1
  • 15. Sustainability 2022, 14, 15663 15 of 18 Table A5. Temperature criteria probability. Temperature Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 22 0 10 3 0 0 0 0 0 0.11 0.04 0 0 0 0 23 1 12 11 3 1 0 0 0.33 0.14 0.14 0.06 0.04 0 0 24 1 12 21 7 4 1 0 0.33 0.14 0.26 0.14 0.17 0.17 0 25 0 9 11 9 4 2 0 0 0.1 0.14 0.18 0.17 0.33 0 22.5 1 19 13 8 4 0 0 0.33 0.22 0.16 0.16 0.17 0 0 22.8 0 3 0 0 0 0 0 0 0.03 0 0 0 0 0 23.5 0 12 7 7 0 0 0 0 0.14 0.09 0.14 0 0 0 24.5 0 9 12 15 4 2 1 0 0.1 0.15 0.3 0.17 0.33 1 25.5 0 1 3 1 7 1 0 0 0.01 0.04 0.02 0.29 0.17 0 3 87 81 50 24 6 1 Table A6. Globe temperature criteria probability. Globe_Temperature Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 23 1 20 9 1 4 0 0 0.33 0.23 0.11 0.02 0.17 0 0 24 0 25 10 1 2 0 0 0 0.29 0.12 0.02 0.08 0 0 25 1 11 9 10 1 0 0 0.33 0.13 0.11 0.2 0.04 0 0 26 0 8 28 10 9 3 1 0 0.09 0.35 0.2 0.38 0.5 1 23.5 1 12 15 11 0 0 0 0.33 0.14 0.19 0.22 0 0 0 24.5 0 4 4 11 4 0 0 0 0.05 0.05 0.22 0.17 0 0 25.5 0 6 6 5 1 2 0 0 0.07 0.07 0.1 0.04 0.33 0 26.5 0 1 0 1 3 1 0 0 0.01 0 0.02 0.13 0.17 0 3 87 81 50 24 6 1 Table A7. Relative humidity criteria probability. Relative_Humidity Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 60 1 1 2 0 0 0 0 0.33 0.01 0.02 0 0 0 0 61 0 3 7 6 0 1 1 0 0.03 0.09 0.12 0 0.17 1 62 0 7 10 3 0 0 0 0 0.08 0.12 0.06 0 0 0 63 0 11 11 0 4 1 0 0 0.13 0.14 0 0.17 0.17 0 64 0 1 0 8 5 0 0 0 0.01 0 0.16 0.21 0 0 65 1 11 2 3 1 0 0 0.33 0.13 0.02 0.06 0.04 0 0 66 1 7 4 5 2 0 0 0.33 0.08 0.05 0.1 0.08 0 0 67 0 7 7 7 1 0 0 0 0.08 0.09 0.14 0.04 0 0
  • 16. Sustainability 2022, 14, 15663 16 of 18 Table A7. Cont. Relative_Humidity Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 68 0 0 1 3 0 0 0 0 0 0.01 0.06 0 0 0 69 0 4 0 0 0 0 0 0 0.05 0 0 0 0 0 70 0 7 6 3 0 0 0 0 0.08 0.07 0.06 0 0 0 71 0 3 8 1 3 0 0 0 0.03 0.1 0.02 0.13 0 0 72 0 2 2 0 0 2 0 0 0.02 0.02 0 0 0.33 0 73 0 2 2 0 0 0 0 0 0.02 0.02 0 0 0 0 74 0 1 5 1 1 1 0 0 0.01 0.06 0.02 0.04 0.17 0 75 0 2 1 0 0 0 0 0 0.02 0.01 0 0 0 0 77 0 1 0 1 3 1 0 0 0.01 0 0.02 0.13 0.17 0 79 0 12 10 6 0 0 0 0 0.14 0.12 0.12 0 0 0 80 0 5 3 3 4 0 0 0 0.06 0.04 0.06 0.17 0 0 3 87 81 50 24 6 1 Table A8. Velocity criteria probability. Velocity Count Probability −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 0 3 87 76 49 21 6 1 1 1 0.94 0.98 0.88 1 1 0.3 0 0 3 0 0 0 0 0 0 0.04 0 0 0 0 1 0 0 2 1 3 0 0 0 0 0.02 0.02 0.13 0 0 3 87 81 50 24 6 1 Sustainability 2022, 14, x FOR PEER REVIEW 16 of 18 Figure A1. Prediction results using Weka software. Figure A2. Confusion matrix using Weka software. Figure A1. Prediction results using Weka software. Sustainability 2022, 14, x FOR PEER REVIEW 16 of 18 Figure A1. Prediction results using Weka software. Figure A2. Confusion matrix using Weka software. Figure A2. Confusion matrix using Weka software.
  • 17. Sustainability 2022, 14, 15663 17 of 18 Figure A2. Confusion matrix using Weka software. Figure A3. Naïve Bayes prediction accuracy using Weka software. References 1. Gong, P.; Cai, Y.; Zhou, Z.; Zhang, C.; Chen, B.; Sharples, S. Investigating spatial impact on indoor personal thermal comfort. J. Build. Eng. 2021, 45, 103536. https://doi.org/10.1016/j.jobe.2021.103536. 2. Dzyuban, Y.; Ching, G.N.; Yik, S.K.; Tan, A.J.; Banerjee, S.; Crank, P.J.; Chow, W.T. Outdoor thermal comfort research in tran- sient conditions: A narrative literature review. Landsc. Urban Plan. 2022, 226, 104496. https://doi.org/10.1016/j.landur- bplan.2022.104496. 3. Larriva, M.T.B.; Mendes, A.S.; Forcada, N. The effect of climatic conditions on occupants’ thermal comfort in naturally venti- lated nursing homes. Build. Environ. 2022, 214, 108930. https://doi.org/10.1016/j.buildenv.2022.108930. 4. Zang, X.; Liu, K.; Qian, Y.; Qu, G.; Yuan, Y.; Ren, L.; Liu, G. The influence of different functional areas on customers’ thermal comfort—A Field study in shopping complexes of North China. Energy Built Environ. 2022, in press. https://doi.org/10.1016/j.enbenv.2022.01.004. 5. Jia, X.; Wang, J.; Zhu, Y.; Ji, W.; Cao, B. Climate chamber study on thermal comfort of walking passengers with elevated ambient air velocity. Build. Environ. 2022, 218, 109100. https://doi.org/10.1016/j.buildenv.2022.109100. 6. Niza, I.L.; Broday, E.E. Thermal comfort conditions in Brazil: A discriminant analysis through the ASHRAE Global Thermal Comfort Database II. Build. Environ. 2022, 221, 109310. https://doi.org/10.1016/j.buildenv.2022.109310. 7. Feng, Y.; Liu, S.; Wang, J.; Yang, J.; Jao, Y.-L.; Wang, N. Data-driven personal thermal comfort prediction: A literature review. Renew. Sustain. Energy Rev. 2022, 161, 112357. https://doi.org/10.1016/j.rser.2022.112357. Figure A3. Naïve Bayes prediction accuracy using Weka software. References 1. Gong, P.; Cai, Y.; Zhou, Z.; Zhang, C.; Chen, B.; Sharples, S. Investigating spatial impact on indoor personal thermal comfort. J. Build. Eng. 2021, 45, 103536. [CrossRef] 2. Dzyuban, Y.; Ching, G.N.; Yik, S.K.; Tan, A.J.; Banerjee, S.; Crank, P.J.; Chow, W.T. Outdoor thermal comfort research in transient conditions: A narrative literature review. Landsc. Urban Plan. 2022, 226, 104496. [CrossRef] 3. Larriva, M.T.B.; Mendes, A.S.; Forcada, N. The effect of climatic conditions on occupants’ thermal comfort in naturally ventilated nursing homes. Build. Environ. 2022, 214, 108930. [CrossRef] 4. Zang, X.; Liu, K.; Qian, Y.; Qu, G.; Yuan, Y.; Ren, L.; Liu, G. The influence of different functional areas on customers’ thermal comfort—A Field study in shopping complexes of North China. Energy Built Environ. 2022, in press. [CrossRef] 5. Jia, X.; Wang, J.; Zhu, Y.; Ji, W.; Cao, B. Climate chamber study on thermal comfort of walking passengers with elevated ambient air velocity. Build. Environ. 2022, 218, 109100. [CrossRef] 6. Niza, I.L.; Broday, E.E. Thermal comfort conditions in Brazil: A discriminant analysis through the ASHRAE Global Thermal Comfort Database II. Build. Environ. 2022, 221, 109310. [CrossRef] 7. Feng, Y.; Liu, S.; Wang, J.; Yang, J.; Jao, Y.-L.; Wang, N. Data-driven personal thermal comfort prediction: A literature review. Renew. Sustain. Energy Rev. 2022, 161, 112357. [CrossRef] 8. Heidari, A.; Maréchal, F.; Khovalyg, D. Reinforcement Learning for proactive operation of residential energy systems by learning stochastic occupant behavior and fluctuating solar energy: Balancing comfort, hygiene and energy use. Appl. Energy 2022, 318, 119206. [CrossRef] 9. Jia, M.; Choi, J.-H.; Liu, H.; Susman, G. Development of facial-skin temperature driven thermal comfort and sensation modeling for a futuristic application. Build. Environ. 2022, 207, 108479. [CrossRef] 10. Liu, K.; Lian, Z.; Dai, X.; Lai, D. Comparing the effects of sun and wind on outdoor thermal comfort: A case study based on longitudinal subject tests in cold climate region. Sci. Total Environ. 2022, 825, 154009. [CrossRef] 11. Ji, Y.; Song, J.; Shen, P. A review of studies and modelling of solar radiation on human thermal comfort in outdoor environment. Build. Environ. 2022, 214, 108891. [CrossRef] 12. Geng, Y.; Hong, B.; Du, M.; Yuan, T.; Wang, Y. Combined effects of visual-acoustic-thermal comfort in campus open spaces: A pilot study in China’s cold region. Build. Environ. 2022, 209, 108658. [CrossRef] 13. Dharmasastha, K.; Samuel, D.L.; Nagendra, S.S.; Maiya, M. Thermal comfort of a radiant cooling system in glass fiber reinforced gypsum roof—An experimental study. Appl. Therm. Eng. 2022, 214, 118842. [CrossRef] 14. Ma, X.; Leung, T.; Chau, C.; Yung, E.H. Analyzing the influence of urban morphological features on pedestrian thermal comfort. Urban Clim. 2022, 44, 101192. [CrossRef] 15. Cheng, J.C.; Kwok, H.H.; Li, A.T.; Tong, J.C.; Lau, A.K. BIM-supported sensor placement optimization based on genetic algorithm for multi-zone thermal comfort and IAQ monitoring. Build. Environ. 2022, 216, 108997. [CrossRef] 16. Luo, Z.; Sun, C.; Dong, Q.; Qi, X. Key control variables affecting interior visual comfort for automated louver control in open-plan office—A study using machine learning. Build. Environ. 2022, 207, 108565. [CrossRef] 17. Zhang, R.; Liu, D.; Shi, L. Thermal-comfort optimization design method for semi-outdoor stadium using machine learning. Build. Environ. 2022, 215, 108890. [CrossRef] 18. Kafy, A.-A.; Saha, M.; Faisal, A.-A.; Rahaman, Z.A.; Rahman, M.T.; Liu, D.; Fattah, A.; Al Rakib, A.; AlDousari, A.E.; Rahaman, S.N.; et al. Predicting the impacts of land use/land cover changes on seasonal urban thermal characteristics using machine learning algorithms. Build. Environ. 2022, 217, 109066. [CrossRef] 19. Yang, S.; Wan, M.P. Machine-learning-based model predictive control with instantaneous linearization—A case study on an air-conditioning and mechanical ventilation system. Appl. Energy 2022, 306, 118041. [CrossRef] 20. Heidari, A.; Maréchal, F.; Khovalyg, D. An occupant-centric control framework for balancing comfort, energy use and hygiene in hot water systems: A model-free reinforcement learning approach. Appl. Energy 2022, 312, 118833. [CrossRef]
  • 18. Sustainability 2022, 14, 15663 18 of 18 21. Irshad, K.; Habib, K.; Kareem, M.; Basrawi, F.; Saha, B.B. Evaluation of thermal comfort in a test room equipped with a photovoltaic assisted thermo-electric air duct cooling system. Int. J. Hydrogen Energy 2017, 42, 26956–26972. [CrossRef] 22. Irshad, K.; Algarni, S.; Jamil, B.; Ahmad, M.T.; Khan, M.A. Effect of gender difference on sleeping comfort and building energy utilization: Field study on test chamber with thermoelectric air-cooling system. Build. Environ. 2019, 152, 214–227. [CrossRef] 23. Yang, Z.; Du, C.; Xiao, H.; Li, B.; Shi, W.; Wang, B. A novel integrated index for simultaneous evaluation of the thermal comfort and energy efficiency of air-conditioning systems. J. Build. Eng. 2022, 57, 104885. [CrossRef] 24. Zhang, W.; Wu, Y.; Calautit, J.K. A review on occupancy prediction through machine learning for enhancing energy efficiency, air quality and thermal comfort in the built environment. Renew. Sustain. Energy Rev. 2022, 167, 112704. [CrossRef] 25. Irshad, K.; Khan, A.I.; Irfan, S.A.; Alam, M.; Almalawi, A.; Zahir, H. Utilizing Artificial Neural Network for Prediction of Occupants Thermal Comfort: A Case Study of a Test Room Fitted With a Thermoelectric Air-Conditioning System. IEEE Access 2020, 8, 99709–99728. [CrossRef] 26. Elnour, M.; Himeur, Y.; Fadli, F.; Mohammedsherif, H.; Meskin, N.; Ahmad, A.M.; Petri, I.; Rezgui, Y.; Hodorog, A. Neural network-based model predictive control system for optimizing building automation and management systems of sports facilities. Appl. Energy 2022, 318, 119153. [CrossRef] 27. Esrafilian-Najafabadi, M.; Haghighat, F. Impact of predictor variables on the performance of future occupancy prediction: Feature selection using genetic algorithms and machine learning. Build. Environ. 2022, 219, 109152. [CrossRef] 28. Rana, R.; Kusy, B.; Jurdak, R.; Wall, J.; Hu, W. Feasibility analysis of using humidex as an indoor thermal comfort predictor. Energy Build. 2013, 64, 17–25. [CrossRef] 29. Benito, P.I.; Sebastián, M.A.; González-Gaya, C. Study and Application of Industrial Thermal Comfort Parameters by Using Bayesian Inference Techniques. Appl. Sci. 2021, 11, 11979. [CrossRef] 30. Aguilera, J.J.; Toftum, J.; Kazanci, O.B. Predicting personal thermal preferences based on data-driven methods. E3S Web Conf. 2019, 111, 05015. [CrossRef] 31. Yang, B.; Li, X.; Liu, Y.; Chen, L.; Guo, R.; Wang, F.; Yan, K. Comparison of models for predicting winter individual thermal comfort based on machine learning algorithms. Build. Environ. 2022, 215, 108970. [CrossRef] 32. Asif, A.; Zeeshan, M.; Khan, S.R.; Sohail, N.F. Investigating the gender differences in indoor thermal comfort perception for summer and winter seasons and comparison of comfort temperature prediction methods. J. Therm. Biol. 2022, 110, 103357. [CrossRef] 33. Yao, F.; Fang, H.; Han, J.; Zhang, Y. Study on the outdoor thermal comfort evaluation of the elderly in the Tibetan plateau. Sustain. Cities Soc. 2021, 77, 103582. [CrossRef] 34. Wei, D.; Yang, L.; Bao, Z.; Lu, Y.; Yang, H. Variations in outdoor thermal comfort in an urban park in the hot-summer and cold-winter region of China. Sustain. Cities Soc. 2022, 77, 103535. [CrossRef] 35. Qin, H.; Wang, X. A multi-discipline predictive intelligent control method for maintaining the thermal comfort on indoor environment. Appl. Soft Comput. 2022, 116, 108299. [CrossRef] 36. Zhu, R.; Zhang, X.; Yang, L.; Liu, Y.; Cong, Y.; Gao, W. Correlation analysis of thermal comfort and physiological responses under different microclimates of urban park. Case Stud. Therm. Eng. 2022, 34, 102044. [CrossRef] 37. Song, G.; Ai, Z.; Zhang, G.; Peng, Y.; Wang, W.; Yan, Y. Using machine learning algorithms to multidimensional analysis of subjective thermal comfort in a library. Build. Environ. 2022, 212, 108790. [CrossRef] 38. Gao, N.; Shao, W.; Rahaman, M.S.; Zhai, J.; David, K.; Salim, F.D. Transfer learning for thermal comfort prediction in multiple cities. Build. Environ. 2021, 195, 107725. [CrossRef] 39. Lin, C.-M.; Lin, S.-F.; Liu, H.-Y.; Tseng, K.-Y. Applying the naïve Bayes classifier to HVAC energy prediction using hourly data. Microsyst. Technol. 2022, 28, 121–135. [CrossRef] 40. Pan, W.; Ming, H.; Yang, Z.; Wang, T. Comments on "Using k-core Decomposition on Class Dependency Networks to Improve Bug Prediction Model’s Practical Performance". IEEE Trans. Softw. Eng. 2022. Early Access. [CrossRef]