SlideShare a Scribd company logo
International
OPEN ACCESS Journal
Of Modern Engineering Research (IJMER)
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 57 |
Data Analytics on Solar Energy Using Hadoop
K.Akila1
, B.Manasavani2
, Dr.Gopikrishnan M3
1
Department Of ComputerEngineering Prathyusha Engineering College Thiruvallur, B.E.,India.
2
Department Of ComputerEngineeringPrathyusha EngineeringCollege
INTRODUCTION
Missing data is the missing form of informationabout phenomena, which is important, and it is the
informationin which we are interested. The existence of missing data isone significant problem in data quality.
Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases.
Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis
missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in
instance by some reasonable values.The case substitution is the method developed for dealingwith
missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processes.
Themethods,such as substitution of missing values by the attributemean or mode should be cautiously handed to
avoid inclusionof bias.
1.1 Randomness of MissingData
Missing data randomness is classified [1] in threeclasses.
Missing completely at random (MCAR):Missing values are scattered randomly across all instances.
Inthistype of randomness, any missing data handling methodcan be applied without risk of introducing bias on
the data.It occurs when the probability of an instance having amissing
valueforanattributedoesnotdependoneithertheknown values or the missingdata.differences on attributes establish
that the two groups donot differsignificantly.
Missing at random (MAR):Missing atrandom (MAR) is a condition, which occurs when missing
valuesare not randomly distributed across all observations butare randomly distributed within one or more
classes (ex.missing more among whites than non-whites, but randomwithin each). The probability of an instance
with a missing valuefor an attribute may depend on the known values, and not onthe value of the missing
dataitself.
Not missing at random (NMAR):Not missingat random is the most challenging form, occurs
whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable
missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of
thatattribute.
1.2 Handling MissingData
Missing data handling methods are categoriesed asfollowsIgnoring data:This method throw-outs all
instanceswith missingdata.Therearetwocoremethodstodiscarddata with missing values. The first one is known as
completecase analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms.
The next method is recognized as discarding instancesor attributes. This method determines the level of
missingdata oneachinstanceandattribute,anddeletestheinstancesor attributes with high extents of missing data.
ABSRACT: Missing data is one of the major issues in data miningand pattern recognition. The
knowledge contains in attributeswith missing data values are important in improvingregression
correlationprocessofanorganization.Thelearningprocesson each instance is necessary as it may
containsome exceptional knowledge. There are various methods tohandle missing data in regression
correlation. Analysis of photovoltaic cell, Sunlight striking on different geographical location to know
the defective or connectionless photovoltaic cell plate. we mainly aims To showcasing the energy
produced at different geographical location and to find the defective plate.
And also analysis the data sets of energy produced along with current weather in particular area to
know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to
analyze the solar energy datasets.
Keywords: missing data, genetic algorithm, regression correlation.
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 58 |
Prior todeletingany attribute, it is vital to evaluate its connotation tothe investigation. The methods, complete
case analysisand discarding is executed only if missing data aremissing completely at random. The missing data
that are notmissing completely at random contain non-random elements thatmay prejudice theresults.
Imputation:In imputation-based proceduresmissing
valuesareimputedwithreasonable,probablevaluesrather than being deleted totally. The objective is to useknown
associations that can be recognized in the valid rangevaluesofthedatasettofacilitateinestimatingthe missing
values
Data quality playsmajor role in machine learning, data mining andknowledge discovery from
databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced
hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing
values in instance by some reasonable values.In first method limitation of ∆T was adjusted, above equation was
used for ∆T<8o
C. Table 1 shows criteria for the decision of τ value. τ value of 0.6-0.7 are commonly used for
clear sky atmospheric transmittance coefficient value. In this study τ value of 0.69 was used for clear sky,
assumed that the clear sky condition occurred when RH<40% and ambient temperature more than 8o
C.
Calculation algorithm was built based on decision matrix and the τ value was locally determined using the
training of data set to get minimum error.
The second method used in this study is by finding the correlation between RH, clearness index and
beam transmittance. The data used to find correlation between beam transmittance and clearness index is
measured data from new radiometer set that was installed in 2010 on the rooftop of Block P,
UniversitiTeknologiPetronas, which located about 30 km from Ipoh city. About 1 month, 5 minutes time step
data of global, beam and diffuse radiation from June to July 2010 was used. Before find the correlation of beam
transmittance and clearness index, RH-clearness index correlation was obtained from Ipoh city available data as
can be seen in Figure 3.
Then beam transmittance-clearness index correlation can be obtained by scatter plot as can be seen in
Figure 4. To plot Figure 4 some data were rejected due to obvious error that can be analyzed from measurement
results, and the basic concept of terrestrial solar radiation characteristicsTherearetwocoremethodstodiscarddata
with missing values. The first one is known as completecase
analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms
MultipleImputations:ItisamethodbyRubin[1]for making multiple simulated values for eachincomplete
information, and then iteratively examining datasets witheach simulated value substituted in every turn. The
intentionis,possibly, to generate estimates that better indicatetrue variance and uncertainty in the data than
doregression methods. This permits expert staff and software to be usedto create imputed datasets that can be
analyzed byrelatively naive users equipped with standard software. It can bevery effective particularly for small
to moderate levelsof missingness, where the missing data mechanism isorganized, and for datasets that are to be
placed in the publicdomain
Not missingat random is the most challenging form, occurs whenmissing values are not randomly
distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with
a missing value for an attributemight depend on the value of thatattribute
Following constraint were used as data rejection criteria:
• Reject night data
• Reject data if clearness index >1
• Reject data if beam transmittance >1
• Reject data if beam transmittance > clearness index
• Reject data if clearness index >0.6 and beam transmittance <0.1
• Reject data if clearness index <0.2 and beam transmittance>0.15
Correlation between RH and beam transmittance was obtained from above correlation and plotted in
Figure 5. Balaras et al studied the relationship between beam transmittance and clearness index in Athens,
Greece [10], the results of the study was adopted to carry out second method in this study. Regression results
were presented as follows:
Therearevariousmethodstoestimatesolarradiation.
Satisfactoryresultforhourlysolarradiationestimationwas obtainedbyusingatmospherictransmittancemodel[1]while
otherauthorshaveuseddiffusefraction[2]andclearness indexmodels[3].Parametricoratmospherictransmittance
modelrequiresdetailsatmosphericcharacteristicinformation [4].Thismodelgiveshigh-
accuracyforclearsky/cloudless conditions,whichisleadingsomeauthor tousethis modelto
evaluatetheperformanceofanempiricalmodelunder cloudlessconditions[5].Therearenumerousauthors
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 59 |
proposedthiskindofmodelasmentionedin[6].However, pureparametricmodelwasnotused in thisstudy,since there
is no detail atmospheric condition data for the site.
Meteorologicalparametersfrequentlyusedaspredictorsof
atmosphericparameterssinceacquiringdetailatmospheric conditionsrequireadvancemeasurement.Meteorological
parameterssuchassunshineduration,cloudcover,ambient temperature,relativehumidity,andprecipitationdatahave
beenusedtoestimateatmospherictransmittancecoefficient inparametricmodel.Thiskindofmodeliscalled
meteorological model
Therearevariouswaystoestimatesolarradiationon certainareaontheearth.Ambienttemperaturebased
estimation is widely used sinceambienttemperaturedataare measuredinmanyweatherstations.Inthisstudy,missing
datawereestimatedbasedonambienttemperature measurementandusedmeasuredRHdataasatmospheric
transmittance of the determination criteria.thatestimate hourlysolarradiationbasedondevelopedCampbelland
Norman method [2], was adapted in this study.
Results of the new model then compared with the existing temperature-based solar radiation prediction
model as follow:
A. H-S model
Hargreaves and Samani [14] conducted an initial study on using Tmax and Tmin to estimate solar radiation by the
following equation:
G T h = K r ( T m a x - T m i n ) 0 . 5
G 0 h ( 9 )
Kris an empirical coefficient, which was recommended tobe 0.16 for interior regions and 0.19 for coastal
regions. In this study Kr was locally determined using training data set.
B. H-S-A model
Annandale et al [15] modified H-S model by introducing correction factor as follow:
GT h =Kr (1+2.7 10- 5
Z) (Tm a x -Tm i n )0 . 5
G0 h (10 )
Z is elevation in m and Krwas locally determined.
V. STATISTICAL ANALYSIS FOR MODEL VALIDATION
Estimation results validated using statistical parameters. Pearson correlation coefficient was calculated
as routine correlation indicator. Residual error was calculated using RMSE (Root Mean Square Error) and also
presented in NRMSE (Normalized Root Mean Square Error) as follows:
R M S E = [ Ʃ { Y c - Y 0 } 2
/ n ] 0 . 5
( 1 1 )
N R M S E = R M S E / y m a x - y m i n ( 1 2 )
where, Yc is predicted variable Y0 is measured variable, n is number of data, ymax is maximum measured data ymin
is minimum measured data.
As an addition, index of agreement was calculated using equation below:
d = 1 – [Ʃ(xi-yi)2
/Ʃ(ǀxi-x̅ iǀ+ǀyi-y̅ iǀ)]0.5
( 1 3 )
where, xi is predicted variable yi is measured variable, x̅ i is averaged predicted variable and y̅ i is averaged
measured variable.
VI. RESULTS AND DISCUSSIONS
Calculation have been carried out using methods 1 and 2, statistical calculation analysis also has been
performed. Table 2 shows statistical analysis results of both methods and results of existing method (H-S and H-
S-A method).The most satisfactory results were obtained using method 1. Figure 3 shows scatter plot of
predicted and measured data for first method. The minimum RMSE value of 87.6 Watt/m2
was obtained with
0.95 correlation coefficient and 0.97 index of agreement value. Previous method which use precipitation data
obtained averaged index of agreement of 0.95, thus the model presented in this study also performed well.
Table II. Statistical analysis results
Themethodsarebasedonthedecompositionmodelthatiscalculatingeachofsolarradiationcomponents,which
dependonatmospherictransmittance.Twomethodstopredictatmospherictransmittancevalueusingavailablemeteorol
ogicaldatawereproposed.Inthefirstmethod,adecisionmatrixwasused,whileinthesecondmethod,regressioncorrelati
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 60 |
onofmeteorologicalparameterswasused.Thecalculationsresultswereevaluatedusingstatisticalparameter.Thoughth
eresultshowsbothofthemethodsperformwell,moresatisfactoryresultswereobtainedfromfirstmethodwithRootMean
SquareErrorof87.6Watt/m2,NormalizedRootMeanSquareErrorof8.29%,correlationcoefficientof0.95andindexofa
greementof0.97.Furthermore,thefirstmethodonlyneedsambienttemperatureandrelativehumiditydatathatcommonl
ymeasuredinmeteorologicalstations.graphical comparison of measured and predicted solar radiation for random
dates. As can be seen most of the error occurred in low radiation or cloudy sky which often occurred in rainy
season (Fig. 7-b). Then after the results has been validated using available data, the predicted results in the
missing data days can be occupied in the measured data set. Now the time series composite data of solar
radiation is completed and can be used for other purpose such as prediction of PV system performance for
respected location.
Missing data is one of the major issues in data miningand pattern recognition. The knowledge contains
in attributeswith missing data values are important in improvingdecision-
makingprocessofanorganization.Thelearningprocesson each instance is necessary as it may containsome
exceptional knowledge. There are various methods tohandle missing data in decision tree learning. Analysis of
photovoltaic cell, Sunlight striking on different geographical location to know the defective or connectionless
photovoltaic cell plate. we mainly aims To showcasing the energy produced at different geographical location
and to find the defective plate.
And also analysis the data sets of energy produced along with current weather in particular area to
know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to analyze
the solar energy datasets.
Not missingat random is the most challenging form, occurs whenmissing values are not randomly
distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with
a missing value for an attributemight depend on the value of thatattribute
(b) Rainy season
Figure 7: Graphical comparison between measured and predicted solarradiation for random dates (Method 1)
Figure 8 shows prediction results in the days which solar radiation measurement were absent. In our
case the amount of missing solar radiation data is 21 full days and 2 half days.
Data quality playsmajor role in machine learning, data mining andknowledge discovery from
databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced
hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing
values in instance by some reasonable values.The case substitution is the method developed for dealingwith
missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processesThe method
developed in this study may be applied for any location on the earth with notes, for first method the assignment
criteria of atmospheric transmittance using RH and ambient temperature should be adjusted to the available
solar radiation data of the area to get minimum error. To generate general criteria of atmospheric transmittance
assignment using RH and ambient temperature further research is required with sufficient large amount of data
for various areas. For the second method the correlation should be rebuild based on available measurement
nearest from the location to give satisfactory estimation results.
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 61 |
VII. CONCLUSIONS
This paper represents to estimate solar radiation from available weather data. The methods presented in
this paper are based on atmospheric transmittance determination using available meteorological data. In this
method regression correlation of meteorological parameter was
used.Thekeyfortheaccuracyofabovemethodisinthe determinationofbeamatmospherictransmittance(τ).Beam
atmospherictransmittanceisthepercentageofthebeam (direct)radiationthatwillpenetratetheatmospherewithout
beingscattered.KurtandSpokasusedprecipitationdata to built decisionmatrix ofatmospheric transmittance
.temperature data. The second method is by using RH-clearness index, clearness index-beam atmospheric
transmission and beam atmospheric transmission-RH correlation. The result shows that both methods perform
well. Method 1 provided better results with minimum correlation coefficient of 0.95, RMSE of 87.6 Watt/m2
,
NRSME of 8.29% and index of agreement of 0.97. The prediction was intended to fill missing data in solar
radiation data set to get complete time series data. However, in this study only one year of one area data have
been used. Validation using sufficient large amount of data is required for wider application of the method.
REFERENCES
[1]. G. S. Campbell and J. M. Norman, “Introduction to Environmental Biophysics. 2nd ed. New York:
Springer-Verlag. Pp. 167–183, 1998
[2]. Guofeng Wu, et al. “Methods and strategy for modeling daily global solar radiation with measured
meteorological data – A case study in Nanchang station, China”, Energy Conversion and Management
48, 2447-2452, 2007
[3]. Yang K, Koike T. Estimating surface solar radiation from upper-airhumidity. Solar Energy, 72(2):177–
86, 2002
[4]. N. Mohan Kumar et al, “An empirical model for estimating hourly solar radiation over the Indian seas
during summer monsoon season”, Indian Journal of Marine Sciences, Vol. 30, pp 123-131, 2001
[5]. Reindl DT, Beckman WA, Duffie JA, “Diffuse fraction correlations”, Solar Energy 1990; 45:1-7
[6]. F.J. Batlles et al, “Empirical modeling of hourly direct irradiance by means of hourly global
irradiance”, Energy 25: 675-688, 2000
[7]. Kurt Spokas and Frank Forcella, “Estimating hourly incoming solar radiation from limited
meteorological data”, Weed Science, 54:182– 189, 2006
[8]. Hunt LA, Kuchar L, Swanton CJ. Estimation of solar radiation for use in crop modelling. Agric. Forest
Meteorol. 91(3–4):293–300, 1998
[9]. Louche et al, “Correlations for direct normal and global horizontal irradiation on French Mediterranean
site”, Solar Energy 46: 261-266, 1991 as cited by [8]
[10]. Athens, Greece. et al., “On the relationship of beam transmittance on clearness index for Athens,
Greece”, Int. J. Solar Energy, Vol. 7, pp 171-179, 1989

More Related Content

What's hot

TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
IJDKP
 
A new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detectionA new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detection
IJDKP
 
A PROCESS OF LINK MINING
A PROCESS OF LINK MININGA PROCESS OF LINK MINING
A PROCESS OF LINK MINING
csandit
 
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET Journal
 
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
Zac Darcy
 
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
IJECEIAES
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
Austin Benson
 
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
IJECEIAES
 
A Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano ClassifierA Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano Classifier
journal ijrtem
 
An Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’sAn Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’s
ijtsrd
 
Evaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chainEvaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chain
IJCI JOURNAL
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
Francisco E. Figueroa-Nigaglioni
 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AI
Databricks
 
Accounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksAccounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarks
Devansh16
 
Two Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemTwo Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information System
IRJET Journal
 
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET Journal
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Devansh16
 
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
IJNSA Journal
 

What's hot (18)

TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
 
A new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detectionA new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detection
 
A PROCESS OF LINK MINING
A PROCESS OF LINK MININGA PROCESS OF LINK MINING
A PROCESS OF LINK MINING
 
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN Algorithm
 
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
 
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
 
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
 
A Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano ClassifierA Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano Classifier
 
An Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’sAn Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’s
 
Evaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chainEvaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chain
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AI
 
Accounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksAccounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarks
 
Two Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemTwo Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information System
 
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
 
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
 

Similar to Data Analytics on Solar Energy Using Hadoop

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
ijiert bestjournal
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
IRJET Journal
 
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubsUnsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
IRJET Journal
 
G44093135
G44093135G44093135
G44093135
IJERA Editor
 
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IJDKP
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
IJDKP
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
IJDKP
 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
Zac Darcy
 
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERSA MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
Zac Darcy
 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
Zac Darcy
 
Outlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional DataOutlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional Data
IJERA Editor
 
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- ReduceBig Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduce
ijircee
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
nalini manogaran
 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy Algorithms
IRJET Journal
 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET Journal
 
Analysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTAnalysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOT
IJERA Editor
 
B0930610
B0930610B0930610
B0930610
IOSR Journals
 
Detection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed ApproachDetection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed Approach
Editor IJMTER
 
Choosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areasChoosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areas
IAESIJAI
 
IRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation ForestIRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation Forest
IRJET Journal
 

Similar to Data Analytics on Solar Energy Using Hadoop (20)

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
 
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubsUnsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
 
G44093135
G44093135G44093135
G44093135
 
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
 
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERSA MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
 
Outlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional DataOutlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional Data
 
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- ReduceBig Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduce
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy Algorithms
 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its Analysis
 
Analysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTAnalysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOT
 
B0930610
B0930610B0930610
B0930610
 
Detection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed ApproachDetection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed Approach
 
Choosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areasChoosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areas
 
IRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation ForestIRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation Forest
 

More from IJMERJOURNAL

Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
IJMERJOURNAL
 
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
IJMERJOURNAL
 
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETA New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
IJMERJOURNAL
 
Design a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing ProcessDesign a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing Process
IJMERJOURNAL
 
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
IJMERJOURNAL
 
Comparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil ConditionComparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil Condition
IJMERJOURNAL
 
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
IJMERJOURNAL
 
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
IJMERJOURNAL
 
Investigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power PlantInvestigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power Plant
IJMERJOURNAL
 
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw PumpStudy of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
IJMERJOURNAL
 
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using IupqcMitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
IJMERJOURNAL
 
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
IJMERJOURNAL
 
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
IJMERJOURNAL
 
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
IJMERJOURNAL
 
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
IJMERJOURNAL
 
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
IJMERJOURNAL
 
Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.
IJMERJOURNAL
 
CASE STUDY
CASE STUDYCASE STUDY
CASE STUDY
IJMERJOURNAL
 
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
IJMERJOURNAL
 
li-fi: the future of wireless communication
li-fi: the future of wireless communicationli-fi: the future of wireless communication
li-fi: the future of wireless communication
IJMERJOURNAL
 

More from IJMERJOURNAL (20)

Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
 
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
 
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETA New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
 
Design a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing ProcessDesign a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing Process
 
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
 
Comparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil ConditionComparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil Condition
 
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
 
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
 
Investigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power PlantInvestigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power Plant
 
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw PumpStudy of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
 
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using IupqcMitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
 
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
 
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
 
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
 
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
 
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
 
Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.
 
CASE STUDY
CASE STUDYCASE STUDY
CASE STUDY
 
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
 
li-fi: the future of wireless communication
li-fi: the future of wireless communicationli-fi: the future of wireless communication
li-fi: the future of wireless communication
 

Recently uploaded

4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
Gino153088
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
ElakkiaU
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
PreethaV16
 
5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf
AlvianRamadhani5
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
aryanpankaj78
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
um7474492
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
Paris Salesforce Developer Group
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
Supermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdfSupermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdf
Kamal Acharya
 
AI-Based Home Security System : Home security
AI-Based Home Security System : Home securityAI-Based Home Security System : Home security
AI-Based Home Security System : Home security
AIRCC Publishing Corporation
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
Addu25809
 
Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...
Prakhyath Rai
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
Roger Rozario
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
upoux
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
Kamal Acharya
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
uqyfuc
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
MadhavJungKarki
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
Anant Corporation
 
2. protection of river banks and bed erosion protection works.ppt
2. protection of river banks and bed erosion protection works.ppt2. protection of river banks and bed erosion protection works.ppt
2. protection of river banks and bed erosion protection works.ppt
abdatawakjira
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 

Recently uploaded (20)

4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
 
5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
Supermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdfSupermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdf
 
AI-Based Home Security System : Home security
AI-Based Home Security System : Home securityAI-Based Home Security System : Home security
AI-Based Home Security System : Home security
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
 
Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
 
2. protection of river banks and bed erosion protection works.ppt
2. protection of river banks and bed erosion protection works.ppt2. protection of river banks and bed erosion protection works.ppt
2. protection of river banks and bed erosion protection works.ppt
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 

Data Analytics on Solar Energy Using Hadoop

  • 1. International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 57 | Data Analytics on Solar Energy Using Hadoop K.Akila1 , B.Manasavani2 , Dr.Gopikrishnan M3 1 Department Of ComputerEngineering Prathyusha Engineering College Thiruvallur, B.E.,India. 2 Department Of ComputerEngineeringPrathyusha EngineeringCollege INTRODUCTION Missing data is the missing form of informationabout phenomena, which is important, and it is the informationin which we are interested. The existence of missing data isone significant problem in data quality. Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in instance by some reasonable values.The case substitution is the method developed for dealingwith missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processes. Themethods,such as substitution of missing values by the attributemean or mode should be cautiously handed to avoid inclusionof bias. 1.1 Randomness of MissingData Missing data randomness is classified [1] in threeclasses. Missing completely at random (MCAR):Missing values are scattered randomly across all instances. Inthistype of randomness, any missing data handling methodcan be applied without risk of introducing bias on the data.It occurs when the probability of an instance having amissing valueforanattributedoesnotdependoneithertheknown values or the missingdata.differences on attributes establish that the two groups donot differsignificantly. Missing at random (MAR):Missing atrandom (MAR) is a condition, which occurs when missing valuesare not randomly distributed across all observations butare randomly distributed within one or more classes (ex.missing more among whites than non-whites, but randomwithin each). The probability of an instance with a missing valuefor an attribute may depend on the known values, and not onthe value of the missing dataitself. Not missing at random (NMAR):Not missingat random is the most challenging form, occurs whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of thatattribute. 1.2 Handling MissingData Missing data handling methods are categoriesed asfollowsIgnoring data:This method throw-outs all instanceswith missingdata.Therearetwocoremethodstodiscarddata with missing values. The first one is known as completecase analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms. The next method is recognized as discarding instancesor attributes. This method determines the level of missingdata oneachinstanceandattribute,anddeletestheinstancesor attributes with high extents of missing data. ABSRACT: Missing data is one of the major issues in data miningand pattern recognition. The knowledge contains in attributeswith missing data values are important in improvingregression correlationprocessofanorganization.Thelearningprocesson each instance is necessary as it may containsome exceptional knowledge. There are various methods tohandle missing data in regression correlation. Analysis of photovoltaic cell, Sunlight striking on different geographical location to know the defective or connectionless photovoltaic cell plate. we mainly aims To showcasing the energy produced at different geographical location and to find the defective plate. And also analysis the data sets of energy produced along with current weather in particular area to know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to analyze the solar energy datasets. Keywords: missing data, genetic algorithm, regression correlation.
  • 2. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 58 | Prior todeletingany attribute, it is vital to evaluate its connotation tothe investigation. The methods, complete case analysisand discarding is executed only if missing data aremissing completely at random. The missing data that are notmissing completely at random contain non-random elements thatmay prejudice theresults. Imputation:In imputation-based proceduresmissing valuesareimputedwithreasonable,probablevaluesrather than being deleted totally. The objective is to useknown associations that can be recognized in the valid rangevaluesofthedatasettofacilitateinestimatingthe missing values Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in instance by some reasonable values.In first method limitation of ∆T was adjusted, above equation was used for ∆T<8o C. Table 1 shows criteria for the decision of τ value. τ value of 0.6-0.7 are commonly used for clear sky atmospheric transmittance coefficient value. In this study τ value of 0.69 was used for clear sky, assumed that the clear sky condition occurred when RH<40% and ambient temperature more than 8o C. Calculation algorithm was built based on decision matrix and the τ value was locally determined using the training of data set to get minimum error. The second method used in this study is by finding the correlation between RH, clearness index and beam transmittance. The data used to find correlation between beam transmittance and clearness index is measured data from new radiometer set that was installed in 2010 on the rooftop of Block P, UniversitiTeknologiPetronas, which located about 30 km from Ipoh city. About 1 month, 5 minutes time step data of global, beam and diffuse radiation from June to July 2010 was used. Before find the correlation of beam transmittance and clearness index, RH-clearness index correlation was obtained from Ipoh city available data as can be seen in Figure 3. Then beam transmittance-clearness index correlation can be obtained by scatter plot as can be seen in Figure 4. To plot Figure 4 some data were rejected due to obvious error that can be analyzed from measurement results, and the basic concept of terrestrial solar radiation characteristicsTherearetwocoremethodstodiscarddata with missing values. The first one is known as completecase analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms MultipleImputations:ItisamethodbyRubin[1]for making multiple simulated values for eachincomplete information, and then iteratively examining datasets witheach simulated value substituted in every turn. The intentionis,possibly, to generate estimates that better indicatetrue variance and uncertainty in the data than doregression methods. This permits expert staff and software to be usedto create imputed datasets that can be analyzed byrelatively naive users equipped with standard software. It can bevery effective particularly for small to moderate levelsof missingness, where the missing data mechanism isorganized, and for datasets that are to be placed in the publicdomain Not missingat random is the most challenging form, occurs whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of thatattribute Following constraint were used as data rejection criteria: • Reject night data • Reject data if clearness index >1 • Reject data if beam transmittance >1 • Reject data if beam transmittance > clearness index • Reject data if clearness index >0.6 and beam transmittance <0.1 • Reject data if clearness index <0.2 and beam transmittance>0.15 Correlation between RH and beam transmittance was obtained from above correlation and plotted in Figure 5. Balaras et al studied the relationship between beam transmittance and clearness index in Athens, Greece [10], the results of the study was adopted to carry out second method in this study. Regression results were presented as follows: Therearevariousmethodstoestimatesolarradiation. Satisfactoryresultforhourlysolarradiationestimationwas obtainedbyusingatmospherictransmittancemodel[1]while otherauthorshaveuseddiffusefraction[2]andclearness indexmodels[3].Parametricoratmospherictransmittance modelrequiresdetailsatmosphericcharacteristicinformation [4].Thismodelgiveshigh- accuracyforclearsky/cloudless conditions,whichisleadingsomeauthor tousethis modelto evaluatetheperformanceofanempiricalmodelunder cloudlessconditions[5].Therearenumerousauthors
  • 3. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 59 | proposedthiskindofmodelasmentionedin[6].However, pureparametricmodelwasnotused in thisstudy,since there is no detail atmospheric condition data for the site. Meteorologicalparametersfrequentlyusedaspredictorsof atmosphericparameterssinceacquiringdetailatmospheric conditionsrequireadvancemeasurement.Meteorological parameterssuchassunshineduration,cloudcover,ambient temperature,relativehumidity,andprecipitationdatahave beenusedtoestimateatmospherictransmittancecoefficient inparametricmodel.Thiskindofmodeliscalled meteorological model Therearevariouswaystoestimatesolarradiationon certainareaontheearth.Ambienttemperaturebased estimation is widely used sinceambienttemperaturedataare measuredinmanyweatherstations.Inthisstudy,missing datawereestimatedbasedonambienttemperature measurementandusedmeasuredRHdataasatmospheric transmittance of the determination criteria.thatestimate hourlysolarradiationbasedondevelopedCampbelland Norman method [2], was adapted in this study. Results of the new model then compared with the existing temperature-based solar radiation prediction model as follow: A. H-S model Hargreaves and Samani [14] conducted an initial study on using Tmax and Tmin to estimate solar radiation by the following equation: G T h = K r ( T m a x - T m i n ) 0 . 5 G 0 h ( 9 ) Kris an empirical coefficient, which was recommended tobe 0.16 for interior regions and 0.19 for coastal regions. In this study Kr was locally determined using training data set. B. H-S-A model Annandale et al [15] modified H-S model by introducing correction factor as follow: GT h =Kr (1+2.7 10- 5 Z) (Tm a x -Tm i n )0 . 5 G0 h (10 ) Z is elevation in m and Krwas locally determined. V. STATISTICAL ANALYSIS FOR MODEL VALIDATION Estimation results validated using statistical parameters. Pearson correlation coefficient was calculated as routine correlation indicator. Residual error was calculated using RMSE (Root Mean Square Error) and also presented in NRMSE (Normalized Root Mean Square Error) as follows: R M S E = [ Ʃ { Y c - Y 0 } 2 / n ] 0 . 5 ( 1 1 ) N R M S E = R M S E / y m a x - y m i n ( 1 2 ) where, Yc is predicted variable Y0 is measured variable, n is number of data, ymax is maximum measured data ymin is minimum measured data. As an addition, index of agreement was calculated using equation below: d = 1 – [Ʃ(xi-yi)2 /Ʃ(ǀxi-x̅ iǀ+ǀyi-y̅ iǀ)]0.5 ( 1 3 ) where, xi is predicted variable yi is measured variable, x̅ i is averaged predicted variable and y̅ i is averaged measured variable. VI. RESULTS AND DISCUSSIONS Calculation have been carried out using methods 1 and 2, statistical calculation analysis also has been performed. Table 2 shows statistical analysis results of both methods and results of existing method (H-S and H- S-A method).The most satisfactory results were obtained using method 1. Figure 3 shows scatter plot of predicted and measured data for first method. The minimum RMSE value of 87.6 Watt/m2 was obtained with 0.95 correlation coefficient and 0.97 index of agreement value. Previous method which use precipitation data obtained averaged index of agreement of 0.95, thus the model presented in this study also performed well. Table II. Statistical analysis results Themethodsarebasedonthedecompositionmodelthatiscalculatingeachofsolarradiationcomponents,which dependonatmospherictransmittance.Twomethodstopredictatmospherictransmittancevalueusingavailablemeteorol ogicaldatawereproposed.Inthefirstmethod,adecisionmatrixwasused,whileinthesecondmethod,regressioncorrelati
  • 4. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 60 | onofmeteorologicalparameterswasused.Thecalculationsresultswereevaluatedusingstatisticalparameter.Thoughth eresultshowsbothofthemethodsperformwell,moresatisfactoryresultswereobtainedfromfirstmethodwithRootMean SquareErrorof87.6Watt/m2,NormalizedRootMeanSquareErrorof8.29%,correlationcoefficientof0.95andindexofa greementof0.97.Furthermore,thefirstmethodonlyneedsambienttemperatureandrelativehumiditydatathatcommonl ymeasuredinmeteorologicalstations.graphical comparison of measured and predicted solar radiation for random dates. As can be seen most of the error occurred in low radiation or cloudy sky which often occurred in rainy season (Fig. 7-b). Then after the results has been validated using available data, the predicted results in the missing data days can be occupied in the measured data set. Now the time series composite data of solar radiation is completed and can be used for other purpose such as prediction of PV system performance for respected location. Missing data is one of the major issues in data miningand pattern recognition. The knowledge contains in attributeswith missing data values are important in improvingdecision- makingprocessofanorganization.Thelearningprocesson each instance is necessary as it may containsome exceptional knowledge. There are various methods tohandle missing data in decision tree learning. Analysis of photovoltaic cell, Sunlight striking on different geographical location to know the defective or connectionless photovoltaic cell plate. we mainly aims To showcasing the energy produced at different geographical location and to find the defective plate. And also analysis the data sets of energy produced along with current weather in particular area to know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to analyze the solar energy datasets. Not missingat random is the most challenging form, occurs whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of thatattribute (b) Rainy season Figure 7: Graphical comparison between measured and predicted solarradiation for random dates (Method 1) Figure 8 shows prediction results in the days which solar radiation measurement were absent. In our case the amount of missing solar radiation data is 21 full days and 2 half days. Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in instance by some reasonable values.The case substitution is the method developed for dealingwith missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processesThe method developed in this study may be applied for any location on the earth with notes, for first method the assignment criteria of atmospheric transmittance using RH and ambient temperature should be adjusted to the available solar radiation data of the area to get minimum error. To generate general criteria of atmospheric transmittance assignment using RH and ambient temperature further research is required with sufficient large amount of data for various areas. For the second method the correlation should be rebuild based on available measurement nearest from the location to give satisfactory estimation results.
  • 5. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 61 | VII. CONCLUSIONS This paper represents to estimate solar radiation from available weather data. The methods presented in this paper are based on atmospheric transmittance determination using available meteorological data. In this method regression correlation of meteorological parameter was used.Thekeyfortheaccuracyofabovemethodisinthe determinationofbeamatmospherictransmittance(τ).Beam atmospherictransmittanceisthepercentageofthebeam (direct)radiationthatwillpenetratetheatmospherewithout beingscattered.KurtandSpokasusedprecipitationdata to built decisionmatrix ofatmospheric transmittance .temperature data. The second method is by using RH-clearness index, clearness index-beam atmospheric transmission and beam atmospheric transmission-RH correlation. The result shows that both methods perform well. Method 1 provided better results with minimum correlation coefficient of 0.95, RMSE of 87.6 Watt/m2 , NRSME of 8.29% and index of agreement of 0.97. The prediction was intended to fill missing data in solar radiation data set to get complete time series data. However, in this study only one year of one area data have been used. Validation using sufficient large amount of data is required for wider application of the method. REFERENCES [1]. G. S. Campbell and J. M. Norman, “Introduction to Environmental Biophysics. 2nd ed. New York: Springer-Verlag. Pp. 167–183, 1998 [2]. Guofeng Wu, et al. “Methods and strategy for modeling daily global solar radiation with measured meteorological data – A case study in Nanchang station, China”, Energy Conversion and Management 48, 2447-2452, 2007 [3]. Yang K, Koike T. Estimating surface solar radiation from upper-airhumidity. Solar Energy, 72(2):177– 86, 2002 [4]. N. Mohan Kumar et al, “An empirical model for estimating hourly solar radiation over the Indian seas during summer monsoon season”, Indian Journal of Marine Sciences, Vol. 30, pp 123-131, 2001 [5]. Reindl DT, Beckman WA, Duffie JA, “Diffuse fraction correlations”, Solar Energy 1990; 45:1-7 [6]. F.J. Batlles et al, “Empirical modeling of hourly direct irradiance by means of hourly global irradiance”, Energy 25: 675-688, 2000 [7]. Kurt Spokas and Frank Forcella, “Estimating hourly incoming solar radiation from limited meteorological data”, Weed Science, 54:182– 189, 2006 [8]. Hunt LA, Kuchar L, Swanton CJ. Estimation of solar radiation for use in crop modelling. Agric. Forest Meteorol. 91(3–4):293–300, 1998 [9]. Louche et al, “Correlations for direct normal and global horizontal irradiation on French Mediterranean site”, Solar Energy 46: 261-266, 1991 as cited by [8] [10]. Athens, Greece. et al., “On the relationship of beam transmittance on clearness index for Athens, Greece”, Int. J. Solar Energy, Vol. 7, pp 171-179, 1989