SlideShare a Scribd company logo
1 of 5
Download to read offline
International
OPEN ACCESS Journal
Of Modern Engineering Research (IJMER)
| IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 57 |
Data Analytics on Solar Energy Using Hadoop
K.Akila1
, B.Manasavani2
, Dr.Gopikrishnan M3
1
Department Of ComputerEngineering Prathyusha Engineering College Thiruvallur, B.E.,India.
2
Department Of ComputerEngineeringPrathyusha EngineeringCollege
INTRODUCTION
Missing data is the missing form of informationabout phenomena, which is important, and it is the
informationin which we are interested. The existence of missing data isone significant problem in data quality.
Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases.
Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis
missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in
instance by some reasonable values.The case substitution is the method developed for dealingwith
missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processes.
Themethods,such as substitution of missing values by the attributemean or mode should be cautiously handed to
avoid inclusionof bias.
1.1 Randomness of MissingData
Missing data randomness is classified [1] in threeclasses.
Missing completely at random (MCAR):Missing values are scattered randomly across all instances.
Inthistype of randomness, any missing data handling methodcan be applied without risk of introducing bias on
the data.It occurs when the probability of an instance having amissing
valueforanattributedoesnotdependoneithertheknown values or the missingdata.differences on attributes establish
that the two groups donot differsignificantly.
Missing at random (MAR):Missing atrandom (MAR) is a condition, which occurs when missing
valuesare not randomly distributed across all observations butare randomly distributed within one or more
classes (ex.missing more among whites than non-whites, but randomwithin each). The probability of an instance
with a missing valuefor an attribute may depend on the known values, and not onthe value of the missing
dataitself.
Not missing at random (NMAR):Not missingat random is the most challenging form, occurs
whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable
missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of
thatattribute.
1.2 Handling MissingData
Missing data handling methods are categoriesed asfollowsIgnoring data:This method throw-outs all
instanceswith missingdata.Therearetwocoremethodstodiscarddata with missing values. The first one is known as
completecase analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms.
The next method is recognized as discarding instancesor attributes. This method determines the level of
missingdata oneachinstanceandattribute,anddeletestheinstancesor attributes with high extents of missing data.
ABSRACT: Missing data is one of the major issues in data miningand pattern recognition. The
knowledge contains in attributeswith missing data values are important in improvingregression
correlationprocessofanorganization.Thelearningprocesson each instance is necessary as it may
containsome exceptional knowledge. There are various methods tohandle missing data in regression
correlation. Analysis of photovoltaic cell, Sunlight striking on different geographical location to know
the defective or connectionless photovoltaic cell plate. we mainly aims To showcasing the energy
produced at different geographical location and to find the defective plate.
And also analysis the data sets of energy produced along with current weather in particular area to
know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to
analyze the solar energy datasets.
Keywords: missing data, genetic algorithm, regression correlation.
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 58 |
Prior todeletingany attribute, it is vital to evaluate its connotation tothe investigation. The methods, complete
case analysisand discarding is executed only if missing data aremissing completely at random. The missing data
that are notmissing completely at random contain non-random elements thatmay prejudice theresults.
Imputation:In imputation-based proceduresmissing
valuesareimputedwithreasonable,probablevaluesrather than being deleted totally. The objective is to useknown
associations that can be recognized in the valid rangevaluesofthedatasettofacilitateinestimatingthe missing
values
Data quality playsmajor role in machine learning, data mining andknowledge discovery from
databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced
hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing
values in instance by some reasonable values.In first method limitation of āˆ†T was adjusted, above equation was
used for āˆ†T<8o
C. Table 1 shows criteria for the decision of Ļ„ value. Ļ„ value of 0.6-0.7 are commonly used for
clear sky atmospheric transmittance coefficient value. In this study Ļ„ value of 0.69 was used for clear sky,
assumed that the clear sky condition occurred when RH<40% and ambient temperature more than 8o
C.
Calculation algorithm was built based on decision matrix and the Ļ„ value was locally determined using the
training of data set to get minimum error.
The second method used in this study is by finding the correlation between RH, clearness index and
beam transmittance. The data used to find correlation between beam transmittance and clearness index is
measured data from new radiometer set that was installed in 2010 on the rooftop of Block P,
UniversitiTeknologiPetronas, which located about 30 km from Ipoh city. About 1 month, 5 minutes time step
data of global, beam and diffuse radiation from June to July 2010 was used. Before find the correlation of beam
transmittance and clearness index, RH-clearness index correlation was obtained from Ipoh city available data as
can be seen in Figure 3.
Then beam transmittance-clearness index correlation can be obtained by scatter plot as can be seen in
Figure 4. To plot Figure 4 some data were rejected due to obvious error that can be analyzed from measurement
results, and the basic concept of terrestrial solar radiation characteristicsTherearetwocoremethodstodiscarddata
with missing values. The first one is known as completecase
analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms
MultipleImputations:ItisamethodbyRubin[1]for making multiple simulated values for eachincomplete
information, and then iteratively examining datasets witheach simulated value substituted in every turn. The
intentionis,possibly, to generate estimates that better indicatetrue variance and uncertainty in the data than
doregression methods. This permits expert staff and software to be usedto create imputed datasets that can be
analyzed byrelatively naive users equipped with standard software. It can bevery effective particularly for small
to moderate levelsof missingness, where the missing data mechanism isorganized, and for datasets that are to be
placed in the publicdomain
Not missingat random is the most challenging form, occurs whenmissing values are not randomly
distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with
a missing value for an attributemight depend on the value of thatattribute
Following constraint were used as data rejection criteria:
ā€¢ Reject night data
ā€¢ Reject data if clearness index >1
ā€¢ Reject data if beam transmittance >1
ā€¢ Reject data if beam transmittance > clearness index
ā€¢ Reject data if clearness index >0.6 and beam transmittance <0.1
ā€¢ Reject data if clearness index <0.2 and beam transmittance>0.15
Correlation between RH and beam transmittance was obtained from above correlation and plotted in
Figure 5. Balaras et al studied the relationship between beam transmittance and clearness index in Athens,
Greece [10], the results of the study was adopted to carry out second method in this study. Regression results
were presented as follows:
Therearevariousmethodstoestimatesolarradiation.
Satisfactoryresultforhourlysolarradiationestimationwas obtainedbyusingatmospherictransmittancemodel[1]while
otherauthorshaveuseddiffusefraction[2]andclearness indexmodels[3].Parametricoratmospherictransmittance
modelrequiresdetailsatmosphericcharacteristicinformation [4].Thismodelgiveshigh-
accuracyforclearsky/cloudless conditions,whichisleadingsomeauthor tousethis modelto
evaluatetheperformanceofanempiricalmodelunder cloudlessconditions[5].Therearenumerousauthors
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 59 |
proposedthiskindofmodelasmentionedin[6].However, pureparametricmodelwasnotused in thisstudy,since there
is no detail atmospheric condition data for the site.
Meteorologicalparametersfrequentlyusedaspredictorsof
atmosphericparameterssinceacquiringdetailatmospheric conditionsrequireadvancemeasurement.Meteorological
parameterssuchassunshineduration,cloudcover,ambient temperature,relativehumidity,andprecipitationdatahave
beenusedtoestimateatmospherictransmittancecoefficient inparametricmodel.Thiskindofmodeliscalled
meteorological model
Therearevariouswaystoestimatesolarradiationon certainareaontheearth.Ambienttemperaturebased
estimation is widely used sinceambienttemperaturedataare measuredinmanyweatherstations.Inthisstudy,missing
datawereestimatedbasedonambienttemperature measurementandusedmeasuredRHdataasatmospheric
transmittance of the determination criteria.thatestimate hourlysolarradiationbasedondevelopedCampbelland
Norman method [2], was adapted in this study.
Results of the new model then compared with the existing temperature-based solar radiation prediction
model as follow:
A. H-S model
Hargreaves and Samani [14] conducted an initial study on using Tmax and Tmin to estimate solar radiation by the
following equation:
G T h = K r ( T m a x - T m i n ) 0 . 5
G 0 h ( 9 )
Kris an empirical coefficient, which was recommended tobe 0.16 for interior regions and 0.19 for coastal
regions. In this study Kr was locally determined using training data set.
B. H-S-A model
Annandale et al [15] modified H-S model by introducing correction factor as follow:
GT h =Kr (1+2.7 10- 5
Z) (Tm a x -Tm i n )0 . 5
G0 h (10 )
Z is elevation in m and Krwas locally determined.
V. STATISTICAL ANALYSIS FOR MODEL VALIDATION
Estimation results validated using statistical parameters. Pearson correlation coefficient was calculated
as routine correlation indicator. Residual error was calculated using RMSE (Root Mean Square Error) and also
presented in NRMSE (Normalized Root Mean Square Error) as follows:
R M S E = [ Ę© { Y c - Y 0 } 2
/ n ] 0 . 5
( 1 1 )
N R M S E = R M S E / y m a x - y m i n ( 1 2 )
where, Yc is predicted variable Y0 is measured variable, n is number of data, ymax is maximum measured data ymin
is minimum measured data.
As an addition, index of agreement was calculated using equation below:
d = 1 ā€“ [Ę©(xi-yi)2
/Ę©(Ē€xi-xĢ… iĒ€+Ē€yi-yĢ… iĒ€)]0.5
( 1 3 )
where, xi is predicted variable yi is measured variable, xĢ… i is averaged predicted variable and yĢ… i is averaged
measured variable.
VI. RESULTS AND DISCUSSIONS
Calculation have been carried out using methods 1 and 2, statistical calculation analysis also has been
performed. Table 2 shows statistical analysis results of both methods and results of existing method (H-S and H-
S-A method).The most satisfactory results were obtained using method 1. Figure 3 shows scatter plot of
predicted and measured data for first method. The minimum RMSE value of 87.6 Watt/m2
was obtained with
0.95 correlation coefficient and 0.97 index of agreement value. Previous method which use precipitation data
obtained averaged index of agreement of 0.95, thus the model presented in this study also performed well.
Table II. Statistical analysis results
Themethodsarebasedonthedecompositionmodelthatiscalculatingeachofsolarradiationcomponents,which
dependonatmospherictransmittance.Twomethodstopredictatmospherictransmittancevalueusingavailablemeteorol
ogicaldatawereproposed.Inthefirstmethod,adecisionmatrixwasused,whileinthesecondmethod,regressioncorrelati
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 60 |
onofmeteorologicalparameterswasused.Thecalculationsresultswereevaluatedusingstatisticalparameter.Thoughth
eresultshowsbothofthemethodsperformwell,moresatisfactoryresultswereobtainedfromfirstmethodwithRootMean
SquareErrorof87.6Watt/m2,NormalizedRootMeanSquareErrorof8.29%,correlationcoefficientof0.95andindexofa
greementof0.97.Furthermore,thefirstmethodonlyneedsambienttemperatureandrelativehumiditydatathatcommonl
ymeasuredinmeteorologicalstations.graphical comparison of measured and predicted solar radiation for random
dates. As can be seen most of the error occurred in low radiation or cloudy sky which often occurred in rainy
season (Fig. 7-b). Then after the results has been validated using available data, the predicted results in the
missing data days can be occupied in the measured data set. Now the time series composite data of solar
radiation is completed and can be used for other purpose such as prediction of PV system performance for
respected location.
Missing data is one of the major issues in data miningand pattern recognition. The knowledge contains
in attributeswith missing data values are important in improvingdecision-
makingprocessofanorganization.Thelearningprocesson each instance is necessary as it may containsome
exceptional knowledge. There are various methods tohandle missing data in decision tree learning. Analysis of
photovoltaic cell, Sunlight striking on different geographical location to know the defective or connectionless
photovoltaic cell plate. we mainly aims To showcasing the energy produced at different geographical location
and to find the defective plate.
And also analysis the data sets of energy produced along with current weather in particular area to
know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to analyze
the solar energy datasets.
Not missingat random is the most challenging form, occurs whenmissing values are not randomly
distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with
a missing value for an attributemight depend on the value of thatattribute
(b) Rainy season
Figure 7: Graphical comparison between measured and predicted solarradiation for random dates (Method 1)
Figure 8 shows prediction results in the days which solar radiation measurement were absent. In our
case the amount of missing solar radiation data is 21 full days and 2 half days.
Data quality playsmajor role in machine learning, data mining andknowledge discovery from
databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced
hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing
values in instance by some reasonable values.The case substitution is the method developed for dealingwith
missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processesThe method
developed in this study may be applied for any location on the earth with notes, for first method the assignment
criteria of atmospheric transmittance using RH and ambient temperature should be adjusted to the available
solar radiation data of the area to get minimum error. To generate general criteria of atmospheric transmittance
assignment using RH and ambient temperature further research is required with sufficient large amount of data
for various areas. For the second method the correlation should be rebuild based on available measurement
nearest from the location to give satisfactory estimation results.
Data Analytics On Solar Energy Using Hadoop
| IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 61 |
VII. CONCLUSIONS
This paper represents to estimate solar radiation from available weather data. The methods presented in
this paper are based on atmospheric transmittance determination using available meteorological data. In this
method regression correlation of meteorological parameter was
used.Thekeyfortheaccuracyofabovemethodisinthe determinationofbeamatmospherictransmittance(Ļ„).Beam
atmospherictransmittanceisthepercentageofthebeam (direct)radiationthatwillpenetratetheatmospherewithout
beingscattered.KurtandSpokasusedprecipitationdata to built decisionmatrix ofatmospheric transmittance
.temperature data. The second method is by using RH-clearness index, clearness index-beam atmospheric
transmission and beam atmospheric transmission-RH correlation. The result shows that both methods perform
well. Method 1 provided better results with minimum correlation coefficient of 0.95, RMSE of 87.6 Watt/m2
,
NRSME of 8.29% and index of agreement of 0.97. The prediction was intended to fill missing data in solar
radiation data set to get complete time series data. However, in this study only one year of one area data have
been used. Validation using sufficient large amount of data is required for wider application of the method.
REFERENCES
[1]. G. S. Campbell and J. M. Norman, ā€œIntroduction to Environmental Biophysics. 2nd ed. New York:
Springer-Verlag. Pp. 167ā€“183, 1998
[2]. Guofeng Wu, et al. ā€œMethods and strategy for modeling daily global solar radiation with measured
meteorological data ā€“ A case study in Nanchang station, Chinaā€, Energy Conversion and Management
48, 2447-2452, 2007
[3]. Yang K, Koike T. Estimating surface solar radiation from upper-airhumidity. Solar Energy, 72(2):177ā€“
86, 2002
[4]. N. Mohan Kumar et al, ā€œAn empirical model for estimating hourly solar radiation over the Indian seas
during summer monsoon seasonā€, Indian Journal of Marine Sciences, Vol. 30, pp 123-131, 2001
[5]. Reindl DT, Beckman WA, Duffie JA, ā€œDiffuse fraction correlationsā€, Solar Energy 1990; 45:1-7
[6]. F.J. Batlles et al, ā€œEmpirical modeling of hourly direct irradiance by means of hourly global
irradianceā€, Energy 25: 675-688, 2000
[7]. Kurt Spokas and Frank Forcella, ā€œEstimating hourly incoming solar radiation from limited
meteorological dataā€, Weed Science, 54:182ā€“ 189, 2006
[8]. Hunt LA, Kuchar L, Swanton CJ. Estimation of solar radiation for use in crop modelling. Agric. Forest
Meteorol. 91(3ā€“4):293ā€“300, 1998
[9]. Louche et al, ā€œCorrelations for direct normal and global horizontal irradiation on French Mediterranean
siteā€, Solar Energy 46: 261-266, 1991 as cited by [8]
[10]. Athens, Greece. et al., ā€œOn the relationship of beam transmittance on clearness index for Athens,
Greeceā€, Int. J. Solar Energy, Vol. 7, pp 171-179, 1989

More Related Content

What's hot

TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...IJDKP
Ā 
A new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detectionA new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detectionIJDKP
Ā 
A PROCESS OF LINK MINING
A PROCESS OF LINK MININGA PROCESS OF LINK MINING
A PROCESS OF LINK MININGcsandit
Ā 
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET Journal
Ā 
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...Zac Darcy
Ā 
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...IJECEIAES
Ā 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifsAustin Benson
Ā 
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...IJECEIAES
Ā 
A Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano ClassifierA Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano Classifierjournal ijrtem
Ā 
An Extensive Review on Generative Adversarial Networks GANā€™s
An Extensive Review on Generative Adversarial Networks GANā€™sAn Extensive Review on Generative Adversarial Networks GANā€™s
An Extensive Review on Generative Adversarial Networks GANā€™sijtsrd
Ā 
Evaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chainEvaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chainIJCI JOURNAL
Ā 
Neural networks, naiĢˆve bayes and decision tree machine learning
Neural networks, naiĢˆve bayes and decision tree machine learningNeural networks, naiĢˆve bayes and decision tree machine learning
Neural networks, naiĢˆve bayes and decision tree machine learningFrancisco E. Figueroa-Nigaglioni
Ā 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AIDatabricks
Ā 
Accounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksAccounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksDevansh16
Ā 
Two Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemTwo Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemIRJET Journal
Ā 
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...IRJET Journal
Ā 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Devansh16
Ā 
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...IJNSA Journal
Ā 

What's hot (18)

TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
TUPLE VALUE BASED MULTIPLICATIVE DATA PERTURBATION APPROACH TO PRESERVE PRIVA...
Ā 
A new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detectionA new clutering approach for anomaly intrusion detection
A new clutering approach for anomaly intrusion detection
Ā 
A PROCESS OF LINK MINING
A PROCESS OF LINK MININGA PROCESS OF LINK MINING
A PROCESS OF LINK MINING
Ā 
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN Algorithm
Ā 
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
APPLICATION OF ARTIFICIAL NEURAL NETWORKS IN ESTIMATING PARTICIPATION IN ELEC...
Ā 
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
The Evaluated Measurement of a Combined Genetic Algorithm and Artificial Immu...
Ā 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
Ā 
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Improving IF Algorithm for Data Aggregation Techniques in Wireless Sensor Net...
Ā 
A Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano ClassifierA Magnified Application of Deficient Data Using Bolzano Classifier
A Magnified Application of Deficient Data Using Bolzano Classifier
Ā 
An Extensive Review on Generative Adversarial Networks GANā€™s
An Extensive Review on Generative Adversarial Networks GANā€™sAn Extensive Review on Generative Adversarial Networks GANā€™s
An Extensive Review on Generative Adversarial Networks GANā€™s
Ā 
Evaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chainEvaluation of network intrusion detection using markov chain
Evaluation of network intrusion detection using markov chain
Ā 
Neural networks, naiĢˆve bayes and decision tree machine learning
Neural networks, naiĢˆve bayes and decision tree machine learningNeural networks, naiĢˆve bayes and decision tree machine learning
Neural networks, naiĢˆve bayes and decision tree machine learning
Ā 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AI
Ā 
Accounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksAccounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarks
Ā 
Two Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemTwo Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information System
Ā 
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...IRJET -  	  Random Data Perturbation Techniques in Privacy Preserving Data Mi...
IRJET - Random Data Perturbation Techniques in Privacy Preserving Data Mi...
Ā 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Ā 
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
ANALYSIS OF MACHINE LEARNING ALGORITHMS WITH FEATURE SELECTION FOR INTRUSION ...
Ā 

Similar to Data Analytics on Solar Energy Using Hadoop

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...ijiert bestjournal
Ā 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET Journal
Ā 
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubsUnsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubsIRJET Journal
Ā 
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IJDKP
Ā 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESSIJDKP
Ā 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESSIJDKP
Ā 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersZac Darcy
Ā 
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERSA MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERSZac Darcy
Ā 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersZac Darcy
Ā 
Outlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional DataOutlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional DataIJERA Editor
Ā 
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- ReduceBig Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduceijircee
Ā 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...nalini manogaran
Ā 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsIRJET Journal
Ā 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET Journal
Ā 
Analysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTAnalysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTIJERA Editor
Ā 
Detection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed ApproachDetection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed ApproachEditor IJMTER
Ā 
Choosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areasChoosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areasIAESIJAI
Ā 
IRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation ForestIRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation ForestIRJET Journal
Ā 

Similar to Data Analytics on Solar Energy Using Hadoop (20)

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
Ā 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
Ā 
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubsUnsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
Ā 
G44093135
G44093135G44093135
G44093135
Ā 
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
Ā 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
Ā 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
Ā 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
Ā 
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERSA MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS
Ā 
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected OutliersA Mixture Model of Hubness and PCA for Detection of Projected Outliers
A Mixture Model of Hubness and PCA for Detection of Projected Outliers
Ā 
Outlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional DataOutlier Detection Using Unsupervised Learning on High Dimensional Data
Outlier Detection Using Unsupervised Learning on High Dimensional Data
Ā 
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- ReduceBig Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduce
Ā 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
Ā 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy Algorithms
Ā 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its Analysis
Ā 
Analysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOTAnalysis on different Data mining Techniques and algorithms used in IOT
Analysis on different Data mining Techniques and algorithms used in IOT
Ā 
B0930610
B0930610B0930610
B0930610
Ā 
Detection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed ApproachDetection of Outliers in Large Dataset using Distributed Approach
Detection of Outliers in Large Dataset using Distributed Approach
Ā 
Choosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areasChoosing allowability boundaries for describing objects in subject areas
Choosing allowability boundaries for describing objects in subject areas
Ā 
IRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation ForestIRJET- Credit Card Fraud Detection using Isolation Forest
IRJET- Credit Card Fraud Detection using Isolation Forest
Ā 

More from IJMERJOURNAL

Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...IJMERJOURNAL
Ā 
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...IJMERJOURNAL
Ā 
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETA New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETIJMERJOURNAL
Ā 
Design a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing ProcessDesign a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing ProcessIJMERJOURNAL
Ā 
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...IJMERJOURNAL
Ā 
Comparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil ConditionComparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil ConditionIJMERJOURNAL
Ā 
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...IJMERJOURNAL
Ā 
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...IJMERJOURNAL
Ā 
Investigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power PlantInvestigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power PlantIJMERJOURNAL
Ā 
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw PumpStudy of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw PumpIJMERJOURNAL
Ā 
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using IupqcMitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using IupqcIJMERJOURNAL
Ā 
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...IJMERJOURNAL
Ā 
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...IJMERJOURNAL
Ā 
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...IJMERJOURNAL
Ā 
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...IJMERJOURNAL
Ā 
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...IJMERJOURNAL
Ā 
Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.IJMERJOURNAL
Ā 
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...IJMERJOURNAL
Ā 
li-fi: the future of wireless communication
li-fi: the future of wireless communicationli-fi: the future of wireless communication
li-fi: the future of wireless communicationIJMERJOURNAL
Ā 

More from IJMERJOURNAL (20)

Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Modeling And Simulation Swash Plate Pump Response Characteristics in Load Sen...
Ā 
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Generation of Electricity Through A Non-Municipal Solid Waste Heat From An In...
Ā 
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFETA New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
A New Two-Dimensional Analytical Model of Small Geometry GaAs MESFET
Ā 
Design a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing ProcessDesign a WSN Control System for Filter Backwashing Process
Design a WSN Control System for Filter Backwashing Process
Ā 
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Application of Customer Relationship Management (Crm) Dimensions: A Critical ...
Ā 
Comparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil ConditionComparisons of Shallow Foundations in Different Soil Condition
Comparisons of Shallow Foundations in Different Soil Condition
Ā 
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Place of Power Sector in Public-Private Partnership: A Veritable Tool to Prom...
Ā 
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Study of Part Feeding System for Optimization in Fms & Force Analysis Using M...
Ā 
Investigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power PlantInvestigating The Performance of A Steam Power Plant
Investigating The Performance of A Steam Power Plant
Ā 
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw PumpStudy of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Study of Time Reduction in Manufacturing of Screws Used in Twin Screw Pump
Ā 
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using IupqcMitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Mitigation of Voltage Imbalance in A Two Feeder Distribution System Using Iupqc
Ā 
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Adsorption of Methylene Blue From Aqueous Solution with Vermicompost Produced...
Ā 
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Analytical Solutions of simultaneous Linear Differential Equations in Chemica...
Ā 
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Experimental Investigation of the Effect of Injection of OxyHydrogen Gas on t...
Ā 
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Hybrid Methods of Some Evolutionary Computations AndKalman Filter on Option P...
Ā 
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
An Efficient Methodology To Develop A Secured E-Learning System Using Cloud C...
Ā 
Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.Nigerian Economy and the Impact of Alternative Energy.
Nigerian Economy and the Impact of Alternative Energy.
Ā 
CASE STUDY
CASE STUDYCASE STUDY
CASE STUDY
Ā 
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Validation of Maintenance Policy of Steel Plant Machine Shop By Analytic Hier...
Ā 
li-fi: the future of wireless communication
li-fi: the future of wireless communicationli-fi: the future of wireless communication
li-fi: the future of wireless communication
Ā 

Recently uploaded

Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
Ā 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
Ā 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
Ā 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
Ā 
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”soniya singh
Ā 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
Ā 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
Ā 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Ā 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
Ā 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
Ā 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Ā 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
Ā 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
Ā 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
Ā 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
Ā 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
Ā 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
Ā 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
Ā 

Recently uploaded (20)

Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
Ā 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
Ā 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
Ā 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
Ā 
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Ā 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Ā 
ā˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
ā˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCRā˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
ā˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
Ā 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Ā 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
Ā 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Ā 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
Ā 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Ā 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Ā 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
Ā 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
Ā 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
Ā 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Ā 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
Ā 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
Ā 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
Ā 

Data Analytics on Solar Energy Using Hadoop

  • 1. International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) | IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 57 | Data Analytics on Solar Energy Using Hadoop K.Akila1 , B.Manasavani2 , Dr.Gopikrishnan M3 1 Department Of ComputerEngineering Prathyusha Engineering College Thiruvallur, B.E.,India. 2 Department Of ComputerEngineeringPrathyusha EngineeringCollege INTRODUCTION Missing data is the missing form of informationabout phenomena, which is important, and it is the informationin which we are interested. The existence of missing data isone significant problem in data quality. Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in instance by some reasonable values.The case substitution is the method developed for dealingwith missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processes. Themethods,such as substitution of missing values by the attributemean or mode should be cautiously handed to avoid inclusionof bias. 1.1 Randomness of MissingData Missing data randomness is classified [1] in threeclasses. Missing completely at random (MCAR):Missing values are scattered randomly across all instances. Inthistype of randomness, any missing data handling methodcan be applied without risk of introducing bias on the data.It occurs when the probability of an instance having amissing valueforanattributedoesnotdependoneithertheknown values or the missingdata.differences on attributes establish that the two groups donot differsignificantly. Missing at random (MAR):Missing atrandom (MAR) is a condition, which occurs when missing valuesare not randomly distributed across all observations butare randomly distributed within one or more classes (ex.missing more among whites than non-whites, but randomwithin each). The probability of an instance with a missing valuefor an attribute may depend on the known values, and not onthe value of the missing dataitself. Not missing at random (NMAR):Not missingat random is the most challenging form, occurs whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of thatattribute. 1.2 Handling MissingData Missing data handling methods are categoriesed asfollowsIgnoring data:This method throw-outs all instanceswith missingdata.Therearetwocoremethodstodiscarddata with missing values. The first one is known as completecase analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms. The next method is recognized as discarding instancesor attributes. This method determines the level of missingdata oneachinstanceandattribute,anddeletestheinstancesor attributes with high extents of missing data. ABSRACT: Missing data is one of the major issues in data miningand pattern recognition. The knowledge contains in attributeswith missing data values are important in improvingregression correlationprocessofanorganization.Thelearningprocesson each instance is necessary as it may containsome exceptional knowledge. There are various methods tohandle missing data in regression correlation. Analysis of photovoltaic cell, Sunlight striking on different geographical location to know the defective or connectionless photovoltaic cell plate. we mainly aims To showcasing the energy produced at different geographical location and to find the defective plate. And also analysis the data sets of energy produced along with current weather in particular area to know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to analyze the solar energy datasets. Keywords: missing data, genetic algorithm, regression correlation.
  • 2. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 58 | Prior todeletingany attribute, it is vital to evaluate its connotation tothe investigation. The methods, complete case analysisand discarding is executed only if missing data aremissing completely at random. The missing data that are notmissing completely at random contain non-random elements thatmay prejudice theresults. Imputation:In imputation-based proceduresmissing valuesareimputedwithreasonable,probablevaluesrather than being deleted totally. The objective is to useknown associations that can be recognized in the valid rangevaluesofthedatasettofacilitateinestimatingthe missing values Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in instance by some reasonable values.In first method limitation of āˆ†T was adjusted, above equation was used for āˆ†T<8o C. Table 1 shows criteria for the decision of Ļ„ value. Ļ„ value of 0.6-0.7 are commonly used for clear sky atmospheric transmittance coefficient value. In this study Ļ„ value of 0.69 was used for clear sky, assumed that the clear sky condition occurred when RH<40% and ambient temperature more than 8o C. Calculation algorithm was built based on decision matrix and the Ļ„ value was locally determined using the training of data set to get minimum error. The second method used in this study is by finding the correlation between RH, clearness index and beam transmittance. The data used to find correlation between beam transmittance and clearness index is measured data from new radiometer set that was installed in 2010 on the rooftop of Block P, UniversitiTeknologiPetronas, which located about 30 km from Ipoh city. About 1 month, 5 minutes time step data of global, beam and diffuse radiation from June to July 2010 was used. Before find the correlation of beam transmittance and clearness index, RH-clearness index correlation was obtained from Ipoh city available data as can be seen in Figure 3. Then beam transmittance-clearness index correlation can be obtained by scatter plot as can be seen in Figure 4. To plot Figure 4 some data were rejected due to obvious error that can be analyzed from measurement results, and the basic concept of terrestrial solar radiation characteristicsTherearetwocoremethodstodiscarddata with missing values. The first one is known as completecase analysis.Itisavailableineveryoneofstatisticalpackagesand is the default method in manyprograms MultipleImputations:ItisamethodbyRubin[1]for making multiple simulated values for eachincomplete information, and then iteratively examining datasets witheach simulated value substituted in every turn. The intentionis,possibly, to generate estimates that better indicatetrue variance and uncertainty in the data than doregression methods. This permits expert staff and software to be usedto create imputed datasets that can be analyzed byrelatively naive users equipped with standard software. It can bevery effective particularly for small to moderate levelsof missingness, where the missing data mechanism isorganized, and for datasets that are to be placed in the publicdomain Not missingat random is the most challenging form, occurs whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of thatattribute Following constraint were used as data rejection criteria: ā€¢ Reject night data ā€¢ Reject data if clearness index >1 ā€¢ Reject data if beam transmittance >1 ā€¢ Reject data if beam transmittance > clearness index ā€¢ Reject data if clearness index >0.6 and beam transmittance <0.1 ā€¢ Reject data if clearness index <0.2 and beam transmittance>0.15 Correlation between RH and beam transmittance was obtained from above correlation and plotted in Figure 5. Balaras et al studied the relationship between beam transmittance and clearness index in Athens, Greece [10], the results of the study was adopted to carry out second method in this study. Regression results were presented as follows: Therearevariousmethodstoestimatesolarradiation. Satisfactoryresultforhourlysolarradiationestimationwas obtainedbyusingatmospherictransmittancemodel[1]while otherauthorshaveuseddiffusefraction[2]andclearness indexmodels[3].Parametricoratmospherictransmittance modelrequiresdetailsatmosphericcharacteristicinformation [4].Thismodelgiveshigh- accuracyforclearsky/cloudless conditions,whichisleadingsomeauthor tousethis modelto evaluatetheperformanceofanempiricalmodelunder cloudlessconditions[5].Therearenumerousauthors
  • 3. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 59 | proposedthiskindofmodelasmentionedin[6].However, pureparametricmodelwasnotused in thisstudy,since there is no detail atmospheric condition data for the site. Meteorologicalparametersfrequentlyusedaspredictorsof atmosphericparameterssinceacquiringdetailatmospheric conditionsrequireadvancemeasurement.Meteorological parameterssuchassunshineduration,cloudcover,ambient temperature,relativehumidity,andprecipitationdatahave beenusedtoestimateatmospherictransmittancecoefficient inparametricmodel.Thiskindofmodeliscalled meteorological model Therearevariouswaystoestimatesolarradiationon certainareaontheearth.Ambienttemperaturebased estimation is widely used sinceambienttemperaturedataare measuredinmanyweatherstations.Inthisstudy,missing datawereestimatedbasedonambienttemperature measurementandusedmeasuredRHdataasatmospheric transmittance of the determination criteria.thatestimate hourlysolarradiationbasedondevelopedCampbelland Norman method [2], was adapted in this study. Results of the new model then compared with the existing temperature-based solar radiation prediction model as follow: A. H-S model Hargreaves and Samani [14] conducted an initial study on using Tmax and Tmin to estimate solar radiation by the following equation: G T h = K r ( T m a x - T m i n ) 0 . 5 G 0 h ( 9 ) Kris an empirical coefficient, which was recommended tobe 0.16 for interior regions and 0.19 for coastal regions. In this study Kr was locally determined using training data set. B. H-S-A model Annandale et al [15] modified H-S model by introducing correction factor as follow: GT h =Kr (1+2.7 10- 5 Z) (Tm a x -Tm i n )0 . 5 G0 h (10 ) Z is elevation in m and Krwas locally determined. V. STATISTICAL ANALYSIS FOR MODEL VALIDATION Estimation results validated using statistical parameters. Pearson correlation coefficient was calculated as routine correlation indicator. Residual error was calculated using RMSE (Root Mean Square Error) and also presented in NRMSE (Normalized Root Mean Square Error) as follows: R M S E = [ Ę© { Y c - Y 0 } 2 / n ] 0 . 5 ( 1 1 ) N R M S E = R M S E / y m a x - y m i n ( 1 2 ) where, Yc is predicted variable Y0 is measured variable, n is number of data, ymax is maximum measured data ymin is minimum measured data. As an addition, index of agreement was calculated using equation below: d = 1 ā€“ [Ę©(xi-yi)2 /Ę©(Ē€xi-xĢ… iĒ€+Ē€yi-yĢ… iĒ€)]0.5 ( 1 3 ) where, xi is predicted variable yi is measured variable, xĢ… i is averaged predicted variable and yĢ… i is averaged measured variable. VI. RESULTS AND DISCUSSIONS Calculation have been carried out using methods 1 and 2, statistical calculation analysis also has been performed. Table 2 shows statistical analysis results of both methods and results of existing method (H-S and H- S-A method).The most satisfactory results were obtained using method 1. Figure 3 shows scatter plot of predicted and measured data for first method. The minimum RMSE value of 87.6 Watt/m2 was obtained with 0.95 correlation coefficient and 0.97 index of agreement value. Previous method which use precipitation data obtained averaged index of agreement of 0.95, thus the model presented in this study also performed well. Table II. Statistical analysis results Themethodsarebasedonthedecompositionmodelthatiscalculatingeachofsolarradiationcomponents,which dependonatmospherictransmittance.Twomethodstopredictatmospherictransmittancevalueusingavailablemeteorol ogicaldatawereproposed.Inthefirstmethod,adecisionmatrixwasused,whileinthesecondmethod,regressioncorrelati
  • 4. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 60 | onofmeteorologicalparameterswasused.Thecalculationsresultswereevaluatedusingstatisticalparameter.Thoughth eresultshowsbothofthemethodsperformwell,moresatisfactoryresultswereobtainedfromfirstmethodwithRootMean SquareErrorof87.6Watt/m2,NormalizedRootMeanSquareErrorof8.29%,correlationcoefficientof0.95andindexofa greementof0.97.Furthermore,thefirstmethodonlyneedsambienttemperatureandrelativehumiditydatathatcommonl ymeasuredinmeteorologicalstations.graphical comparison of measured and predicted solar radiation for random dates. As can be seen most of the error occurred in low radiation or cloudy sky which often occurred in rainy season (Fig. 7-b). Then after the results has been validated using available data, the predicted results in the missing data days can be occupied in the measured data set. Now the time series composite data of solar radiation is completed and can be used for other purpose such as prediction of PV system performance for respected location. Missing data is one of the major issues in data miningand pattern recognition. The knowledge contains in attributeswith missing data values are important in improvingdecision- makingprocessofanorganization.Thelearningprocesson each instance is necessary as it may containsome exceptional knowledge. There are various methods tohandle missing data in decision tree learning. Analysis of photovoltaic cell, Sunlight striking on different geographical location to know the defective or connectionless photovoltaic cell plate. we mainly aims To showcasing the energy produced at different geographical location and to find the defective plate. And also analysis the data sets of energy produced along with current weather in particular area to know the status of the photovoltaic plates. In this project, we used Hadoop Map-Reduce framework to analyze the solar energy datasets. Not missingat random is the most challenging form, occurs whenmissing values are not randomly distributed across observations. Itisalso called as non-ignorable missingness. The probabilityof an instance with a missing value for an attributemight depend on the value of thatattribute (b) Rainy season Figure 7: Graphical comparison between measured and predicted solarradiation for random dates (Method 1) Figure 8 shows prediction results in the days which solar radiation measurement were absent. In our case the amount of missing solar radiation data is 21 full days and 2 half days. Data quality playsmajor role in machine learning, data mining andknowledge discovery from databases. Machine learningalgorithmshandle missing data in a quite naive way. To avoid biasingin induced hypothesis missing data treatment shouldbe carefullyhandled.Imputationisaprocessthatreplacesthe missing values in instance by some reasonable values.The case substitution is the method developed for dealingwith missingdataininstancesanditishavingsomedrawbackswhen applied to the data mining processesThe method developed in this study may be applied for any location on the earth with notes, for first method the assignment criteria of atmospheric transmittance using RH and ambient temperature should be adjusted to the available solar radiation data of the area to get minimum error. To generate general criteria of atmospheric transmittance assignment using RH and ambient temperature further research is required with sufficient large amount of data for various areas. For the second method the correlation should be rebuild based on available measurement nearest from the location to give satisfactory estimation results.
  • 5. Data Analytics On Solar Energy Using Hadoop | IJMER | ISSN: 2249ā€“6645 | www.ijmer.com | Vol. 7 | Iss. 2 | Feb. 2017 | 61 | VII. CONCLUSIONS This paper represents to estimate solar radiation from available weather data. The methods presented in this paper are based on atmospheric transmittance determination using available meteorological data. In this method regression correlation of meteorological parameter was used.Thekeyfortheaccuracyofabovemethodisinthe determinationofbeamatmospherictransmittance(Ļ„).Beam atmospherictransmittanceisthepercentageofthebeam (direct)radiationthatwillpenetratetheatmospherewithout beingscattered.KurtandSpokasusedprecipitationdata to built decisionmatrix ofatmospheric transmittance .temperature data. The second method is by using RH-clearness index, clearness index-beam atmospheric transmission and beam atmospheric transmission-RH correlation. The result shows that both methods perform well. Method 1 provided better results with minimum correlation coefficient of 0.95, RMSE of 87.6 Watt/m2 , NRSME of 8.29% and index of agreement of 0.97. The prediction was intended to fill missing data in solar radiation data set to get complete time series data. However, in this study only one year of one area data have been used. Validation using sufficient large amount of data is required for wider application of the method. REFERENCES [1]. G. S. Campbell and J. M. Norman, ā€œIntroduction to Environmental Biophysics. 2nd ed. New York: Springer-Verlag. Pp. 167ā€“183, 1998 [2]. Guofeng Wu, et al. ā€œMethods and strategy for modeling daily global solar radiation with measured meteorological data ā€“ A case study in Nanchang station, Chinaā€, Energy Conversion and Management 48, 2447-2452, 2007 [3]. Yang K, Koike T. Estimating surface solar radiation from upper-airhumidity. Solar Energy, 72(2):177ā€“ 86, 2002 [4]. N. Mohan Kumar et al, ā€œAn empirical model for estimating hourly solar radiation over the Indian seas during summer monsoon seasonā€, Indian Journal of Marine Sciences, Vol. 30, pp 123-131, 2001 [5]. Reindl DT, Beckman WA, Duffie JA, ā€œDiffuse fraction correlationsā€, Solar Energy 1990; 45:1-7 [6]. F.J. Batlles et al, ā€œEmpirical modeling of hourly direct irradiance by means of hourly global irradianceā€, Energy 25: 675-688, 2000 [7]. Kurt Spokas and Frank Forcella, ā€œEstimating hourly incoming solar radiation from limited meteorological dataā€, Weed Science, 54:182ā€“ 189, 2006 [8]. Hunt LA, Kuchar L, Swanton CJ. Estimation of solar radiation for use in crop modelling. Agric. Forest Meteorol. 91(3ā€“4):293ā€“300, 1998 [9]. Louche et al, ā€œCorrelations for direct normal and global horizontal irradiation on French Mediterranean siteā€, Solar Energy 46: 261-266, 1991 as cited by [8] [10]. Athens, Greece. et al., ā€œOn the relationship of beam transmittance on clearness index for Athens, Greeceā€, Int. J. Solar Energy, Vol. 7, pp 171-179, 1989