SlideShare a Scribd company logo
1 of 8
• Environmentally friendly practices that can complement
technology-focused perspectives aiming to design more
energy efficient IT-based systems.
• Data science—data mining, can be described as
knowledge discovery from data or in terms of different
activities as collecting, cleaning, processing, analysing
and gaining useful insights from data [2].
• Principles derived by analysing the CRISP-DM data
mining process and literature on green IT and data
mining.
• First step, is to identify factors determining energy
consumption.
• Second step, identify individual steps of the CRISP-DM
process by investigating possibilities for reduction of each
factor.
• There are multiple data mining processes most of which share
common phases.
• CRISP-DM is arguably the most widely known and practiced
model attending to business and data understanding, data
preparation, modelling, evaluation and deployment.
• The business understanding phase clarifies project objectives
and business requirements, which are then translated into a
data mining problem.
• There are unsupervised data mining problems including
association pattern mining and clustering as well as
supervised approaches like classification.
• Generally, there is no clear consensus about which model is
best for a task. Consequently, some form of trial and error can
often not be avoided.
• To maximize the outcome of time invested into making
data mining more environmentally friendly, the focus
should be on the most energy consuming factors.
• Project objectives such as predictive accuracy or
required confidence in the analysis are very likely to have
a profound impact on energy consumption, since they
often indirectly influence the choice of computational
methods and data.
• Data preparation does often only require simple
techniques, but it might be dominating in terms of energy
consumption if complex computationally expensive
methods are needed to extract features from the data
that are used in later phases of the process.
• A data scientist might control make-or-buy decisions.
• From an environmental perspective, outsourcing can be
preferable if the contractor is more energy-efficient in
extracting the demanded information, for instance,
because of their prior experience and specialization,
more energy efficient infrastructure, or even possession
of relevant data.
• On a global scale, outsourcing of data analysis has the
potential to involve less computation and to save energy.
• Progress in the field of data science also relies on
publicly available data, models, and development
frameworks. Initiatives to make data available by
research institutions and by governments help create
entire ecosystems .
• The use of renewable (“green”) energy such as solar or
wind should be maximized. Conceptually, the idea is to
align computation with the availability of green energy.
• A system must predict the availability of green energy as
well as brown energy and derive a schedule to maximize
green energy use and to avoid using brown power at
peak demand times.

More Related Content

Similar to Principles of green data mining

Data Science for Building Energy Management a reviewMigue.docx
Data Science for Building Energy Management a reviewMigue.docxData Science for Building Energy Management a reviewMigue.docx
Data Science for Building Energy Management a reviewMigue.docx
randyburney60861
 
EDRG12_Re.doc
EDRG12_Re.docEDRG12_Re.doc
EDRG12_Re.doc
butest
 
EDRG12_Re.doc
EDRG12_Re.docEDRG12_Re.doc
EDRG12_Re.doc
butest
 
Green it at university of bahrain
Green it at university of bahrainGreen it at university of bahrain
Green it at university of bahrain
Greenology
 
bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02
bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02
bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02
denesuk
 

Similar to Principles of green data mining (20)

Green it and audit
Green it and auditGreen it and audit
Green it and audit
 
Go Green to Save Green – Embracing Green Energy Practices
Go Green to Save Green – Embracing Green Energy PracticesGo Green to Save Green – Embracing Green Energy Practices
Go Green to Save Green – Embracing Green Energy Practices
 
Data Science for Building Energy Management a reviewMigue.docx
Data Science for Building Energy Management a reviewMigue.docxData Science for Building Energy Management a reviewMigue.docx
Data Science for Building Energy Management a reviewMigue.docx
 
EDRG12_Re.doc
EDRG12_Re.docEDRG12_Re.doc
EDRG12_Re.doc
 
EDRG12_Re.doc
EDRG12_Re.docEDRG12_Re.doc
EDRG12_Re.doc
 
MISO Info forum 072517
MISO Info forum 072517MISO Info forum 072517
MISO Info forum 072517
 
Translation techniques lav
Translation techniques lavTranslation techniques lav
Translation techniques lav
 
Datanomics: the value of research data.
 Datanomics: the value of research data.  Datanomics: the value of research data.
Datanomics: the value of research data.
 
Green ICT.pptx
Green ICT.pptxGreen ICT.pptx
Green ICT.pptx
 
Energy Efficient Technologies for Virtualized Cloud Data Center: A Systematic...
Energy Efficient Technologies for Virtualized Cloud Data Center: A Systematic...Energy Efficient Technologies for Virtualized Cloud Data Center: A Systematic...
Energy Efficient Technologies for Virtualized Cloud Data Center: A Systematic...
 
Green it at university of bahrain
Green it at university of bahrainGreen it at university of bahrain
Green it at university of bahrain
 
Big data, little data, whatever
Big data, little data, whateverBig data, little data, whatever
Big data, little data, whatever
 
bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02
bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02
bigdatalittledataspe-pd2aoct2012denesuk-140321031823-phpapp02
 
IAOS 2018 - Enhanced recommendations on step-by-step procedure and approach t...
IAOS 2018 - Enhanced recommendations on step-by-step procedure and approach t...IAOS 2018 - Enhanced recommendations on step-by-step procedure and approach t...
IAOS 2018 - Enhanced recommendations on step-by-step procedure and approach t...
 
Ch~2.pdf
Ch~2.pdfCh~2.pdf
Ch~2.pdf
 
Guidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access PlansGuidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access Plans
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Cleaner Production Management
Cleaner Production Management Cleaner Production Management
Cleaner Production Management
 
Energy resource management
Energy resource managementEnergy resource management
Energy resource management
 
Environmental Sound Technology Assessment
Environmental Sound Technology AssessmentEnvironmental Sound Technology Assessment
Environmental Sound Technology Assessment
 

Recently uploaded

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 

Recently uploaded (20)

Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 

Principles of green data mining

  • 1.
  • 2. • Environmentally friendly practices that can complement technology-focused perspectives aiming to design more energy efficient IT-based systems. • Data science—data mining, can be described as knowledge discovery from data or in terms of different activities as collecting, cleaning, processing, analysing and gaining useful insights from data [2].
  • 3. • Principles derived by analysing the CRISP-DM data mining process and literature on green IT and data mining. • First step, is to identify factors determining energy consumption. • Second step, identify individual steps of the CRISP-DM process by investigating possibilities for reduction of each factor.
  • 4. • There are multiple data mining processes most of which share common phases. • CRISP-DM is arguably the most widely known and practiced model attending to business and data understanding, data preparation, modelling, evaluation and deployment. • The business understanding phase clarifies project objectives and business requirements, which are then translated into a data mining problem. • There are unsupervised data mining problems including association pattern mining and clustering as well as supervised approaches like classification. • Generally, there is no clear consensus about which model is best for a task. Consequently, some form of trial and error can often not be avoided.
  • 5. • To maximize the outcome of time invested into making data mining more environmentally friendly, the focus should be on the most energy consuming factors. • Project objectives such as predictive accuracy or required confidence in the analysis are very likely to have a profound impact on energy consumption, since they often indirectly influence the choice of computational methods and data. • Data preparation does often only require simple techniques, but it might be dominating in terms of energy consumption if complex computationally expensive methods are needed to extract features from the data that are used in later phases of the process.
  • 6.
  • 7. • A data scientist might control make-or-buy decisions. • From an environmental perspective, outsourcing can be preferable if the contractor is more energy-efficient in extracting the demanded information, for instance, because of their prior experience and specialization, more energy efficient infrastructure, or even possession of relevant data. • On a global scale, outsourcing of data analysis has the potential to involve less computation and to save energy. • Progress in the field of data science also relies on publicly available data, models, and development frameworks. Initiatives to make data available by research institutions and by governments help create entire ecosystems .
  • 8. • The use of renewable (“green”) energy such as solar or wind should be maximized. Conceptually, the idea is to align computation with the availability of green energy. • A system must predict the availability of green energy as well as brown energy and derive a schedule to maximize green energy use and to avoid using brown power at peak demand times.