This paper highlights the significance of AI-powered maintenance strategies in modern industry for operational optimization and reduced downtime. It emphasizes the crucial role of sensor data analysis in identifying anomalies and predicting failures. The research specifically examines sensor data from an automotive press shop, addressing questions related to data selection, collection challenges, and knowledge generation. By utilizing unsupervised learning on compressed air data from a press line, the study identifies patterns, anomalies, and correlations. The results offer insights into the potential for implementing an effective predictive maintenance strategy. Additionally, a systematic literature review underscores the importance of data analysis in production systems, particularly in the context of maintenance.
The document presents a systematic literature review of research on big data analytics in supply chain management from 2011 to 2021. It analyzes the literature from both organizational and technical perspectives. From an organizational perspective, it examines the theoretical foundations and models used to explain how big data analytics improves sustainability and performance. From a technical perspective, it analyzes the types of analytics, techniques, algorithms, and features developed for supply chain functions. The review identifies gaps in the research and provides directions for future work.
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...PhD Assistance
Outcomes in health-related issues including psychological, educational, Behavioral, environmental, and social are intended to sustain positive change by digital interferences. These changes may be delivered using any digital device like a phone or computer, and make them gainful for the provider. Complex and large-scale datasets that contain usage data can be yielded by testing a digital intervention. This data provides invaluable detail about how the users interact with these interventions and notify their knowledge of engagement, if they are analyzed properly. This paper recommends an innovative framework for the process of analyzing usage associated with a digital intervention .
PhD Assistance is an Academic The Best Dissertation Writing Service & Consulting Support Company established in 2001. specialiWeze in providing PhD Assignments, PhD Dissertation Writing Help , Statistical Analyses, and Programming Services to students in the USA, UK, Canada, UAE, Australia, New Zealand, Singapore and many more.
Website Visit: https://bit.ly/3dANXUD
Contact Us:
UK NO: +44-1143520021
India No: +91-8754446690
Email: info@phdassistance.com
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSijistjournal
The growth of smart and intelligent devices known as sensors generate large amount of data. These generated data over a time span takes such a large volume which is designated as big data. The data structure of repository holds unstructured data. The traditional data analytics methods well developed and used widely to analyze structured data and to limit extend the semi-structured data which involves additional processing over heads. The similar methods used to analyze unstructured data are different because of distributed computing approach where as there is a possibility of centralized processing in case of structured and semi-structured data. The under taken work is confined to analysis of both verities of methods. The result of this study is targeted to introduce methods available to analyze big data.
The document discusses tools and techniques for big data analytics, including A/B testing, crowdsourcing, machine learning, and data mining. It provides an overview of the big data analysis pipeline, including data acquisition, information extraction, integration and representation, query processing and analysis, and interpretation. The document also discusses fields where big data is relevant like industry, healthcare, and research. It analyzes tools like A/B testing, machine learning, and data mining techniques in more detail.
Remarks09222018 - Bold section headings enhance the works o.docxcarlt4
Remarks
09/22/2018 - Bold section headings enhance the work's overall organization. Recurring Professional Communication (Articulation) concerns are evident with parts of speech, varying issues such as verb form(s), subject-verb agreement, and sentence fluency, varying issues such as run-on sentences, comma splices. These writing concerns diminish the clarity of the response.
10/7/18 - Financial institutions' use of data is adequately presented as a real-world business situation, yet the summary is lacking details particularly how the business situation can be addressed by data collection and analysis. One specific question or scenario for data analysis could not be located.
For instruction on describing the relevant data collected, please revisit the section titled “The Case for Quantitative Analysis” in the study plan for this course by clicking on the link located in the top left in this rubric item’s name,“B1. Summary of Data.”
9/22/18 - The submission includes an adequate summary of a real-world business situation regarding financial institutions. The provided detail is insufficient as it is unclear how this business situation can be addressed by collecting and analyzing a set of data, and more specifically a research question is not clearly identified.
10/7/18 - The submission provides financial data as relevant data for the analysis. It is unclear whether the described data relates to a relevant business situation. Please review this response for alignment to the business situation once aspect A has been revised
9/22/18 - The submission provides a limited discussion on financial data as relevant data for the analysis. It is unclear whether the described data relates to a relevant business situation. Please review this response for alignment to the business situation once aspect A has been revised.
10/7/18 - The work provides graphics of processes and generalized financial information as a graphical display of the data collected. It is not clear whether this graphical display is appropriate,as the relevancy of the data collected to the business situation could not be verified. Please review the response to this aspect after revising the data collected in aspect B1.
9/22/18 - Two bar graphs are provided as a graphical display. It is unclear whether the provided graphical display is appropriate, as the description of the data collected in aspect B1 is insufficient.
10/7/18 - The submission describes several common techniques in the finance realm as the data analysis techniques. It is not clear whether this is an appropriate technique to be used to analyze the collected data. Please review this response for alignment to the data to be collected once aspect B1 has been revised
9/22/18 - Numerous possible techniques are described. However, it is unclear whether any of these techniques is appropriate for data that is relevant to the business situation.
10/7/18 - The submission provides several examples of how data could be used in fina.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.Gurdal Ertek
Data collected and generated through and posterior to projects, such as data residing in project management software and post project review documents, can be a major source of actionable insights and competitive advantage. This paper presents a rigorous
methodological analysis of the applied research published in academic literature, on the application of data mining (DM) for project management (PM). The objective of the paper is to provide a comprehensive analysis and discussion of where and how data mining is applied for project management data and to provide practical insights for future research in the field.
https://dl.acm.org/citation.cfm?id=3176714
https://ertekprojects.com/ftp/papers/2017/ertek_et_al_2017_Data_Mining_of_Project_Management_Data.pdf
Novel holistic architecture for analytical operation on sensory data relayed...IJECEIAES
With increasing adoption of the sensor-based application, there is an exponential rise of the sensory data that eventually take the shape of the big data. However, the practicality of executing high end analytical operation over the resource-constrained big data has never being studied closely. After reviewing existing approaches, it is explored that there is no cost-effective schemes of big data analytics over large scale sensory data processiing that can be directly used as a service. Therefore, the propsoed system introduces a holistic architecture where streamed data after performing extraction of knowedge can be offered in the form of services. Implemented in MATLAB, the proposed study uses a very simplistic approach considering energy constrained of the sensor nodes to find that proposed system offers better accuracy, reduced mining duration (i.e. faster response time), and reduced memory dependencies to prove that it offers cost effective analytical solution in contrast to existing system.
Corporate bankruptcy prediction using Deep learning techniquesShantanu Deshpande
This document proposes using deep learning techniques like LSTM neural networks to predict corporate bankruptcy by integrating both financial ratio data and textual disclosures from annual reports. It notes that previous studies have largely relied on statistical models or used only financial data with machine learning. The researcher aims to determine if adding textual data to an LSTM model improves prediction performance over a CNN model using only financial ratios. The document outlines the research question, objectives, and provides an overview of previous bankruptcy prediction studies using statistical, machine learning and deep learning methods.
The document presents a systematic literature review of research on big data analytics in supply chain management from 2011 to 2021. It analyzes the literature from both organizational and technical perspectives. From an organizational perspective, it examines the theoretical foundations and models used to explain how big data analytics improves sustainability and performance. From a technical perspective, it analyzes the types of analytics, techniques, algorithms, and features developed for supply chain functions. The review identifies gaps in the research and provides directions for future work.
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...PhD Assistance
Outcomes in health-related issues including psychological, educational, Behavioral, environmental, and social are intended to sustain positive change by digital interferences. These changes may be delivered using any digital device like a phone or computer, and make them gainful for the provider. Complex and large-scale datasets that contain usage data can be yielded by testing a digital intervention. This data provides invaluable detail about how the users interact with these interventions and notify their knowledge of engagement, if they are analyzed properly. This paper recommends an innovative framework for the process of analyzing usage associated with a digital intervention .
PhD Assistance is an Academic The Best Dissertation Writing Service & Consulting Support Company established in 2001. specialiWeze in providing PhD Assignments, PhD Dissertation Writing Help , Statistical Analyses, and Programming Services to students in the USA, UK, Canada, UAE, Australia, New Zealand, Singapore and many more.
Website Visit: https://bit.ly/3dANXUD
Contact Us:
UK NO: +44-1143520021
India No: +91-8754446690
Email: info@phdassistance.com
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSijistjournal
The growth of smart and intelligent devices known as sensors generate large amount of data. These generated data over a time span takes such a large volume which is designated as big data. The data structure of repository holds unstructured data. The traditional data analytics methods well developed and used widely to analyze structured data and to limit extend the semi-structured data which involves additional processing over heads. The similar methods used to analyze unstructured data are different because of distributed computing approach where as there is a possibility of centralized processing in case of structured and semi-structured data. The under taken work is confined to analysis of both verities of methods. The result of this study is targeted to introduce methods available to analyze big data.
The document discusses tools and techniques for big data analytics, including A/B testing, crowdsourcing, machine learning, and data mining. It provides an overview of the big data analysis pipeline, including data acquisition, information extraction, integration and representation, query processing and analysis, and interpretation. The document also discusses fields where big data is relevant like industry, healthcare, and research. It analyzes tools like A/B testing, machine learning, and data mining techniques in more detail.
Remarks09222018 - Bold section headings enhance the works o.docxcarlt4
Remarks
09/22/2018 - Bold section headings enhance the work's overall organization. Recurring Professional Communication (Articulation) concerns are evident with parts of speech, varying issues such as verb form(s), subject-verb agreement, and sentence fluency, varying issues such as run-on sentences, comma splices. These writing concerns diminish the clarity of the response.
10/7/18 - Financial institutions' use of data is adequately presented as a real-world business situation, yet the summary is lacking details particularly how the business situation can be addressed by data collection and analysis. One specific question or scenario for data analysis could not be located.
For instruction on describing the relevant data collected, please revisit the section titled “The Case for Quantitative Analysis” in the study plan for this course by clicking on the link located in the top left in this rubric item’s name,“B1. Summary of Data.”
9/22/18 - The submission includes an adequate summary of a real-world business situation regarding financial institutions. The provided detail is insufficient as it is unclear how this business situation can be addressed by collecting and analyzing a set of data, and more specifically a research question is not clearly identified.
10/7/18 - The submission provides financial data as relevant data for the analysis. It is unclear whether the described data relates to a relevant business situation. Please review this response for alignment to the business situation once aspect A has been revised
9/22/18 - The submission provides a limited discussion on financial data as relevant data for the analysis. It is unclear whether the described data relates to a relevant business situation. Please review this response for alignment to the business situation once aspect A has been revised.
10/7/18 - The work provides graphics of processes and generalized financial information as a graphical display of the data collected. It is not clear whether this graphical display is appropriate,as the relevancy of the data collected to the business situation could not be verified. Please review the response to this aspect after revising the data collected in aspect B1.
9/22/18 - Two bar graphs are provided as a graphical display. It is unclear whether the provided graphical display is appropriate, as the description of the data collected in aspect B1 is insufficient.
10/7/18 - The submission describes several common techniques in the finance realm as the data analysis techniques. It is not clear whether this is an appropriate technique to be used to analyze the collected data. Please review this response for alignment to the data to be collected once aspect B1 has been revised
9/22/18 - Numerous possible techniques are described. However, it is unclear whether any of these techniques is appropriate for data that is relevant to the business situation.
10/7/18 - The submission provides several examples of how data could be used in fina.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.Gurdal Ertek
Data collected and generated through and posterior to projects, such as data residing in project management software and post project review documents, can be a major source of actionable insights and competitive advantage. This paper presents a rigorous
methodological analysis of the applied research published in academic literature, on the application of data mining (DM) for project management (PM). The objective of the paper is to provide a comprehensive analysis and discussion of where and how data mining is applied for project management data and to provide practical insights for future research in the field.
https://dl.acm.org/citation.cfm?id=3176714
https://ertekprojects.com/ftp/papers/2017/ertek_et_al_2017_Data_Mining_of_Project_Management_Data.pdf
Novel holistic architecture for analytical operation on sensory data relayed...IJECEIAES
With increasing adoption of the sensor-based application, there is an exponential rise of the sensory data that eventually take the shape of the big data. However, the practicality of executing high end analytical operation over the resource-constrained big data has never being studied closely. After reviewing existing approaches, it is explored that there is no cost-effective schemes of big data analytics over large scale sensory data processiing that can be directly used as a service. Therefore, the propsoed system introduces a holistic architecture where streamed data after performing extraction of knowedge can be offered in the form of services. Implemented in MATLAB, the proposed study uses a very simplistic approach considering energy constrained of the sensor nodes to find that proposed system offers better accuracy, reduced mining duration (i.e. faster response time), and reduced memory dependencies to prove that it offers cost effective analytical solution in contrast to existing system.
Corporate bankruptcy prediction using Deep learning techniquesShantanu Deshpande
This document proposes using deep learning techniques like LSTM neural networks to predict corporate bankruptcy by integrating both financial ratio data and textual disclosures from annual reports. It notes that previous studies have largely relied on statistical models or used only financial data with machine learning. The researcher aims to determine if adding textual data to an LSTM model improves prediction performance over a CNN model using only financial ratios. The document outlines the research question, objectives, and provides an overview of previous bankruptcy prediction studies using statistical, machine learning and deep learning methods.
Learning Resources Week 2
Frankfort-Nachmias, C., & Leon-Guerrero, A. (2020). Social statistics for a diverse society (9th ed.). Sage Publications.
· Chapter 2, “The Organization and Graphic Presentation Data” (pp. 27-74)
Wagner, III, W. E. (2020). Using IBM® SPSS® statistics for research methods and social science statistics (7th ed.). Thousand Oaks, CA: Sage Publications.
· Chapter 5, “Charts and Graphs”
· Chapter 11, “Editing Output”
Walden University Writing Center. (n.d.). General guidance on data displays. Retrieved from http://waldenwritingcenter.blogspot.com/2013/02/general-guidance-on-data-displays.html
Use this website to guide you as you provide appropriate APA formatting and citations for data displays.
Laureate Education (Producer). (2016j). Visual displays of data [Video file]. Baltimore, MD: Author.
Note: The approximate length of this media piece is 9 minutes.
In this media program, Dr. Matt Jones discusses frequency distributions. Focus on how his explanation might support your analysis in this week’s Assignment. (video attached separately)
sustainability
Case Report
Integrated Understanding of Big Data, Big Data
Analysis, and Business Intelligence: A Case Study
of Logistics
Dong-Hui Jin and Hyun-Jung Kim *
Seoul Business School, aSSIST, 46 Ewhayeodae 2-gil, Seodaemun-gu, Seoul 03767, Korea; [email protected]
* Correspondence: [email protected]; Tel.: +82-70-7012-2722
Received: 5 October 2018; Accepted: 17 October 2018; Published: 19 October 2018
����������
�������
Abstract: Efficient decision making based on business intelligence (BI) is essential to ensure
competitiveness for sustainable growth. The rapid development of information and communication
technology has made collection and analysis of big data essential, resulting in a considerable increase
in academic studies on big data and big data analysis (BDA). However, many of these studies are
not linked to BI, as companies do not understand and utilize the concepts in an integrated way.
Therefore, the purpose of this study is twofold. First, we review the literature on BI, big data,
and BDA to show that they are not separate methods but an integrated decision support system.
Second, we explore how businesses use big data and BDA practically in conjunction with BI through
a case study of sorting and logistics processing of a typical courier enterprise. We focus on the
company’s cost efficiency as regards to data collection, data analysis/simulation, and the results from
actual application. Our findings may enable companies to achieve management efficiency by utilizing
big data through efficient BI without investing in additional infrastructure. It could also give them
indirect experience, thereby reducing trial and error in order to maintain or increase competitiveness.
Keywords: business application; big data; big data analysis; business intelligence; logistics;
courier service
1. Introduction
A growing number of corporations depend on various and cont.
Learning Resources Week 2
Frankfort-Nachmias, C., & Leon-Guerrero, A. (2020). Social statistics for a diverse society (9th ed.). Sage Publications.
· Chapter 2, “The Organization and Graphic Presentation Data” (pp. 27-74)
Wagner, III, W. E. (2020). Using IBM® SPSS® statistics for research methods and social science statistics (7th ed.). Thousand Oaks, CA: Sage Publications.
· Chapter 5, “Charts and Graphs”
· Chapter 11, “Editing Output”
Walden University Writing Center. (n.d.). General guidance on data displays. Retrieved from http://waldenwritingcenter.blogspot.com/2013/02/general-guidance-on-data-displays.html
Use this website to guide you as you provide appropriate APA formatting and citations for data displays.
Laureate Education (Producer). (2016j). Visual displays of data [Video file]. Baltimore, MD: Author.
Note: The approximate length of this media piece is 9 minutes.
In this media program, Dr. Matt Jones discusses frequency distributions. Focus on how his explanation might support your analysis in this week’s Assignment. (video attached separately)
sustainability
Case Report
Integrated Understanding of Big Data, Big Data
Analysis, and Business Intelligence: A Case Study
of Logistics
Dong-Hui Jin and Hyun-Jung Kim *
Seoul Business School, aSSIST, 46 Ewhayeodae 2-gil, Seodaemun-gu, Seoul 03767, Korea; [email protected]
* Correspondence: [email protected]; Tel.: +82-70-7012-2722
Received: 5 October 2018; Accepted: 17 October 2018; Published: 19 October 2018
����������
�������
Abstract: Efficient decision making based on business intelligence (BI) is essential to ensure
competitiveness for sustainable growth. The rapid development of information and communication
technology has made collection and analysis of big data essential, resulting in a considerable increase
in academic studies on big data and big data analysis (BDA). However, many of these studies are
not linked to BI, as companies do not understand and utilize the concepts in an integrated way.
Therefore, the purpose of this study is twofold. First, we review the literature on BI, big data,
and BDA to show that they are not separate methods but an integrated decision support system.
Second, we explore how businesses use big data and BDA practically in conjunction with BI through
a case study of sorting and logistics processing of a typical courier enterprise. We focus on the
company’s cost efficiency as regards to data collection, data analysis/simulation, and the results from
actual application. Our findings may enable companies to achieve management efficiency by utilizing
big data through efficient BI without investing in additional infrastructure. It could also give them
indirect experience, thereby reducing trial and error in order to maintain or increase competitiveness.
Keywords: business application; big data; big data analysis; business intelligence; logistics;
courier service
1. Introduction
A growing number of corporations depend on various and cont.
Data Science for Building Energy Management a reviewMigue.docxrandyburney60861
Data Science for Building Energy Management: a review
Miguel Molina-Solanaa,b, Maŕıa Rosa,∗, M. Dolores Ruiza, Juan Gómez-Romeroa, M.J. Martin-Bautistaa
aDepartment of Computer Science and Artificial Intelligence, Universidad de Granada
bData Science Institute, Imperial College London
Abstract
The energy consumption of residential and commercial buildings has risen steadily in recent years, an
increase largely due to their HVAC systems. Expected energy loads, transportation, and storage as well
as user behavior influence the quantity and quality of the energy consumed daily in buildings. However,
technology is now available that can accurately monitor, collect, and store the huge amount of data involved
in this process. Furthermore, this technology is capable of analyzing and exploiting such data in meaningful
ways. Not surprisingly, the use of data science techniques to increase energy efficiency is currently attracting
a great deal of attention and interest. This paper reviews how Data Science has been applied to address the
most difficult problems faced by practitioners in the field of Energy Management, especially in the building
sector. The work also discusses the challenges and opportunities that will arise with the advent of fully
connected devices and new computational technologies.
1. Introduction
There is a general consensus in the world today that human activities are having a negative impact
on the environment and have accelerated both global warming and climate change. These environmental
threats have been intensified by the emissions produced by the energy required for the lighting and HVAC
(heating, ventilation and air-conditioning) systems in building constructions. According to the International
Energy Agency (IEA), residential and commercial buildings are responsible for up to 32% of the total final
energy consumption. In fact, in most IEA countries, they account for approximately 40% of the primary
energy consumption. Similar statistics are given by the World Business Council for Sustainable Development
(WBCSD) within the framework of its Energy Efficiency in Buildings (EEB) project1. Also provided is a
comprehensive review [1] of the state of the art in building energy use (with a primary focus on energy
demand).
These data indicate that inefficient energy management in aging buildings combined with rising construc-
tion activity in developed countries will cause energy consumption to soar in the near future and heighten the
negative impacts associated with this consumption. Moreover, variable energy costs call for the implemen-
tation of more intelligent strategies to adapt and reduce energy consumption as well as to find alternative
and sustainable energy sources. The relevance of these issues is clearly reflected in the research priorities of
the European Union, as stated in its Horizon2020 Societal Challenge “Secure, Clean and Efficient Energy”.
This work program targets a significant reduction in energy consu.
IRJET- Fault Detection and Prediction of Failure using Vibration AnalysisIRJET Journal
This document discusses fault detection and prediction of failures in rotating equipment using vibration analysis. It begins by introducing vibration analysis as a method to monitor machines and detect faults in rotating components that may cause failures. It then discusses how motor vibration is measured and analyzed using techniques like spectrum analysis to identify faults like unbalance, bearing issues, or broken rotor bars. The document proposes decomposing vibration signals using intrinsic mode functions and calculating the Gabor representation's frequency marginal to identify fault types using classifiers like support vector machines or random forests. It provides context on data mining techniques relevant to this type of fault prediction problem.
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...ijaia
In the era of the fourth industrial revolution, measuring and ensuring the reliability, efficiency and safety of the industrial systems and components are one of the uppermost key concern. In addition, predicting performance degradation or remaining useful life (RUL) of an equipment over time based on its historical sensor data enables companies to greatly reduce their maintenance cost. In this way, companies can prevent costly unexpected breakdown and become more profitable and competitive in the marketplace. This paper introduces a deep learning-based method by combining CNN(Convolutional Neural Networks) and LSTM (Long Short-Term Memory)neural networks to predict RUL for industrial equipment. The proposed method does not depend upon any degradation trend assumptions and it can learn complex temporal representative and distinguishing patterns in the sensor data. In order to evaluate the efficiency and effectiveness of the proposed method, we evaluated it on two different experiment: RUL estimation and predicting the status of the IoT devices in 2-week period. Experiments are conducted on a publicly available NASA’s turbo fan-engine dataset. Based on the experiment results, the deep learning-based approach achieved high prediction accuracy. Moreover, the results show that the method outperforms standard well-accepted machine learning algorithms and accomplishes competitive performance when compared to the state-of-the art methods
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...gerogepatton
In the era of the fourth industrial revolution, measuring and ensuring the reliability, efficiency and safety of the industrial systems and components are one of the uppermost key concern. In addition, predicting performance degradation or remaining useful life (RUL) of an equipment over time based on its historical sensor data enables companies to greatly reduce their maintenance cost. In this way, companies can prevent costly unexpected breakdown and become more profitable and competitive in the marketplace. This paper introduces a deep learning-based method by combining CNN(Convolutional Neural Networks) and LSTM (Long Short-Term Memory)neural networks to predict RUL for industrial equipment. The proposed method does not depend upon any degradation trend assumptions and it can learn complex temporal representative and distinguishing patterns in the sensor data. In order to evaluate the efficiency and effectiveness of the proposed method, we evaluated it on two different experiment: RUL estimation and predicting the status of the IoT devices in 2-week period. Experiments are conducted on a publicly available NASA’s turbo fan-engine dataset. Based on the experiment results, the deep learning-based approach achieved high prediction accuracy. Moreover, the results show that the method outperforms standard well-accepted machine learning algorithms and accomplishes competitive performance when compared to the state-of-the art methods
This document reviews the use of data mining and neural network techniques for stock market prediction. It discusses how data mining can extract hidden patterns from large datasets and neural networks can handle nonlinear and uncertain financial data. Specifically, it examines how a combination of data mining and neural networks may improve the reliability of stock predictions by leveraging their complementary strengths. The document also provides an overview of common data mining and neural network methods used for this purpose, such as statistical data mining, neural network-based data processing, clustering, and fuzzy logic. It reviews several previous studies that found neural networks and other nonlinear techniques often outperform traditional statistical models at predicting stock prices and indices.
This document reviews the use of data mining and neural network techniques for stock market prediction. It discusses how data mining can extract hidden patterns from large datasets and make predictions about future trends. Neural networks are also effective for stock prediction due to their ability to handle uncertain and changing data. The document examines different data mining methods like statistical analysis, neural networks, clustering and fuzzy sets. It suggests that combining data mining and neural networks could improve the reliability of stock market predictions by uncovering the nonlinear patterns in stock price data.
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...ertekg
İndirmek için Bağlantı > https://ertekprojects.com/gurdal-ertek-publications/blog/re-mining-association-mining-results-through-visualization-data-envelopment-analysis-and-decision-trees/
Re-mining is a general framework which suggests the execution of additional data mining steps based on the results of an original data mining process. This study investigates the multi-faceted re-mining of association mining results, develops and presents a practical methodology, and shows the applicability of the developed methodology through real world data. The methodology suggests re-mining using data visualization, data envelopment analysis, and decision trees. Six hypotheses, regarding how re-mining can be carried out on association mining results, are answered in the case study through empirical analysis.
ANOMALY DETECTION AND ATTRIBUTION USING AUTO FORECAST AND DIRECTED GRAPHSIJDKP
In the business world, decision makers rely heavily on data to back their decisions. With the quantum of
data increasing rapidly, traditional methods used to generate insights from reports and dashboards will
soon become intractable. This creates a need for efficient systems which can substitute human intelligence
and reduce time latency in decision making. This paper describes an approach to process time series data
with multiple dimensions such as geographies, verticals, products, efficiently, and to detect anomalies in
the data and further, to explain potential reasons for the occurrence of the anomalies. The algorithm
implements auto selection of forecast models to make reliable forecasts and detect such anomalies. Depth
First Search (DFS) is applied to analyse each of these anomalies and find its root causes. The algorithm
filters the redundant causes and reports the insights to the stakeholders. Apart from being a hair-trigger
KPI tracking mechanism, this algorithm can also be customized for problems lke A/B testing, campaign
tracking and product evaluations.
This document discusses challenges and outlooks related to big data. It begins with an introduction describing how big data is being collected and analyzed in various fields such as science, education, healthcare, urban planning, and more. It then outlines the key phases in big data analysis: data acquisition and recording, information extraction and cleaning, data integration and representation, query processing and analysis, and result interpretation. For each phase, it discusses challenges and how existing techniques can be applied or extended to address big data issues. Some of the major challenges discussed are data scale, heterogeneity, lack of structure, privacy, timeliness, provenance, and visualization across the entire big data analysis pipeline.
Due to the arrival of new technologies, devices, and communication means, the amount of data produced by mankind is growing rapidly every year. This gives rise to the era of big data. The term big data comes with the new challenges to input, process and output the data. The paper focuses on limitation of traditional approach to manage the data and the components that are useful in handling big data. One of the approaches used in processing big data is Hadoop framework, the paper presents the major components of the framework and working process within the framework.
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...PhD Assistance
Outcomes in health-related issues including psychological, educational, Behavioral, environmental, and social are intended to sustain positive change by digital interferences. These changes may be delivered using any digital device like a phone or computer, and make them gainful for the provider. Complex and large-scale datasets that contain usage data can be yielded by testing a digital intervention. This data provides invaluable detail about how the users interact with these interventions and notify their knowledge of engagement, if they are analyzed properly. This paper recommends an innovative framework for the process of analyzing usage associated with a digital intervention .
PhD Assistance is an Academic The Best Dissertation Writing Service & Consulting Support Company established in 2001. specialiWeze in providing PhD Assignments, PhD Dissertation Writing Help , Statistical Analyses, and Programming Services to students in the USA, UK, Canada, UAE, Australia, New Zealand, Singapore and many more.
Website Visit: https://bit.ly/3dANXUD
Contact Us:
UK NO: +44-1143520021
India No: +91-8754446690
Email: info@phdassistance.com
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET Journal
This document discusses using data mining techniques to analyze agricultural data and estimate crop yields. It begins with an introduction to big data and data preprocessing techniques. The main goals of data cleaning are discussed, including filtering noise from datasets. Common data mining tasks like classification, regression, and clustering are also overviewed. Previous studies on crop yield prediction using clustering algorithms and weather data are summarized. The proposed system would use the DBSCAN clustering algorithm to analyze various agricultural datasets to more optimally and accurately estimate crop yields. This could help farmers make better decisions to increase production.
This document discusses medical data mining and classification techniques. It begins with an introduction to data mining and its applications in healthcare to improve treatment. Medical data mining can help discover patterns in medical data to aid diagnosis. Classification algorithms like decision trees can categorize medical records and help predict outcomes. Specifically, the document discusses the J48 decision tree algorithm available in the WEKA data mining tool, which implements the C4.5 algorithm for classification. Decision trees work by recursively splitting the data into subsets based on attribute values, forming a tree structure. The document concludes that while data mining can help with medical analysis, results from small medical datasets should be interpreted cautiously.
Mining Social Media Data for Understanding Drugs UsageIRJET Journal
This document discusses mining social media data to understand drug usage. It proposes using big data techniques like Hadoop and MapReduce to extract and analyze data from social networks about drug abuse. The methodology involves collecting data from platforms using crawlers, storing it in Hadoop, filtering it, then applying complex analysis using cloud computing. Prior work on extracting health information from social media and multi-scale community detection in networks is reviewed. The challenges of privacy preservation and scalability when anonymizing big healthcare datasets are also discussed.
STORAGE GROWING FORECAST WITH BACULA BACKUP SOFTWARE CATALOG DATA MININGcsandit
Backup software information is a potential source for data mining: not only the unstructured
stored data from all other backed-up servers, but also backup jobs metadata, which is stored in
a formerly known catalog database. Data mining this database, in special, could be used in
order to improve backup quality, automation, reliability, predict bottlenecks, identify risks,
failure trends, and provide specific needed report information that could not be fetched from
closed format property stock property backup software database. Ignoring this data mining
project might be costly, with lots of unnecessary human intervention, uncoordinated work and
pitfalls, such as having backup service disruption, because of insufficient planning. The specific
goal of this practical paper is using Knowledge Discovery in Database Time Series, Stochastic
Models and R scripts in order to predict backup storage data growth. This project could not be
done with traditional closed format proprietary solutions, since it is generally impossible to
read their database data from third party software because of vendor lock-in deliberate
overshadow. Nevertheless, it is very feasible with Bacula: the current third most popular backup
software worldwide, and open source. This paper is focused on the backup storage demand
prediction problem, using the most popular prediction algorithms. Among them, Holt-Winters
Model had the highest success rate for the tested data sets.
IRJET- Contradicting the Hypothesis of Data Analytics with the Help of a Use-...IRJET Journal
This document discusses how data analytics techniques can be used to analyze manufacturing industry data and help with decision making. It presents a case study analyzing expenses data from a manufacturing company from 1950 to 2020. Descriptive analytics on the data show trends in number of employees, working hours, costs of raw materials, machinery, overhead, labor, and profit over time. Diagnostic analytics provide reasons for these trends, such as increases in employees and costs correlating with new technologies and production increases. Predictive analytics are not discussed in the summary. The document suggests prescriptive analytics using advanced Industry 4.0 technologies like ultrafast 3D printing could help maximize profit and minimize employees and costs.
IRJET- A Survey on Mining of Tweeter Data for Predicting User BehaviorIRJET Journal
This document discusses mining and analyzing social media data using big data techniques to predict user behavior. It proposes using tools like Hadoop and HDFS to capture trends in areas like drug abuse from large amounts of Twitter data. A framework is presented that involves gathering Twitter data using APIs, applying big data mining techniques, and using the results for more sophisticated analysis applications to help address issues like monitoring public health. Challenges around managing large social media datasets are also discussed.
Hierarchical Forecasting and Reconciliation in The Context of Temporal HierarchyIRJET Journal
This document discusses hierarchical forecasting and reconciliation for temporally aggregated data that exhibits seasonal patterns. It analyzes 10 years of monthly foreign tourist visitation data to Kerala, India aggregated into monthly, quarterly, half-yearly, and annual levels. Different forecasting strategies are evaluated, including bottom-up, top-down, and optimal combination approaches using exponential smoothing techniques. The mean absolute percentage error is used to compare the accuracy of forecasts from each strategy. Preliminary results suggest the bottom-up approach outperforms other strategies on average and across all levels of the data hierarchy.
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
More Related Content
Similar to Steps and Challenges in Analyzing Real Sensor Data from a Productive Press Shop and its Value for Predictive Maintenance Application
Learning Resources Week 2
Frankfort-Nachmias, C., & Leon-Guerrero, A. (2020). Social statistics for a diverse society (9th ed.). Sage Publications.
· Chapter 2, “The Organization and Graphic Presentation Data” (pp. 27-74)
Wagner, III, W. E. (2020). Using IBM® SPSS® statistics for research methods and social science statistics (7th ed.). Thousand Oaks, CA: Sage Publications.
· Chapter 5, “Charts and Graphs”
· Chapter 11, “Editing Output”
Walden University Writing Center. (n.d.). General guidance on data displays. Retrieved from http://waldenwritingcenter.blogspot.com/2013/02/general-guidance-on-data-displays.html
Use this website to guide you as you provide appropriate APA formatting and citations for data displays.
Laureate Education (Producer). (2016j). Visual displays of data [Video file]. Baltimore, MD: Author.
Note: The approximate length of this media piece is 9 minutes.
In this media program, Dr. Matt Jones discusses frequency distributions. Focus on how his explanation might support your analysis in this week’s Assignment. (video attached separately)
sustainability
Case Report
Integrated Understanding of Big Data, Big Data
Analysis, and Business Intelligence: A Case Study
of Logistics
Dong-Hui Jin and Hyun-Jung Kim *
Seoul Business School, aSSIST, 46 Ewhayeodae 2-gil, Seodaemun-gu, Seoul 03767, Korea; [email protected]
* Correspondence: [email protected]; Tel.: +82-70-7012-2722
Received: 5 October 2018; Accepted: 17 October 2018; Published: 19 October 2018
����������
�������
Abstract: Efficient decision making based on business intelligence (BI) is essential to ensure
competitiveness for sustainable growth. The rapid development of information and communication
technology has made collection and analysis of big data essential, resulting in a considerable increase
in academic studies on big data and big data analysis (BDA). However, many of these studies are
not linked to BI, as companies do not understand and utilize the concepts in an integrated way.
Therefore, the purpose of this study is twofold. First, we review the literature on BI, big data,
and BDA to show that they are not separate methods but an integrated decision support system.
Second, we explore how businesses use big data and BDA practically in conjunction with BI through
a case study of sorting and logistics processing of a typical courier enterprise. We focus on the
company’s cost efficiency as regards to data collection, data analysis/simulation, and the results from
actual application. Our findings may enable companies to achieve management efficiency by utilizing
big data through efficient BI without investing in additional infrastructure. It could also give them
indirect experience, thereby reducing trial and error in order to maintain or increase competitiveness.
Keywords: business application; big data; big data analysis; business intelligence; logistics;
courier service
1. Introduction
A growing number of corporations depend on various and cont.
Learning Resources Week 2
Frankfort-Nachmias, C., & Leon-Guerrero, A. (2020). Social statistics for a diverse society (9th ed.). Sage Publications.
· Chapter 2, “The Organization and Graphic Presentation Data” (pp. 27-74)
Wagner, III, W. E. (2020). Using IBM® SPSS® statistics for research methods and social science statistics (7th ed.). Thousand Oaks, CA: Sage Publications.
· Chapter 5, “Charts and Graphs”
· Chapter 11, “Editing Output”
Walden University Writing Center. (n.d.). General guidance on data displays. Retrieved from http://waldenwritingcenter.blogspot.com/2013/02/general-guidance-on-data-displays.html
Use this website to guide you as you provide appropriate APA formatting and citations for data displays.
Laureate Education (Producer). (2016j). Visual displays of data [Video file]. Baltimore, MD: Author.
Note: The approximate length of this media piece is 9 minutes.
In this media program, Dr. Matt Jones discusses frequency distributions. Focus on how his explanation might support your analysis in this week’s Assignment. (video attached separately)
sustainability
Case Report
Integrated Understanding of Big Data, Big Data
Analysis, and Business Intelligence: A Case Study
of Logistics
Dong-Hui Jin and Hyun-Jung Kim *
Seoul Business School, aSSIST, 46 Ewhayeodae 2-gil, Seodaemun-gu, Seoul 03767, Korea; [email protected]
* Correspondence: [email protected]; Tel.: +82-70-7012-2722
Received: 5 October 2018; Accepted: 17 October 2018; Published: 19 October 2018
����������
�������
Abstract: Efficient decision making based on business intelligence (BI) is essential to ensure
competitiveness for sustainable growth. The rapid development of information and communication
technology has made collection and analysis of big data essential, resulting in a considerable increase
in academic studies on big data and big data analysis (BDA). However, many of these studies are
not linked to BI, as companies do not understand and utilize the concepts in an integrated way.
Therefore, the purpose of this study is twofold. First, we review the literature on BI, big data,
and BDA to show that they are not separate methods but an integrated decision support system.
Second, we explore how businesses use big data and BDA practically in conjunction with BI through
a case study of sorting and logistics processing of a typical courier enterprise. We focus on the
company’s cost efficiency as regards to data collection, data analysis/simulation, and the results from
actual application. Our findings may enable companies to achieve management efficiency by utilizing
big data through efficient BI without investing in additional infrastructure. It could also give them
indirect experience, thereby reducing trial and error in order to maintain or increase competitiveness.
Keywords: business application; big data; big data analysis; business intelligence; logistics;
courier service
1. Introduction
A growing number of corporations depend on various and cont.
Data Science for Building Energy Management a reviewMigue.docxrandyburney60861
Data Science for Building Energy Management: a review
Miguel Molina-Solanaa,b, Maŕıa Rosa,∗, M. Dolores Ruiza, Juan Gómez-Romeroa, M.J. Martin-Bautistaa
aDepartment of Computer Science and Artificial Intelligence, Universidad de Granada
bData Science Institute, Imperial College London
Abstract
The energy consumption of residential and commercial buildings has risen steadily in recent years, an
increase largely due to their HVAC systems. Expected energy loads, transportation, and storage as well
as user behavior influence the quantity and quality of the energy consumed daily in buildings. However,
technology is now available that can accurately monitor, collect, and store the huge amount of data involved
in this process. Furthermore, this technology is capable of analyzing and exploiting such data in meaningful
ways. Not surprisingly, the use of data science techniques to increase energy efficiency is currently attracting
a great deal of attention and interest. This paper reviews how Data Science has been applied to address the
most difficult problems faced by practitioners in the field of Energy Management, especially in the building
sector. The work also discusses the challenges and opportunities that will arise with the advent of fully
connected devices and new computational technologies.
1. Introduction
There is a general consensus in the world today that human activities are having a negative impact
on the environment and have accelerated both global warming and climate change. These environmental
threats have been intensified by the emissions produced by the energy required for the lighting and HVAC
(heating, ventilation and air-conditioning) systems in building constructions. According to the International
Energy Agency (IEA), residential and commercial buildings are responsible for up to 32% of the total final
energy consumption. In fact, in most IEA countries, they account for approximately 40% of the primary
energy consumption. Similar statistics are given by the World Business Council for Sustainable Development
(WBCSD) within the framework of its Energy Efficiency in Buildings (EEB) project1. Also provided is a
comprehensive review [1] of the state of the art in building energy use (with a primary focus on energy
demand).
These data indicate that inefficient energy management in aging buildings combined with rising construc-
tion activity in developed countries will cause energy consumption to soar in the near future and heighten the
negative impacts associated with this consumption. Moreover, variable energy costs call for the implemen-
tation of more intelligent strategies to adapt and reduce energy consumption as well as to find alternative
and sustainable energy sources. The relevance of these issues is clearly reflected in the research priorities of
the European Union, as stated in its Horizon2020 Societal Challenge “Secure, Clean and Efficient Energy”.
This work program targets a significant reduction in energy consu.
IRJET- Fault Detection and Prediction of Failure using Vibration AnalysisIRJET Journal
This document discusses fault detection and prediction of failures in rotating equipment using vibration analysis. It begins by introducing vibration analysis as a method to monitor machines and detect faults in rotating components that may cause failures. It then discusses how motor vibration is measured and analyzed using techniques like spectrum analysis to identify faults like unbalance, bearing issues, or broken rotor bars. The document proposes decomposing vibration signals using intrinsic mode functions and calculating the Gabor representation's frequency marginal to identify fault types using classifiers like support vector machines or random forests. It provides context on data mining techniques relevant to this type of fault prediction problem.
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...ijaia
In the era of the fourth industrial revolution, measuring and ensuring the reliability, efficiency and safety of the industrial systems and components are one of the uppermost key concern. In addition, predicting performance degradation or remaining useful life (RUL) of an equipment over time based on its historical sensor data enables companies to greatly reduce their maintenance cost. In this way, companies can prevent costly unexpected breakdown and become more profitable and competitive in the marketplace. This paper introduces a deep learning-based method by combining CNN(Convolutional Neural Networks) and LSTM (Long Short-Term Memory)neural networks to predict RUL for industrial equipment. The proposed method does not depend upon any degradation trend assumptions and it can learn complex temporal representative and distinguishing patterns in the sensor data. In order to evaluate the efficiency and effectiveness of the proposed method, we evaluated it on two different experiment: RUL estimation and predicting the status of the IoT devices in 2-week period. Experiments are conducted on a publicly available NASA’s turbo fan-engine dataset. Based on the experiment results, the deep learning-based approach achieved high prediction accuracy. Moreover, the results show that the method outperforms standard well-accepted machine learning algorithms and accomplishes competitive performance when compared to the state-of-the art methods
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...gerogepatton
In the era of the fourth industrial revolution, measuring and ensuring the reliability, efficiency and safety of the industrial systems and components are one of the uppermost key concern. In addition, predicting performance degradation or remaining useful life (RUL) of an equipment over time based on its historical sensor data enables companies to greatly reduce their maintenance cost. In this way, companies can prevent costly unexpected breakdown and become more profitable and competitive in the marketplace. This paper introduces a deep learning-based method by combining CNN(Convolutional Neural Networks) and LSTM (Long Short-Term Memory)neural networks to predict RUL for industrial equipment. The proposed method does not depend upon any degradation trend assumptions and it can learn complex temporal representative and distinguishing patterns in the sensor data. In order to evaluate the efficiency and effectiveness of the proposed method, we evaluated it on two different experiment: RUL estimation and predicting the status of the IoT devices in 2-week period. Experiments are conducted on a publicly available NASA’s turbo fan-engine dataset. Based on the experiment results, the deep learning-based approach achieved high prediction accuracy. Moreover, the results show that the method outperforms standard well-accepted machine learning algorithms and accomplishes competitive performance when compared to the state-of-the art methods
This document reviews the use of data mining and neural network techniques for stock market prediction. It discusses how data mining can extract hidden patterns from large datasets and neural networks can handle nonlinear and uncertain financial data. Specifically, it examines how a combination of data mining and neural networks may improve the reliability of stock predictions by leveraging their complementary strengths. The document also provides an overview of common data mining and neural network methods used for this purpose, such as statistical data mining, neural network-based data processing, clustering, and fuzzy logic. It reviews several previous studies that found neural networks and other nonlinear techniques often outperform traditional statistical models at predicting stock prices and indices.
This document reviews the use of data mining and neural network techniques for stock market prediction. It discusses how data mining can extract hidden patterns from large datasets and make predictions about future trends. Neural networks are also effective for stock prediction due to their ability to handle uncertain and changing data. The document examines different data mining methods like statistical analysis, neural networks, clustering and fuzzy sets. It suggests that combining data mining and neural networks could improve the reliability of stock market predictions by uncovering the nonlinear patterns in stock price data.
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...ertekg
İndirmek için Bağlantı > https://ertekprojects.com/gurdal-ertek-publications/blog/re-mining-association-mining-results-through-visualization-data-envelopment-analysis-and-decision-trees/
Re-mining is a general framework which suggests the execution of additional data mining steps based on the results of an original data mining process. This study investigates the multi-faceted re-mining of association mining results, develops and presents a practical methodology, and shows the applicability of the developed methodology through real world data. The methodology suggests re-mining using data visualization, data envelopment analysis, and decision trees. Six hypotheses, regarding how re-mining can be carried out on association mining results, are answered in the case study through empirical analysis.
ANOMALY DETECTION AND ATTRIBUTION USING AUTO FORECAST AND DIRECTED GRAPHSIJDKP
In the business world, decision makers rely heavily on data to back their decisions. With the quantum of
data increasing rapidly, traditional methods used to generate insights from reports and dashboards will
soon become intractable. This creates a need for efficient systems which can substitute human intelligence
and reduce time latency in decision making. This paper describes an approach to process time series data
with multiple dimensions such as geographies, verticals, products, efficiently, and to detect anomalies in
the data and further, to explain potential reasons for the occurrence of the anomalies. The algorithm
implements auto selection of forecast models to make reliable forecasts and detect such anomalies. Depth
First Search (DFS) is applied to analyse each of these anomalies and find its root causes. The algorithm
filters the redundant causes and reports the insights to the stakeholders. Apart from being a hair-trigger
KPI tracking mechanism, this algorithm can also be customized for problems lke A/B testing, campaign
tracking and product evaluations.
This document discusses challenges and outlooks related to big data. It begins with an introduction describing how big data is being collected and analyzed in various fields such as science, education, healthcare, urban planning, and more. It then outlines the key phases in big data analysis: data acquisition and recording, information extraction and cleaning, data integration and representation, query processing and analysis, and result interpretation. For each phase, it discusses challenges and how existing techniques can be applied or extended to address big data issues. Some of the major challenges discussed are data scale, heterogeneity, lack of structure, privacy, timeliness, provenance, and visualization across the entire big data analysis pipeline.
Due to the arrival of new technologies, devices, and communication means, the amount of data produced by mankind is growing rapidly every year. This gives rise to the era of big data. The term big data comes with the new challenges to input, process and output the data. The paper focuses on limitation of traditional approach to manage the data and the components that are useful in handling big data. One of the approaches used in processing big data is Hadoop framework, the paper presents the major components of the framework and working process within the framework.
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...PhD Assistance
Outcomes in health-related issues including psychological, educational, Behavioral, environmental, and social are intended to sustain positive change by digital interferences. These changes may be delivered using any digital device like a phone or computer, and make them gainful for the provider. Complex and large-scale datasets that contain usage data can be yielded by testing a digital intervention. This data provides invaluable detail about how the users interact with these interventions and notify their knowledge of engagement, if they are analyzed properly. This paper recommends an innovative framework for the process of analyzing usage associated with a digital intervention .
PhD Assistance is an Academic The Best Dissertation Writing Service & Consulting Support Company established in 2001. specialiWeze in providing PhD Assignments, PhD Dissertation Writing Help , Statistical Analyses, and Programming Services to students in the USA, UK, Canada, UAE, Australia, New Zealand, Singapore and many more.
Website Visit: https://bit.ly/3dANXUD
Contact Us:
UK NO: +44-1143520021
India No: +91-8754446690
Email: info@phdassistance.com
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET Journal
This document discusses using data mining techniques to analyze agricultural data and estimate crop yields. It begins with an introduction to big data and data preprocessing techniques. The main goals of data cleaning are discussed, including filtering noise from datasets. Common data mining tasks like classification, regression, and clustering are also overviewed. Previous studies on crop yield prediction using clustering algorithms and weather data are summarized. The proposed system would use the DBSCAN clustering algorithm to analyze various agricultural datasets to more optimally and accurately estimate crop yields. This could help farmers make better decisions to increase production.
This document discusses medical data mining and classification techniques. It begins with an introduction to data mining and its applications in healthcare to improve treatment. Medical data mining can help discover patterns in medical data to aid diagnosis. Classification algorithms like decision trees can categorize medical records and help predict outcomes. Specifically, the document discusses the J48 decision tree algorithm available in the WEKA data mining tool, which implements the C4.5 algorithm for classification. Decision trees work by recursively splitting the data into subsets based on attribute values, forming a tree structure. The document concludes that while data mining can help with medical analysis, results from small medical datasets should be interpreted cautiously.
Mining Social Media Data for Understanding Drugs UsageIRJET Journal
This document discusses mining social media data to understand drug usage. It proposes using big data techniques like Hadoop and MapReduce to extract and analyze data from social networks about drug abuse. The methodology involves collecting data from platforms using crawlers, storing it in Hadoop, filtering it, then applying complex analysis using cloud computing. Prior work on extracting health information from social media and multi-scale community detection in networks is reviewed. The challenges of privacy preservation and scalability when anonymizing big healthcare datasets are also discussed.
STORAGE GROWING FORECAST WITH BACULA BACKUP SOFTWARE CATALOG DATA MININGcsandit
Backup software information is a potential source for data mining: not only the unstructured
stored data from all other backed-up servers, but also backup jobs metadata, which is stored in
a formerly known catalog database. Data mining this database, in special, could be used in
order to improve backup quality, automation, reliability, predict bottlenecks, identify risks,
failure trends, and provide specific needed report information that could not be fetched from
closed format property stock property backup software database. Ignoring this data mining
project might be costly, with lots of unnecessary human intervention, uncoordinated work and
pitfalls, such as having backup service disruption, because of insufficient planning. The specific
goal of this practical paper is using Knowledge Discovery in Database Time Series, Stochastic
Models and R scripts in order to predict backup storage data growth. This project could not be
done with traditional closed format proprietary solutions, since it is generally impossible to
read their database data from third party software because of vendor lock-in deliberate
overshadow. Nevertheless, it is very feasible with Bacula: the current third most popular backup
software worldwide, and open source. This paper is focused on the backup storage demand
prediction problem, using the most popular prediction algorithms. Among them, Holt-Winters
Model had the highest success rate for the tested data sets.
IRJET- Contradicting the Hypothesis of Data Analytics with the Help of a Use-...IRJET Journal
This document discusses how data analytics techniques can be used to analyze manufacturing industry data and help with decision making. It presents a case study analyzing expenses data from a manufacturing company from 1950 to 2020. Descriptive analytics on the data show trends in number of employees, working hours, costs of raw materials, machinery, overhead, labor, and profit over time. Diagnostic analytics provide reasons for these trends, such as increases in employees and costs correlating with new technologies and production increases. Predictive analytics are not discussed in the summary. The document suggests prescriptive analytics using advanced Industry 4.0 technologies like ultrafast 3D printing could help maximize profit and minimize employees and costs.
IRJET- A Survey on Mining of Tweeter Data for Predicting User BehaviorIRJET Journal
This document discusses mining and analyzing social media data using big data techniques to predict user behavior. It proposes using tools like Hadoop and HDFS to capture trends in areas like drug abuse from large amounts of Twitter data. A framework is presented that involves gathering Twitter data using APIs, applying big data mining techniques, and using the results for more sophisticated analysis applications to help address issues like monitoring public health. Challenges around managing large social media datasets are also discussed.
Hierarchical Forecasting and Reconciliation in The Context of Temporal HierarchyIRJET Journal
This document discusses hierarchical forecasting and reconciliation for temporally aggregated data that exhibits seasonal patterns. It analyzes 10 years of monthly foreign tourist visitation data to Kerala, India aggregated into monthly, quarterly, half-yearly, and annual levels. Different forecasting strategies are evaluated, including bottom-up, top-down, and optimal combination approaches using exponential smoothing techniques. The mean absolute percentage error is used to compare the accuracy of forecasts from each strategy. Preliminary results suggest the bottom-up approach outperforms other strategies on average and across all levels of the data hierarchy.
Similar to Steps and Challenges in Analyzing Real Sensor Data from a Productive Press Shop and its Value for Predictive Maintenance Application (20)
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
International Conference on NLP, Artificial Intelligence, Machine Learning an...gerogepatton
International Conference on NLP, Artificial Intelligence, Machine Learning and Applications (NLAIM 2024) offers a premier global platform for exchanging insights and findings in the theory, methodology, and applications of NLP, Artificial Intelligence, Machine Learning, and their applications. The conference seeks substantial contributions across all key domains of NLP, Artificial Intelligence, Machine Learning, and their practical applications, aiming to foster both theoretical advancements and real-world implementations. With a focus on facilitating collaboration between researchers and practitioners from academia and industry, the conference serves as a nexus for sharing the latest developments in the field.
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTjpsjournal1
The rivalry between prominent international actors for dominance over Central Asia's hydrocarbon
reserves and the ancient silk trade route, along with China's diplomatic endeavours in the area, has been
referred to as the "New Great Game." This research centres on the power struggle, considering
geopolitical, geostrategic, and geoeconomic variables. Topics including trade, political hegemony, oil
politics, and conventional and nontraditional security are all explored and explained by the researcher.
Using Mackinder's Heartland, Spykman Rimland, and Hegemonic Stability theories, examines China's role
in Central Asia. This study adheres to the empirical epistemological method and has taken care of
objectivity. This study analyze primary and secondary research documents critically to elaborate role of
china’s geo economic outreach in central Asian countries and its future prospect. China is thriving in trade,
pipeline politics, and winning states, according to this study, thanks to important instruments like the
Shanghai Cooperation Organisation and the Belt and Road Economic Initiative. According to this study,
China is seeing significant success in commerce, pipeline politics, and gaining influence on other
governments. This success may be attributed to the effective utilisation of key tools such as the Shanghai
Cooperation Organisation and the Belt and Road Economic Initiative.
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Sinan KOZAK
Sinan from the Delivery Hero mobile infrastructure engineering team shares a deep dive into performance acceleration with Gradle build cache optimizations. Sinan shares their journey into solving complex build-cache problems that affect Gradle builds. By understanding the challenges and solutions found in our journey, we aim to demonstrate the possibilities for faster builds. The case study reveals how overlapping outputs and cache misconfigurations led to significant increases in build times, especially as the project scaled up with numerous modules using Paparazzi tests. The journey from diagnosing to defeating cache issues offers invaluable lessons on maintaining cache integrity without sacrificing functionality.
Introduction- e - waste – definition - sources of e-waste– hazardous substances in e-waste - effects of e-waste on environment and human health- need for e-waste management– e-waste handling rules - waste minimization techniques for managing e-waste – recycling of e-waste - disposal treatment methods of e- waste – mechanism of extraction of precious metal from leaching solution-global Scenario of E-waste – E-waste in India- case studies.
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...IJECEIAES
Climate change's impact on the planet forced the United Nations and governments to promote green energies and electric transportation. The deployments of photovoltaic (PV) and electric vehicle (EV) systems gained stronger momentum due to their numerous advantages over fossil fuel types. The advantages go beyond sustainability to reach financial support and stability. The work in this paper introduces the hybrid system between PV and EV to support industrial and commercial plants. This paper covers the theoretical framework of the proposed hybrid system including the required equation to complete the cost analysis when PV and EV are present. In addition, the proposed design diagram which sets the priorities and requirements of the system is presented. The proposed approach allows setup to advance their power stability, especially during power outages. The presented information supports researchers and plant owners to complete the necessary analysis while promoting the deployment of clean energy. The result of a case study that represents a dairy milk farmer supports the theoretical works and highlights its advanced benefits to existing plants. The short return on investment of the proposed approach supports the paper's novelty approach for the sustainable electrical system. In addition, the proposed system allows for an isolated power setup without the need for a transmission line which enhances the safety of the electrical network
artificial intelligence and data science contents.pptxGauravCar
What is artificial intelligence? Artificial intelligence is the ability of a computer or computer-controlled robot to perform tasks that are commonly associated with the intellectual processes characteristic of humans, such as the ability to reason.
› ...
Artificial intelligence (AI) | Definitio
Batteries -Introduction – Types of Batteries – discharging and charging of battery - characteristics of battery –battery rating- various tests on battery- – Primary battery: silver button cell- Secondary battery :Ni-Cd battery-modern battery: lithium ion battery-maintenance of batteries-choices of batteries for electric vehicle applications.
Fuel Cells: Introduction- importance and classification of fuel cells - description, principle, components, applications of fuel cells: H2-O2 fuel cell, alkaline fuel cell, molten carbonate fuel cell and direct methanol fuel cells.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
Comparative analysis between traditional aquaponics and reconstructed aquapon...bijceesjournal
The aquaponic system of planting is a method that does not require soil usage. It is a method that only needs water, fish, lava rocks (a substitute for soil), and plants. Aquaponic systems are sustainable and environmentally friendly. Its use not only helps to plant in small spaces but also helps reduce artificial chemical use and minimizes excess water use, as aquaponics consumes 90% less water than soil-based gardening. The study applied a descriptive and experimental design to assess and compare conventional and reconstructed aquaponic methods for reproducing tomatoes. The researchers created an observation checklist to determine the significant factors of the study. The study aims to determine the significant difference between traditional aquaponics and reconstructed aquaponics systems propagating tomatoes in terms of height, weight, girth, and number of fruits. The reconstructed aquaponics system’s higher growth yield results in a much more nourished crop than the traditional aquaponics system. It is superior in its number of fruits, height, weight, and girth measurement. Moreover, the reconstructed aquaponics system is proven to eliminate all the hindrances present in the traditional aquaponics system, which are overcrowding of fish, algae growth, pest problems, contaminated water, and dead fish.
Steps and Challenges in Analyzing Real Sensor Data from a Productive Press Shop and its Value for Predictive Maintenance Application
1. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
Bibhu Dash et al: SCDD -2024
pp. 01-10, 2024. IJCI – 2024 DOI:10.5121/ijci.2024.130301
STEPS AND CHALLENGES IN ANALYZING
REAL SENSOR DATA FROM A PRODUCTIVE
PRESS SHOP AND ITS VALUE FOR
PREDICTIVE MAINTENANCE APPLICATION
Safa Evirgen1
and Maylin Wartenberg2
1
Volkswagen AG, Berliner Ring 2, 38426 Wolfsburg, Germany
2
Hochschule Hannover, Recklinger Stadtweg 120,
30459 Hannover, Germany
ABSTRACT
This paper highlights the significance of AI-powered maintenance strategies in modern
industry for operational optimization and reduced downtime. It emphasizes the crucial role
of sensor data analysis in identifying anomalies and predicting failures. The research
specifically examines sensor data from an automotive press shop, addressing questions
related to data selection, collection challenges, and knowledge generation. By utilizing
unsupervised learning on compressed air data from a press line, the study identifies
patterns, anomalies, and correlations. The results offer insights into the potential for
implementing an effective predictive maintenance strategy. Additionally, a systematic
literature review underscores the importance of data analysis in production systems,
particularly in the context of maintenance.
KEYWORDS
Predictive Maintenance, smart database, sensor data analytics, data science, production.
1. INTRODUCTION
In today's increasingly connected industrial landscape, AI-powered maintenance strategies of
production facilities are gaining importance as companies strive to minimize downtime and
ensure more efficient operations. A key component of these maintenance strategies is the use of
sensor data to detect anomalies and predict potential failures[1]. In this paper, we focus on the
analysis of sensor data from a production press shop of an automotive manufacturer. Essentially,
the following research questions, the answers to which are structured according to the principle of
the core process of organizational data processing as described by Wölfl et al. (2019) (see
Figure1) are listed in detail below[2]:
2. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
2
Figure 1. Research question structured based on core process of organizational data processing as described
by Wölfl et al. (2019).
The press shop is a central part of the manufacturing process in the automotive industry. This is
where car body parts are manufactured, formed and machined [3]. Through the use of various
sensors, valuable information can be obtained about the condition of the machines and tools. This
information can serve as the basis for implementing AI-supported maintenance strategies to avoid
unplanned downtime and increase productivity [4]. The aim of this study is to propose an
approach for handling and analyzing large volumes of productive data, specifically compressed
air data from a press line, for predictive maintenance. The process involves data volume
assessment, data selection, and the application of unsupervised learning techniques to identify
data patterns, anomalies, and correlations. The results will determine the feasibility and
effectiveness of a predictive maintenance system for the press shop. This research will assist
companies in making informed decisions to enhance their predictive maintenance strategies and
maximize the value of sensor data. The paper begins with a comprehensive literature review to
underscore the topic's relevance and identify important prior works. It then delves into the
specific case of compressed air data and presents the results. Additionally, it outlines the
approach's applicability in a predictive maintenance strategy. Finally, the paper provides a
summary of the findings and future research directions.
2. LITERATURE REVIEW – DATA ANALYSIS IN PRODUCTION SYSTEMS
RELATED TO MAINTENANCE
Considering the subject of the paper, this chapter identifies and examines existing literature. The
goal is to confirm the scientific relevance of the topic as well as to clarify the need for further
investigation in the context of using data mining methods in relation to production data.
Production data is narrowed down to production data in the context of maintenance. The
procedure of the systematic literature research is mainly based on the systematics according to
vom Brocke (2009) [5]. The search engines Google Scholar, IEEE Xplore and Science Direct
were used for the search. The main search terms "maintenance", "production" and "data science"
were linked by means of logical operators. In order not to distort the core content of the paper,
the individual steps up to finding the results according to vom Brocke (2009) are not presented in
detail. Instead, an overview of the procedure can be seen in Figure 2. According to this, the main
search criteria are first used to search the search engines under consideration, as just described. In
order to reduce the number of hits found to the essentials, duplicates are not considered, and
filtering is done by year and type of literature. Subsequently, the abstract of the respective
publication was screened before further filtering in the analysis phase. Specific search terms such
as Condition Monitoring or Knowledge Based Maintenance were used. The remaining
3. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
3
publications were then analyzed in more detail. The result of the analysis can be seen in Table 1.
The literature was clustered into the main topics "Maintenance Strategy" and "Data Science". The
"Maintenance Strategy" cluster is divided into the current trend strategies Condition Monitoring
(CM), Predictive Maintenance (PM) and Knowledge Based Maintenance (KBM). The topic
"Data Science" was considered with a wider spectrum. The main classification falls under
Supervised Learning (S), Unsupervised Learning (US), Machine Learning (ML) as well as Deep
Learning (DL). Further, the amount of data considered in the paper was separated into Database
Small (DBS) and Database Large (DBL). Case studies in the papers with recorded data over
multiple cycles and weeks, and/or a set of data points greater than 100,000 are assigned to DBL.
Other-wise, or in cases where the amount of data is not precisely described in the papers, the
assumption is made that small amounts of data were used in a case study. Thus, these papers are
classified in DBS. In addition, the application domain in the considered papers was specifically
added. Here, we distinguish between a case study on production data (CSPD), case study on real
data (CSRD) and case study on simulation data (CSSD). It was also considered the point whether
different data sources and data types were used in the papers. These were assigned to the Various
Data Sources (VDS) category.
Figure 2. Outline of the stepwise approach to the literature search and filtering strategy.
In Figure 3, the authors explore maintenance strategies with a data analytics perspective. Among
the 27 authors, 20 concentrate on applying data mining methods in maintenance. Out of these, 12
use production data, and 6 employ real data from outside production machines, as seen in the
case study of Kammerer et al. (2020) involving the packaging industry [6]. 2 of the authors use
simulated data in their case studies, such as Feng et al. (2013) [7]. Here, simulated data for gear
acceleration is used to develop automatic time series models based on prediction errors and
provide the proof of function. Moreover, it can be noted that 15 out of 20 authors use small data
sets for the case studies. Only 5 of the papers considered use large data sets. For example, Han
and Chi(2016) use about 37,000 data points and thus are attributed to DBS, while Shi et al.
(2019) [8, 9] use about 32,500,000 with the considered vibration data and thus are attributed to
DBL. If in addition the component VDS is considered, a total of 7 authors can be summarized,
which were simultaneously attributed to DBS by means of which a case study was conducted and
related to maintenance strategies. For example, Sang et al. (2021) provided a decision support
model based on the use of various data sources. The data sources were mainly composed of
machine, process and sensor data. These data are used to develop a LSTM model to estimate the
RUL (remaining useful life). Real data from three robots and carrier plates with one bearing on
the one hand and from CNC machines on the other hand were used. As a result, a sufficient
statement about the planning of maintenance activities could be made, but Sang et al. emphasize
the complexity of Industry 4.0. Accordingly, the aspects of domain knowledge and the limitation
of the amount of data should be considered [10]. Tangible knowledge is needed especially for the
KBM strategy. 8 out of 27 authors deal with this strategy, whereas only 5 authors use production
4. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
4
data and obtain data from different sources at the same time. Furthermore, the data sets of the just
mentioned authors are to be defined on DBS. This leads to the conclusion that in the field of
KBM a compilation of larger amounts of data is difficult. However, it should also be noted that
large data volumes are not synonymous with high data quality. Rather, the collection of,
according to Compare et al. (2020), smart data is of great importance [11]. Zenkert et al. (2021)
take the idea of knowledge utilization further in the context of manufacturing and highlight the
importance of knowledge for manufacturing companies [12]. Complementing this, Tao et al.
(2019), whose work is cited by Zenkert et al. (2021), emphasizes that the transformation of
manufacturing companies from knowledge-based intelligent production to data-driven and
knowledge-enabled smart production is in full swing [13]. However, challenges arise because
knowledge, in particular, is often only available verbally or in documents. This complicates the
integration into data-driven infrastructures that serve to build and use data analysis methods, such
as supervised learning [12]. Consequently, the need for an intelligent methodology in handling
large amounts of productive data (DBL) for the use of data analysis methods which lead to a
predictive maintenance application results.
Table 1. Analysis of the publications.
3. CASE STUDY – COMPRESSED AIR SYSTEM
A case study was conducted in an automotive manufacturer's press shop with a press line
equipped with numerous sensors producing terabytes of data weekly. Due to storage limitations,
historical sensor data isn't available, and data selection for predictive maintenance is crucial.
Maintenance
Strategy
Data Science
Author CM PM KBM S US ML DL DBS DBL CSPD CSRD CSSD VDS
[14] X X - X X X - X - X - - -
[15] - X - X X X - - - - - - X
[6] X X - - - X - X - - X - X
[16] - X - X - X - X - - X - -
[17] - X - - - X - - X X - - -
[11] - X - X X X X - - - - - -
[18] - X - X X X X - - - - - -
[10] - X - X - X - X - - X - X
[19] - X - - X X - X - - X - -
[12] - X X X X X X - - - - - X
[20] - - X X - - X X - X - - X
[21] - X - - - X - - - - - - X
[22] X - - - X X - X - X - - -
[7] X - - - X X - X - - - X -
[23] - - X X - X - X - X - - X
[9] X X - X - X - X - X - - -
[24] X X - X - X - - X X - - -
[25] X X X - - - - X - X - - X
[26] X X - X X X X X - X - - -
[27] X X X - - X X - - - - - -
[28] - X - X X X X X - X - - -
[8] X X - X X X X - X - X - X
[29] X X - X - X - - X - X - -
[30] X X - - X X - - X - - X X
[31] - - X - - X X - - - - - X
[32] X X X X X X X X - X - - X
[33] - - X X - X - X - X - - X
5. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
5
Predictive maintenance aims to reduce downtime by detecting component failures early. To
achieve this, the root causes of unplanned downtimes must be identified. Asset and maintenance
logbooks are valuable sources in this context, recording detailed information about downtime
events. An analysis of these logbooks over ten years in the press shop revealed that many
unplanned downtimes were linked to compressed air loss. This insight was derived from a de-
tailed examination of textual data within technical documentation and confirmed through
interviews with experienced employees. The paper's focus is on analyzing compressed air data to
detect anomalies that may indicate such events, ultimately aiding in predictive maintenance. The
press line in question is a transfer press responsible for automatically moving components
between press stages using a bar as a carrier. Gripper arms with suction cups are used for
handling components, and compressed air is created by a compressor. This air is accelerated and
compressed by a special nozzle, creating a vacuum that helps lift components with the suction
cups. Once a component reaches the next press stage, the vacuum is released, and the component
is inserted into the press die. From a sensor data perspective, the process is illustrated in Figure 3.
The data includes the compressed air flow sensor values over time and the crankshaft position of
the press line. These two sets of data are then correlated, allowing the tracking of the forming
process at each press line step. For instance, the graph on the right in Fig. 4 illustrates the
relationship between compressed air flow sensor values and shaft positions, showing when
components are picked up and released by the suction cups.
Figure 3. Transfer compressed air system.
3.1. Data Preparation
For the case study, data from 16 compressed air flow sensors, three total air consumption sensors,
three compressor pressure sensors and a drive shaft sensor were recorded over a period of one
month. In addition to the sensors, the tool number and crankshaft position, also called press
position, were also recorded. This data is especially important to provide a reference to a specific
tool and or press position. Following the CRISP-DM model, the ETL (Extract, Transform, Load)
process is performed.
The data, recorded using Siemens AG's xTools software in XTS format, can only be read with
xTools. To make it more accessible, the data is converted from XTS to CSV using an extended
function in xTools. This conversion takes as long as the original recording time, e.g., 24 hours to
convert 24 hours of data. To expedite this process, every tenth data point from the XTS file is
converted, reducing the measurement frequency from 100 Hz to 10 Hz. This lower sampling rate
is sufficient for analysis and mitigates Big Data issues while requiring less computing power due
to the reduced data volume. The data for each sensor is stored in twelve-minute CSV packets, a
result of xTools' data packet size limit setting. The case study generated a total of 81,675 CSV
6. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
6
files, amounting to 243 GB. Each file includes a timestamp in the first column, formatted as
"date:time.nanoseconds." A Python script was used to merge the CSV files by sensor and day.
To establish an initial understanding of the data and sensor data relationships, a preliminary
visualization was generated using the first 3000 data points, as depicted in Figure 5. Here a cyclic
characteristic of the sensor values L1-L16, as well as the press position becomes clear. The press
position moves within the range of 0° and 360°, which can be attributed to the rotation of the
crankshaft. From this, it can be deduced that a stroke occurs at the press line during one rotation.
Likewise, a cyclical movement over timecan be seen in the compressed air flow data. In addition,
it can be seen that the compressed air consumption first increases and then falls again in the
course of the press position. When the respective sensor values are compared, a clear difference
in the consumptions can be identified. For example, sensor 16, the front-of-line sensor, has a
significantly higher consumption than all the other sensors. Between timestamp 1000 and 2100,
there are no noticeable spikes in the data. This is because the press line is standing still and the
crankshaft is not moving. The assumption can be made here that the compressed air consumption
should decrease drastically. However, in Figure4, it can be seen that sensors L5, L9, L11 and L16
continue to have high consumption. This is an indication of leakage in the compressed air system.
In addition, the tool number is constant, so no tool change has taken place in the period
considered. In order to identify a dependency on the installed tool, the average amount of
compressed air consumption for three different tools was considered for sensor L1. In Figure 5, a
comparison of sensor L1 for tools 1, 9, and 69 reveals notable differences, which were confirmed
by experts. These differences are due to varying press settings for each tool, specifying when a
vacuum is generated on the suction cups at different press positions. As each press stage has
unique settings, the sensors are independent of one another.
Figure 4. Plot all sensor values over time.
Therefore, the focus for further consideration in all data points was placed on the sensor L1 with
the tool number 10. In addition, periods of planned downtime were filtered out of the data set,
leaving a total of 1651488 data points. This data set is used for the further anomaly analysis.
First, the data points were divided into the respective strokes so that the strokes of a press line can
be compared with each other. The result is a data set in which each row represents a stroke and
each column a press position. This results in a total of 26857 strokes for the data set under
consideration. This results in a total of 26857 strokes for the data set under consideration.
3.2. Unsupervised Machine Learning
The prepared data is used in the following for the application of unsupervised machine learning
models. The goal is to let the models learn patterns and correlations independently and thus
identify outliers. One known algorithm is the DBSCAN (Density-Based Spatial Clustering of
7. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
7
Applications with Noise). This works density-based and has the ability to detect multiple clusters,
where noise points are returned separately. In the following, we consider the application of
DBSCAN to a time series and to a scatter plot, using DBSCAN from the scikit-learn library. The
algorithm assigns the label "-1" to all data points that cannot be assigned to a cluster. Thus, these
data points can be identified as outliers and the time of occurrence can be determined. For the
application on a time series, the result can be seen in Figure5.
Figure 5. Clustering with DBSCAN.
Since there is a correlation between the compressed air values of a sensor and the press position,
density-based clustering was further performed using DBSCAN algorithm for a scatter plot
consisting of the compressed air values of a sensor and the press position. To apply the algorithm,
several data preparation steps are required first. As a first step, unneeded columns are removed
from the data frame so that it only consists of the columns "Press position" and "Sensor value", as
well as "Datetime" as an index column. The data is then normalized using min-max normal-
ization in order to be able to use uniform parameters. After this data preparation, the actual
algorithm is performed. The algorithm forms clusters of points that are close together. Points that
cannot be assigned to a cluster are marked as outliers. Figure6 shows that the algorithm can
already identify the outliers clearly without perfectly optimized parameters. However, the use of
the results to create added value for production needs to be discussed in more detail.
Figure 6. Clustering with DBSCAN on a scatter plot.
3.3. Predictive Maintenance Approach
The analysis reveals clear anomalies in the press rotation curve, raising the question of how to
apply this information in a predictive maintenance strategy. For predictive maintenance to predict
possible defects or sudden shutdowns, the analysis should signal potential damage to production
machinery. The unsupervised learning analysis established a pattern using historical data and
identified outliers in the same time frame. The next step is realtime monitoring of each press
cycle, but manual monitoring of every cycle is impractical, and setting static alarm limits for
outliers is unreliable. To address these issues, supervised learning models, developed with
8. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
8
domain experts, can be employed to assign real-world states to outliers based on sensor data.
However, this approach requires substantial effort to gather domain knowledge through
interviews or text evaluation. This use case provides valuable insight for future research into AI-
supported maintenance strategies.To fully implement a holistic predictive maintenance strategy,
the ability to predict damage before it occurs is essential. This necessitates integrating machine
knowledge, sensor data, and specialist empirical values. In essence, robust damage detection on
machines requires a solid link between data analysis and domain expertise, and standalone
anomaly detection in sensor data alone is not sufficient.
4. CONCLUSIONS
In conclusion, this paper has provided comprehensive guidelines for analyzing sensor data from a
productive press shop in order to determine its value for a predictive maintenance database. The
research questions addressed in this study revolved around the selection of sensor data,
challenges in data collection and analysis, and the creation of conditions to generate knowledge
from the data analyses. The analysis focused on sensor data from a production press shop in the
automotive industry, with the goal of supporting AI-powered maintenance strategies to minimize
downtime and increase productivity. Various data analysis techniques, particularly unsu-pervised
learning, were employed to identify patterns, anomalies, and correlations in the collected
compressed air data from a press line. By evaluating the results of the analysis, conclusions can
be drawn regarding the feasibility and effectiveness of a predictive maintenance strategy for the
press shop. The literature review conducted as part of this research highlighted the relevance of
data analysis in production systems related to maintenance. It revealed the application of data
mining methods in the context of maintenance strategies, including CM, PM and KBM. The
review also emphasized the importance of knowledge utilization and data-driven infrastructures
for effective data analysis in manufacturing companies. The case study on the compressed air
system of a press line demonstrated the practical implementation of the research approach. By
analyzing asset and maintenance logbooks, it was determined that a significant number of
unplanned downtimes were caused by compressed air loss. This finding guided the analysis of the
compressed air data, further validating the relevance of the study. The DBSCAN provided
acceptable results for outlier analysis, but it didn't fully meet predictive maintenance
requirements. Nevertheless, this work offers valuable insights for sensor data analysis in a press
shop. The findings can aid companies in making informed decisions about maintenance strategies
and maximizing sensor data potential. As Industry 4.0 evolves, future work should address the
integration of knowledge and data-driven infrastructures, key challenges in this field.
REFERENCES
[1] A. Hildebrandt und W. Landhäußer, Hg.,CSR und Digitalisierung: Der digitale Wandel als Chance
und Herausforderung für Wirtschaft und Gesellschaft. Berlin, Heidelberg: Springer Gabler, 2017.
[Online]. Verfügbar unter: http://www.springer.com/
[2] S. Wölfl, A. Leischnig, B. Ivens und D. Hein, „From Big Data to Smart Data – Problemfelder der
systematischen Nutzung von Daten in Unternehmen“ inGeschäftsmodelle in der digitalen Welt, W.
Becker et al., Hg., Wiesbaden: Springer Fachmedien Wiesbaden, 2019, S. 213–231, doi: 10.1007/978-
3-658-22129-4_11.
[3] M. Kropik, Produktionsleitsysteme für die Automobilindustrie: Digitalisierung des Shop-Floors in der
Automobilproduktion, 2. Aufl. Berlin, Heidelberg: Springer Vieweg, 2021.
[4] E. Traini, G. Bruno, G. D’Antonio und F. Lombardi, „Machine Learning Framework for Predictive
Maintenance in Milling“,IFAC-PapersOnLine, Jg. 52, Nr. 13, S. 177–182, 2019, doi:
10.1016/j.ifacol.2019.11.172.
[5] J. vom Brocke, A. Simons, B. Niehaves, K. Riemer, R. Plattfaut und A. Cleven, Reconstructing the
giant: On the importance of rigour in documenting the literature search process, 2009. [Online].
Verfügbar unter: https://aisel.aisnet.org/cgi/viewcontent.cgi?article=1145&context=ecis2009
9. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
9
[6] K. Kammerer, R. Pryss, B. Hoppenstedt, K. Sommer und M. Reichert, „Process-Driven and Flow-
Based Processing of Industrial Sensor Data“ (eng),Sensors (Basel, Switzerland), Jg. 20, Nr. 18, 2020,
doi: 10.3390/s20185245.
[7] W. Feng, L. Xia, Z. Bing und W. Xiang-yu, „A study on time series models and criterion rules based
on condition monitoring of the tracking test system“ in2013 International Conference on Mechatronic
Sciences, Electric Engineering and Computer (MEC), Shengyang, China, 122013, S. 663–667, doi:
10.1109/MEC.2013.6885146.
[8] J.-H. Han und S.-Y. Chi, „Consideration of manufacturing data to apply machine learning methods
for predictive manufacturing“ in2016 Eighth International Conference on Ubiquitous and Future
Networks (ICUFN), Vienna, Austria, 72016, S. 109–113, doi: 10.1109/ICUFN.2016.7536995.
[9] C. Shi, G. Panoutsos, B. Luo, H. Liu, B. Li und X. Lin, „Using Multiple-Feature-Spaces-Based Deep
Learning for Tool Condition Monitoring in Ultraprecision Manufacturing“,IEEE Trans. Ind.
Electron., Jg. 66, Nr. 5, S. 3794–3803, 2019, doi: 10.1109/TIE.2018.2856193.
[10] G. M. Sang, L. Xu und P. de Vrieze, „A Predictive Maintenance Model for Flexible Manufacturing in
the Context of Industry 4.0“ (eng), Frontiers in big data, Jg. 4, S. 663466, 2021, doi:
10.3389/fdata.2021.663466.
[11] M. Compare, P. Baraldi und E. Zio, „Challenges to IoT-Enabled Predictive Maintenance for Industry
4.0“,IEEE Internet Things J., Jg. 7, Nr. 5, S. 4585–4597, 2020, doi: 10.1109/JIOT.2019.2957029.
[12] J. Zenkert, C. Weber, M. Dornhöfer, H. Abu-Rasheed und M. Fathi, „Knowledge Integration in Smart
Factories“, Encyclopedia, Jg. 1, Nr. 3, S. 792–811, 2021, doi: 10.3390/encyclopedia1030061.
[13] F. Tao, Q. Qi, L. Wang und A. Nee, „Digital Twins and Cyber–Physical Systems toward Smart
Manufacturing and Industry 4.0: Correlation and Comparison“, Engineering, Jg. 5, Nr. 4, S. 653–661,
2019, doi: 10.1016/j.eng.2019.01.014.
[14] Susana Ferreiro, Egoitz Konde, Santiago Fernandez und Augustin Prado, INDUSTRY 4.0: Predictive
Intelligent Maintenance for Production Equipment, 2016. [Online]. Verfügbar unter:
http://www.papers.phmsociety.org/index.php/phme/article/view/1667
[15] G. M. Sang, L. Xu, P. de Vrieze, Y. Bai und F. Pan, „Predictive Maintenance in Industry 4.0“ in
ICIST '20: 10th International Conference on Information Systems and Technologies, Lecce Italy,
06042020, S. 1–11, doi: 10.1145/3447568.3448537.
[16] M. Paolanti, L. Romeo, A. Felicetti, A. Mancini, E. Frontoni und J. Loncarski, „Machine Learning
approach for Predictive Maintenance in Industry 4.0“ in 2018 14th IEEE/ASME International
Conference on Mechatronic and Embedded Systems and Applications (MESA), Oulu, 72018, S. 1–6,
doi: 10.1109/MESA.2018.8449150.
[17] L. Spendla, M. Kebisek, P. Tanuska und L. Hrcka, „Concept of predictive maintenance of production
systems in accordance with industry 4.0“ in2017 IEEE 15th International Symposium on Applied
Machine Intelligence and Informatics (SAMI), Herl'any, Slovakia, 12017, S. 405–410, doi:
10.1109/SAMI.2017.7880343.
[18] Z. M. Çınar, A. AbdussalamNuhu, Q. Zeeshan, O. Korhan, M. Asmael und B. Safaei, „Machine
Learning in Predictive Maintenance towards Sustainable Smart Manufacturing in Industry
4.0“,Sustainability, Jg. 12, Nr. 19, S. 8211, 2020, doi: 10.3390/su12198211.
[19] F. Morselli, L. Bedogni, U. Mirani, M. Fantoni und S. Galasso, „Anomaly Detection and
Classification in Predictive Maintenance Tasks with Zero Initial Training“, IoT, Jg. 2, Nr. 4, S. 590–
609, 2021, doi: 10.3390/iot2040030.
[20] S. Ren, Y. Zhang, T. Sakao, Y. Liu und R. Cai, „An Advanced Operation Mode with Product-Service
System Using Lifecycle Big Data and Deep Learning“, Int. J. of Precis. Eng. and Manuf.-Green
Tech., Jg. 9, Nr. 1, S. 287–303, 2022, doi: 10.1007/s40684-021-00354-3.
[21] M. Pech, J. Vrchota und J. Bednář, „Predictive Maintenance and Intelligent Sensors in Smart Factory:
Review“ (eng),Sensors (Basel, Switzerland), Jg. 21, Nr. 4, 2021, doi: 10.3390/s21041470.
[22] D. Kateris, D. Moshou, X.-E. Pantazi, I. Gravalos, N. Sawalhi und S. Loutridis, „A machine learning
approach for the condition monitoring of rotating machinery“, J Mech Sci Technol, Jg. 28, Nr. 1, S.
61–71, 2014, doi: 10.1007/s12206-013-1102-y.
[23] B. Denkena, J. Schmidt und M. Krüger, „Data Mining Approach for Knowledge-based Process
Planning“, Procedia Technology, Jg. 15, S. 406–415, 2014, doi: 10.1016/j.protcy.2014.09.095.
[24] A. Kanawaday und A. Sane, „Machine learning for predictive maintenance of industrial machines
using IoT sensor data“ in 2017 8th IEEE International Conference on Software Engineering and
Service Science (ICSESS), Beijing, China, 112017, S. 87–90, doi: 10.1109/ICSESS.2017.8342870.
10. International Journal on Cybernetics & Informatics (IJCI) Vol.13, No.3, June 2024
10
[25] B. Schmidt und L. Wang, „Cloud-enhanced predictive maintenance“, Int J Adv Manuf Technol, Jg.
99, 1-4, S. 5–13, 2018, doi: 10.1007/s00170-016-8983-8.
[26] M. Zekveld und G. P. Hancke, „Vibration Condition Monitoring Using Machine Learning“ inIECON
2018 - 44th Annual Conference of the IEEE Industrial Electronics Society, Washington, DC, 102018,
S. 4742–4747, doi: 10.1109/IECON.2018.8591167.
[27] O. O. Aremu, A. S. Palau, A. K. Parlikad, D. Hyland-Wood und P. R. McAree, „Structuring Data for
Intelligent Predictive Maintenance in Asset Management“, IFAC-PapersOnLine, Jg. 51, Nr. 11, S.
514–519, 2018, doi: 10.1016/j.ifacol.2018.08.370.
[28] S. Butte, A. R. Prashanth und S. Patil, „Machine Learning Based Predictive Maintenance Strategy: A
Super Learning Approach with Deep Neural Networks“ in2018 IEEE Workshop on Microelectronics
and Electron Devices (WMED), Boise, ID, 42018, S. 1–5, doi: 10.1109/WMED.2018.8360836.
[29] H. Wang, A. Nunez, Z. Liu, D. Zhang und R. Dollevoet, „ A Bayesian Network Approach for
Condition Monitoring of High-Speed Railway Catenaries“, IEEE Trans. Intell. Transport. Syst., Jg.
21, Nr. 10, S. 4037–4051, 2020, doi: 10.1109/TITS.2019.2934346.
[30] J. Sossenheimer, J. Walther, J. Fleddermann und E. Abele, „A Sensor Reduced Machine Learning
Approach for Condition-based Energy Monitoring for Machine Tools“, Procedia CIRP, Jg. 81, S.
570–575, 2019, doi: 10.1016/j.procir.2019.03.157.
[31] T. Nemeth, F. Ansari, W. Sihn, B. Haslhofer und A. Schindler, „PriMa-X: A reference model for
realizing prescriptive maintenance and assessing its maturity enhanced by machine learning“,
Procedia CIRP, Jg. 72, S. 1039–1044, 2018, doi: 10.1016/j.procir.2018.03.280.
[32] F. Ansari, R. Glawar und T. Nemeth, „PriMa: a prescriptive maintenance model for cyber-physical
production systems“, International Journal of Computer Integrated Manufacturing, Jg. 32, 4-5, S.
482–503, 2019, doi: 10.1080/0951192X.2019.1571236.
[33] K. Schenkelberg, U. Seidenberg und F. Ansari, „Analyzing the impact of maintenance on profitability
using dynamic bayesian networks“, Procedia CIRP, Jg. 88, S. 42–47, 2020, doi:
10.1016/j.procir.2020.05.008.
AUTHORS
M.Sc. Safa Evirgen, In his current position, he works as IT Service Manager in
Volkswagen AG's IT department at the Wolfsburg plant, focusing on increasing IT
security in production and standardizing processes. This position was followed by
several years in the press shop, where he mainly managed IT and digitalization
projects, including a project with the aim of implementing predictive maintenance in
the press shop. His academic career began with a bachelor's and master's degree in
industrial engineering at TU Dortmund University, including a semester abroad at
California State University Long Beach. His doctorate began with a position as a
doctoral student at Volkswagen AG in the Wolfsburg press shop with the aim of
designing an intelligent data pipeline on which data mining methods can be applied with the aim of
implementing a prescriptive maintenance strategy in production.
Prof. Dr. Maylin Wartenberg, Professor of business informatics with a focus on
data science at Hanover University of Applied Sciences and Arts since 2018. Her
research interests cover various topics in the field of data analytics and its interaction
with computer-aided simulation, with a particular focus on sustainable mobility and
logistics (see www.das-hub.de). In addition, she gives (popular) scientific lectures in
the field of artificial intelligence. In teaching, she is characterized by the use of
innovative concepts and the testing of new formats. Since 2023, she has been a
member of the board of the Institute for Applied Data Science at Hannover
University of Applied Sciences and Arts, Data|H. Her academic career led her to a degree and doctorate in
mathematics at the TU Braunschweig. Prior to her position at Hanover University of Applied Sciences and
Arts, she gained almost 10 years of professional experience at the Volkswagen Group, at VW Financial
Services. There she held various positions, from system analyst to project and team leader to sub-
department head.