This document describes a proposed method for rolling bearing fault diagnosis using a hybrid deep learning model. The method constructs a model combining a gated recurrent unit (GRU) and sparse autoencoder (SAE) to directly extract fault features from raw vibration signals. The GRU is used to extract time-series features from the signals, which are then input to the SAE to obtain more robust representations. Key parameters of the hybrid model are optimized using a grey wolf optimizer algorithm. The features extracted by the model are input to a classifier to obtain the final diagnosis results. The method aims to accurately and steadily diagnose rolling bearing faults without requiring advanced signal processing or feature selection.
Fault Diagnostics of Rolling Bearing based on Improve Time and Frequency Doma...ijsrd.com
The neural network based approaches a feed forward neural network trained with Back Propagation technique was used for automatic diagnosis of defects in bearings. Vibration time domain signals were collected from a normal bearing and defective bearings under various speed conditions. The signals were processed to obtain various statistical parameters, which are good indicators of bearing condition, then best features are selected from graphical method and these inputs were used to train the neural network and the output represented the bearing states. The trained neural networks were used for the recognition of bearing states. The results showed that the trained neural networks were able to distinguish a normal bearing from defective bearings with 83.33 % reliability. Moreover, the network was able to classify the bearings into different states with success rates better than those achieved with the best among the state-of-the-art techniques.
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 65, NO. 3, M.docxsheronlewthwaite
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 65, NO. 3, MARCH 2018 2727
Electric Locomotive Bearing Fault Diagnosis
Using a Novel Convolutional
Deep Belief Network
Haidong Shao, Hongkai Jiang , Member, IEEE, Haizhou Zhang, and Tianchen Liang
Abstract—Bearing fault diagnosis is of significance to
enhance the reliability and security of electric locomotive.
In this paper, a novel convolutional deep belief network
(CDBN) is proposed for bearing fault diagnosis. First, an
auto-encoder is used to compress data and reduce the di-
mension. Second, a novel CDBN is constructed with Gaus-
sian visible units to learn the representative features. Third,
exponential moving average is employed to improve the
performance of the constructed deep model. The proposed
method is applied to analyze experimental signals collected
from electric locomotive bearings. The results show that
the proposed method is more effective than the traditional
methods and standard deep learning methods.
Index Terms—Convolutional deep belief network (CDBN),
electric locomotive bearing, exponential moving average
(EMA), fault diagnosis, feature learning.
NOMENCLATURE
ANFIS Adaptive neuro fuzzy inference system.
ANN Artificial neural network.
BPNN Back propagation neural network.
CDBN Convolutional deep belief network.
CNN Convolutional neural network.
CRBM Convolutional restricted Boltzmann machine.
DAE Deep auto-encoder.
DBN Deep belief network.
EMA Exponential moving average.
FD Frequency domain.
PCA Principal component analysis.
RBM Restricted Boltzmann machine.
SVM Support vector machine.
TD Time domain.
Manuscript received January 13, 2017; revised April 24, 2017 and
June 26, 2017; accepted August 5, 2017. Date of publication August
25, 2017; date of current version December 15, 2017. This work was
supported in part by the National Natural Science Foundation of China
under Grant 51475368, in part by the Shanghai Engineering Research
Center of Civil Aircraft Health Monitoring Foundation of China under
Grant GCZX-2015-02, and in part by the Innovation Foundation for Doc-
tor Dissertation of Northwestern Polytechnical University under Grant
CX201710. (Corresponding author: Hongkai Jiang.)
The authors are with the School of Aeronautics, Northwestern Poly-
technical University, Xi’an 710072, China (e-mail: [email protected]
edu.cn; [email protected]; [email protected];
[email protected]).
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TIE.2017.2745473
I. INTRODUCTION
E LECTRIC locomotive is playing a more and more impor-tant role in the modern transportation. The key parts of
electric locomotive usually get various faults due to the harsh
operating conditions, which may result in great catastrophes.
Bearing is one of the most widely used components in elec-
tric locomotive [1]; thus, automatic and accurate fault diagnosis
techniques are critically needed to ensure the sa ...
In this paper, the artificial neural network (ANN) has been utilized for rotating machinery faults detection and classification. First, experiments were performed to measure the lateral vibration signals of laboratory test rigs for rotor-disk-blade when the blades are defective. A rotor-disk-blade system with 6 regular blades and 5 blades with various defects was constructed. Second, the ANN was applied to classify the different x- and y-axis lateral vibrations due to different blade faults. The results based on training and testing with different data samples of the fault types indicate that the ANN is robust and can effectively identify and distinguish different blade faults caused by lateral vibrations in a rotor. As compared to the literature, the present paper presents a novel work of identifying and classifying various rotating blade faults commonly encountered in rotating machines using ANN. Experimental data of lateral vibrations of the rotor-disk-blade system in both x- and y-directions are used for the training and testing of the network.
Induction Motor Bearing Health Condition Classification Using Machine Learnin...Niloy Sikder
The document presents a seminar on classifying induction motor bearing health conditions using machine learning algorithms. It introduces the topic, discusses electric motors and bearings, and reviews several studies that classify bearing faults using techniques like adaptive deep convolutional neural networks, k-nearest neighbors, general-purpose graphics processing units, convolutional neural networks, and random forests. The studies are compared based on their classification accuracy, precision, recall, and F1-score. The conclusion is that researchers are making progress toward achieving perfect classification accuracy and developing an online bearing fault detection module.
This document discusses using artificial neural networks (ANN) and Daubechies wavelet transforms to diagnose faults in induction motor bearings based on vibration signal analysis. It presents the following key points:
1) Vibration signals were collected from a test rig under healthy and faulty bearing conditions. Statistical features were extracted from the signals using different Daubechies wavelet transforms.
2) These statistical features were used as input for an ANN to classify the bearing conditions. The Db4 wavelet produced the most accurate fault classifications by the ANN.
3) The methodology involved feature extraction from raw vibration signals using Daubechies wavelets, selecting the best wavelet based on classification accuracy, and using
1) The document discusses using discrete wavelet transforms to analyze vibration signals from roller bearings to detect faults. It proposes a new feature - summing the squared wavelet decomposition coefficients at each level - and compares it to the traditional energy-based feature.
2) An experiment is described where vibration signals are collected from a test rig under normal conditions and with introduced inner race, outer race, and combined faults. The signals are decomposed using discrete wavelet transforms.
3) Features are then extracted from the wavelet decompositions using both the proposed summed squared coefficient feature and the traditional energy-based feature. A decision tree is used to classify the features and determine which feature performs better at detecting the faults.
Survey on deep learning applied to predictive maintenance IJECEIAES
Prognosis health monitoring (PHM) plays an increasingly important role in the management of machines and manufactured products in today’s industry, and deep learning plays an important part by establishing the optimal predictive maintenance policy. However, traditional learning methods such as unsupervised and supervised learning with standard architectures face numerous problems when exploiting existing data. Therefore, in this essay, we review the significant improvements in deep learning made by researchers over the last 3 years in solving these difficulties. We note that researchers are striving to achieve optimal performance in estimating the remaining useful life (RUL) of machine health by optimizing each step from data to predictive diagnostics. Specifically, we outline the challenges at each level with the type of improvement that has been made, and we feel that this is an opportunity to try to select a state-of-the-art architecture that incorporates these changes so each researcher can compare with his or her model. In addition, post-RUL reasoning and the use of distributed computing with cloud technology is presented, which will potentially improve the classification accuracy in maintenance activities. Deep learning will undoubtedly prove to have a major impact in upgrading companies at the lowest cost in the new industrial revolution, Industry 4.0.
Fault Diagnostics of Rolling Bearing based on Improve Time and Frequency Doma...ijsrd.com
The neural network based approaches a feed forward neural network trained with Back Propagation technique was used for automatic diagnosis of defects in bearings. Vibration time domain signals were collected from a normal bearing and defective bearings under various speed conditions. The signals were processed to obtain various statistical parameters, which are good indicators of bearing condition, then best features are selected from graphical method and these inputs were used to train the neural network and the output represented the bearing states. The trained neural networks were used for the recognition of bearing states. The results showed that the trained neural networks were able to distinguish a normal bearing from defective bearings with 83.33 % reliability. Moreover, the network was able to classify the bearings into different states with success rates better than those achieved with the best among the state-of-the-art techniques.
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 65, NO. 3, M.docxsheronlewthwaite
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 65, NO. 3, MARCH 2018 2727
Electric Locomotive Bearing Fault Diagnosis
Using a Novel Convolutional
Deep Belief Network
Haidong Shao, Hongkai Jiang , Member, IEEE, Haizhou Zhang, and Tianchen Liang
Abstract—Bearing fault diagnosis is of significance to
enhance the reliability and security of electric locomotive.
In this paper, a novel convolutional deep belief network
(CDBN) is proposed for bearing fault diagnosis. First, an
auto-encoder is used to compress data and reduce the di-
mension. Second, a novel CDBN is constructed with Gaus-
sian visible units to learn the representative features. Third,
exponential moving average is employed to improve the
performance of the constructed deep model. The proposed
method is applied to analyze experimental signals collected
from electric locomotive bearings. The results show that
the proposed method is more effective than the traditional
methods and standard deep learning methods.
Index Terms—Convolutional deep belief network (CDBN),
electric locomotive bearing, exponential moving average
(EMA), fault diagnosis, feature learning.
NOMENCLATURE
ANFIS Adaptive neuro fuzzy inference system.
ANN Artificial neural network.
BPNN Back propagation neural network.
CDBN Convolutional deep belief network.
CNN Convolutional neural network.
CRBM Convolutional restricted Boltzmann machine.
DAE Deep auto-encoder.
DBN Deep belief network.
EMA Exponential moving average.
FD Frequency domain.
PCA Principal component analysis.
RBM Restricted Boltzmann machine.
SVM Support vector machine.
TD Time domain.
Manuscript received January 13, 2017; revised April 24, 2017 and
June 26, 2017; accepted August 5, 2017. Date of publication August
25, 2017; date of current version December 15, 2017. This work was
supported in part by the National Natural Science Foundation of China
under Grant 51475368, in part by the Shanghai Engineering Research
Center of Civil Aircraft Health Monitoring Foundation of China under
Grant GCZX-2015-02, and in part by the Innovation Foundation for Doc-
tor Dissertation of Northwestern Polytechnical University under Grant
CX201710. (Corresponding author: Hongkai Jiang.)
The authors are with the School of Aeronautics, Northwestern Poly-
technical University, Xi’an 710072, China (e-mail: [email protected]
edu.cn; [email protected]; [email protected];
[email protected]).
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TIE.2017.2745473
I. INTRODUCTION
E LECTRIC locomotive is playing a more and more impor-tant role in the modern transportation. The key parts of
electric locomotive usually get various faults due to the harsh
operating conditions, which may result in great catastrophes.
Bearing is one of the most widely used components in elec-
tric locomotive [1]; thus, automatic and accurate fault diagnosis
techniques are critically needed to ensure the sa ...
In this paper, the artificial neural network (ANN) has been utilized for rotating machinery faults detection and classification. First, experiments were performed to measure the lateral vibration signals of laboratory test rigs for rotor-disk-blade when the blades are defective. A rotor-disk-blade system with 6 regular blades and 5 blades with various defects was constructed. Second, the ANN was applied to classify the different x- and y-axis lateral vibrations due to different blade faults. The results based on training and testing with different data samples of the fault types indicate that the ANN is robust and can effectively identify and distinguish different blade faults caused by lateral vibrations in a rotor. As compared to the literature, the present paper presents a novel work of identifying and classifying various rotating blade faults commonly encountered in rotating machines using ANN. Experimental data of lateral vibrations of the rotor-disk-blade system in both x- and y-directions are used for the training and testing of the network.
Induction Motor Bearing Health Condition Classification Using Machine Learnin...Niloy Sikder
The document presents a seminar on classifying induction motor bearing health conditions using machine learning algorithms. It introduces the topic, discusses electric motors and bearings, and reviews several studies that classify bearing faults using techniques like adaptive deep convolutional neural networks, k-nearest neighbors, general-purpose graphics processing units, convolutional neural networks, and random forests. The studies are compared based on their classification accuracy, precision, recall, and F1-score. The conclusion is that researchers are making progress toward achieving perfect classification accuracy and developing an online bearing fault detection module.
This document discusses using artificial neural networks (ANN) and Daubechies wavelet transforms to diagnose faults in induction motor bearings based on vibration signal analysis. It presents the following key points:
1) Vibration signals were collected from a test rig under healthy and faulty bearing conditions. Statistical features were extracted from the signals using different Daubechies wavelet transforms.
2) These statistical features were used as input for an ANN to classify the bearing conditions. The Db4 wavelet produced the most accurate fault classifications by the ANN.
3) The methodology involved feature extraction from raw vibration signals using Daubechies wavelets, selecting the best wavelet based on classification accuracy, and using
1) The document discusses using discrete wavelet transforms to analyze vibration signals from roller bearings to detect faults. It proposes a new feature - summing the squared wavelet decomposition coefficients at each level - and compares it to the traditional energy-based feature.
2) An experiment is described where vibration signals are collected from a test rig under normal conditions and with introduced inner race, outer race, and combined faults. The signals are decomposed using discrete wavelet transforms.
3) Features are then extracted from the wavelet decompositions using both the proposed summed squared coefficient feature and the traditional energy-based feature. A decision tree is used to classify the features and determine which feature performs better at detecting the faults.
Survey on deep learning applied to predictive maintenance IJECEIAES
Prognosis health monitoring (PHM) plays an increasingly important role in the management of machines and manufactured products in today’s industry, and deep learning plays an important part by establishing the optimal predictive maintenance policy. However, traditional learning methods such as unsupervised and supervised learning with standard architectures face numerous problems when exploiting existing data. Therefore, in this essay, we review the significant improvements in deep learning made by researchers over the last 3 years in solving these difficulties. We note that researchers are striving to achieve optimal performance in estimating the remaining useful life (RUL) of machine health by optimizing each step from data to predictive diagnostics. Specifically, we outline the challenges at each level with the type of improvement that has been made, and we feel that this is an opportunity to try to select a state-of-the-art architecture that incorporates these changes so each researcher can compare with his or her model. In addition, post-RUL reasoning and the use of distributed computing with cloud technology is presented, which will potentially improve the classification accuracy in maintenance activities. Deep learning will undoubtedly prove to have a major impact in upgrading companies at the lowest cost in the new industrial revolution, Industry 4.0.
Vibration Analysis of Industrial Drive for Broken Bearing Detection Using Pro...IAES-IJPEDS
The document describes a proposed Biorthogonal Posterior Vibration Signal-Data Probabilistic Wavelet Neural Network (BPPVS-WNN) system for detecting broken bearings in induction motors used in industrial drives. The system uses biorthogonal wavelet transform on vibration signal data to localize time and frequency domains and identify transient disturbances. It extracts detailed coefficients up to the fifth derivative form. A Posterior Probabilistic Neural Network then detects fault levels faster using the fifth derivative, achieving detection at a constant frequency with minimal execution time. The system aims to reduce current flow and identify faults earlier compared to existing methods. An experiment using Simulink detects healthy and unhealthy motors based on fault detection rate, current flow rate,
This document summarizes a 3-month internship project involving programming tasks and plant visits. The internship focused on developing MATLAB applications for vibration data acquisition and analysis. Two key projects involved (1) creating a module to display real-time shaft centerline movement from proximity probe data and (2) designing a low-cost webcam-based system to remotely monitor analog meters. The webcam system used computer vision techniques to identify needle positions in images and calculate readings, allowing remote monitoring of meters located up to 100 meters from the control room. Overall, the internship provided hands-on engineering experience in condition monitoring.
This document describes a low-cost system for detecting cracks in railway tracks using an Arduino microcontroller, LEDs, LDRs, a GPS module, and a GSM module. The system works by placing an LED on one side of the tracks and an LDR on the other. In the presence of a crack, light from the LED will be obstructed from reaching the LDR, causing its resistance to change. This resistance change is detected by the Arduino. If a crack is detected, the GPS module provides location data and the GSM module sends an SMS alert with the coordinates to relevant authorities. The system aims to provide a low-cost solution for continuous, unmanned rail track monitoring.
A Survey on Ultrasound Beamforming StrategiesIRJET Journal
This document summarizes different strategies for ultrasound beamforming. Beamforming is the crucial step in ultrasound imaging where sound waves are focused on a specific point or area. The strategies are different in aspects like the type of signals used, imaging region size, time and computational costs. Several strategies are discussed including plane wave beamforming using the Fourier transform, software-based beamforming using data compression techniques, and FPGA-based modular digital beamforming. Beamforming strategies also differ in image resolution, information loss, and ability to reduce clutter from unwanted signals. Strict timing architectures can guarantee timing coherence for applications like ultrasound beamforming.
Conditioning Monitoring of Gearbox Using Different Methods: A ReviewIJMER
Gears are important element in a variety of industrial applications such as machine tool
and gearboxes. An unexpected failure of the gear may cause significant economic losses. For that
reason, fault diagnosis in gears has been the subject of intensive research. Vibration signal analysis
has been widely used in the fault detection of rotation machinery. Fault diagnosis plays an important
role in condition monitoring to enhance the machine time. In view of this, the present investigation
focused on the development of Fault diagnosis system of gearboxes based on the vibration signatures
and Artificial Neural Networks. In the present investigation to generate the vibration signatures an
experimental set-up has been fabricated with sensing and measuring equipment. The prominent faults,
wear, crack, broken tooth and insufficient lubrication of the gear were practically induced in the
present investigation. Vibration signatures of the gearbox were collected by transmitting the motion at
constant speed with gears having no fault, without applying any load. By inducing one fault at a time,
vibration signatures were collected with different degrees of wear on a gear tooth, a gear with a
broken tooth, tooth with crack and with insufficient lubrication. As the vibration data of maximum
amplitudes was found to be inseparable, fault diagnosis based on this data was not possible. Five
prominent statistical features were extracted based on data pertaining to maximum amplitudes of
vibration and used fault diagnosis. Due overlapping of this data, it was decided to use ANN based
fault diagnosis system for the present investigation. The set of statistical features were extracted based
on data pertaining to maximum amplitudes of vibration and used them as input parameters to the
ANN based fault diagnosis system designed.
IRJET- In-Situ Monitoring for Fatigue Crack Detection using Control System an...IRJET Journal
This document describes a proposed system for automatic in-situ monitoring of specimens to detect fatigue cracks using image processing and a control system. A camera and Raspberry Pi controller are used to capture images of a specimen under cyclic loading. An image processing algorithm analyzes the images to identify any cracks present based on area and isolate the crack from the background. The algorithm then measures the dimensions of detected cracks. The goal is to alert the user as soon as a crack is found and display the crack dimensions to reduce manual inspection time during fatigue testing. A literature review discusses previous research on fatigue crack detection using techniques like vibration analysis, stroboscopic illumination, and digital image correlation.
Report on Fault Diagnosis of Ball Bearing SystemSridhara R
This document discusses fault detection in bearings using signal processing in MATLAB. It begins with an introduction that outlines common bearing faults like improper design, manufacturing, lubrication, or overloading. Chapter 1 then discusses using vibration monitoring and condition monitoring techniques like principal component analysis to detect faults early. Chapter 2 reviews literature on fault diagnosis techniques including using wavelet transforms, artificial intelligence, and simulation. Chapter 3 defines common bearing fault problems like excessive loads, overheating, loose fits, roller ball faults, and inner race faults. Chapter 4 then outlines the methodology used, which involves building a test rig with an electric motor, pulleys, belts, and accelerometers to collect vibration data from bearings under different fault conditions.
Fault Detection and Condition Monitoring of Rolling Contact Bearings using Vi...IRJET Journal
1) The document discusses using vibration signature analysis to detect faults in rolling contact bearings. An experimental setup was developed to induce defects into bearings and acquire vibration signals.
2) Kurtosis value and continuous wavelet transforms were used to analyze the signals in the time and time-frequency domains. Higher kurtosis values indicate bearing defects. Vibration signatures were found to be unique for different defect types, allowing detection and condition monitoring.
3) A literature review found that vibration analysis is better than acoustic analysis for bearing fault detection. Time-frequency domain techniques like wavelet transforms are effective for both stationary and non-stationary signals in identifying weak fault signals. Statistical parameters and defect frequencies can provide indications of bearing
Artificial Neural Network Based Graphical User Interface for Estimation of Fa...ijsrd.com
This document describes the development of an artificial neural network (ANN) and graphical user interface (GUI) to estimate fabrication time in rig construction projects. The ANN was trained on data from 960 completed fabrication jobs. It uses height, plate thickness, and inspection criteria as inputs to predict fabrication time in days as the output. Eleven different ANN architectures were tested and the model with 3 input nodes, 50 hidden nodes, and 1 output node performed best with a mean squared error of 1.35337e-2. A GUI was created allowing users to input job parameters and receive a fabrication time prediction without ANN expertise. The developed ANN and GUI provide a data-driven method for fabrication time estimation in rig construction project
Artificial Neural Network Based Graphical User Interface for Estimation of Fa...ijsrd.com
This document describes research using an artificial neural network (ANN) and graphical user interface (GUI) to estimate fabrication time for rig construction projects. The ANN was trained on data from 960 completed fabrication jobs. It used 3 input parameters (height, plate thickness, inspection criteria) to predict fabrication time in days. Eleven different ANN architectures were tested, with a 3-50-1 network achieving the best results. A GUI was developed to allow users to simulate the best ANN model without ANN expertise and obtain fabrication time estimates. The research demonstrated the potential of ANNs for construction time forecasting when extensive historical data is available.
1. The document discusses a method for detecting distracted drivers using computer vision and machine learning techniques. It proposes using a convolutional neural network (CNN), specifically modifying the VGG-16 architecture, to classify images and identify different types of driver distractions or safe driving behaviors.
2. The CNN would take images of the driver as input to extract features, which would then be classified by the network to determine if the driver is distracted or driving safely. The researchers evaluated their proposed system using the StateFarm distracted driver detection dataset.
3. Previous work on detecting distracted driving is discussed, including using features like hands, face, and mouth to identify cell phone use, as well as developing datasets and classifiers to detect other dist
Optimizing Facility Layout Through SimulationIRJET Journal
This document discusses optimizing facility layout through simulation. It begins by introducing the research goals, which include developing a simulation model of a manufacturing system using Arena software, analyzing system behavior under different conditions, identifying problems, and suggesting improvements. The document then provides details on the data collection and analysis methods used to build an accurate simulation model. Primary, secondary, and tertiary data were gathered on factors like processing times and resource availability. The document explains how Arena was used to create a model of the manufacturing system and analyze key performance indicators. The overall aim was to evaluate different layout options and identify ways to optimize efficiency and productivity.
IRJET- Automated Measurement of AVR Feature in Fundus Images using Image ...IRJET Journal
This document summarizes an ongoing project to automatically measure the arteriolar-to-venular ratio (AVR) in fundus images using image processing and machine learning techniques. The project involves six main stages: preprocessing, vessel segmentation, region of interest detection, vessel width measurement, vessel classification into arteries and veins, and AVR calculation. So far, the team has completed the first four stages using image processing in MATLAB and Python. They are now working on the vessel classification stage, evaluating both unsupervised k-means clustering and supervised naive Bayes classification approaches. The goal of the project is to develop a fully automated method without any user input to accurately measure AVR, which is important for predicting cardiovascular and other diseases.
This document summarizes a study that used artificial neural networks (ANN) and fuzzy logic to predict drill wear. Acoustic emission, vibration velocity, and drill chatter measured via machine vision were used as input parameters. An ANN using backpropagation and a Mamdani fuzzy inference system were developed and compared. When predicting actual tool wear measured with a microscope, the ANN model produced better correlations and is selected for wear prediction under the present work conditions.
Predictive maintenance of rotational machinery using deep learningIJECEIAES
This paper describes an implementation of a deep learning-based predictive maintenance (PdM) system for industrial rotational machinery, built upon the foundation of a long short-term memory (LSTM) autoencoder and regression analysis. The autoencoder identifies anomalous patterns, while the latter, based on the autoencoder’s output, estimates the machine’s remaining useful life (RUL). Unlike prior PdM systems dependent on labelled historical data, the developed system doesn’t require it as it’s based on an unsupervised deep learning model, enhancing its adaptability. The paper also explores a robust condition monitoring system that collects machine operational data, including vibration and current parameters, and transmits them to a database via a Bluetooth low energy (BLE) network. Additionally, the study demonstrates the integration of this PdM system within a web-based framework, promoting its adoption across various industrial settings. Tests confirm the system's ability to accurately identify faults, highlighting its potential to reduce unexpected downtime and enhance machinery reliability.
This document describes modeling and simulation of a shock absorber test rig. It discusses the need to minimize vibrations in vehicles for improved comfort and handling. The objective is to design a universal process to measure shock absorber performance through mathematical modeling, FEM analysis, physical tests on a test rig, and Python analysis. The test rig was designed to test shock absorbers up to 3000kg with 40mm of displacement. Simulation results from Python analysis showed displacement and load values within 3-15% of physical test rig values, demonstrating the model's ability to simulate the rig. The conclusion is that this process can help evaluate damping system performance for research and development.
This document discusses a study that uses a hybrid CNN-LSTM attention model with quantile regression to predict faults in electrical machines by analyzing time series sensor data. The model aims to better manage uncertainties in the data compared to traditional models. Researchers collected vibration data from sensors on a real electrical machine measuring variations in three axes. They preprocessed the data using empirical wavelet transform and Savitzky-Golay filtering to extract relevant features and reduce noise. The hybrid deep learning model was trained on this data and used with quantile regression and anomaly detection algorithms to predict faults and provide probability levels to machine operators. The study aims to help optimize maintenance scheduling and improve electrical machine performance.
TONGUE DRIVE SYSTEM (TDS) OPERATED PATIENT FRIENDLY WHEEL CHAIRIJARIDEA Journal
This document describes a tongue-operated wheelchair system called the Tongue Drive System (TDS). The system uses a small permanent magnet attached to the tongue and Hall effect sensors mounted outside the mouth to detect tongue movements. When the tongue moves toward a sensor, it sends a signal to the wheelchair's microcontroller to move in a corresponding direction. The system is intended to allow disabled individuals to freely move about using only their tongue. It discusses the design of the TDS and wheelchair, including the hardware components like sensors, motors, and microcontrollers. The goal is to provide a low-cost assistive technology solution to help disabled people navigate and complete daily tasks.
The document discusses stress analysis and durability studies of spur gears using finite element analysis tools. It outlines how FEA can be used to model contact stresses and bending stresses in gears to better understand gear failure from factors like pitting. The analysis aims to reduce transmission error and thereby noise generated by more accurately predicting stresses, stiffness, and life of gears.
Gesture Recognition System using Computer VisionIRJET Journal
This document presents a gesture recognition system using computer vision and convolutional neural networks. It discusses developing classifiers to recognize hand gestures and facial expressions. A dataset of 87,000 images is used to train models to classify 26 letters of the American Sign Language alphabet, as well as additional classes for space, delete and nothing. The models are trained using transfer learning with MobileNet, achieving validation accuracies of over 90% for hand gesture classification and implementing a system that recognizes and translates gestures in real-time. It concludes the paper developed robust models for American Sign Language translation and facial expression recognition using CNNs.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
Vibration Analysis of Industrial Drive for Broken Bearing Detection Using Pro...IAES-IJPEDS
The document describes a proposed Biorthogonal Posterior Vibration Signal-Data Probabilistic Wavelet Neural Network (BPPVS-WNN) system for detecting broken bearings in induction motors used in industrial drives. The system uses biorthogonal wavelet transform on vibration signal data to localize time and frequency domains and identify transient disturbances. It extracts detailed coefficients up to the fifth derivative form. A Posterior Probabilistic Neural Network then detects fault levels faster using the fifth derivative, achieving detection at a constant frequency with minimal execution time. The system aims to reduce current flow and identify faults earlier compared to existing methods. An experiment using Simulink detects healthy and unhealthy motors based on fault detection rate, current flow rate,
This document summarizes a 3-month internship project involving programming tasks and plant visits. The internship focused on developing MATLAB applications for vibration data acquisition and analysis. Two key projects involved (1) creating a module to display real-time shaft centerline movement from proximity probe data and (2) designing a low-cost webcam-based system to remotely monitor analog meters. The webcam system used computer vision techniques to identify needle positions in images and calculate readings, allowing remote monitoring of meters located up to 100 meters from the control room. Overall, the internship provided hands-on engineering experience in condition monitoring.
This document describes a low-cost system for detecting cracks in railway tracks using an Arduino microcontroller, LEDs, LDRs, a GPS module, and a GSM module. The system works by placing an LED on one side of the tracks and an LDR on the other. In the presence of a crack, light from the LED will be obstructed from reaching the LDR, causing its resistance to change. This resistance change is detected by the Arduino. If a crack is detected, the GPS module provides location data and the GSM module sends an SMS alert with the coordinates to relevant authorities. The system aims to provide a low-cost solution for continuous, unmanned rail track monitoring.
A Survey on Ultrasound Beamforming StrategiesIRJET Journal
This document summarizes different strategies for ultrasound beamforming. Beamforming is the crucial step in ultrasound imaging where sound waves are focused on a specific point or area. The strategies are different in aspects like the type of signals used, imaging region size, time and computational costs. Several strategies are discussed including plane wave beamforming using the Fourier transform, software-based beamforming using data compression techniques, and FPGA-based modular digital beamforming. Beamforming strategies also differ in image resolution, information loss, and ability to reduce clutter from unwanted signals. Strict timing architectures can guarantee timing coherence for applications like ultrasound beamforming.
Conditioning Monitoring of Gearbox Using Different Methods: A ReviewIJMER
Gears are important element in a variety of industrial applications such as machine tool
and gearboxes. An unexpected failure of the gear may cause significant economic losses. For that
reason, fault diagnosis in gears has been the subject of intensive research. Vibration signal analysis
has been widely used in the fault detection of rotation machinery. Fault diagnosis plays an important
role in condition monitoring to enhance the machine time. In view of this, the present investigation
focused on the development of Fault diagnosis system of gearboxes based on the vibration signatures
and Artificial Neural Networks. In the present investigation to generate the vibration signatures an
experimental set-up has been fabricated with sensing and measuring equipment. The prominent faults,
wear, crack, broken tooth and insufficient lubrication of the gear were practically induced in the
present investigation. Vibration signatures of the gearbox were collected by transmitting the motion at
constant speed with gears having no fault, without applying any load. By inducing one fault at a time,
vibration signatures were collected with different degrees of wear on a gear tooth, a gear with a
broken tooth, tooth with crack and with insufficient lubrication. As the vibration data of maximum
amplitudes was found to be inseparable, fault diagnosis based on this data was not possible. Five
prominent statistical features were extracted based on data pertaining to maximum amplitudes of
vibration and used fault diagnosis. Due overlapping of this data, it was decided to use ANN based
fault diagnosis system for the present investigation. The set of statistical features were extracted based
on data pertaining to maximum amplitudes of vibration and used them as input parameters to the
ANN based fault diagnosis system designed.
IRJET- In-Situ Monitoring for Fatigue Crack Detection using Control System an...IRJET Journal
This document describes a proposed system for automatic in-situ monitoring of specimens to detect fatigue cracks using image processing and a control system. A camera and Raspberry Pi controller are used to capture images of a specimen under cyclic loading. An image processing algorithm analyzes the images to identify any cracks present based on area and isolate the crack from the background. The algorithm then measures the dimensions of detected cracks. The goal is to alert the user as soon as a crack is found and display the crack dimensions to reduce manual inspection time during fatigue testing. A literature review discusses previous research on fatigue crack detection using techniques like vibration analysis, stroboscopic illumination, and digital image correlation.
Report on Fault Diagnosis of Ball Bearing SystemSridhara R
This document discusses fault detection in bearings using signal processing in MATLAB. It begins with an introduction that outlines common bearing faults like improper design, manufacturing, lubrication, or overloading. Chapter 1 then discusses using vibration monitoring and condition monitoring techniques like principal component analysis to detect faults early. Chapter 2 reviews literature on fault diagnosis techniques including using wavelet transforms, artificial intelligence, and simulation. Chapter 3 defines common bearing fault problems like excessive loads, overheating, loose fits, roller ball faults, and inner race faults. Chapter 4 then outlines the methodology used, which involves building a test rig with an electric motor, pulleys, belts, and accelerometers to collect vibration data from bearings under different fault conditions.
Fault Detection and Condition Monitoring of Rolling Contact Bearings using Vi...IRJET Journal
1) The document discusses using vibration signature analysis to detect faults in rolling contact bearings. An experimental setup was developed to induce defects into bearings and acquire vibration signals.
2) Kurtosis value and continuous wavelet transforms were used to analyze the signals in the time and time-frequency domains. Higher kurtosis values indicate bearing defects. Vibration signatures were found to be unique for different defect types, allowing detection and condition monitoring.
3) A literature review found that vibration analysis is better than acoustic analysis for bearing fault detection. Time-frequency domain techniques like wavelet transforms are effective for both stationary and non-stationary signals in identifying weak fault signals. Statistical parameters and defect frequencies can provide indications of bearing
Artificial Neural Network Based Graphical User Interface for Estimation of Fa...ijsrd.com
This document describes the development of an artificial neural network (ANN) and graphical user interface (GUI) to estimate fabrication time in rig construction projects. The ANN was trained on data from 960 completed fabrication jobs. It uses height, plate thickness, and inspection criteria as inputs to predict fabrication time in days as the output. Eleven different ANN architectures were tested and the model with 3 input nodes, 50 hidden nodes, and 1 output node performed best with a mean squared error of 1.35337e-2. A GUI was created allowing users to input job parameters and receive a fabrication time prediction without ANN expertise. The developed ANN and GUI provide a data-driven method for fabrication time estimation in rig construction project
Artificial Neural Network Based Graphical User Interface for Estimation of Fa...ijsrd.com
This document describes research using an artificial neural network (ANN) and graphical user interface (GUI) to estimate fabrication time for rig construction projects. The ANN was trained on data from 960 completed fabrication jobs. It used 3 input parameters (height, plate thickness, inspection criteria) to predict fabrication time in days. Eleven different ANN architectures were tested, with a 3-50-1 network achieving the best results. A GUI was developed to allow users to simulate the best ANN model without ANN expertise and obtain fabrication time estimates. The research demonstrated the potential of ANNs for construction time forecasting when extensive historical data is available.
1. The document discusses a method for detecting distracted drivers using computer vision and machine learning techniques. It proposes using a convolutional neural network (CNN), specifically modifying the VGG-16 architecture, to classify images and identify different types of driver distractions or safe driving behaviors.
2. The CNN would take images of the driver as input to extract features, which would then be classified by the network to determine if the driver is distracted or driving safely. The researchers evaluated their proposed system using the StateFarm distracted driver detection dataset.
3. Previous work on detecting distracted driving is discussed, including using features like hands, face, and mouth to identify cell phone use, as well as developing datasets and classifiers to detect other dist
Optimizing Facility Layout Through SimulationIRJET Journal
This document discusses optimizing facility layout through simulation. It begins by introducing the research goals, which include developing a simulation model of a manufacturing system using Arena software, analyzing system behavior under different conditions, identifying problems, and suggesting improvements. The document then provides details on the data collection and analysis methods used to build an accurate simulation model. Primary, secondary, and tertiary data were gathered on factors like processing times and resource availability. The document explains how Arena was used to create a model of the manufacturing system and analyze key performance indicators. The overall aim was to evaluate different layout options and identify ways to optimize efficiency and productivity.
IRJET- Automated Measurement of AVR Feature in Fundus Images using Image ...IRJET Journal
This document summarizes an ongoing project to automatically measure the arteriolar-to-venular ratio (AVR) in fundus images using image processing and machine learning techniques. The project involves six main stages: preprocessing, vessel segmentation, region of interest detection, vessel width measurement, vessel classification into arteries and veins, and AVR calculation. So far, the team has completed the first four stages using image processing in MATLAB and Python. They are now working on the vessel classification stage, evaluating both unsupervised k-means clustering and supervised naive Bayes classification approaches. The goal of the project is to develop a fully automated method without any user input to accurately measure AVR, which is important for predicting cardiovascular and other diseases.
This document summarizes a study that used artificial neural networks (ANN) and fuzzy logic to predict drill wear. Acoustic emission, vibration velocity, and drill chatter measured via machine vision were used as input parameters. An ANN using backpropagation and a Mamdani fuzzy inference system were developed and compared. When predicting actual tool wear measured with a microscope, the ANN model produced better correlations and is selected for wear prediction under the present work conditions.
Predictive maintenance of rotational machinery using deep learningIJECEIAES
This paper describes an implementation of a deep learning-based predictive maintenance (PdM) system for industrial rotational machinery, built upon the foundation of a long short-term memory (LSTM) autoencoder and regression analysis. The autoencoder identifies anomalous patterns, while the latter, based on the autoencoder’s output, estimates the machine’s remaining useful life (RUL). Unlike prior PdM systems dependent on labelled historical data, the developed system doesn’t require it as it’s based on an unsupervised deep learning model, enhancing its adaptability. The paper also explores a robust condition monitoring system that collects machine operational data, including vibration and current parameters, and transmits them to a database via a Bluetooth low energy (BLE) network. Additionally, the study demonstrates the integration of this PdM system within a web-based framework, promoting its adoption across various industrial settings. Tests confirm the system's ability to accurately identify faults, highlighting its potential to reduce unexpected downtime and enhance machinery reliability.
This document describes modeling and simulation of a shock absorber test rig. It discusses the need to minimize vibrations in vehicles for improved comfort and handling. The objective is to design a universal process to measure shock absorber performance through mathematical modeling, FEM analysis, physical tests on a test rig, and Python analysis. The test rig was designed to test shock absorbers up to 3000kg with 40mm of displacement. Simulation results from Python analysis showed displacement and load values within 3-15% of physical test rig values, demonstrating the model's ability to simulate the rig. The conclusion is that this process can help evaluate damping system performance for research and development.
This document discusses a study that uses a hybrid CNN-LSTM attention model with quantile regression to predict faults in electrical machines by analyzing time series sensor data. The model aims to better manage uncertainties in the data compared to traditional models. Researchers collected vibration data from sensors on a real electrical machine measuring variations in three axes. They preprocessed the data using empirical wavelet transform and Savitzky-Golay filtering to extract relevant features and reduce noise. The hybrid deep learning model was trained on this data and used with quantile regression and anomaly detection algorithms to predict faults and provide probability levels to machine operators. The study aims to help optimize maintenance scheduling and improve electrical machine performance.
TONGUE DRIVE SYSTEM (TDS) OPERATED PATIENT FRIENDLY WHEEL CHAIRIJARIDEA Journal
This document describes a tongue-operated wheelchair system called the Tongue Drive System (TDS). The system uses a small permanent magnet attached to the tongue and Hall effect sensors mounted outside the mouth to detect tongue movements. When the tongue moves toward a sensor, it sends a signal to the wheelchair's microcontroller to move in a corresponding direction. The system is intended to allow disabled individuals to freely move about using only their tongue. It discusses the design of the TDS and wheelchair, including the hardware components like sensors, motors, and microcontrollers. The goal is to provide a low-cost assistive technology solution to help disabled people navigate and complete daily tasks.
The document discusses stress analysis and durability studies of spur gears using finite element analysis tools. It outlines how FEA can be used to model contact stresses and bending stresses in gears to better understand gear failure from factors like pitting. The analysis aims to reduce transmission error and thereby noise generated by more accurately predicting stresses, stiffness, and life of gears.
Gesture Recognition System using Computer VisionIRJET Journal
This document presents a gesture recognition system using computer vision and convolutional neural networks. It discusses developing classifiers to recognize hand gestures and facial expressions. A dataset of 87,000 images is used to train models to classify 26 letters of the American Sign Language alphabet, as well as additional classes for space, delete and nothing. The models are trained using transfer learning with MobileNet, achieving validation accuracies of over 90% for hand gesture classification and implementing a system that recognizes and translates gestures in real-time. It concludes the paper developed robust models for American Sign Language translation and facial expression recognition using CNNs.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Sinan KOZAK
Sinan from the Delivery Hero mobile infrastructure engineering team shares a deep dive into performance acceleration with Gradle build cache optimizations. Sinan shares their journey into solving complex build-cache problems that affect Gradle builds. By understanding the challenges and solutions found in our journey, we aim to demonstrate the possibilities for faster builds. The case study reveals how overlapping outputs and cache misconfigurations led to significant increases in build times, especially as the project scaled up with numerous modules using Paparazzi tests. The journey from diagnosing to defeating cache issues offers invaluable lessons on maintaining cache integrity without sacrificing functionality.
International Conference on NLP, Artificial Intelligence, Machine Learning an...gerogepatton
International Conference on NLP, Artificial Intelligence, Machine Learning and Applications (NLAIM 2024) offers a premier global platform for exchanging insights and findings in the theory, methodology, and applications of NLP, Artificial Intelligence, Machine Learning, and their applications. The conference seeks substantial contributions across all key domains of NLP, Artificial Intelligence, Machine Learning, and their practical applications, aiming to foster both theoretical advancements and real-world implementations. With a focus on facilitating collaboration between researchers and practitioners from academia and industry, the conference serves as a nexus for sharing the latest developments in the field.
ACEP Magazine edition 4th launched on 05.06.2024Rahul
This document provides information about the third edition of the magazine "Sthapatya" published by the Association of Civil Engineers (Practicing) Aurangabad. It includes messages from current and past presidents of ACEP, memories and photos from past ACEP events, information on life time achievement awards given by ACEP, and a technical article on concrete maintenance, repairs and strengthening. The document highlights activities of ACEP and provides a technical educational article for members.
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...University of Maribor
Slides from talk presenting:
Aleš Zamuda: Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapter and Networking.
Presentation at IcETRAN 2024 session:
"Inter-Society Networking Panel GRSS/MTT-S/CIS
Panel Session: Promoting Connection and Cooperation"
IEEE Slovenia GRSS
IEEE Serbia and Montenegro MTT-S
IEEE Slovenia CIS
11TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONIC AND COMPUTING ENGINEERING
3-6 June 2024, Niš, Serbia
Comparative analysis between traditional aquaponics and reconstructed aquapon...bijceesjournal
The aquaponic system of planting is a method that does not require soil usage. It is a method that only needs water, fish, lava rocks (a substitute for soil), and plants. Aquaponic systems are sustainable and environmentally friendly. Its use not only helps to plant in small spaces but also helps reduce artificial chemical use and minimizes excess water use, as aquaponics consumes 90% less water than soil-based gardening. The study applied a descriptive and experimental design to assess and compare conventional and reconstructed aquaponic methods for reproducing tomatoes. The researchers created an observation checklist to determine the significant factors of the study. The study aims to determine the significant difference between traditional aquaponics and reconstructed aquaponics systems propagating tomatoes in terms of height, weight, girth, and number of fruits. The reconstructed aquaponics system’s higher growth yield results in a much more nourished crop than the traditional aquaponics system. It is superior in its number of fruits, height, weight, and girth measurement. Moreover, the reconstructed aquaponics system is proven to eliminate all the hindrances present in the traditional aquaponics system, which are overcrowding of fish, algae growth, pest problems, contaminated water, and dead fish.
Using recycled concrete aggregates (RCA) for pavements is crucial to achieving sustainability. Implementing RCA for new pavement can minimize carbon footprint, conserve natural resources, reduce harmful emissions, and lower life cycle costs. Compared to natural aggregate (NA), RCA pavement has fewer comprehensive studies and sustainability assessments.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
Batteries -Introduction – Types of Batteries – discharging and charging of battery - characteristics of battery –battery rating- various tests on battery- – Primary battery: silver button cell- Secondary battery :Ni-Cd battery-modern battery: lithium ion battery-maintenance of batteries-choices of batteries for electric vehicle applications.
Fuel Cells: Introduction- importance and classification of fuel cells - description, principle, components, applications of fuel cells: H2-O2 fuel cell, alkaline fuel cell, molten carbonate fuel cell and direct methanol fuel cells.
2. An optimal deep sparse autoencoder with gated recurrent
unit for rolling bearing fault diagnosis
Ke Zhao, Hongkai Jiang1
, Xingqiu Li, Ruixin Wang
School of Aeronautics, Northwestern Polytechnical University, 710072 Xi’an, China
Abstract: Effective fault diagnosis of rolling bearings are of great importance in guaranteeing the
normal operation of rotating machinery. However, the measured rolling bearing vibration signals
are highly nonlinear and interrupted by background noise, making it hard to obtain the representative
fault features. Based on this, an optimal fault diagnosis method is proposed to accurately and
steadily diagnose the rolling bearing faults in this paper. The proposed method mainly contains the
following stages. Firstly, gated recurrent unit and sparse autoencoder are constructed as a novel
hybrid deep learning model to directly and effectively mine the fault information of rolling bearing
vibration signals. Secondly, the key parameters of the constructed model are optimized by grey wolf
optimizer algorithm to achieve better diagnosis performance. Finally, the features obtained by the
constructed model are input into the classifier to get the final diagnosis results. The proposed method
is validated using the experimental and practical engineering bearing data and the results confirm
the diagnosis performance of the developed method is more effective and robust than other methods.
Keywords: Rolling bearing fault diagnosis; Hybrid deep learning model; Gated recurrent unit;
Sparse autoencoder; Grey wolf optimizer
1. Introduction
With the rapid development of society and technology, the worsening working environment
and increasing working hours lead to various failures of rotating machinery [1]. As the key part of
rotating machinery, the failures of rolling bearing may result in immeasurable losses and
catastrophic damage. However, the difficulties in rolling bearing fault diagnosis are mainly caused
by high-intensity working conditions, and the vibration characteristics of rolling bearing are affected
by local defects, including edge shape and size [2-4]. Consequently, accurate and stable diagnosis
of rolling bearing faults are realistic and urgent in practical engineering.
For decades, vibration mechanism analysis has played a major role in machinery fault
diagnosis [5-7]. The machine equipment are becoming increasingly complex, and the measured
bearing vibration signals are highly nonlinear and non-stationary with much noise. Therefore, how
to effectively obtain the fault features from the measured bearing vibration signals is the crux of
bearing fault diagnosis [8]. Currently, intelligent diagnosis methods have been widely used in rolling
bearings for the advantages of non-requirement for abundant expertise and automatically presenting
diagnosis results [9, 10]. Artificial neural network (ANN) and support vector machine (SVM) are
two most prevalent intelligent diagnosis methods in bearing fault diagnosis [11]. Unal et al.
extracted the features of vibration signals with Hilbert Transform and then used artificial neural
network (ANN) to classify the processed features [12]. Zarei et al. obtained the domain features of
bearing data and applied artificial neural network (ANN) to get the diagnosis results [13]. Yan et al.
captured the multi-domain features of bearing signals and developed optimized support vector
1 Corresponding author
Email address: jianghk@nwpu.edu.cn
Page 1 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
3. machine (SVM) to detect faults [14]. Zheng et al. extracted the features of raw vibration signals and
used support vector machine (SVM) to classify bearing fault conditions [15]. To summarize, ANN
and SVM have made progress in the field of bearing fault diagnosis, but the raw vibration signals
need to extract representative features and the extracted features need to select sensitive features
before diagnosis. Moreover, the diagnosis accuracy heavily relies on the extracted and selected
features. However, it needs advanced signal processing techniques to extract the features of raw
vibration signals and the quality of the selected features is very dependent on engineering experience
[16, 17]. All of these mentioned factors limit the wide application of ANN an SVM, so it is essential
to develop a novel method that could directly and effectively extracts the fault features of raw
vibration signals and doesn’t require abundant engineering experience.
Deep learning has been the focus of research in recent years, assembling multi-layer data
processing units into deep architectures to extract multiple levels of data abstraction [18]. So far
deep learning has made great achievements in natural language processing, speech recognition,
medical image analysis and so on [19-21]. In other words, deep learning methods hold great
potential to get rid of the reliance on various advanced signal processing techniques and manual
feature extraction. [22]. Up to now, autoencoder (AE), convolutional neural network (CNN), deep
belief network (DBN) and recurrent neural network (RNN) are four most commonly used deep
learning methods. Shao et al. constructed a novel deep autoencoder (AE) for rotating machinery
fault diagnosis [23]. Meng et al. proposed an enhancement denoising autoencoder (AE) for rolling
bearing fault diagnosis [24]. Lu et al. designed a hierarchical convolutional neural network (CNN)
for rolling bearing fault diagnosis [25]. Huang et al. adopted an improved convolutional neural
network (CNN) for bearing diagnosis [26]. Shao et al. applied a deep belief network (DBN) with
dual-tree complex wavelet packet for bearing fault diagnosis [27]. Tang et al. developed an adaptive
deep belief network (DBN) for rotating machinery fault diagnosis [28]. Jiang et al. used recurrent
neural network (RNN) to classify bearing fault conditions [29]. Zhao et al. proposed a novel
recurrent neural network (RNN) for machine health monitoring [30]. According to the above
literature review, the previous fault diagnosis methods mainly focus on single deep learning models.
However, single deep learning models are hard to deal with increasingly complex diagnosis issues.
Thus, advanced signal processing techniques or some other model improvement methods are
essential for single deep learning models. This paper is devoted to developing a novel method to
tackle the increasingly complex diagnosis issues, which only focuses on deep learning models,
without considering advanced signal processing techniques and model improvement methods. Up
to now, AE has been the most prevalent deep learning method for rolling bearing fault diagnosis
because of its simple structure, easy to expand and powerful feature learning ability [31]. Sparse
autoencoder (SAE) as a variant of AE could learns more robust feature representations than basic
AE [32]. The measured bearing vibration signals are time series data, and gated recurrent unit (GRU)
as a novel variant of RNN shows extraordinary ability in extracting the time relevance of sequential
signals [33]. Thus, to maximize the advantages of GRU and SAE, a hybrid deep learning model that
combines GRU and SAE is constructed in this paper. GRU is first used for extracting the features
of bearing sequential signals, and then the extracted features are input into SAE to obtain more
robust feature representations. At last, the robust features are input into the classifier to obtain the
final diagnosis results. As all is known, the process of tuning parameters for deep learning models
is really a time-consuming and laborious work, so it is essential and meaningful to obtain the key
parameters of the constructed model automatically and quickly [34]. Grey wolf optimizer (GWO)
Page 2 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
4. algorithm is a novel optimization algorithm with flexibility, simplicity, robustness and simple
implementation, and it has been successfully applied to bearing fault diagnosis [35, 36]. Thus, GWO
algorithm is applied to automatically and quickly obtain the key parameters of constructed model.
In this paper, an optimal deep sparse autoencoder (SAE) with gated recurrent unit (GRU) is
proposed for rolling bearing fault diagnosis. The proposed method is validated using the
experimental and practical engineering bearing data and the results confirm the diagnosis
performance of the developed method is more effective and robust than other methods. The main
contributions of our work can be summarized as follows.
(1) Our proposed framework could be regarded as a hybrid model of automatic feature learning
based on deep learning models. The hybrid model is constructed by GRU and SAE to directly
extract the representative fault features of raw vibration signals. Then the extracted fault
features will input into the classifier to obtain the final diagnosis results.
(2) Due to the hybrid model has many parameters to tune, and the tuning parameters process is
really a time-consuming and laborious work. Thus, the key parameters of the hybrid model are
obtained by GWO algorithm to save time and to achieve better diagnosis performance.
(3) Comprehensive experimental studies contain experiment bearing fault diagnosis and practical
engineering bearing fault detection. The effectiveness and generalization capability of the
proposed method have been verified.
The organization of the remainder is as follows: The basic theory of the constructed model is
described in Section 2. Section 3 introduces the proposed method in detail. The proposed model is
verified by the experimental bearing data in Section 4. Section 5 gives the practical engineering
application of the proposed method. The general conclusion is given in Section 6.
2. The basic theory of the constructed model
This part is mainly to illustrate the basic theory of the constructed model. Section 2.1
introduces the basic theory of RNN and Section 2.2 describes the basic theory of GRU. The principle
of AE is illustrated in Section 2.3.
2.1 The basic theory of recurrent neural network
Unlike other deep learning models, RNN builds dependencies between its hidden units by a
directed cycle [37]. In other words, the output of a hidden layer at time t-1 will input into itself at
time t. Fig. 1 (a) shows the basic architecture of RNN and Fig. 1 (b) shows the architecture of RNN
across a time step.
Output layer
Hidden layer
t-1 t
Input layer
(a) (b)
Fig. 1 (a) the basic architecture of RNN, (b) the architecture of RNN across a time step
The above-mentioned procedure is presented mathematically by Eq. (1) and Eq. (2):
Page 3 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
5. 𝐻𝑡 = 𝑓𝐻(𝑈𝑖𝐻𝑋𝑡 + 𝑈𝐻𝐻𝐻𝑡−1 + 𝑏𝐻) (1)
𝑦𝐻 = 𝑓0(𝑈𝐻0𝐻𝑡 + 𝑏0) (2)
where 𝑓𝐻 and 𝑓0 are the activation functions of hidden layer and output layer, 𝑈𝑖𝐻, 𝑈𝐻𝐻 and
𝑈𝐻0 are the weight matrixes. 𝑏𝐻 and 𝑏0 are bias vectors of hidden layer and output layer. 𝐻𝑡
and 𝑦𝐻 are the output of hidden layer and output layer.
2.2 The basic theory of GRU
The basic theory of RNN is presented in Section 2.1. RNN has powerful ability in extracting
the time relevance of sequential signals. However, the gradient vanishing or exploding problem
greatly limit the application of RNN [30]. Thus, GRU as a novel variant of RNN, which could solves
the problem by using a gating mechanism, is used for extracting the features of raw vibration signals
in this paper [38]. Fig. 2 shows the structure of GRU.
× +
× 1-
σ σ
tanh
×
Zt
Ht
Ht-1
Rt
Ct
Xt
Fig. 2 The structure of GRU [38].
Known from Fig. 2, the most difference between GRU and RNN is GRU has two gates, reset
gate R and update gate Z. Reset gate relates to how the inputs and the previously stored information
are integrated. Update gate controls the retention of the previously stored information. The formula
is as follows:
𝑍𝑡 = 𝜎(𝑈𝑍𝑋𝑡 + 𝑉𝑍𝐻𝑡−1 + 𝑏𝑍) (3)
𝑅𝑡 = 𝜎(𝑈𝑅𝑋𝑡 + 𝑉𝑅𝐻𝑡−1 + 𝑏𝑅) (4)
𝐶𝑡 = 𝑡𝑎𝑛ℎ(𝑈𝑋𝑡 + 𝑉(𝑅𝑡 𝐻𝑡−1 ) + 𝑏) (5)
𝐻𝑡 = (1 − 𝑍𝑡) 𝐻𝑡−1 + 𝑍𝑡 𝐶𝑡 (6)
where 𝐻𝑡 and 𝐶𝑡 are an activation and a candidate activation at time t. 𝑍𝑡 and 𝑅𝑡 denote update
and reset gates. and 𝑡𝑎𝑛ℎ are Sigmoid and hyperbolic tangent functions. 𝑈𝑍, 𝑈𝑅, 𝑈, 𝑉𝑍, 𝑉𝑅
and 𝑉 are weight matrices, respectively. 𝑏𝑍, 𝑏𝑅 and 𝑏 are bias parameters, respectively. is
the dot product.
2.3 The principle of standard autoencoder
Compared with RNN, AE is a type of unsupervised neural network and the goal is to make the
input equal to the output. The basic structure of AE is shown in Fig. 3. It can be seen that the input
is encoded firstly, and then the encoded data are processed by the activation function. At last, the
processed data are decoded as the output. In addition, the output 𝑌𝑖 is approximately equal to the
input 𝑋𝑖. Due to the number of hidden layers are always less than the dimension of input, thus, the
Page 4 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
6. hidden units are regard as the high-dimensional of input. For a given input 𝑋𝑖(𝑋𝑖 ∈ 𝑅𝑙∗1
), the
hidden representation 𝐻(𝑋𝑖) could be presented mathematically by Eq. (7).
𝐻(𝑋𝑖) = 𝑓𝑆(𝑉𝑖𝑗𝑋𝑖 + 𝑏1) (7)
where 𝑓𝑆(∙) denotes the Sigmoid activation function, 𝑓𝑆(𝑡) = 1 (1 + 𝑒−𝑡
)
⁄ . 𝑉𝑖𝑗 ∈ 𝑅𝑛∗𝑙 presents
weight matrix, 𝑏1 ∈ 𝑅𝑛∗1 is bias vector.
After that, transforming vector H into reconstructed vector 𝑌𝑖(𝑌𝑖 ∈ 𝑅𝑙∗1
).
𝑌𝑖 = 𝑓𝑆(𝑊
𝑗𝑖𝐻(𝑋𝑖) + 𝑏2) (8)
where 𝑊
𝑗𝑖 ∈ 𝑅𝑙∗𝑛 denotes weight matrix, 𝑏2 ∈ 𝑅𝑙∗1 is bias vector.
The loss function of standard AE is mean square error (MSE), which is to realize the
minimization of the reconstruction error by optimizing the parameters.
𝐿(𝜃) =
1
𝑛
(∑ (
1
2
‖𝑌𝑖 − 𝑋𝑖‖2
)
𝑛
𝑖=1 ) (9)
where 𝜃 denotes the parameters.
Fig. 3 The basic structure of AE
3. The proposed method.
An optimal method is proposed for rolling bearing fault diagnosis. This part is a detailed
illustration of the proposed method. Section 3.1 details the construction of the hybrid deep learning
model. Section 3.2 describes the optimization process of the constructed model. Section 3.3 shows
the general process of the proposed method.
3. 1 The model construction
The worsening environment and increasing working hours contribute to various failures of
rolling bearings. Consequently, accurate and stable diagnosis of bearing faults are realistic and
urgent. The measured bearing vibration signals are sequential signals, which are complex and
nonlinear. However, GRU has powerful ability in extracting the time relevance of sequential signals
and SAE could learns more robust feature representations. Thus, to maximize the advantages of
GRU and SAE, a hybrid deep learning model that combines GRU and SAE is constructed in this
paper and the constructed model is shown in Fig. 4. The raw bearing signals are firstly processed
by GRU layer to obtain the Feature 1, the Feature 1 are input into to the first SAE to get the Feature
2, and then the Feature 2 becomes the input of second SAE for obtaining the Feature 3 (the final
features). Finally, the final features are entered into the classifier to obtain the final diagnosis results.
The cross-entropy loss function is applied for GRU to realize the minimization of the reconstruction
error by optimizing parameters, and the formula is as follows:
Sigmoid
x1
x2
x3
xn
Sigmoid
Sigmoid
/
/
/
/
Y1
Y2
Y3
Yn
Vij
b1 b2
Wji
encoding decoding
Page 5 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
7. 𝐿(𝜃, 𝑀, 𝐹) = −
1
𝑛
∑ 𝑚𝑗 𝑙𝑜𝑔2 𝑓𝑗
𝑛
𝑗=1 (10)
where 𝑛 is the number of trained samples, 𝜃 is the optimized parameters, 𝑀 = {𝑚𝑗|𝑗 = 1, ⋯ , 𝑛}
denotes the actual output set of trained samples and 𝐹 = {𝑓𝑗|𝑗 = 1, ⋯ , 𝑛} is the corresponding
label set. 𝑗 means the jth trained sample. GRU is trained by back propagation through time (BPTT).
Adaptive gradient is used to update the weight matrices and as follows:
∆𝜃 = −
𝜀
√∑ (
𝜕𝐿(𝜃,𝑀,𝐹)
𝜕𝜃
)𝑗
2
𝑡
𝑗=1
∙ (
𝜕𝐿(𝜃,𝑀,𝐹)
𝜕𝜃
)𝑡 (11)
𝜃𝑡 = 𝜃𝑡−1 + ∆𝜃 (12)
where ε is the learning rate, (
𝜕𝐿(𝜃,𝑀,𝐹)
𝜕𝜃
)𝑡 denotes the gradient at step t. 𝜃𝑡 is the parameters at
step t and ∆θ is the updated values.
Compared with standardAE, SAE has more robustness and inference ability that could let SAE
learn more reliable and effective features [32]. The most difference between SAE and AE is their
loss functions are different. Regularization term and a sparsity constraint are added to the cost
function to realize the sparse representation of features. And the whole loss function of SAE is as
follows:
𝐿𝑆𝐴𝐸 = 𝐿𝐴𝐸 +
𝜏
2
∑(𝑤𝑖𝑗)2
+ 𝛽(∑ 𝜌 log
𝜌
𝜌𝑗
̂
+ (1 − 𝜌)
𝑝
𝑗=1 log
1−𝜌
1−𝜌𝑗
̂
) (13)
where τ is the regularization term parameter that adjusts the weight 𝑤. The third is the Kullback–
Leibler divergence function that is to measure the difference between 𝜌 and 𝜌𝑗
̂ . 𝜌 is a predefined
sparse parameter, 𝜌𝑗
̂ is the average activation value of hidden unit j and β is the sparse penalty
factor.
3.2 The constructed model optimization
3.2.1 GWO algorithm
In GWO algorithm, is the best result, is the second best and is the third best result.
The formula is as follows:
1
2
D a r a
= − (14)
( 1) ( ) ( ) ( )
m m
Y k Y k D E Y k Y k
+ = − − (15)
where k is the current iteration, Y and m
Y are the position vectors of wolf and prey. E and 1
r
are random vectors. a linearly decreases from 2 to 0 in the iterative process. The other wolves
update their positions by 、 and .
1 1
( )
A
Y Y D E Y Y
= − − (16)
2 2
( )
B
Y Y D E Y Y
= − − (17)
3 3
( )
C
Y Y D E Y Y
= − − (18)
( 1)
3
A B C
Y Y Y
Y k
+ +
+ = (19)
where 1
D , 2
D and 3
D are similar to D , 1
E , 2
E and 3
E are similar to E . The details of
GWO algorithm can be seen in Ref. [39]. GWO algorithm is applied to obtain the optimal
parameters of the hybrid deep learning model.
3.2.2 The constructed model optimization
Page 6 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
8. A hybrid deep learning model that combines GRU and SAE is constructed for rolling bearing
fault diagnosis. The mentioned process in section 3.1 seems quite easy. However, it always requires
a lot of tunings for getting a satisfactory diagnosis result, which means a lot of time. The values of
ε in Eq. (11), τ, β and ρ in Eq. (13) greatly impact the diagnosis performance of the constructed
model. Thus, GWO as a novel optimization algorithm is applied to obtain the optimal values of the
mentioned parameters. The optimization process are as follows:
⚫ Step 1: Construct the proposed model.
⚫ Step 2: GWO algorithm initialization and set the number of wolves N and iterative steps K. 𝑐𝑖 =
[𝜀𝑖; 𝜏𝑖; 𝛽𝑖; 𝜌𝑖] (𝑖 = 1,2,3 ⋯ 𝑁) is the optimized parameter set. The error rate of classification is
taken as the fitness function of GWO.
⚫ Step 3: Initialize the original state of each search agent by randomly generating between ranges.
Update the positions of search agents by Eq. (19). The fitness of each search agent is calculated,
the minimum fitness and the optimal state of search agent are all saved at each iteration of GWO.
⚫ Step 4: Finish the optimization process if the iterative step reaches K and obtain the optimized
parameter set.
3.3 The general step of the proposed method
An optimal bearing fault diagnosis method is proposed in this paper. The framework is shown
in Fig. 5, and the general process is as follows.
◼ Step 1: Use data acquisition system to measure the bearing vibration signals.
◼ Step 2: Construct the proposed model.
◼ Step 3: Use GWO to obtain the key parameters of the constructed model.
◼ Step 4: Verify the effectiveness of the optimization process and output the final diagnosis
result.
Page 7 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
9. Vibration signals acquisition
Rolling bearings
The proposed model construction
Sample 1 Sample N
Sample 2
Measured vibration signals
The proposed method
Raw vibration signals
GRU
layer
Feature 1
Feature 1
SAE1
layer
Feature 2
Feature 3
SAE2
layer
Feature 2
Diagnosis result
Softmax
classifier
Feature 3
The optimization
parameters
Application of the proposed method
Diagnosis results Visualize the learned features
Fig. 5 The framework of the proposed method [22].
Page 8 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
10. GRU
layer
Raw vibration signals
Feature 1
SAE1
layer
Feature 1
Feature 2
SAE2
layer
Feature 3
Feature 2
Softmax
classifier
Diagnosis result
Feature 3
Fig. 4 The constructed model
4. Experimental verification
4.1 Data description
The verified bearing data is from Case Western Reserve University (CWRU) [40]. The
experimental device is shown in Fig 6 (a), and the schematic illustration is shown in Fig 6 (b). The
experimental platform contains a torque sensor, a motor, electronic control equipment and a power
meter. The fault diameters are 0.007, 0.014, 0.021 (1 inch = 2.54 cm) and the frequency is 12 kHz.
(a) (b)
Fig. 6 (a) Experimental device, (b) schematic illustration
The drive-end data used in this paper are measured at 1797 rpm. In this case study, 10 kinds of
working conditions were designed. The introduction of the data are shown in Table1, the fault
conditions include ball (B), outer race (OR) and inner race (IR) fault. Each condition includes 150
sample and each includes 800 points. The first 100 samples are for training and the rest are for
testing. The raw vibration signals of the 10 conditions is shown in Fig. 7
Page 9 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
11. Table1
Introduction of training sample and testing sample
Health condition Fault diameter (in.) Training/testing samples Condition
Normal 0 100/50 1
B 0.007 100/50 2
B 0.014 100/50 3
B 0.021 100/50 4
IR 0.007 100/50 5
IR 0.014 100/50 6
IR 0.021 100/50 7
OR 0.007 100/50 8
OR 0.014 100/50 9
OR 0.021 100/50 10
Amplitude
(m/𝑠
2
)
(1)
(2)
(3)
(4)
(5)
(6)
Page 10 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
12. Fig. 7 The vibration signals of ten bearing conditions and the number corresponds to the
corresponding condition [9].
4.2 Diagnosis results and analysis
This part is mainly to estimate the diagnosis ability of the proposed method for raw vibration
signals, the constructed hybrid model without GWO, Deep GRU, SAE, standard AE, ANN and
SVM are compared with the proposed method. And the only inputs to all methods are raw vibration
signals. GWO is used for obtaining the main parameters of the constructed model, the parameters
are illustrated in Table 2 and the iteration process is shown in Fig. 8. A rule similar to [41] is
followed in deciding the structure of the constructed model and the structure is chosen as 800-400-
200-100-10.
Each method runs 10 times under its respective parameters to estimate the stability of all
methods. The results of all methods are shown in Fig. 9 and the confusion matrix (the first trial),
which detailed describes the fault condition, is shown in Fig. 10. Table3 shows the detailed results
in the experiment.
Table 2
The parameters in the experiment
Description Symbol value
The learning rate of GRU ε 0.0531 (given by GWO)
Weight regularization of SAE τ 4.6023 (given by GWO)
Sparsity proportion of SAE ρ 0.2901 (given by GWO)
Sparsity weight of SAE β 0.3755 (given by GWO)
The number of wolves of GWO N 10
The iterative steps of GWO K 20
Time (s)
(7)
(8)
(9)
(10)
)
Page 11 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
13. Table 3
Average diagnosis accuracy and standard deviation of the 10 trials
Fig. 8 The GWO optimization process for bearing vibration signals
Categories Methods Average accuracy (%) Standard deviation (%)
Deep learning
The proposed method 97.130 0.625
Constructed model 95.425 0.787
Deep GRU 91.313 1.095
SAE 86.647 1.612
Deep AE 81.673 2.108
Shallow learning SVM 70.546 2.993
ANN 59.589 3.473
Accuracy
(%)
Iterative number
Page 12 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
14. Fig. 9 Diagnosis results of the 10 trials for each method
Fig. 10 The confusion matrix of the propsed method (the first trial)
As Table 3 shows, the average accuracy of the proposed method, the constructed hybrid model
without GWO, Deep GRU, SAE ,Deep AE, SVM and ANN are 97.130%, 95.425%, 91.313%,
86.647%, 81.673%, 70.546% and 59.589%, respectively. Obviously, the diagnosis performance of
the proposed method is much better than others. The standard deviation of the proposed method is
only 0.625% that is much smaller than that of the constructed hybrid model without GWO, Deep
GRU, SAE, Deep AE, SVM and ANN, which are 0.787%, 1.095%, 1.612%, 2.108%, 2.993% and
Accuracy
(%)
Actual
label
Predict number
Trial number
Page 13 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
15. 3.473%, respectively. According to the Table 3 and Fig. 9, we can conclude that: (1) Deep learning
methods could achieve better diagnosis performance than shawllow learning methods, the major is
the adoption of deep architectures could let deep learning methods extract more representive fault
features from the raw bearing vibration signals. (2) The proposed method has more effective and
robust performace than Deep GRU, SAE and Deep AE, the main reason is the proposed method
could effectively extracts the time relevance of sequential signals and learns more robust feature
representations. (3) The only difference bettween the proposed method and the construted model is
whether GWO is used or not. The comparison results show that the proposed method could achives
more effective and robust results, the major is the application of GWO could let the constructed
model has an optimazation set of parameters. These conparisons prove the proposed method is more
effective and robust than other methods.
The specific parameters are as follows: (1) the constructed hybrid model without GWO (the
parameters are obtained by repeated experiments): the structure is 800-400-200-100-10, the learning
rate of GRU is 0.1, the weight regularization of SAE is 4.0, the sparsity proportion of SAE is 0.3
and the sparsity weight of SAE is 0.3. (2) Deep GRU: the structure is 400-200-100-10, the learning
rate is 0.06 and the iterative steps is 100. (3) SAE: the structure is 400-200-100-10, the weight
regularization, the sparsity proportion and the sparsity weight are 3, 0.3 and 0.4, respectively. (4)
Deep AE: the structure is 400-200-100-10, the learning rate and momentum are 0.12 and 0.6. (5)
SVM: the RBF kernel is applied, the penalty factor is 2 and the radius of the kernel function is 0.25.
(6) ANN: the structure is 600-100-10, the learning rate is 0.05 and the iterative step is 550.
The following part is mainly to estimate the feature learning ability of the constructed model
for raw bearing vibration signals. The learned features at each level are visualized by t-SNE
algorithm, which include the raw bearing vibration signals, the learned features at GRU layer, the
learned features at first layer and second layer of SAE. Fig. 11 shows the visualized pictures.
(a) (b)
Page 14 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
16. (c) (d)
Fig. 11 3D visualization of the features at each level. (a) The raw vibration signals. (b) The learned
features at GRU layer. (c) The learned features at SAE first layer. (d) The learned features at SAE
second layer.
The features at each level are shown in Fig. 11. It can knows that the raw vibration signals of
ten categories are mixed seriously, which makes it impossible to distinguish them. The learned
features at GRU layer are more recognizable that proves the effectiveness of GRU in extracting the
time relevance of sequential signals. The learned features at first layer and second layer of SAE, the
ten categories can be distinguished clearly, which proves SAE could learn robust feature
representations. Consequently, this case illustrates the proposed method could effectively and
adaptively learns the features of raw vibration signals.
Besides, some published deep learning methods are applied to compare with the
proposed method to prove its superiority and the specific comparison results are shown
in Table 4.
Table 4
The specific comparison results
References Accuracy (Raw CWRU data sets) Accuracy (Processed CWRU data sets)
[9] 95.20% /
[42] 96.36% /
[43] / 99.98%
[44] 96.75% 97.47%
In reference [9], the author constructed a deep wavelet autoencoder to effectively
capture the signal characters of raw CWRU data, and the captured characters are input
into extreme learning machine to obtain the final diagnosis result. Comparing the final
diagnosis result of reference [9] with the diagnosis result of the proposed method, it can
be clearly seen that the proposed method could achieves higher diagnostic accuracy
than the reference [9]. In reference [42], a hybrid model that combines denoising
autoencoder and contractive autoencoder is constructed to automatically extract the
features of raw CWRU data. Then there are two different ways to process the extracted
Page 15 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
17. features. The first way is the extracted features are directly input into the softmax
classifier and the final diagnosis accuracy is 93.18%. The other way is the extracted
features are reduced by a modified t-distributed stochastic neighbor embedding to
achieve higher diagnosis accuracy and the final diagnosis accuracy is 96.36 %. The
final diagnosis accuracy of the proposed method is also higher than the method in
reference [42]. In reference [43], the author proposed a novel batch-normalized stacked
autoencoder for machine fault diagnosis. The diagnosis accuracy of the method in
reference [43] is 99.98% that is higher than our method which is 97.13%. However,
one point to emphasize is the inputs in reference [9] are frequency domain signals. As
is known, compared with the raw vibration signals, the frequency domain signals are
easy to obtain good diagnostic results for deep learning models. In reference [44], the
author combined compressed sensing and deep learning for rolling bearing fault
diagnosis. For raw CWRU data, the diagnosis accuracy is 96.75%, which is lower than
our result. After the raw data are processed by compressed sensing, the diagnosis result
is 97.47% that is higher than our results. However, there are two points that cannot be
ignored. The raw CWRU data need to be processed by compressed sensing, which not
only increases the complexity of the method, but also the successful application of
compressed sensing requires engineering experience and puts higher demands for
operators. In addition, the classify fault conditions in reference [44] only has 7
conditions, and ours has 10 conditions. The above comparisons could proves the
superiority of the proposed method and the existing result is also rationality.
5. Engineering verification
5.1 Data description
The actual locomotive bearing vibration signals are apllied to evaluate the reliability of the
proposed method in practical engineering. The experimental device is shown in Fig. 12. The signals
are collected at frequency of 12.8 kHz. More details can be seen in reference [16]. Table 5 lists the
nine conditions of bearing vibration signals. Each condition includes 300 samples and each sample
has 800 points. The first 200 samples are applied for training the model, the others are for testing.
The 8192 data points of each condition are shown in Fig. 13
Accelerometer
Fig. 12 The experimental device of electrical locomotive
Page 16 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
18. Figure. 13 The collected vibration signals of the nine conditions [23]. (a) Normal condition. (b)
Slight outer race damage. (c) Serious out race damage. (d) Roller damage. (e) Inner race damage.
(f) Compound faults (out race and inner race). (g) Compound faults (out race and roller). (h)
Compound faults (roller and inner race). (i) Compound faults (outer race、inner race and roller).
Table 5
Amplitude
(m/𝑠
2
)
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
Time (s)
Page 17 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
19. Introduction of training sample A and testing sample B
Health condition Motor speed (rpm)
Trained/tested
samples
Label
Normal 490 200/100 1
Slight outer race damage 490 200/100 2
Serious out race demage 481 200/100 3
Roller damage 531 200/100 4
Inner race damage 498 200/100 5
Compound faults (out race and inner race) 525 200/100 6
Compound faults (out race and roller) 521 200/100 7
Compound faults (roller and inner race) 640 200/100 8
Compound faults (outer race、inner race and roller) 549 200/100 9
5.2 Results and analysis
This part is mainly to estimate the diagnosis ability of the proposed method for actual
engineering data. The constructed hybrid model without GWO, Deep GRU, SAE, standard AE,
ANN and SVM are compared with the proposed method, and the only inputs to all methods are raw
vibration signals. GWO is used for obtaining the main parameters, same as Table 2, ε is 0.1658,
τ is 3.5329, ρ is 0.1258 and β is 0.3254. The structure is selected as 800-400-200-100-9.
Each method runs 10 times under its respective parameters to estimate the stability of the
proposed method. The results of each method are shown in Fig. 14. Table 6 shows the detailed
results in the experiment.
Fig. 14 Diagnosis results of the 10 trials for each method
Known from Table 6, the average accuracy of the proposed meBthod is 93.834%, that is
obviously higher than other methods, which are 91.304 %, 88.647%, 83.084%, 79.208%, 64.656%,
90.084%, 89.109%, 57.814%, 89.109% and 87.192%, respectively. The standard deviation of the
proposed method is 1.154% that is obviously less than others, which are 1.392%, 1.631%, 1.495%,
Trial number
Accuracy
(%)
Page 18 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
20. 1.971%, 2.595% and 2.853%, respectively. Obviously, the diagnosis performance of the proposed
method is more effective and robust than other methods. The results confirm the diagnosis ability
of the proposed method is effective and robust and the proposed method is realiabe in practical
engineering.
Table 6
Average diagnosis accuracy and standard deviation of the 10 trial
The specific parameters are as follows: (1) the constructed hybrid model without GWO (the
parameters are obtained by repeated experiments): the structure is 800-400-200-100-9, the learning
rate of GRU is 0.1, the weight regularization of SAE is 3.0, the sparsity proportion of SAE is 0.1
and the sparsity weight of SAE is 0.4. (2) Deep GRU: the structure is 400-200-100-9, the learning
rate is 0.1 and the iterative steps is 80. (3) SAE: the structure is 400-200-100-9, the weight
regularization, the sparsity proportion and the sparsity weight are 3.5, 0.18 and 0.32, respectively.
(4) Deep AE: the structure is 400-200-100-9, the learning rate and momentum are 0.15 and 0.5,
respectively. (5) SVM: the RBF kernel is applied, the penalty factor is 2 and the radius of the kernel
function is 0.25. (6) ANN: the structure is 400-150-9, the learning rate is 0.06 and the iterative step
is 800.
The learned features at each level are visualized by t-SNE algorithm, which include raw bearing
vibration signals, the learned features at GRU layer, the learned features at first layer and second
layer of SAE. Fig. 15 shows the visualized pictures. The visulization results confirm the proposed
method could effectively and and adaptively learns the features of raw vibration signals.
(1) (2)
Methods Average accuracy (%) Standard deviation (%)
The proposed method 93.834 1.154
The constructed model (without GWO) 91.304 1.392
Deep GRU 88.647 1.631
SAE 83.084 1.495
Deep AE 79.208 1.971
SVM 64.656 2.595
ANN 57.814 2.853
Page 19 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
21. (3) (4)
Fig. 15 3D visulization of features at each level. (1) The raw vibration signals. (2) The learned
features at GRU layer. (3) The learned features at first SAE layer. (3) The learned features at second
SAE layer.
6. Conclusion
In this paper, an optimal hybrid deep learning model based on gated recurrent unit (GRU) and
sparse autoencoder (SAE) is proposed for rolling bearing fault diagnosis. The key paprameters of
the hybrid deep learning model can be obtained adaptively and the hybrid deep learning model could
directly processes the raw bearing vibration signals. The proposed method is verified by the
experimental and practical engineering bearing data and the results prove the proposed method
could achieves more effective and robust diagnosis performance than other method. In addition, the
results also prove the extraordinary ability of GRU in extracting the time relevance of sequential
signals. Consequently, GRU is a promising tool for bearing fault diagnosis.
However, Compared with some of the latest bearing fault diagnosis articles, the accuracy of
the diagnosis in this paper has no advantage. The author will continue to investigate this topic in
future study to fully mine the feature extraction ability of GRU in processing sequential signals.
Acknowledgement
This research is supported by the major research plan of the National Natural Science
Foundation of China (No. 91860124), the National Natural Science Foundation of China (No.
51875459) and the Aeronautical Science Foundation of China (No. 20170253003), the Synergy
Innovation Foundation of the University and Enterprise for Graduate Students in Northwestern
Polytechnical University (No. XQ201901).
Reference
[1] H.K. Jiang, C.L. Li, H.X. Li, An improved EEMD with multiwavelet packet for rotating
machinery multi-fault diagnosis, Mech. Syst. Signal Process., 36 (2013) 225-239.
[2] Y. Zhang, B.P. Wang, Y. Han, L. Deng, Bearing performance degradation assessment based on
time-frequency code features and SOM network, Meas. Sci. Technol., 28 (2017) 045601.
[3] Y.Y. Zhang, X.Y. Li, L. Gao, L.H. Wang, L. Wen, Imbalanced data fault diagnosis of rotating
machinery using synthetic oversampling and feature learning, J. Manuf. Syst., 48 (2018) 34-50.
[4] J.L. Chen, Y.Y. Zi, Z.J. He, J. Yuan, Improved spectral kurtosis with adaptive redundant
multiwavelet packet and its applications for rotating machinery fault detection, Meas. Sci. Technol.,
23 (2012) 045608.
Page 20 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
22. [5] Y.G. Lei, J. Lin, Z.J. He, M.J. Zuo, A review on empirical mode decomposition in fault diagnosis
of rotating machinery, Mech. Syst. Signal Process., 1-2 (2013) 108-126. Meas. Sci. Technol., 23
(2012) 045608.
[6] Y.Y. Zhang, X.Y. Li, L. Gao, P.G. Li, A new subset based deep feature learning method for
intelligent fault diagnosis of bearing, Expert Syst. Appl., 110 (2018) 125-142.
[7] H.R. Cao, L.K. Niu, S.T. Xi, X.F. Chen, Mechanical model development of rolling bearing-rotor
systems: A review, Mech. Syst. Signal Process., 102 (2018) 37-58.
[8] J.D. Zheng, J.S. Cheng, Y. Yu, A rolling bearing fault diagnosis approach based on LCD and
fuzzy entropy, Mech. Mach. Theory, 70 (2013) 441-453.
[9] H.D. Shao, H.K. Jiang, X.Q. Li, S.P. Wu, Intelligent fault diagnosis of rolling bearing using deep
wavelet auto-encoder with extreme learning machine, Knowl.-Based Syst., 140 (2018) 1-14.
[10] S. Ma, F.L. Chu, Ensemble deep learning-based fault diagnosis of rotor bearing systems,
Comput. Ind., 105 (2019) 143-152.
[11] H.D. Shao, H.K. Jiang, K. Zhao, D.D. Wei, X.Q. Li, A novel tracking deep wavelet auto-
encoder method for intelligent fault diagnosis of electric locomotive bearings, Mech. Syst. Signal
Process., 110 (2018) 193–209.
[12] M. Unal, M. Onat, M. Demetgul, H. Kucuk, Fault diagnosis of rolling bearings using a genetic
algorithm optimized neural network. Measurement, 58 (2014) 187-196.
[13] J. Zarei, Induction motors bearing fault detection using pattern recognition techniques. Expert
Syst. Appl., 39 (2012) 68-73.
[14] X.A. Yan, M.P. Jia, A novel optimized SVM classification algorithm with multi-domain feature
and its application to fault diagnosis of rolling bearing. Neurocomputing, 313 (2018) 47-64.
[15] J.D. Zheng, H.Y. Pan, S.B. Yang, J.S. Chen, Generalized composite multiscale permutation
entropy and laplacian score based rolling bearing fault diagnosis. Mech. Syst. Signal Process., 99
(2018) 229-243.
[16] H.D. Shao, H.K. Jiang, H.Z Zhang, T.C. Liang, Electric Locomotive Bearing Fault Diagnosis
Using a Novel Convolutional Deep Belief Network, IEEE T. Ind. Electron. 65 (2017) 2727-2736.
[17] H.D. Shao, H.K. Jiang, X. Zhang, M.G. Niu. Rolling bearing fault diagnosis using an
optimization deep belief network. Meas. Sci. Technol., 26 (2015) 115002.
[18] S.H. Wang, J.W. Xiang, Y.T. Zhong, Y.Q. Zhou, Convolutional neural network-based hidden
Markov models for rolling element bearing fault identification. Knowl.-Based Syst., 144 (2018) 65-
76.
[19] J.H. Dou, J.Y. Qin, Z.X. Jin, Z. Li, Knowledge graph based on domain ontology and natural
language processing technology for Chinese intangible cultural heritage. J. Visual. Lang. Comput.,
48 (2018) 19-28.
[20] H.M. Fayek, M. Lech, L. Cavendon, Evaluating deep learning architectures for Speech
Emotion Recognition. Neural Networks, 92 (2017) 60-68.
[21] Z.L. Hu, J.S. Tang, Z.M. Wang, K. Zhang, L. Zhang, Q.L. Sun, Deep learning for image-based
cancer detection and diagnosis − A survey. Pattern Recognit., 83 (2018) 134-149.
[22] H.D. Shao, H.K. Jiang. F.A. Wang, H.W. Zhao, An enhancement deep feature fusion method
for rotating machinery fault diagnosis, Knowl.-Based Syst., 119 (2017) 200-220.
[23] H.D. Shao, H.K. Jiang, H.W. Zhao, F.A. Wang, A novel deep autoencoder feature learning
method for rotating machinery fault diagnosis, Mech. Syst. Signal Process., 95 (2017) 187-204.
Page 21 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
23. [24] Z. Meng, X.Y. Zhan, J. Li, Z.Z. Pan,An enhancement denoising autoencoder for rolling bearing
fault diagnosis, Measurement, 130 (2018) 448-454.
[25] C. Lu, Z.Y. Wang, B. Zhou, Intelligent fault diagnosis of rolling bearing using hierarchical
convolutional network based health state classification, Adv. Eng. Inform., 32 (2017) 139-151.
[26] W.Y. Huang, J.S. Cheng, Y. Yang, G.Y. Guo, An improved deep convolutional neural network
with multi-scale information for bearing fault diagnosis, Neurocomputing, 2019.
[27] S.H. Tang, C.Q. Shen, D. Wang, S. Li, W.G. Huang, Z.K. Zhu, Adaptive deep feature learning
network with Nesterov momentum and its application to rotating machinery fault diagnosis,
Neurocomputing, 305 (2018) 1-14.
[28] H.D. Shao, H.K. Jiang, F.A. Wang, Y.N. Wang, Rolling bearing fault diagnosis using adaptive
deep belief network with dual-tree complex wavelet packet, ISA T., 69 (2017) 187-201.
[29] H.K. Jiang, X.Q. Li, H.D. Shao, K. Zhao, Intelligent fault diagnosis of rolling bearing using
improved deep recurrent neural network, Meas. Sci. Technol., 29 (2018) 065107.
[30] R. Zhao, D.Z. Wang, R.Q. Yan, K.Z. Mao, F. Shen, J.J. Wang, Machine Health Monitoring
Using Local Feature-based Gated Recurrent Unit Networks, J. IEEE T. Ind. Electron., 99 (2017),
1539 – 1548.
[31] D.T. Hoang, H.J. Kang, A survey on Deep Learning based bearing fault diagnosis,
Neurocomputing, 335 (2019) 327-335.
[32] L. Xu, M.Y. Cao, B.Y. Song, J.S. Zhang, Y.R. Liu, F.E. Alsaadi, Open-circuit fault diagnosis of
power rectifier using sparse autoencoder based deep neural network, Neurocomputing, 311 (2018)
1-10.
[33] H. Liu, J.Z. Zhou, Y. Zheng, W. Jiang, Y.C. Zhang, Fault diagnosis of rolling bearings with
recurrent neural network-based autoencoders. ISA T. 77 (2018) 167-178.
[34] F.A. Wang, H.K. Jiang, H.D. Shao, W.J. Duan, S.P. Wu, An adaptive deep convolutional neural
network for rolling bearing fault diagnosis, Meas. Sci. Technol., 28 (2017) 095005.
[35] X. Zhang, Z.W. Liu, Q. Miao, L. Wang, An optimized time varying filtering based empirical
mode decomposition method with grey wolf optimizer for machinery fault diagnosis, J. Sound Vib.,
418 (2018) 55-78.
[36] X. Zhang, Q. Miao, Z.W. Liu, Z.J. He, An adaptive stochastic resonance method based on grey
wolf optimizer algorithm and its application to machinery fault diagnosis, ISA T. 71 (2017) 206-
214.
[37] A. Graves. Supervised Sequence Labelling with Recurrent Neural Networks [M], Springer
Berlin Heidelberg, 2012.
[38] X.Q. Li, H.K. Jiang, X. Xiong, H.D. Shao, Rolling bearing health prognosis using a modified
health index based hierarchical gated recurrent unit network, Mech. Mach. Theory, 133 (2019) 229-
249.
[39] S. Mirjalili, S.M. Mirjalili, A. Lewis, Grey Wolf Optimizer, Adv. Eng. Softw., 69 (2014) 46-61.
[40] W.A. Smith, R.B. Randall, Rolling element bearing diagnostics using the Case Western Reserve
University data: a benchmark study, Mech. Syst. Signal Process., 64–65 (2015) 100-131.
[41] F. Jia, Y.G. Lei, J. Lin, X. Zhou, N. Lu, Deep neural networks: a promising tool for fault
characteristic mining and intelligent diagnosis of rotating machinery with massive data, Mech. Syst.
Signal Process. 72–73 (2016) 303–315.
[42] W. Jiang, J.Z. Zhou, H. Liu, Y.H. Shan, A multi-step progressive fault diagnosis method for
rolling element bearing based on energy entropy theory and hybrid ensemble auto-encoder, ISA T.,
Page 22 of 23
AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t
24. 87 (2019) 235-250.
[43] J.R. Wang, S.M. Li, Z.H. An, X.X. Jiang, W.W. Qian, S.S. Ji, Batch-normalized deep neural
networks for achieving fast intelligent fault diagnosis of machines, Neurocomputing, 329 (2019)
53-65.
[44] J.D. Sun, C.H. Yan, J.T. Wen, Intelligent Bearing Fault Diagnosis Method Combining
Compressed DataAcquisition and Deep Learning, IEEE T. INSTRUM. MEAS., 67 (2018) 185-195.
Page 23 of 23 AUTHOR SUBMITTED MANUSCRIPT - MST-108890.R1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A
c
c
e
p
t
e
d
M
a
n
u
s
c
r
i
p
t