Zum Inhalt springen

Enhancement of Bearing Fault Diagnosis Using Optimized Variational Decomposition, Entropy-Based Modal Reconstruction, and Evolutionary Bidirectional Fusion Network

Prometheus Redaktion

Open AccessArticle Enhancement of Bearing Fault Diagnosis Using Optimized Variational Decomposition, Entropy-Based Modal Reconstruction, and Evolutionary Bidirectional Fusion Network 1 Changlongshan Pumped Storage Power Plant, China Yangtze Power Renewables Co., Ltd., Huzhou 310009, China 2 Longxu Youth Innovation Studio, China Yangtze Power Renewables Co., Ltd., Huzhou 310009, China 3 PowerChina HuaDong Engineering Co., Ltd., Hangzhou 311122, China * Author to whom correspondence should be addressed. Processes 2026, 14(12), 1861; https://doi.org/10.3390/pr14121861 (registering DOI) Submission received: 18 May 2026 / Revised: 4 June 2026 / Accepted: 5 June 2026 / Published: 9 June 2026 Abstract Rolling bearing vibration signals often exhibit strong nonstationarity and are susceptible to noise interference, which makes fault feature extraction and accurate diagnosis challenging under complex operating conditions. To address these issues, this paper proposes a fault diagnosis pipeline that sequentially combines an improved snow ablation optimizer (ISAO), variational generalized nonlinear mode decomposition (VGNMD), and a bidirectional temporal sequence fusion network (BiTSF-Net). Firstly, ISAO is used to optimize the key parameters of VGNMD, including the bandwidth penalty parameter and smoothing constraint parameter, with minimum envelope entropy as the fitness function. Secondly, the optimized VGNMD decomposes raw vibration signals into modal components, and the modal component with the minimum envelope entropy is selected to highlight fault-related impulsive characteristics. Thirdly, 11-dimensional time-domain statistical features are extracted from the selected optimal modal component to characterize bearing health states. Finally, these extracted features are used as the input to BiTSF-Net, which combines bidirectional temporal convolutional networks and bidirectional long short-term memory networks in a parallel structure to learn local transient features and temporal dependencies for fault classification. Experimental validation is conducted on the Case Western Reserve University dataset. Comparative results with convolutional neural networks, gated recurrent units, and long short-term memory networks demonstrate that the proposed pipeline achieves superior diagnostic performance, with an average accuracy of 99.63% and a maximum accuracy of 100%. These results confirm the effectiveness and robustness of the proposed ISAO-VGNMD feature extraction and BiTSF-Net classification pipeline for bearing fault diagnosis under complex nonstationary conditions. 1. Introduction 1.1. Research Background As crucial supporting components in rotating machinery systems, rolling bearings are extensively employed in power equipment, rail transportation, aerospace, and industrial manufacturing industries. Their operational state directly influences the safety and reliability of mechanical systems [ 1]. In the event of bearing failure, it may, at the least, result in a decrease in equipment operational efficiency, or, at the most, lead to severe mechanical damage and major safety incidents. Therefore, efficient and accurate fault diagnosis of rolling bearings has significant engineering implications and research value [ 2]. In recent years, with the escalating complexity of industrial equipment, traditional fault diagnosis methods based on empirical rules have proven insufficient for intelligent diagnosis under complex operating conditions. Achieving high-precision fault identification through advanced signal processing techniques and intelligent algorithms has become a major research focus in the field of mechanical fault diagnosis. Among diverse monitoring techniques, vibration signals are extensively employed in bearing fault diagnosis research, as they directly mirror the operating conditions of mechanical structures. Nevertheless, bearing vibration signals generally display prominent non-stationarity, nonlinearity, and multi-scale coupling characteristics. Under complex operational circumstances and interference from environmental noise, fault impact features are frequently concealed within background signals, substantially augmenting the difficulties in feature extraction and fault identification [ 3]. Consequently, the effective extraction of discriminative feature information from complex vibration signals has become a crucial challenge in the research of rolling bearing fault diagnosis. In recent years, with the progress of signal processing methods and artificial intelligence technologies, fault diagnosis approaches that combine signal decomposition with deep learning have emerged as a research focus. This integrated approach provides an effective framework for feature extraction and condition recognition in complex nonstationary signals. [ 4]. 1.2. Literature Review To address the nonstationary characteristics of bearing vibration signals, numerous signal decomposition methods have been developed for fault feature extraction. Among them, Empirical Mode Decomposition (EMD) [ 5] is regarded as one of the most representative adaptive signal analysis approaches. Lei et al. [ 6] systematically reviewed the application of EMD in rotating machinery fault diagnosis and demonstrated its capability in separating signal components at different scales. However, EMD-based methods remain highly susceptible to mode mixing, end effects, and decomposition instability under noisy operating conditions, which may reduce the reliability of extracted fault features in practical applications. To overcome these limitations, Dragomiretskiy and Zosso [ 7] proposed Variational Mode Decomposition (VMD), which formulates signal decomposition as a variational optimization problem and effectively suppresses modal aliasing. Compared with EMD, VMD exhibits better mathematical robustness and decomposition stability. Nevertheless, the decomposition performance of VMD remains highly dependent on manually selected parameters, particularly the modal number and bandwidth penalty factor. Improper parameter settings may lead to under-decomposition or redundant modal components, thereby affecting fault feature extraction accuracy. Subsequently, Nazari et al. [ 8] proposed Successive Variational Mode Decomposition (SVMD), which further improves decomposition stability through successive extraction strategies. Although SVMD alleviates parameter sensitivity to some extent, its adaptability to highly nonlinear and strongly nonstationary vibration signals remains limited under complex operating conditions. In recent years, Wang et al. [ 9] proposed Variational Generalized Nonlinear Mode Decomposition (VGNMD), which integrates adaptive time–frequency clustering with variational optimization to simultaneously process nonlinear frequency modulation modes and dispersive modes. Compared with conventional decomposition methods, VGNMD demonstrates stronger capability in analyzing complex signal structures and extracting physically meaningful modal information. However, the decomposition performance of VGNMD is still highly sensitive to key parameters, such as the bandwidth penalty parameter and frequency smoothing constraint coefficient [ 10]. Inappropriate parameter selection may result in insufficient modal separation, information leakage, or redundant modal components, thereby affecting subsequent fault diagnosis performance. Therefore, adaptive parameter optimization has become an important research direction for improving decomposition quality and feature representation capability. With the development of swarm intelligence optimization techniques, various optimization algorithms have been introduced into mechanical fault diagnosis, including Particle Swarm Optimization (PSO) [ 11], Gray Wolf Optimizer (GWO) [ 12], and Differential Evolution (DE) [ 13]. These methods have demonstrated effectiveness in parameter optimization and feature selection tasks. Recently, Deng et al. [ 14] proposed the Snow Ablation Optimizer (SAO), which exhibits strong global optimization capability in engineering applications. However, similar to many swarm intelligence algorithms, the original SAO still suffers from insufficient population diversity and premature convergence when solving high-dimensional nonlinear optimization problems. Consequently, improving the exploration–exploitation balance and convergence stability of optimization algorithms remains a critical issue. Meanwhile, deep learning techniques have been extensively applied in intelligent fault diagnosis and remaining useful life prediction [ 15, 16]. In particular, Zhang et al. [ 17] proposed a cross-working-condition bearing remaining useful life prediction method based on SPW-SVDD health indicators and a temporal self-attention mechanism, demonstrating the effectiveness of deep learning architectures in capturing temporal degradation characteristics under varying operating conditions. Ji et al. [ 18] established a swarm intelligence-based deep learning model combining the improved whale optimization algorithm and bidirectional long short-term memory to realize fault diagnosis of chemical processes. However, CNN-based models mainly focus on local receptive fields and lack sufficient capability for capturing long-term temporal dependencies [ 19]. Long Short-Term Memory (LSTM) networks [ 20] can effectively model temporal sequence information and have demonstrated advantages in time-series prediction and fault diagnosis tasks. Nevertheless, LSTM architectures often suffer from high computational complexity, limited parallelization capability, and gradient vanishing problems when modeling long sequences. Temporal Convolutional Networks (TCNs) [ 21], benefiting from dilated causal convolution structures, can effectively expand receptive fields while maintaining parallel computation capability. However, TCN-based models still exhibit limitations in modeling complex nonlinear temporal dynamics. To further improve feature learning capability, many studies have attempted to combine multiple network architectures. Existing serial hybrid models, such as TCN-LSTM frameworks, can partially integrate local feature extraction and temporal dependency modeling. However, serial structures may still introduce information loss during feature transmission and fail to fully preserve multi-scale temporal characteristics. In addition, the increasing network depth may further reduce training efficiency and model robustness. Overall, existing studies have achieved significant progress in bearing fault diagnosis. Nevertheless, several challenges remain unresolved, including parameter-sensitive signal decomposition, insufficient robustness under complex operating conditions, limited adaptive feature extraction capability, and inadequate joint modeling of local transient characteristics and long-term temporal dependencies. Therefore, developing an intelligent fault diagnosis framework with adaptive decomposition capability, robust parameter optimization performance, and efficient multi-scale temporal feature learning remains an important research topic. 1.3. Research Gaps and Innovations Overall, existing studies have achieved considerable progress in bearing fault diagnosis. However, several critical challenges remain unresolved, including parameter-sensitive signal decomposition, insufficient robustness under complex operating conditions, limited adaptive feature extraction capability, and inadequate joint modeling of local transient characteristics and long-term temporal dependencies. Traditional signal decomposition methods may still suffer from mode mixing and strong parameter sensitivity when processing complex nonlinear vibration signals, which undermines the accuracy of fault feature extraction. Although VGNMD can achieve high-precision decomposition of complex non-stationary signals, its key parameters are highly sensitive to the decomposition results. Inappropriate parameter settings may deteriorate the quality of decomposition. During fault diagnosis, single deep-learning models often struggle to simultaneously capture local impulsive features and long-term temporal dependencies, thus restricting their capacity to effectively represent complex vibration signals. To tackle the aforementioned challenges, this paper presents a bearing fault diagnosis method that integrates an improved snow ablation optimizer algorithm with variational generalized nonlinear mode decomposition. The key innovations are as follows: An ISAO algorithm combining Latin hypercube sampling (LHS) and Tent chaotic mapping, which performs adaptive optimization of key parameters in VGNMD to improve mode decomposition quality; A modal selection strategy based on minimum envelope entropy that adaptively selects optimal modal components from decomposed signals while extracting 11-dimensional time-domain statistical features to enhance fault feature representation; A bidirectional time-series parallel fusion network (BiTSF-Net) integrating bidirectional time convolution networks (BiTCN) and bidirectional long short-term memory networks (BiLSTM) for multi-scale temporal feature learning, thereby improving fault recognition performance for complex vibration signals. 3. BiTSF-Net Construction In the domain of mechanical fault diagnosis, the commonly employed deep-learning models mainly encompass architectures like CNN, LSTM/GRU, and TCN. Nevertheless, all these models present certain limitations in feature extraction. CNNs are effective in extracting local features from signals through convolutional kernels with local receptive fields, and they showcase remarkable performance in image recognition and one-dimensional signal processing. However, their restricted receptive fields encounter difficulties in modeling long-term dependencies and are unable to capture the global evolutionary patterns of vibration signals. LSTM and GRU architectures, although demonstrating strong capabilities in time-series modeling for representing long-term dependencies, have inherent limitations. These include challenges in parallelization during training, sub-optimal computational efficiency, weak localization of transient impact features, and potential gradient decay issues in extended sequences. TCN architectures partially overcome the shortcomings of CNNs by expanding convolutional structures, enabling effective long-term sequence modeling with improved parallel processing capabilities. Nevertheless, as convolutional architectures at their core, TCNs still exhibit limitations in modeling complex nonlinear dynamic patterns, making it arduous to fully characterize the evolutionary dynamics of vibration signals. Serial TCN-LSTM models: Current research generally adopts a tandem structure (TCN→LSTM) to integrate the advantages of both architectures. However, such architectures still manifest the following problems: information loss during feature transfer; biased learning of features across different scales; and an increase in network depth due to the serial structure, which undermines training efficiency. To tackle the aforementioned challenges, this study proposes the BiTSF-Net model with a parallel dual-branch architecture. By simultaneously applying BiTCN and BiLSTM to input features, the model realizes collaborative learning of multi-scale features. 3.1. BiTCN BiTCN utilizes one-dimensional causal convolution to conduct time-series modeling. The output of BiTCN at any specific moment is solely contingent upon the current and historical inputs, thus precluding information leakage. Regarding the input sequence X, the one-dimensional convolution operation within TCN integrates dilated convolution to augment the modeling capacity for long-term dependencies, which can be formulated as y ( t ) = ∑ k = 0 K − 1 w ( k ) x ( t − d · k ) (16) where K denotes the size of the convolution kernel, w ( k ) represents the kernel weight, and d is the dilation factor. By incrementally increasing the expansion factor layer by layer, TCN can effectively extract features from long time series without significantly increasing network depth or parameter scale. Moreover, BiTCN employs forward TCN (Forward TCN) and backward TCN (Backward TCN) modeling for input sequences. The output features of the two TCN pathways are represented as follows: H f = TCN ( x ) (17) H b = TCN ( reverse ( x ) ) (18) 3.2. BiLSTM BiLSTM combines forward and backward LSTM layers to encode sequences from two directions, allowing the model to concurrently capture bidirectional contextual information that includes historical and future contexts. This bidirectional architecture substantially enhances the model’s capacity to perceive global sequence features. BiLSTM is capable of processing both forward and backward time-series data simultaneously, facilitating more comprehensive prediction results and notably improving the accuracy of soft measurement forecasting. The following presents the hidden state equation of BiLSTM at time t: h t → = LSTM → ( h t − 1 , x t , g t − 1 ) h t ← = LSTM ← ( h t + 1 , x t , g t + 1 ) (19) H t = h t → , h t ← (20) where h t → denotes the forward-propagation hidden layer state, while h t ← represents the backward-propagation hidden layer state. 3.3. Feature Extraction To effectively characterize the dynamic variations in bearing vibration signals under different fault conditions, 11-dimensional time-domain statistical features are extracted from the selected modal component in this study. These features include mean value, standard deviation, variance, root mean square, peak value, peak-to-peak value, skewness, kurtosis, crest factor, impulse factor, and margin factor. Specifically, the mean value reflects the average vibration level of the selected modal component, while the standard deviation and variance describe the dispersion degree and fluctuation intensity of the signal. The root mean square represents the effective energy level of the vibration signal and is sensitive to changes in bearing operating states. The peak value and peak-to-peak value characterize the maximum impact amplitude and overall vibration range, which are closely related to transient shock components caused by local bearing defects. In addition, skewness reflects the asymmetry of the signal amplitude distribution, whereas kurtosis is highly sensitive to impulsive components and is commonly used to identify early bearing faults. The crest factor evaluates the ratio between the peak amplitude and the effective signal energy, which can indicate the existence of abnormal impact responses. The impulse factor further measures the intensity of sudden shocks relative to the average signal amplitude, while the margin factor emphasizes the sensitivity of extreme impact components and is useful for detecting localized fault-induced vibration impulses. Compared with high-dimensional frequency-domain and time–frequency-domain features, these time-domain statistical indicators have lower computational complexity, stronger physical interpretability, and higher computational efficiency. More importantly, bearing fault signals usually exhibit impulsive, nonstationary, and amplitude-fluctuation characteristics, which can be effectively described by the selected 11-dimensional feature set. Therefore, these features can balance diagnostic information representation and computational cost, providing reliable input information for subsequent BiTSF-Net fault classification. 3.4. Feature Fusion and Overall Network Architecture of BiTSF-Net To comprehensively leverage the multi-level temporal features intrinsic to mechanical fault signals, this research puts forward a BiTSF-Net model. Through the effective integration of complementary features extracted by BiTCN and BiLSTM, the network realizes the collaborative modeling of local impact characteristics and long-term dependency features. The comparative architecture between BiTSF-Net and serial TCN-LSTM is presented in Figure 3. In the BiTSF-Net, the input feature sequence is initially fed in parallel into two sequential modeling branches, namely the BiTCN and the BiLSTM. The BiTCN is dedicated to extracting local temporal patterns and multi-scale impact features from signals via expanded convolutional architectures, which facilitates the efficient capture of short-term mutations and periodic information. Simultaneously, the BiLSTM utilizes a bidirectional recursive structure to model sequences, highlighting the long-term dependency relationships and nonlinear dynamic evolution characteristics of signals. To effectively utilize the temporal features extracted from different branches, this research adopts a feature-level fusion strategy to integrate the outputs of the BiTCN and the BiLSTM. Specifically, a joint feature representation is constructed by concatenating the two feature streams along the feature dimension. H fusion = [ H BiTCN / / H BiLSTM ] (21) Following feature fusion, a fully connected layer is incorporated to conduct nonlinear mapping and dimensionality reduction on the integrated features. This process not only strengthens feature coupling but also suppresses redundancy. Regularization methods, such as Dropout, are utilized to enhance the model’s generalization ability. Subsequently, the fused features are input into the classification layer for fault pattern recognition. In this layer, the Softmax function yields probability distributions for each fault category. The mathematical expression is as follows: P ( y = i ∣ H fusion ) = exp ( z i ) ∑ j = 1 C exp ( z j ) (22) where C denotes the number of fault categories, and z i represents the network output corresponding to the i-th category. By minimizing the cross-entropy loss function, end-to-end training of network parameters is conducted to achieve accurate discrimination of different fault states. 3.5. Bearing Fault Diagnosis Method for ISAO-VGNMD and BiTSF-Net The bearing fault diagnosis model based on ISAO-VGNMD and BiTSF-Net follows the workflow illustrated in Figure 4. The specific diagnostic procedure is illustrated as follows: Step 1: The original dataset is first divided into independent training and test sets. Subsequently, samples used for ISAO-based VGNMD parameter optimization are randomly selected only from the training set to avoid potential data leakage during the optimization process. Step 2: Using envelope entropy as the fitness function and combining it with the ISAO algorithm, the parameters ( α, β) of VGNMD are optimized to obtain the optimal IMF component index value. Step 3: The optimal parameters are substituted back into VGNMD, the original data are decomposed, and the optimal IMF components are output. Step 4: The corresponding optimal IMF components and their 11-dimensional time-domain features are extracted; then, a feature set is constructed after data augmentation. Step 5: The extracted feature set is used to construct the BiTSF-Net fault diagnosis model based on the predefined training and test sets, where the test data remain completely independent throughout the optimization and training stages. 4. Experimental Verification and Analysis 4.1. Data Introduction The collected signals are saved in MAT format for subsequent data processing and analysis. These data will be utilized in bearing fault diagnosis research, enabling accurate fault diagnosis through feature extraction and classification recognition of vibration signals. In this experiment, the dataset consists of 1200 samples divided into training and testing sets at a 3:1 ratio. To ensure fair evaluation and avoid information leakage, the test set was not used in the parameter optimization, modal selection, or model training procedures. Standard data labels are assigned as follows: Label 1 for normal data, Label 2 for inner ring fault 1, Label 3 for inner ring fault 2, Label 4 for inner ring fault 3, Label 5 for outer ring fault 1, Label 6 for outer ring fault 2, Label 7 for outer ring fault 3, Label 8 for rolling element fault 1, Label 9 for rolling element fault 2, and Label 10 for rolling element fault 3. Table 3 details the composition of the experimental dataset. 4.3. Comparison of Diagnostic Results in the CWRU Dataset To validate the diagnostic performance of the model described in Section 3.4, this study comprehensively evaluates its adaptability and robustness in VGNMD-decomposed data environments using the CWRU dataset. Cross-validation experiments are conducted with recently proposed fault diagnosis models, including CNN, LSTM, GRU, and ISAO algorithms. The experiments were performed independently 20 times with repeated runs. Comparative results of all control models are presented in Table 4. The population size NP of the SAO and ISAO algorithms is set to 20, and the iteration count T is set to 25. Then, fault diagnosis experiments are conducted for M1 through M6. The confusion matrix diagram obtained from one independent experiment is shown in Figure 9. As shown in Figure 9, the overall diagnostic performance of models M1, M2, and M3 falls within the “usable but unstable” range. Specifically, M1 achieves an accuracy rate of 86.67%, M3 reaches 88.67%, and M2 demonstrates slightly better performance at 91.33%. These results indicate that under identical feature and input conditions, directly integrating basic network architectures like CNN, LSTM, or GRU after VGNMD alone results in limited discrimination capability for similar fault categories. This leads to more pronounced “off-diagonal misclassification” phenomena in the confusion matrix, where certain fault categories are erroneously assigned to other similar categories, thereby reducing overall classification accuracy. Within the internal comparison of M1–M3, the GRU-based model (M3) outperforms LSTM (M2) and CNN (M1) in accuracy, reflecting GRU’s superior balance between fitting performance and generalization ability through its simplified gate structure and moderate parameter scale in this experimental dataset. However, even with these advantages, the overall accuracy rate remains below 92%, indicating persistent challenges in completely eliminating confusion between different fault categories. This highlights the need for significant improvements in adaptability and robustness for existing models when handling complex data environments. Further analysis of performance metrics from M4 to M6 reveals a stepwise significant improvement. M4 achieves an accuracy rate of 98%, demonstrating that the BiTSF-Net architecture effectively integrates long-term temporal dependencies with discriminative features in time-series data. This optimization concentrates energy distribution along the principal diagonal in the confusion matrix, resulting in a substantial reduction in inter-class misclassifications. Building upon this foundation, the introduction of SAO and the improved ISAO algorithm for automatic parameter optimization further enhanced the diagnostic performance. Specifically, M5 achieved an accuracy of 98.33%, while M6 obtained a maximum diagnostic accuracy of 100% in one independent experimental run. In Figure 9g, 10 categories correspond to ten fault types, and the diagnostic accuracy for each fault reaches 100%. Detailed performance metrics and stability data are presented in Table 5. It should be noted that the time reported in Table 5 refers to the testing time, namely, the runtime required to perform fault diagnosis on the test set. It does not include the model training time or the offline ISAO-based VGNMD parameter optimization time. Table 5 reveals that traditional models (CNN, GRU, and LSTM) exhibit noticeable performance limitations in fault diagnosis tasks, with accuracy rates consistently below 92%. Regarding diagnostic efficiency, the reported time corresponds to the average testing time required for fault classification rather than model training time. Among the traditional models, GRU achieves a relatively favorable balance between diagnostic accuracy and testing efficiency, whereas LSTM requires slightly longer testing time without providing substantial improvements in diagnostic performance. In contrast, the proposed BiTSF-Net model (M4) achieves a diagnostic accuracy of 98.00% while maintaining a relatively low testing time, demonstrating that the parallel fusion of BiTCN and BiLSTM effectively enhances feature representation capability without significantly increasing diagnostic latency. Building upon this architecture, the introduction of SAO optimization (M5) further improves the average diagnostic accuracy to 98.53%, indicating that parameter optimization enhances the robustness and stability of the fault diagnosis model. By implementing the enhanced ISAO algorithm (M6), the proposed model achieved a maximum diagnostic accuracy of 100% and an average diagnostic accuracy of 99.63% across 20 independent experiments. The results indicate that the proposed framework maintains high diagnostic stability and strong generalization performance under repeated evaluations. Overall, the proposed ISAO-BiTSF-Net model demonstrates significant advantages in accuracy, stability, and computational efficiency, validating the effectiveness and superiority of this approach for bearing fault diagnosis under complex operating conditions. It should be noted that the CWRU dataset used in this study was collected under relatively controlled laboratory conditions with stable operating parameters and limited environmental interference. Therefore, the fault characteristics are more distinguishable than those in real industrial environments. In practical applications, variable operating conditions, strong background noise, and coupled interference may increase the difficulty of fault diagnosis. Although the proposed method achieved excellent performance on the benchmark dataset, further validation under real industrial scenarios is still required to evaluate its generalization capability and engineering applicability. The results indicate that the design of BiTSF-Net and optimization algorithms is key to performance improvement, with ISAO exhibiting stronger global search and convergence capabilities compared to SAO. To further demonstrate ISAO’s convergence performance, it was tested against the SAOE algorithm on four benchmark functions, as shown in Figure 10. The convergence curves in Figure 10a–d demonstrate that ISAO outperforms SAO across all test functions, achieving faster convergence to optimal solutions during initial iterations with lower final fitness values. Its convergence process exhibits smoother stability without significant oscillations or premature convergence, indicating the algorithm strikes a more balanced equilibrium between global exploration and local optimization. To further verify the contribution of the proposed ISAO optimization strategy and the BiTSF-Net architecture to bearing fault diagnosis performance, additional ablation experiments were conducted under identical experimental conditions. The comparative results are summarized in Table 6. As shown in Table 6, the proposed BiTSF-Net achieves higher diagnostic accuracy than the conventional TCN-LSTM architecture, indicating that the parallel dual-branch structure can more effectively capture both local transient features and long-term temporal dependencies of bearing vibration signals. Compared with the original BiTSF-Net, the incorporation of SAO-based parameter optimization further improves the diagnostic performance, demonstrating the importance of adaptive VGNMD parameter optimization for modal decomposition quality and fault feature representation. Furthermore, the proposed ISAO-BiTSF-Net framework achieves the highest optimal accuracy and average accuracy among all compared models. These results demonstrate that the proposed ISAO strategy can effectively enhance decomposition parameter optimization capability, thereby improving modal separation quality and providing more discriminative fault information for subsequent BiTSF-Net classification. 5. Conclusions and Further Research 5.1. Conclusions To address the challenges of strong nonstationarity, noise interference, and insufficient adaptive feature extraction in rolling bearing vibration signals under complex operating conditions, this study proposes an intelligent fault diagnosis framework integrating ISAO-optimized VGNMD with BiTSF-Net. The proposed method combines adaptive signal decomposition, entropy-based modal selection, and multi-scale temporal feature learning to improve fault feature representation and diagnostic performance. First, the proposed ISAO algorithm effectively enhances the parameter optimization capability of VGNMD by introducing Latin hypercube sampling and Tent chaotic mapping. Compared with conventional optimization strategies, ISAO improves population diversity, global exploration capability, and convergence stability, thereby enabling more accurate extraction of informative modal components from complex nonstationary signals. Second, the minimum envelope entropy criterion enables adaptive selection of modal components containing significant fault-related impact information. Combined with multidimensional time-domain feature extraction, the proposed framework effectively suppresses redundant and noise-dominated information while improving feature representation capability. Third, experimental results verified the effectiveness and superiority of the proposed framework. On the CWRU bearing dataset containing 10 fault categories, the proposed ISAO-VGNMD-BiTSF-Net method achieved a maximum diagnostic accuracy of 100% and an average diagnostic accuracy of 99.63%, outperforming comparative deep learning models such as CNN, GRU, LSTM, and conventional TCN-LSTM networks. These results confirm that the proposed method possesses excellent feature extraction capability, high diagnostic accuracy, and strong robustness for intelligent bearing fault diagnosis under nonstationary conditions. From an industrial perspective, the proposed framework provides a promising solution for intelligent condition monitoring and predictive maintenance of rotating machinery. By effectively extracting fault-sensitive information from complex vibration signals, the method can potentially be applied to critical industrial equipment such as wind turbines, induction motors, gearboxes, railway traction systems, and power generation machinery. The high diagnostic accuracy and adaptive decomposition capability of the proposed framework may contribute to early fault detection, maintenance decision support, reduced unplanned downtime, and improved operational reliability in industrial environments. It should also be noted that the present study was validated using the CWRU benchmark dataset collected under relatively controlled laboratory conditions. Although the proposed framework demonstrated excellent diagnostic performance, further validation using real industrial field data is still necessary to comprehensively evaluate its robustness and generalization capability under practical operating environments involving variable working conditions, strong background noise, sensor uncertainty, and coupled fault scenarios. Such investigations will further support the practical deployment of the proposed framework in industrial intelligent maintenance systems. 5.2. Further Research Although the proposed method achieves excellent diagnostic performance on the benchmark dataset, several research directions deserve further investigation: Future work will integrate vibration, acoustic, temperature, current, and other heterogeneous sensor signals to construct a multi-modal fault diagnosis framework, thereby improving diagnostic reliability and robustness under complex industrial conditions. Future studies will explore domain adaptation and transfer learning techniques to enhance model adaptability under varying loads, rotational speeds, and environmental disturbances, thereby improving cross-condition diagnostic performance. To facilitate practical industrial implementation, lightweight network architectures and edge-computing deployment strategies will be investigated to enable real-time fault monitoring and intelligent maintenance applications. Although ISAO demonstrates strong optimization capability, hybrid optimization mechanisms combining multiple swarm intelligence algorithms and adaptive parameter control strategies can be further explored to improve convergence efficiency and solution accuracy in high-dimensional optimization problems. Author Contributions Conceptualization, J.L. and X.C.; methodology, H.L.; software, J.L.; validation, X.Z., X.H.; formal analysis, J.L.; investigation, X.C.; resources, H.L.; data curation, X.Z.; writing—original draft preparation, J.L.; writing—review and editing, X.C.; visualization, T.P.; supervision, X.C.; project administration, X.C. All authors have read and agreed to the published version of the manuscript. Funding This research was funded by the Construction Project of Changdian Xinneng Pumped Storage Electrical Secondary Innovation Laboratory, China Yangtze Power Co., Ltd., grant number Z152401006. Data Availability Statement The data supporting the reported results are available from the corresponding author upon reasonable request. Acknowledgments Special thanks are given to the Longxu Youth Innovation Studio for its support. Conflicts of Interest Authors Xupeng Chen, Huiyin Li, Xu Zhang and Jianling Lai were employed by the company China Yangtze Power Renewables Co., Ltd. Authors Xin Hu and Tian Peng were employed by the company PowerChina HuaDong Engineering Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. References Framework of VGNMD. Framework of VGNMD. Flowchart of the ISAO algorithm. Flowchart of the ISAO algorithm. The comparative architecture of BiTSF-Net and serial TCN-LSTM. The comparative architecture of BiTSF-Net and serial TCN-LSTM. Flowchart of Diagnostic Process. Flowchart of Diagnostic Process. Experimental platform of Western Reserve University. Experimental platform of Western Reserve University. Line Chart of Envelope Entropy. Line Chart of Envelope Entropy. Plot of Optimal IMF. Plot of Optimal IMF. Line Chart of Feature Data. Line Chart of Feature Data. Plot of Diagnostic Results for M1–M6. Plot of Diagnostic Results for M1–M6. Comparison of Convergence Curves between ISAO and SAO. Comparison of Convergence Curves between ISAO and SAO. Table 1. Model parameters of different fault states. Table 1. Model parameters of different fault states. Status Fault Diameter (Inches) Rolling Element Fault 0.007/0.014/0.021 Inner Race Fault 0.007/0.014/0.021 Outer Race Fault 0.007/0.014/0.021 Normal Condition None Table 2. Parameter settings of bearing data. Table 2. Parameter settings of bearing data. Parameters Settings Load 0 HP Model SKF6025 Frequency 12 kHz Rotational Speed 1797 rpm Sampling Points 2048 Table 3. Composition of the experimental bearing fault states dataset. Table 3. Composition of the experimental bearing fault states dataset. Dataset Label Bearing Fault Type Fault Diameter (Inches) 1 Normal 0 2 Inner Race Fault 0.007 3 Rolling Element Fault 0.007 4 Outer Race Fault 0.007 5 Inner Race Fault 0.014 6 Rolling Element Fault 0.014 7 Outer Race Fault 0.014 8 Inner Race Fault 0.021 9 Rolling Element Fault 0.021 10 Outer Race Fault 0.021 Table 4. Comparison of Various Control Group Models. Table 4. Comparison of Various Control Group Models. Number Model Number Model M1 CNN M4 BiTSF-Net M2 GRU M5 SAO-BiTSF-Net M3 LSTM M6 ISAO-BiTSF-Net Table 5. Comparison of diagnostic results for the CWRU dataset. Table 5. Comparison of diagnostic results for the CWRU dataset. Diagnostic Model Time Optimal Accuracy Rate Average Accuracy Rate M1 2.04 86.67% 85.33% M2 1.43 91.33% 90.79% M3 2.44 88.67% 87.67% M4 1.65 98% 97.33% M5 2.57 98.53% 98.33% M6 3.09 100.0% 99.63% Table 6. Ablation analysis of different optimization strategies and network architectures. Table 6. Ablation analysis of different optimization strategies and network architectures. Method Time Optimal Accuracy Rate Average Accuracy Rate TCN-LSTM 1.51 97.12% 96.78% BiTSF-Net 1.65 98% 97.33% SAO-BiTSF-Net 2.57 98.33% 98.53% ISAO-BiTSF-Net 3.09 100.0% 99.63% Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. © 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Share and Cite MDPI and ACS Style Chen, X.; Li, H.; Zhang, X.; Lai, J.; Hu, X.; Peng, T. Enhancement of Bearing Fault Diagnosis Using Optimized Variational Decomposition, Entropy-Based Modal Reconstruction, and Evolutionary Bidirectional Fusion Network. Processes 2026, 14, 1861. https://doi.org/10.3390/pr14121861 AMA Style Chen X, Li H, Zhang X, Lai J, Hu X, Peng T. Enhancement of Bearing Fault Diagnosis Using Optimized Variational Decomposition, Entropy-Based Modal Reconstruction, and Evolutionary Bidirectional Fusion Network. Processes. 2026; 14(12):1861. https://doi.org/10.3390/pr14121861 Chicago/Turabian Style Chen, Xupeng, Huiyin Li, Xu Zhang, Jianling Lai, Xin Hu, and Tian Peng. 2026. "Enhancement of Bearing Fault Diagnosis Using Optimized Variational Decomposition, Entropy-Based Modal Reconstruction, and Evolutionary Bidirectional Fusion Network" Processes 14, no. 12: 1861. https://doi.org/10.3390/pr14121861 APA Style Chen, X., Li, H., Zhang, X., Lai, J., Hu, X., & Peng, T. (2026). Enhancement of Bearing Fault Diagnosis Using Optimized Variational Decomposition, Entropy-Based Modal Reconstruction, and Evolutionary Bidirectional Fusion Network. Processes, 14(12), 1861. https://doi.org/10.3390/pr14121861 Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here. Article Metrics Article metric data becomes available approximately 24 hours after publication online.

www.mdpi.com

Zum Originalartikel