A novel approach to aeroengine performance diagnosis based on physical model coupling data-driven using low-rank multimodal fusion method

Aeroengine health assessment plays a pivotal role in ensuring flight safety and reliability. Traditionally, this process involves diagnosing the performance of the aeroengine gas path. However, owing to the intricacies of operating conditions, non-linear performance, and the interplay of gas path performance fault characteristics, determining the aeroengine health condition directly from engine monitoring information poses a significant challenge, particularly in cases of insufficient sensor data. To address these challenges, a novel digital twin method for aeroengine performance diagnosis has been proposed. This method integrates data-driven and performance models, employing a low-rank multimodal fusion approach. By digitizing the physical system or process through mathematical models and simulation technology, this approach presents distinct advantages compared to previous methods relying solely on models or data. At the aeroengine component level, an adaptive model was implemented, and the data-driven model was constructed using flight data. Gas path fault classification employed support vector machines. The engine digital twin was established through low-order multimodal fusion. Results indicate that the proposed method attains excellent diagnostic accuracy under both steady and transient conditions. It can be harnessed to enhance engine performance monitoring and evaluation, thereby improving the reliability, availability, and efficiency of the engine.

Introduction

Aeroengines, acting as the primary power source for aircraft, undergo rigorous operating conditions throughout their lifespan, resulting in the degradation of gas path performance (Shuang et al., 2021 & Chen et al., 2021). This deterioration can give rise to gas path failures, and in extreme cases, lead to system collapse. Consequently, the creation of precise and dependable aeroengine performance diagnosis systems is imperative to guarantee flight safety, reliability, and the extension of engine service life (Xu et al., 2022, Sun et al., 2020). Presently, the majority of engine performance diagnosis systems are grounded in the theoretical framework of gas path analysis (GPA), utilizing two primary approaches: model-based and data-driven methods.

The model-based approach focuses on establishing a precise mechanism model for quantitatively analyzing the variations in the engine's performance (Urban, 1973). Several researchers, including Gulati et al. (2000) and Aretakis et al. (2003), have conducted engine performance evaluation and fault diagnosis by deriving multiple fault equations from diverse steady-state points. Zedda and Singh (2003) have addressed the diagnosis of sensor bias in sexual energy and employed a two-level combination search of sensor and gas path components to isolate these components from each other. Song et al. (2015) have developed an engine performance prediction model and validated its accuracy in predicting the engine gas path performance status. Kim et al. (2020) have proposed a mechanism model method to predict engine performance changes in both steady-state and transient modes. However, to enhance the accuracy of the mechanism model, parameter modifications are necessary. Typically, steady-state working points are extracted from a significant amount of experimental and operational data for verification. Nevertheless, this process can be relatively complex and may result in information loss (Tsoutsanis et al., 2014).

The data-driven approach relies on utilizing existing information, experience, and data to quantitatively analyze the engine's health without the need for complex mechanism models (Wang and Zhao, 2023). To enhance detection quality and reduce setup time, Viharos and Kis (2015) proposed a fuzzy logic method that combines neural network learning with fuzzy logic and ideal solution similarity ranking technology based on user rules. Pu et al. (2013) investigated a directed graph Bayesian belief network that employs probabilistic reasoning and expert systems for knowledge representation and reasoning, demonstrating good performance in handling uncertain information. Kumar et al. (2018) introduced a method that combines fuzzy logic with support vector machines, which proves valuable not only for engine performance analysis and fault diagnosis but also for estimating the remaining life of engine components. Fentaye et al. (2019) utilized neural networks and support vector machines to quantify and classify engine component performance degradation under standard white noise. Lu et al. (2019) proposed a decentralized DKELM algorithm that significantly improves real-time performance while maintaining classification accuracy. Lu et al. (2020) presented the GPKELM algorithm, which reduces computing time without sacrificing accuracy, effectively enhancing the real-time performance of fault diagnosis. However, the data-driven model is essentially a “black box” that lacks detailed engine performance information, posing challenges for subsequent engine performance diagnostics.

With the advancements in physical model methods and artificial intelligence, the concept of digital twins has emerged as an innovative approach. A digital twin is essentially a virtual replica that is connected to a physical system or process. It collects and integrates data from the physical system using sensors, simulators, and other technologies to create an accurate visual model. Digital twin models have the capability to analyse, optimize, and predict the performance of physical systems, providing real-time information about the system's operational status and health (Bondarenko and Fukuda, 2020). However, there is currently a lack of research on digital twinning specifically in the field of aeroengines. Aeroengines are complex physical systems, and effectively integrating engine mechanism models with data-driven models to achieve digital twinning remains a challenging task (Huang et al., 2023).

This paper introduces a digital twinning framework for aeroengines that employs low-rank multimodal fusion methods to combine mechanism models with data-driven models. The proposed framework demonstrates improved accuracy in performance prediction and fault diagnosis compared to using model-based and data-driven methods separately. The mechanism model is constructed at the component level, while the data-driven model utilizes recurrent neural networks. These models are coupled through low-rank multimodal fusion methods to create the proposed data twin model. To achieve the engine performance diagnosis function of the digital twin framework, support vector machines are utilized for fault diagnosis of the rotating components in the engine gas path.

This paper presents a digital twinning framework for aeroengines, utilizing low-rank multimodal fusion methods to integrate mechanism models with data-driven models. The proposed framework showcases enhanced accuracy in performance prediction and fault diagnosis compared to employing model-based and data-driven methods independently. The mechanism model is developed at the component level, with the data-driven model incorporating recurrent neural networks. These models are interconnected through low-rank multimodal fusion techniques to form the proposed data twin model. To fulfilling the engine performance diagnosis function of the digital twin framework, support vector machines are employed for fault diagnosis of the rotating components in the engine gas path.

Methodology

Component-level model

Aeroengines are complex systems that operate in harsh environments for extended periods, leading to degradation of their gas path components and resulting changes in their performance. Consequently, developing an accurate engine performance model is vital for analysing engine gas path performance and diagnosing faults effectively (Talaat et al., 2020). With the advancement of model technology and software, aeroengine performance simulation technology has become a fast and reliable tool for engine engineers to evaluate engine performance. Common engine performance models can be broadly classified into two categories: mathematical theoretical models and component-level models (Li et al., 2012). Component-level models are preferred over mathematical theoretical models as they provide a more comprehensive description of engine gas path components' characteristics and offer better guidance and support for engine performance and structural design. Therefore, subsequent research in this field has focused on utilizing engine component-level models as the foundation for analysis and development.

The engine model investigated in this study focuses on the CFM56-5B engine, which represents a typical high bypass ratio turbofan engine with two spools and a booster. The structure of the turbofan engine is depicted in Figure 1. The engine's component-level model developed for this research comprises various modules, including the intake, fan, low-pressure compressor, high-pressure compressor, combustion chamber, high-pressure turbine, low-pressure turbine, outer bypass, and nozzle. The model environment employed in this study is Matlab2020a.

Figure 1.

Schematic diagram of engine component-level model.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.01_min.jpg

In Figure 1, each block represents a specific module within the engine model. The Intake block represents the intake port, FAN represents the fan module, LPC represents the low-pressure compressor module, HPC represents the high-pressure compressor module, Combustion represents the combustion chamber module, HPT represents the high-pressure turbine module, LPT represents the low-pressure turbine module, Nozzle represents the nozzle module, and Bypass represents the outer duct module. The model receives inputs such as flight altitude, Mach number, and fuel flow, and solves the equilibrium equations under both steady-state and transient conditions using the Newton-Raphson method (Li et al., 2018). It calculates the corresponding gas path parameters for each station of the engine during equilibrium. The component-level model possesses both steady-state and transient capabilities. Table 1 presents the design point performance of the engine, while Table 2 displays the relative absolute error (RE) results of relevant station parameters for the component-level model compared to GasTurb13 under design performance conditions. The table demonstrates that the maximum error for the same engine station is only 0.012%, indicating the accuracy of the performance calculation achieved by the developed model. The component-level model undergoes modifications through an adaptive method using component characteristic diagrams, with further details provided in reference (Li and Nilkitsaranont, 2009).

Table 1.

Design point performance.

Parameter	Value	Unit
Thrust	95.31	kN
Total flow rate	322.65	kg/s
Total Pressure ratio	33.8
SFC	10.17	g/(kN*s)

Table 2.

Design point performance validation.

Symbol	Definition	Units	Engine Model	GasTurb	RE (%)
T₁₃	Bypass exit temperature	K	327.84	327.85	0.006
W₁₃	Bypass exit mass flow	kg/s	270.79	270.78	0.004
T₃	LPC exit temperature	K	389.33	389.34	0.003
W₃	LPC exit mass flow	kg/s	51.301	51.297	0.004
T₄	HPC exit temperature	K	802.79	802.83	0.006
T₅	CC exit temperature	K	1.6703 × 10³	1.6704 × 10³	0.007
W₅	CC exit mass flow	kg/s	52.071	52.066	0.008
T₆	LPT exit temperature	K	1.1138 × 10³	1.1139 × 10³	0.011
T₈	Nozzle exit temperature	K	834.51	834.61	0.012

Data-driven model

Data-driven methods differ from model-based methods as they utilize existing information, experience, and data to quantitatively analyse engine performance changes. These methods do not require the establishment of additional complex mechanism models and possess characteristics such as dynamic learning and self-optimization. The objective of this study is to develop an accurate digital twin model for aviation engines that can monitor their health in real-time. However, component-level models often involve complex iterations, leading to low computational efficiency and difficulty in meeting engineering requirements for real-time monitoring. While data-driven models are sometimes viewed as black boxes, they can dynamically predict the future performance degradation trend of aeroengines through historical data with high prediction accuracy. Moreover, they can promptly respond to changes in engine operating conditions. Flight data typically consists of time-series data with strong timeliness, and data-driven models can only predict real-time data based on historical flight data. Therefore, extracting relevant features from historical flight data and ensuring the predictive performance of real-time data pose challenges that require the predictive ability of machine learning algorithms.

Recurrent Neural Networks (RNN) are a type of algorithm particularly suitable for learning continuous time series data. The architecture of an RNN incorporates the ability to propagate information from previous time steps to the current time step, enabling it to capture the temporal characteristics of the data. Figure 2 illustrates the structure of an RNN, which resembles that of a conventional multi-layer feedforward neural network. However, in an RNN, the output of the hidden layer neurons is fed back and utilized as input for the subsequent time step, alongside the input signals from the input layer neurons. This feedback loop allows certain neuron outputs to serve as inputs, facilitating the RNN's ability to handle time-dependent dynamics. The circular structure of an RNN allows information to flow not only from the input at time t but also from the network state at time t − 1, affecting the output state of the RNN at time t. This characteristic enhances the RNN's capability to capture temporal dependencies and handle time-related dynamic changes (Asrav and Aydin, 2023). In this study, the RNN employs a sigmoid activation function and is trained using the Backpropagation (BP) algorithm.

Figure 2.

Network Schematic Structure of the RNN.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.02_min.jpg

Low-rank multimodal musion

Multimodal musion using tensor representations

In general, modality refers to the manner in which things occur or exist, while multimodality refers to the combination of two or more modes. Modes represent different sources or forms of information, such as text, images, or voice. The current research field primarily focuses on processing three modes: image, text, and voice. The rationale behind fusing these modes is that each mode provides unique representations and perspectives on things. Consequently, there may be overlapping and complementary phenomena, as well as multiple interactions between different modes of information. Effectively processing multimodal information can result in the extraction of rich feature information and enhance prediction accuracy (Wörtwein and Scherer, 2017).

The goal of multimodal fusion is to integrate unimodal representations into a compact multimodal representation for downstream tasks. Tensor representations have gained significant attention for their ability to capture multimodal interactions. The tensor representation accomplishes this by converting the input representation to a high-dimensional tensor and then transforming it to a low-dimensional output vector space. The tensor is constructed by taking the external product of the input modes (Zadeh et al., 2017). The input tensor Z is expressed as follows:

(1)

Z=⊗m=1Mzm,zm∈Rdm

where, ⊗m=1M represents the tensor outer product of a set of vectors indexed by M, and z_m is the input representation. The input tensor Z is transformed by a linear layer g to produce a vector, which is expressed as follows:

(2)

h=g(Z;W,b)=W∗Z+b,h,b∈Rdy

where, W represents weight and b represents offset. Figure 3 is a schematic diagram of tensor fusion under dual mode conditions.

Figure 3.

Schematic diagram of bimodal tensor fusion.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.03_min.jpg

One of the main drawbacks of tensor fusion is the need to calculate correlations between elements of different modes through tensor outer products. This process can increase the dimensionality of the feature vector significantly when the number of modes is large. This can result in a large amount of computation that can be difficult to train and may lead to overfitting.

Low-rank multimodal fusion with modality-specific factors

To address the limitations of tensor-based multimodal fusion methods when processing large amounts of modal data, this paper proposes a low-rank multimodal fusion method (LMF). LMF decomposes the weight of a tensor representation into tensor representations, utilizing the parallel decomposition of low-rank weight tensors and input tensors to calculate tensor-based fusion. This reduces the number of parameters, improves computational efficiency, and better adapts to a large number of modal application scenarios. The core idea is to perform a multi-dimensional dot product after each mode is individually linearly transformed. This involves summing the results of multiple low-rank vectors, thereby reducing the number of parameters in the model (Liu et al., 2018). The tensor of order M (M is the number of input modes) can be decomposed into vectors in the following form:

(3)

Wk=∑i=1R⊗m=1Mwm,ki,wm,ki∈Rdm

In the formula (3), the rank of the tensor is the smallest R obtained by decomposition, and the decomposition factor of the rank R of the original tensor is {{wim,k}m=1M}i=1R. The low rank weight tensor can be modified to:

(4)

W=∑i=1r⊗m=1Mwmi

The LMF method utilizes each rank r to reconstruct low rank W_k, and recombines and concatenates these vectors into the low rank factors of M modes. Equation (2) can be rewritten as:

(5)

h=(∑i=1r⊗m=1Mwmi)∗Z

By introducing a low rank factor to reconstruct the calculation of the weight W, formula (5) can be further rewritten as:

(6)

h=∑i=1r⁡[200m=1M⁡[wm1,wm2,…,wmr]∗zm∧]i

In the formula, ΛM_m = 1 represents the meta product on a quantity sequence. Figure 4 shows a flow diagram for decomposing the weight tensor W into low-rank factors in a dual-mode situation. As shown in Figure 4, the low-rank factors are connected to form M-order tensors, which are used for element multiplication and summation along the first dimension of the bounding matrix. Instead of representing h using vector sets, it is calculated using modal-specific decomposition factors and parameterized by M-order tensors. This approach greatly reduces the dimensionality of the tensor Z and weight W, and prevents excessive computation and difficulty in training.

Figure 4.

Schematic diagram of tensor fusion under dual-mode condition.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.04_min.jpg

Support vector machine

SVM (Support Vector Machine) is a widely used machine learning algorithm for classification and regression analysis. The basic concept behind SVM is to find a hyperplane or decision boundary in a high-dimensional space to separate samples of different categories. In classification problems, SVM represents samples as vectors, and aims to find a hyperplane that places samples of the same category on the same side of the hyperplane, while samples of different categories are on opposite sides. If the samples cannot be perfectly separated, SVM allows a certain degree of classification error while minimizing both the classification error and the distance from the hyperplane to the sample points. This is the optimization goal of SVM. Figure 5 illustrates a schematic diagram of a linearly separable classification support vector machine (Jana et al., 2023).

Figure 5.

Schematic diagram of linearly separable classification support vector machine.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.05_min.jpg

SVM has several characteristics that make it a popular machine learning algorithm, including:

SVM can handle both linear and nonlinear classification problems by using kernel functions to map the input space to a high-dimensional space. This allows SVM to transform the nonlinear classification problem into a linear classification problem, making it easier to find a hyperplane to separate different categories.
SVM performs well when processing high-dimensional data. Unlike other machine learning algorithms, SVM is not affected by the curse of dimensionality, meaning that increasing the sample dimensions will not significantly impact its performance.
SVM can be effective in small sample situations, as it only uses a subset of the samples for training. This subset is called support vectors, which reduces the risk of overfitting and improves the generalization ability of the model.

The classification effect of SVM can be optimized by adjusting hyperparameters such as the kernel function, error tolerance, and regularization coefficient. This allows the user to fine-tune the model and achieve better classification results. Therefore, SVM is a useful tool for fault identification and classification in engine digital twins, as it can handle complex classification problems, is not affected by high-dimensional data, and can be optimized to improve classification performance.

Process framework

Aeroengine digital twin technology has the potential to assist operators in identifying engine issues and conducting predictive maintenance by analyzing real-time engine data. This can result in enhanced engine reliability, safety, as well as reduced maintenance costs and risks. However, achieving accurate and efficient monitoring and diagnosis of aeroengine performance using digital twin technology remains a significant challenge. To tackle this challenge, a digital twin framework has been proposed based on an engine mechanism model and a data-driven model. This framework accurately simulates the engine's operational state and assesses its health status in real-time by analyzing gas path measurement information obtained from sensors. Figure 6 illustrates the flowchart of this digital twin framework. Through the utilization of this framework, operators can effectively monitor and diagnose the performance of aeroengines with improved accuracy and efficiency, ultimately enhancing engine reliability and safety.

Figure 6.

Flow Chart of Digital Twin Framework.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.06_min.jpg

Firstly, a component-level model of the aeroengine mechanism is established to simulate the engine's gas path performance. This model can simulate the measured parameters of each gas path station's inlet and outlet sections based on the engine's operating state, effectively reflecting the engine's performance changes. The mechanism model can also be self-optimized using adaptive methods to better represent the engine's health status. Secondly, an RNN-based data-driven model is developed. Historical flight data is utilized to train the data-driven model, and the internal parameters of the RNN model are adjusted to enable real-time monitoring of flight data. Subsequently, the mechanism model and the data-driven model are integrated through LMF method, forming an engine digital twin model. In particular, the weights assigned to the two modalities are computed through the fusion of mechanism model features and data-driven model features. These weights are then concatenated with the two modal features to calculate a new fusion feature vector. This integration leverages the advantages of both models to simulate the measured gas path parameters more accurately. Finally, an engine performance diagnostic model is constructed using SVM to monitor the engine's health status. This is achieved by comparing the deviation between the simulated gas path measurement parameters and the actual monitoring parameters. The improved accuracy of the generated gas path measurement parameters obtained from the engine digital twin enables SVM to extract features more effectively, leading to more reliable performance diagnosis results.

Results and discussion

As described in Section 2.1, the developed component-level mechanistic model enables precise simulation of the engine's gas path channel and can adapt itself to track the engine's performance degradation. However, this mechanistic model assumes relatively ideal conditions and does not account for the influence of measurement noise and external environmental factors in the data. Consequently, accurately assessing the engine's real-time health status becomes challenging. In contrast, as outlined in Section 2.2, data-driven models have the ability to dynamically learn from historical and current data, providing stronger real-time capabilities. Moreover, these models can capture the effects of measurement noise and external environmental factors during the learning process. However, data-driven models act as black boxes and do not reveal the trends of unmeasurable data, such as changes in component characteristic parameters, within the engine. To address this limitation, LMF method is employed to integrate the two models and establish an engine digital twin framework that harnesses the strengths of both approaches. To assess the reliability of the developed digital twin framework, the proposed method is tested following the process framework depicted in Figure 6. After obtaining separate prediction results from the mechanistic model and the data-driven model, the LMF technique is used to fuse these results. Subsequently, SVM are employed for engine performance diagnosis. By combining the insights from both models through the LMF method and using SVM for diagnosis, the reliability of the engine digital twin framework is verified. This approach enables more accurate and comprehensive assessment of the engine's health status and performance.

Due to the unavailability of actual flight data, simulated flight data generated by the engine component-level model described in Section 2.1 is utilized in the subsequent research. The simulated flight data generates one data point per second to mimic the real sensor acquisition scenario. In order to enhance the realism of the engine data, measurement noise is added to the simulated data to better simulate the engine's flight conditions. Assuming that the measurement noise follows a normal distribution, Table 3 presents the significance of the engine gas path data and their corresponding noise levels. In this study, we have intentionally introduced a small deviation as simulated noise after applying noise reduction techniques. However, it is important to note that the primary focus of this study does not encompass the investigation of the impact of measurement noise on model accuracy. The input to the engine digital twin framework consists of flight control data, including flight conditions such as flight altitude (H) and Mach number (Ma), as well as control regulations such as fuel flow rate (W_f). Figure 7 illustrates the input data for the engine digital twin framework in this study. The engine component-level mechanistic model computes the engine's gas path measurement parameters based on the input flight control data. These missions comprise three main stages: climb, cruise, and landing. Throughout the climb and landing processes, the engine is treated as transient, meaning it experiences dynamic changes in its operating conditions. The simulations aim to demonstrate the effectiveness and reliability of the proposed digital twin method in capturing and analyzing the engine's behavior under real-world flight scenarios.

Table 3.

Measurement parameters and noise level description

Parameters	Definition	Reference noise level
N₁	LP spool speed	0.25%
N₂	HP spool speed	0.25%
T₂	Fan outer exit temperature	0.75%
P₂	Fan outer exit pressure	0.5%
T₃	LPC exit temperature	0.75%
P₃	LPC exit pressure	0.5%
T₄	HPC inlet temperature	0.75%
P₄	HPC inlet pressure	0.5%
T₈	LPT exit temperature	0.75%
P₈	LPT exit pressure	0.5%

Figure 7.

(a–c). Flight Control Data. (a) Control law of H. (b) Control law of W_f. (c) Control law of Ma.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.07_min.jpg

Similarly, the data-driven model, trained with sensor data, predicts the engine's gas path measurement parameters based on the flight control data. To account for the measurement issues encountered with actual sensors, the output of the digital twin framework corresponds to the gas path measurement parameters presented in Table 2. Figure 8 demonstrates the predicted values of the total temperature (T₈) at the exit of the low-pressure compressor, as generated by the mechanistic model, the data-driven model, and the digital twin framework with the LMF method. From the figure, it can be observed that while the mechanistic model accurately predicts the engine's performance changes based on the flight control data, it struggles to account for the influence of measurement noise. In comparison, the data-driven model exhibits better prediction accuracy and can incorporate noise, but it may suffer from local overfitting and inaccuracies in some data points. The digital twin framework with the LMF method generates predicted results that exhibit the best fit with the actual values and are closer to the real data.

Figure 8.

(a–c). The T₈ output results of each model. (a) T₈ prediction distribution diagram of mechanism model. (b) T₈ prediction distribution diagram of data-driven model. (c) T₈ prediction distribution diagram of digital twin model.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.08_min.jpg

In order to better demonstrate the advantages of the proposed digital twin framework, we compared the mean absolute error (MAE) of the predicted values of different models. The calculation formula for MAE is as follows:

(7)

MAE=1n∑in⁡|yi−yi∗yi|∗100%

where y_i and y*_i represent the actual value and predicted value of the i-th data sample, respectively.

Figure 9 displays the Mean Absolute Error (MAE) of the gas path measurement parameters obtained using different methods. It is evident from Figure 9 that the digital twin method yields lower errors compared to both the physics-based model and the data-driven model, demonstrating that the digital twin method significantly improves prediction performance. Although the MAE of the data-driven model is higher than that of the digital twin method, its prediction results are considerably better than those of the physics-based model. This discrepancy arises because the physics-based model captures the trend of aeroengine performance changes but cannot account for small measurement deviations present in actual measurements. On the other hand, the data-driven model can incorporate the influence of noise during the learning process, but being a black box, it fails to capture the degradation of unmeasurable parameters such as flow rate, pressure ratio, and efficiency in the aeroengine. In terms of overall MAE, the physics-based model has an MAE of 0.55%, the data-driven model has an MAE of 0.31%, and the digital twin model achieves the lowest overall MAE of only 0.24%.

Figure 9.

MAE of each model.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.09_min.jpg

Through the aforementioned case studies, we have demonstrated the predictive accuracy of the digital twin framework in accurately simulating actual gas path measurement parameters for engine performance diagnosis. However, obtaining actual fault data for aviation engines can be challenging. Therefore, in this study, a mechanistic model was employed to simulate actual engine gas path fault data. This simulated data, along with Support Vector Machines (SVM), was utilized to diagnose engine gas path performance faults, completing the overall digital twin framework. The study assessed the degree of degradation by incorporating an engine fault rule library. Based on the engine gas path analysis theory, the mechanistic model can simulate the degradation of gas path measurement parameters by modifying component characteristic parameters. This, in turn, enables the simulation of degradation in single or multiple gas path components. Common fault scenarios in the engine gas path include issues such as dirt, erosion, and corrosion. Table 4 provides the relationship between physical faults and component characteristic parameters, outlining how the degradation of specific components affects the gas path. Table 5 presents nine single and multi-type fault scenario cases studied in this research, as they have a higher probability of occurrence. The gas path fault conditions in Table 5 are represented by the corresponding component degradation conditions outlined in Table 4. Utilizing the degradation rates specified in Table 4, different gas path fault conditions are generated by randomly generating the degradation amounts of each gas path component's performance parameters. The developed mechanistic model is then used to generate gas path measurement parameters based on the gas path fault scenarios.

Table 4.

Relationship between the physical faults and health parameters

Physical fault	Flow capacity change (A)	Isentropic efficiency change (B)	Ratio A:B	Range
Compressor fouling	↓	↓	∼3:1	(0, −7.5%), (0, 2.5%)
Compressor erosion	↓	↓	∼2:1	(0, −4%), (0, −2%)
Turbine fouling	↓	↓	∼2:1	(0, −4%), (0, −2%)
Turbine erosion	↑	↓	∼2:1	(0, 4%), (0, −2%)

Table 5.

Cases based on the cause-effect scenario.

Case	FAN fouling	LPC fouling	HPC fouling	LPC erosion	HPT fouling	LPT fouling	LPT erosion
1	×
2		×
3			×
4				×
5					×
6						×
7	×	×			×		×
8	×	×	×	×		×
9	×	×	×		×		×

To validate the effectiveness of the proposed digital twin framework, the engine performance diagnosis function within the framework was performed using SVM, as described in Section 2.4. Table 5 presents the corresponding nine types and combinations of faults considered in the study. A total of 2,700 data samples were collected, with 300 samples for each type of fault. The confusion matrix of the predicted fault classification results is illustrated in Figure 10. Table 6 provides the diagnostic accuracy for different fault cases. The results indicate that the digital twin framework can accurately distinguish all nine fault cases, with only a small number of samples being misclassified. Notably, Case 9 exhibited the highest error rate, with a total of 19 samples being misclassified, with 9 fault samples mistakenly classified as Case 7 and 10 fault samples mistakenly classified as Case 7. Despite these misclassifications, the digital twin framework still demonstrates high accuracy, with an overall classification accuracy of 97.3%. In conclusion, the case study presented in this paper demonstrates that the proposed digital twin framework is well-suited for aeroengine performance diagnosis tasks.

Figure 10.

Fuzzy Matrix of Diagnostic Results.

https://journal.gpps.global/f/fulltexts/191169/JGPPS-00213-2023-01.10_min.jpg

Table 6.

Diagnostic accuracy of each fault case.

Case	Accuracy
Case1	99%
Case2	98.3%
Case3	98.7%
Case4	99%
Case5	99%
Case6	99%
Case7	95%
Case8	94%
Case9	93.7%

The proposed method runs on a computer equipped with an AMD Ryzen 9 3900X CPU and 32GB of memory, allowing it to process the diagnostics of 2,700 engine operating points in just 98.7 seconds. In other words, it takes only 0.0366 seconds to analyze a single engine operating point. This impressive performance demonstrates that the proposed digital twin method is highly suitable for aeroengine performance diagnosis tasks.

Conclusions

In order to enhance the real-time performance diagnosis of aeroengines, this paper introduces a digital twin method that combines a low-rank multimodal fusion mechanism model with a data-driven model. By leveraging the different modal engine information provided by the mechanism model and data-driven model, the LWF method fuses their respective features to jointly represent them as the corresponding engine digital twin model. The engine digital twin model based on the LWF method achieves high-precision performance prediction. Additionally, by incorporating SVM for diagnosing engine gas path performance faults, the overall digital twin framework is established to accomplish engine performance diagnosis.

The feasibility of the proposed method is demonstrated through a comprehensive case study. The main conclusions derived from this study are as follows:

The proposed digital twin method outperforms the mechanism model and data-driven model in simulating the actual health status of the engine. The overall MAE of the digital twin model is only 0.24%. This significant improvement in simulation accuracy enhances the representation of gas path parameters under full flight tasks and accurately reflects the performance changes of engine gas path components.
The proposed digital twin method achieves accurate diagnosis of engine gas path faults, with an overall classification accuracy of 97.3%.
The proposed digital twin method enables real-time performance diagnosis of aeroengines, providing effective data support for engine health management.

In summary, we proposed a new approach to the difficulty of integrating actual physical data and virtual simulation data in the digital twin scheme of aeroengines. The findings of this paper validate the effectiveness of the proposed digital twin method for aeroengine performance diagnosis. The method's capability to accurately simulate engine health, diagnose faults, and facilitate real-time monitoring contributes to enhanced engine health management.

Nomenclature

LPC

low pressure compressor

HPC

high pressure compressor

HPT

high pressure turbine

LPT

low pressure turbine

mass flow

rotor rotational speed

temperature

pressure

RNN

recurrent neural networks

SVM

support vector machine

LWF

low-rank multimodal fusion

MAE

mean absolute error

FUNDING

This research was co-funded by Research Start-up Fund of Fudan University (Grant nos. FDU41051) and the AECC Commercial Aircraft Engine Co., Ltd. (No. AR0973.00RW.001).

COMPETING INTERESTS

Zepeng Wang declares that he has no conflict of interest. Ye Wang declares that he has no conflict of interest. Xizhen Wang declares that he has no conflict of interest. Bokun Zhao declares that he has no conflict of interest. Yongjun Zhao declares that he has no conflict of interest.

REFERENCES (29)

Aretakis N., Associate R., Mathioudakis K., Professor A., Stamatis A., and Associate R. (2003). Nonlinear engine component fault diagnosis from a limited number of measurements using a combinatorial approach. Journal of Engineering for Gas Turbines & Power. 125: 642–650. 10.1115/1.1582494.

CrossRef

Google Scholar

Asrav T. and Aydin E. (2023). Physics-informed recurrent neural networks and hyper-parameter optimization for dynamic process systems. Computers & Chemical Engineering. 173: 108195. 10.1016/j.compchemeng.2023.108195.

CrossRef

Google Scholar

Bondarenko O. and Fukuda T. (2020). Development of a diesel engine’s digital twin for predicting propulsion system dynamics. Energy. 196: 117126. 10.1016/j.energy.2020.117126.

CrossRef

Google Scholar

Chen Y.-Z., Li Y.-G., Tsoutsanis E., Newby M., and Zhao X.-D. (2021). Techno-economic evaluation and optimization of CCGT power plant: a multi-criteria decision support system. Energy Convers Manage. 237: 114107. 10.1016/j.enconman.2021.114107.

CrossRef

Google Scholar

Fentaye A. D., Ul-Haq Gilani S. I., Baheta A. T., and Li Y.-G. (2019). Performance-based fault diagnosis of a gas turbine engine using an integrated support vector machine and artificial neural network method. Proceedings of the Institution of Mechanical Engineers. 233 (6): 786–802. 10.1177/0957650918812510.

CrossRef

Google Scholar

Gulati A., Zedda M., and Singh R. (2000). Gas turbine engine and sensor multiple operating point analysis using optimization techniques. In AIAA 2000 3716. USA: AIAA. 10.2514/6.2000-3716.

CrossRef

Google Scholar

Huang Y., Tao J., Sun G., Wu T., Yu L., and Zhao X. (2023). A novel digital twin approach based on deep multimodal information fusion for aero-engine fault diagnosis. Energy. 270: 126894. 10.1016/j.energy.2023.126894.

CrossRef

Google Scholar

Jana D. K., Bhunia P., Adhikary S. D., and Mishra A. (2023). Analyzing of salient features and classification of wine type based on quality through various neural network and support vector machine classifiers. Results in Control and Optimization. 11: 100219. 10.1016/j.rico.2023.100219.

CrossRef

Google Scholar

Kim S., Kim K., and Son C. (2020). A new transient performance adaptation method for an aero gas turbine engine. Energy 193: 116752. 10.1016/j.energy.2019.116752.

CrossRef

Google Scholar

10.

Kumar A., Shankar R., and Thakur L. S. (2018). A big data driven sustainable manufacturing framework for condition-based maintenance prediction. Journal of Computational Science. 27: 428–39. 10.1016/j.jocs.2017.06.006.

CrossRef

Google Scholar

11.

Li Y. G. and Nilkitsaranont P. (2009). Gas turbine performance prognostic for condition-based maintenance. Applied Energy. 86 (10): 2152–2161. 10.1016/j.apenergy.2009.02.011.

CrossRef

Google Scholar

12.

Li Y. G., Abdul Ghafir M. F., Wang L., Singh R., Huang K., et al. (2012). Improved multiple point nonlinear genetic algorithm based performance adaptation using least square method. Journal of Engineering for Gas Turbines and Power. 134 (3): 031701.1-031701.10. 10.1115/1.4004395.

CrossRef

Google Scholar

13.

Li Y., Zhang H., Han J., and Sun Q. (2018). Distributed multi-agent optimization via event-triggered based continuous-time Newton-Raphson algorithm. Neurocomputing. 275 (JAN.31): 1416–1425. 10.1016/j.neucom.2017.09.079.

CrossRef

Google Scholar

14.

Liu Z., Shen Y., Lakshminarasimhan V. B., Liang P. P., Zadeh A., and Morency L.-P. (2018). Efficient Low-rank Multimodal Fusion with Modality-Specific Factors. (2018). 10.48550/arXiv.1806.00064.

CrossRef

Google Scholar

15.

Lu J., Huang J., and Lu F. (2019). Distributed kernel extreme learning machines for aircraft engine failure diagnostics. Applied Sciences. 9 (8): 1707. 10.3390/app9081707.

CrossRef

Google Scholar

16.

Lu J., Huang J., and Lu F. (2020). Kernel extreme learning machine with iterative picking scheme for failure diagnosis of a turbofan engine. Aerospace Science and Technology. 96: 105539. 10.1016/j.ast.2019.105539.

CrossRef

Google Scholar

17.

Pu X., Liu S., Jiang H., and Yu D. (2013). Sparse bayesian learning for gas path diagnostics. Journal of Engineering for Gas Turbines and Power. 135: 071601. 10.1115/1.4023608.

CrossRef

Google Scholar

18.

Shuang S., Ze-peng W., Xiao-peng S., Hong-li Z., and Zhi-ping W. (2021). An adaptive compressor characteristic map method based on the Bézier curve. Case Studies in Thermal Engineering. 28: 101512. 10.1016/j.csite.2021.101512.

CrossRef

Google Scholar

19.

Song Y., Gu C.-W., and Ji X.-X. (2015). Development and validation of a full-range performance analysis model for a three-spool gas turbine with turbine cooling. Energy. 89: 545–557. 10.1016/j.energy.2015.06.015.

CrossRef

Google Scholar

20.

Sun R., Shi L., Yang X., Wang Y., and Zhao Q. (2020). A coupling diagnosis method of sensors faults in gas turbine control system. Energy. 205: 117999. 10.1016/j.energy.2020.117999.

CrossRef

Google Scholar

21.

Talaat M., Farahat M. A., Mansour N., and Hatata A. Y. (2020). Load forecasting based on grasshopper optimization and a multilayer feed-forward neural network using regressive approach. Energy. 196: 117087.1-117087.12. 10.1016/j.energy.2020.117087.

CrossRef

Google Scholar

22.

Tsoutsanis E., Meskin N., Benammar M., and Khorasani K. (2014). A component map tuning method for performance prediction and diagnostics of gas turbine compressors. Applied Energy. 135: 572–585. 10.1016/j.apenergy.2014.08.115.

CrossRef

Google Scholar

23.

Urban L. A. (1973). Gas path analysis applied to turbine engine condition monitoring. Journal of Aircraft. 10 (7): 400–406.

Google Scholar

24.

Viharos Z. J. and Kis K. B. (2015). Survey on neuro-fuzzy systems and their applications in technical diagnostics and measurement. Journal of the International Measurement Confederation. 67: 126–36. 10.1016/j.measurement.2015.02.001.

CrossRef

Google Scholar

25.

Wang Z. and Zhao Y. (2023). Data-driven exhaust gas temperature baseline predictions for aeroengine based on machine learning algorithms. Aerospace. 10: 17. 10.3390/aerospace10010017.

CrossRef

Google Scholar

26.

Wörtwein T. and Scherer S. (2017). What really matters — An information gain analysis of questions and reactions in automated PTSD screenings. In International Conference on Affective Computing & Intelligent Interaction. IEEE Computer Society, pp. 15–20. 10.1155/2020/8843186.

CrossRef

Google Scholar

27.

Xu M., Liu J., Li M., Geng J., Wu Y., and Song Z. (2022). Improved hybrid modeling method with input and output self-tuning for gas turbine engine. Energy. 238: 121672. 10.1016/j.energy.2021.121672.

CrossRef

Google Scholar

28.

Zadeh A., Chen M., Poria S., Cambria E., and Morency L.-P. (2017). Tensor Fusion Network for Multimodal Sentiment Analysis: arXiv,10.48550/arXiv.1707.07250. 2017. 10.48550/arXiv.1707.07250.

CrossRef

Google Scholar

29.

Zedda M. and Singh R. (2003). Gas turbine engine and sensor fault diagnosis using optimisation techniques. Journal of Propulsion and Power. 18 (5): 1019–1025. 10.2514/2.6050.

Journal Issues

Parameter selection for aeroengine transient state gas path analysis

Estimating RANS model uncertainty using machine learning

Indexes

Keywords index

Topics index

Authors index

Table of contents